NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|50428933|ref|NP_001002259|]
View 

caprin-2 isoform 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
620-935 1.18e-172

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


:

Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 509.72  E-value: 1.18e-172
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    620 LQDLMTQIQGTCNFMQESVLDFDKPS-SAIPTSQPPSATP-----GSPVASKEQNLSSQSDFLQEPLQATSSPVTCSSNA 693
Cdd:pfam12287    1 LQDLMAQIQGTYNFMQDSMLDFDKPSdSAIVSAQPPSQSPdlsqmVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSNA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    694 ClvttdqASSGSETEFMTSET--PEAAIPP---GKQPSSLASPNPPMAKGSE-QGFQSPPASSSSVTINTAPFQAMQTVF 767
Cdd:pfam12287   81 C------ASSGSEYQFHTSEPpqPEAIDPIqssMSLPSELAPPSPPLSPASQpQVFQSKPASSSGINVNAAPFQSMQTVF 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    768 NVNAPLPPRKEQEIKE-SPYSPGYNQSFTTASTQTPPQCQLPSIHVEQTVhsqetAANYHPDGTIQVSNGSLAFYPAQTN 846
Cdd:pfam12287  155 NVNAPVPPRNEQELKEsSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTV-----VGAYHPDGTIQVSNGHLAFYPAQTN 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    847 VFPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYK-GFDTYRG-LPSISNGNYSQLQFQAREYSGAPYSQRDNFQQCY 924
Cdd:pfam12287  230 GFPRPPQPFYNSRGSPRGGPRGGRGLMNGYRGPNGFKgGFDGYRGpFPNTPNGGYGQLQFQARDYSGTPYSQRDGYQQNY 309
                          330
                   ....*....|.
gi 50428933    925 KRGGTSGGPRA 935
Cdd:pfam12287  310 KRGGTQSGPRA 320
Caprin-1_dimer pfam18293
Caprin-1 dimerization domain; This domain is found in human Caprin-1 protein. Caprin-1 plays a ...
200-315 1.79e-51

Caprin-1 dimerization domain; This domain is found in human Caprin-1 protein. Caprin-1 plays a role in many important biological processes, including cellular proliferation, innate immune response and synaptic plasticity. This domain is found in the highly conserved homologous region 1(HR1) and is responsible for the tight homodimerization of Caprin-1.


:

Pssm-ID: 436391  Cd Length: 116  Bit Score: 176.25  E-value: 1.79e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    200 RREHMLKLEAEKKKLRTILQVQYVLQNLTQEHVQKDFKGGLNGAVYLPSKELDYLIKFSKLTCPERNESLSVEDQMEQSS 279
Cdd:pfam18293    1 KKEAQLKMQAELARLREVLQVQDVLNSLGSEDVRNDFLNGTNGAVKLTEEDLKQLDEFYKLVGPKRDEDTSFADQMQKAA 80
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 50428933    280 LYFWDLLEGSEKAVVGTTYKHLKDLLSKLLNSGYFE 315
Cdd:pfam18293   81 EHLWALLEGKEKPVAGTTYKELKELLDKILNCGYFD 116
C1q pfam00386
C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement ...
999-1124 6.60e-41

C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement system.


:

Pssm-ID: 395310 [Multi-domain]  Cd Length: 126  Bit Score: 146.66  E-value: 6.60e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    999 AFSAARTSNLAPGTlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAvNVPLYVNLMKNEEVLVSAYAND 1078
Cdd:pfam00386    1 AFSAGRTTGLTAPN-EQPVRFDKVLTNIGGHYDPATGKFTCPVPGVYYFSYHITTVD-GKSLYVSLVKNGQEVVSFYDQP 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 50428933   1079 GAPDHETASNHAILQLFQGDQIWLRLH--RGAIYGSSWKYSTFSGYLL 1124
Cdd:pfam00386   79 QKGSLDVASGSVVLELQRGDEVWLQLTgyNGLYYDGSDTDSTFSGFLL 126
SCP-1 super family cl30946
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
88-540 3.41e-06

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


The actual alignment was detected with superfamily member pfam05483:

Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 51.26  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933     88 QVNHSQHGESQRaLSPLQSTLSSAASPSQAYEtyiENGLICLKHKIRNIEKKKLKLEDyKDRLKSGEHLNPDQLEA-VEK 166
Cdd:pfam05483  286 ELIEKKDHLTKE-LEDIKMSLQRSMSTQKALE---EDLQIATKTICQLTEEKEAQMEE-LNKAKAAHSFVVTEFEAtTCS 360
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    167 YEEVLHNLEfaKELQKTFSGLSLDLLKAQKKAQRREHMLKL----EAEKKKLRTILQVQYVL--QNLTQEHVQKDFKGG- 239
Cdd:pfam05483  361 LEELLRTEQ--QRLEKNEDQLKIITMELQKKSSELEEMTKFknnkEVELEELKKILAEDEKLldEKKQFEKIAEELKGKe 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    240 --LNGAVYLPSKEL-DYLIKFSKLTCPERNESLSVED---QMEQSSLYFWDLLEGSEKAVVGTT--YKHLKDLLSKLLNS 311
Cdd:pfam05483  439 qeLIFLLQAREKEIhDLEIQLTAIKTSEEHYLKEVEDlktELEKEKLKNIELTAHCDKLLLENKelTQEASDMTLELKKH 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    312 GyfESIpvpKNAKEKevplEEEMLIQ----SEKKTQL-SKTESVKEseslmEFAQP------EIQPQEFLNRRYMTEVDY 380
Cdd:pfam05483  519 Q--EDI---INCKKQ----EERMLKQienlEEKEMNLrDELESVRE-----EFIQKgdevkcKLDKSEENARSIEYEVLK 584
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    381 SNKQGEEQPWEADYARK--PNLPKRWDMLTEPDGQEKKQESFKSWEASGKHQEVSKPAVSLEQRKQDTSKLRSTLPEEQK 458
Cdd:pfam05483  585 KEKQMKILENKCNNLKKqiENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIE 664
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    459 KQEISKSKpspsqwkqdtpkskagyVQEEQKKQETPKLWPVQLQKEQDPKKQTPkswTPSMQSEQNTTKSWTTPMCEEQD 538
Cdd:pfam05483  665 DKKISEEK-----------------LLEEVEKAKAIADEAVKLQKEIDKRCQHK---IAEMVALMEKHKHQYDKIIEERD 724

                   ..
gi 50428933    539 SK 540
Cdd:pfam05483  725 SE 726
 
Name Accession Description Interval E-value
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
620-935 1.18e-172

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 509.72  E-value: 1.18e-172
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    620 LQDLMTQIQGTCNFMQESVLDFDKPS-SAIPTSQPPSATP-----GSPVASKEQNLSSQSDFLQEPLQATSSPVTCSSNA 693
Cdd:pfam12287    1 LQDLMAQIQGTYNFMQDSMLDFDKPSdSAIVSAQPPSQSPdlsqmVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSNA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    694 ClvttdqASSGSETEFMTSET--PEAAIPP---GKQPSSLASPNPPMAKGSE-QGFQSPPASSSSVTINTAPFQAMQTVF 767
Cdd:pfam12287   81 C------ASSGSEYQFHTSEPpqPEAIDPIqssMSLPSELAPPSPPLSPASQpQVFQSKPASSSGINVNAAPFQSMQTVF 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    768 NVNAPLPPRKEQEIKE-SPYSPGYNQSFTTASTQTPPQCQLPSIHVEQTVhsqetAANYHPDGTIQVSNGSLAFYPAQTN 846
Cdd:pfam12287  155 NVNAPVPPRNEQELKEsSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTV-----VGAYHPDGTIQVSNGHLAFYPAQTN 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    847 VFPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYK-GFDTYRG-LPSISNGNYSQLQFQAREYSGAPYSQRDNFQQCY 924
Cdd:pfam12287  230 GFPRPPQPFYNSRGSPRGGPRGGRGLMNGYRGPNGFKgGFDGYRGpFPNTPNGGYGQLQFQARDYSGTPYSQRDGYQQNY 309
                          330
                   ....*....|.
gi 50428933    925 KRGGTSGGPRA 935
Cdd:pfam12287  310 KRGGTQSGPRA 320
Caprin-1_dimer pfam18293
Caprin-1 dimerization domain; This domain is found in human Caprin-1 protein. Caprin-1 plays a ...
200-315 1.79e-51

Caprin-1 dimerization domain; This domain is found in human Caprin-1 protein. Caprin-1 plays a role in many important biological processes, including cellular proliferation, innate immune response and synaptic plasticity. This domain is found in the highly conserved homologous region 1(HR1) and is responsible for the tight homodimerization of Caprin-1.


Pssm-ID: 436391  Cd Length: 116  Bit Score: 176.25  E-value: 1.79e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    200 RREHMLKLEAEKKKLRTILQVQYVLQNLTQEHVQKDFKGGLNGAVYLPSKELDYLIKFSKLTCPERNESLSVEDQMEQSS 279
Cdd:pfam18293    1 KKEAQLKMQAELARLREVLQVQDVLNSLGSEDVRNDFLNGTNGAVKLTEEDLKQLDEFYKLVGPKRDEDTSFADQMQKAA 80
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 50428933    280 LYFWDLLEGSEKAVVGTTYKHLKDLLSKLLNSGYFE 315
Cdd:pfam18293   81 EHLWALLEGKEKPVAGTTYKELKELLDKILNCGYFD 116
C1q pfam00386
C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement ...
999-1124 6.60e-41

C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement system.


Pssm-ID: 395310 [Multi-domain]  Cd Length: 126  Bit Score: 146.66  E-value: 6.60e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    999 AFSAARTSNLAPGTlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAvNVPLYVNLMKNEEVLVSAYAND 1078
Cdd:pfam00386    1 AFSAGRTTGLTAPN-EQPVRFDKVLTNIGGHYDPATGKFTCPVPGVYYFSYHITTVD-GKSLYVSLVKNGQEVVSFYDQP 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 50428933   1079 GAPDHETASNHAILQLFQGDQIWLRLH--RGAIYGSSWKYSTFSGYLL 1124
Cdd:pfam00386   79 QKGSLDVASGSVVLELQRGDEVWLQLTgyNGLYYDGSDTDSTFSGFLL 126
C1Q smart00110
Complement component C1q domain; Globular domain found in many collagens and eponymously in ...
993-1127 1.29e-32

Complement component C1q domain; Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.


Pssm-ID: 128420  Cd Length: 135  Bit Score: 123.18  E-value: 1.29e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933     993 PQQMRVAFSAARTSNLAPGtlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVplYVNLMKNEEVLV 1072
Cdd:smart00110    3 KAQPRSAFSVIRSNRPPPP--GQPIRFDKVLYNQQGHYDPRTGKFTCPVPGVYYFSYHVESKGRNV--KVSLMKNGIQVM 78
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 50428933    1073 SAYANDGAPDHETASNHAILQLFQGDQIWLRLHR--GAIYGSSWKYSTFSGYLLYQD 1127
Cdd:smart00110   79 STYDEYQKGLYDVASGGALLQLRQGDQVWLELPDekNGLYAGEYVDSTFSGFLLFPD 135
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
88-540 3.41e-06

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 51.26  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933     88 QVNHSQHGESQRaLSPLQSTLSSAASPSQAYEtyiENGLICLKHKIRNIEKKKLKLEDyKDRLKSGEHLNPDQLEA-VEK 166
Cdd:pfam05483  286 ELIEKKDHLTKE-LEDIKMSLQRSMSTQKALE---EDLQIATKTICQLTEEKEAQMEE-LNKAKAAHSFVVTEFEAtTCS 360
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    167 YEEVLHNLEfaKELQKTFSGLSLDLLKAQKKAQRREHMLKL----EAEKKKLRTILQVQYVL--QNLTQEHVQKDFKGG- 239
Cdd:pfam05483  361 LEELLRTEQ--QRLEKNEDQLKIITMELQKKSSELEEMTKFknnkEVELEELKKILAEDEKLldEKKQFEKIAEELKGKe 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    240 --LNGAVYLPSKEL-DYLIKFSKLTCPERNESLSVED---QMEQSSLYFWDLLEGSEKAVVGTT--YKHLKDLLSKLLNS 311
Cdd:pfam05483  439 qeLIFLLQAREKEIhDLEIQLTAIKTSEEHYLKEVEDlktELEKEKLKNIELTAHCDKLLLENKelTQEASDMTLELKKH 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    312 GyfESIpvpKNAKEKevplEEEMLIQ----SEKKTQL-SKTESVKEseslmEFAQP------EIQPQEFLNRRYMTEVDY 380
Cdd:pfam05483  519 Q--EDI---INCKKQ----EERMLKQienlEEKEMNLrDELESVRE-----EFIQKgdevkcKLDKSEENARSIEYEVLK 584
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    381 SNKQGEEQPWEADYARK--PNLPKRWDMLTEPDGQEKKQESFKSWEASGKHQEVSKPAVSLEQRKQDTSKLRSTLPEEQK 458
Cdd:pfam05483  585 KEKQMKILENKCNNLKKqiENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIE 664
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    459 KQEISKSKpspsqwkqdtpkskagyVQEEQKKQETPKLWPVQLQKEQDPKKQTPkswTPSMQSEQNTTKSWTTPMCEEQD 538
Cdd:pfam05483  665 DKKISEEK-----------------LLEEVEKAKAIADEAVKLQKEIDKRCQHK---IAEMVALMEKHKHQYDKIIEERD 724

                   ..
gi 50428933    539 SK 540
Cdd:pfam05483  725 SE 726
PTZ00121 PTZ00121
MAEBL; Provisional
130-569 2.03e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.90  E-value: 2.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   130 KHKIRNIEKK---KLKLEDYKDRL---KSGEHLNPDQLEAVEKYEEVLHNLEF---AKELQKTFSGLSLDLLKAQKKAQR 200
Cdd:PTZ00121 1456 AKKAEEAKKKaeeAKKADEAKKKAeeaKKADEAKKKAEEAKKKADEAKKAAEAkkkADEAKKAEEAKKADEAKKAEEAKK 1535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   201 REHMLKLEaEKKKLRTILQVQYVLQnltQEHVQKdfkgglngavylpskeldylIKFSKLTCPERNESLSVEDQMEQssl 280
Cdd:PTZ00121 1536 ADEAKKAE-EKKKADELKKAEELKK---AEEKKK--------------------AEEAKKAEEDKNMALRKAEEAKK--- 1588
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   281 yfwdlLEGSEKAVVGTTYKHLKDLLSKLLNSGYFESIPVPKNAKEKEVPLEEEMLIQSEKKtQLSKTESVKESESLMEFA 360
Cdd:PTZ00121 1589 -----AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE-EKKKAEELKKAEEENKIK 1662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   361 QPEIQPQEFLNRRYMTEVdySNKQGEEQPWEADYARKPNLPKRWDMLTEPDGQEKKQEsfkswEASGKHQEVSKPAVSLE 440
Cdd:PTZ00121 1663 AAEEAKKAEEDKKKAEEA--KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKA-----EELKKAEEENKIKAEEA 1735
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   441 QRK--QDTSKLRSTLPEEQKKQEISKSKPSPSQWKQDTPKSKAGYVQEEQKKQETPKlwpvqlQKEQDPKKQTPKSWTPS 518
Cdd:PTZ00121 1736 KKEaeEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR------RMEVDKKIKDIFDNFAN 1809
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 50428933   519 MQSEQNTTKSWTTPMCEEQDSKQPETPKS----WENNVESQKHSLTSQSQISPKS 569
Cdd:PTZ00121 1810 IIEGGKEGNLVINDSKEMEDSAIKEVADSknmqLEEADAFEKHKFNKNNENGEDG 1864
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
647-1012 3.37e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 3.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   647 AIPTSQPPSATPGSPVAskeqnlssqsdflQEPLQATSSPVTCSSNACLVTT-DQASSGSETEFMTSETPEAAIPPGKQP 725
Cdd:PHA03307   61 ACDRFEPPTGPPPGPGT-------------EAPANESRSTPTWSLSTLAPASpAREGSPTPPGPSSPDPPPPTPPPASPP 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   726 SSLASPNPPMAKGSEQGFQSPPASSSSvtintapfqamqtvfnvnAPLPPRKEQEIKESPYSPGYNQSFTTASTQTPpqc 805
Cdd:PHA03307  128 PSPAPDLSEMLRPVGSPGPPPAASPPA------------------AGASPAAVASDAASSRQAALPLSSPEETARAP--- 186
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   806 qlPSIHVEQTVHSQETAANYHP---DGTIQVSNGSLAFYPAQTNVFPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGY 882
Cdd:PHA03307  187 --SSPPAEPPPSTPPAAASPRPprrSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPIT 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   883 KGFDTYRGLPSISNGNYSQLQFQA---REYSGAPYSQRDnfqqcyKRGGTSGGPRANSRAGWSDSSQVSSPERDNETFNS 959
Cdd:PHA03307  265 LPTRIWEASGWNGPSSRPGPASSSsspRERSPSPSPSSP------GSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRG 338
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 50428933   960 GDSGQGDSRSMTPVDVPVTNPAATilpvhvyPLPQQMRVAFSAARTSNLAPGT 1012
Cdd:PHA03307  339 AAVSPGPSPSRSPSPSRPPPPADP-------SSPRKRPRPSRAPSSPAASAGR 384
 
Name Accession Description Interval E-value
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
620-935 1.18e-172

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 509.72  E-value: 1.18e-172
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    620 LQDLMTQIQGTCNFMQESVLDFDKPS-SAIPTSQPPSATP-----GSPVASKEQNLSSQSDFLQEPLQATSSPVTCSSNA 693
Cdd:pfam12287    1 LQDLMAQIQGTYNFMQDSMLDFDKPSdSAIVSAQPPSQSPdlsqmVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSNA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    694 ClvttdqASSGSETEFMTSET--PEAAIPP---GKQPSSLASPNPPMAKGSE-QGFQSPPASSSSVTINTAPFQAMQTVF 767
Cdd:pfam12287   81 C------ASSGSEYQFHTSEPpqPEAIDPIqssMSLPSELAPPSPPLSPASQpQVFQSKPASSSGINVNAAPFQSMQTVF 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    768 NVNAPLPPRKEQEIKE-SPYSPGYNQSFTTASTQTPPQCQLPSIHVEQTVhsqetAANYHPDGTIQVSNGSLAFYPAQTN 846
Cdd:pfam12287  155 NVNAPVPPRNEQELKEsSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTV-----VGAYHPDGTIQVSNGHLAFYPAQTN 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    847 VFPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYK-GFDTYRG-LPSISNGNYSQLQFQAREYSGAPYSQRDNFQQCY 924
Cdd:pfam12287  230 GFPRPPQPFYNSRGSPRGGPRGGRGLMNGYRGPNGFKgGFDGYRGpFPNTPNGGYGQLQFQARDYSGTPYSQRDGYQQNY 309
                          330
                   ....*....|.
gi 50428933    925 KRGGTSGGPRA 935
Cdd:pfam12287  310 KRGGTQSGPRA 320
Caprin-1_dimer pfam18293
Caprin-1 dimerization domain; This domain is found in human Caprin-1 protein. Caprin-1 plays a ...
200-315 1.79e-51

Caprin-1 dimerization domain; This domain is found in human Caprin-1 protein. Caprin-1 plays a role in many important biological processes, including cellular proliferation, innate immune response and synaptic plasticity. This domain is found in the highly conserved homologous region 1(HR1) and is responsible for the tight homodimerization of Caprin-1.


Pssm-ID: 436391  Cd Length: 116  Bit Score: 176.25  E-value: 1.79e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    200 RREHMLKLEAEKKKLRTILQVQYVLQNLTQEHVQKDFKGGLNGAVYLPSKELDYLIKFSKLTCPERNESLSVEDQMEQSS 279
Cdd:pfam18293    1 KKEAQLKMQAELARLREVLQVQDVLNSLGSEDVRNDFLNGTNGAVKLTEEDLKQLDEFYKLVGPKRDEDTSFADQMQKAA 80
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 50428933    280 LYFWDLLEGSEKAVVGTTYKHLKDLLSKLLNSGYFE 315
Cdd:pfam18293   81 EHLWALLEGKEKPVAGTTYKELKELLDKILNCGYFD 116
C1q pfam00386
C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement ...
999-1124 6.60e-41

C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement system.


Pssm-ID: 395310 [Multi-domain]  Cd Length: 126  Bit Score: 146.66  E-value: 6.60e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    999 AFSAARTSNLAPGTlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAvNVPLYVNLMKNEEVLVSAYAND 1078
Cdd:pfam00386    1 AFSAGRTTGLTAPN-EQPVRFDKVLTNIGGHYDPATGKFTCPVPGVYYFSYHITTVD-GKSLYVSLVKNGQEVVSFYDQP 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 50428933   1079 GAPDHETASNHAILQLFQGDQIWLRLH--RGAIYGSSWKYSTFSGYLL 1124
Cdd:pfam00386   79 QKGSLDVASGSVVLELQRGDEVWLQLTgyNGLYYDGSDTDSTFSGFLL 126
C1Q smart00110
Complement component C1q domain; Globular domain found in many collagens and eponymously in ...
993-1127 1.29e-32

Complement component C1q domain; Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.


Pssm-ID: 128420  Cd Length: 135  Bit Score: 123.18  E-value: 1.29e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933     993 PQQMRVAFSAARTSNLAPGtlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVplYVNLMKNEEVLV 1072
Cdd:smart00110    3 KAQPRSAFSVIRSNRPPPP--GQPIRFDKVLYNQQGHYDPRTGKFTCPVPGVYYFSYHVESKGRNV--KVSLMKNGIQVM 78
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 50428933    1073 SAYANDGAPDHETASNHAILQLFQGDQIWLRLHR--GAIYGSSWKYSTFSGYLLYQD 1127
Cdd:smart00110   79 STYDEYQKGLYDVASGGALLQLRQGDQVWLELPDekNGLYAGEYVDSTFSGFLLFPD 135
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
571-1001 2.75e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 58.39  E-value: 2.75e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    571 GVATASLIPNDQLLPRklNTEPKDVPKPVHQPVGSSSTLPKDPVLRKEKLQDlmtqiqgtcNFMQESVLDFDKPSSAIPT 650
Cdd:pfam05109  447 GLPSSTHVPTNLTAPA--STGPTVSTADVTSPTPAGTTSGASPVTPSPSPRD---------NGTESKAPDMTSPTSAVTT 515
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    651 SQPPSATPGSPVASKEQNLSSqsdflqePLQATSSPVTCssnaclVTTDQASSGSETEFMTSETPEAAIPP-GK-QPSSL 728
Cdd:pfam05109  516 PTPNATSPTPAVTTPTPNATS-------PTLGKTSPTSA------VTTPTPNATSPTPAVTTPTPNATIPTlGKtSPTSA 582
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    729 ASPNPPMAKGSEQGFQSPPA---------SSSSVTINTAPFQAMQTVFNVNAPLPPRKEQEIKESPYSPGYNQSFTTAST 799
Cdd:pfam05109  583 VTTPTPNATSPTVGETSPQAnttnhtlggTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDN 662
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    800 QTPPQCQLPSIHveQTVHSQETAANYHPDGTIQVSNGSLAFYPAQTNVFPRPTQPFVNSRGSVRGCTRGGRLI-TNSYRS 878
Cdd:pfam05109  663 STSHMPLLTSAH--PTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKnATSPQA 740
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    879 PGGYKgfdtyRGLPSI-SNGNYSQLQFQAREYSGapYSQRDNFQQCYKRGGTSGGPRANSRAGWSDSSQVSSPERDNETF 957
Cdd:pfam05109  741 PSGQK-----TAVPTVtSTGGKANSTTGGKHTTG--HGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTF 813
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 50428933    958 NSGdsgqgdsrsmtpvdvPVTNPAATIlpvhvyPLPQQMRVAFS 1001
Cdd:pfam05109  814 TSP---------------PVTTAQATV------PVPPTSQPRFS 836
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
88-540 3.41e-06

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 51.26  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933     88 QVNHSQHGESQRaLSPLQSTLSSAASPSQAYEtyiENGLICLKHKIRNIEKKKLKLEDyKDRLKSGEHLNPDQLEA-VEK 166
Cdd:pfam05483  286 ELIEKKDHLTKE-LEDIKMSLQRSMSTQKALE---EDLQIATKTICQLTEEKEAQMEE-LNKAKAAHSFVVTEFEAtTCS 360
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    167 YEEVLHNLEfaKELQKTFSGLSLDLLKAQKKAQRREHMLKL----EAEKKKLRTILQVQYVL--QNLTQEHVQKDFKGG- 239
Cdd:pfam05483  361 LEELLRTEQ--QRLEKNEDQLKIITMELQKKSSELEEMTKFknnkEVELEELKKILAEDEKLldEKKQFEKIAEELKGKe 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    240 --LNGAVYLPSKEL-DYLIKFSKLTCPERNESLSVED---QMEQSSLYFWDLLEGSEKAVVGTT--YKHLKDLLSKLLNS 311
Cdd:pfam05483  439 qeLIFLLQAREKEIhDLEIQLTAIKTSEEHYLKEVEDlktELEKEKLKNIELTAHCDKLLLENKelTQEASDMTLELKKH 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    312 GyfESIpvpKNAKEKevplEEEMLIQ----SEKKTQL-SKTESVKEseslmEFAQP------EIQPQEFLNRRYMTEVDY 380
Cdd:pfam05483  519 Q--EDI---INCKKQ----EERMLKQienlEEKEMNLrDELESVRE-----EFIQKgdevkcKLDKSEENARSIEYEVLK 584
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    381 SNKQGEEQPWEADYARK--PNLPKRWDMLTEPDGQEKKQESFKSWEASGKHQEVSKPAVSLEQRKQDTSKLRSTLPEEQK 458
Cdd:pfam05483  585 KEKQMKILENKCNNLKKqiENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIE 664
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    459 KQEISKSKpspsqwkqdtpkskagyVQEEQKKQETPKLWPVQLQKEQDPKKQTPkswTPSMQSEQNTTKSWTTPMCEEQD 538
Cdd:pfam05483  665 DKKISEEK-----------------LLEEVEKAKAIADEAVKLQKEIDKRCQHK---IAEMVALMEKHKHQYDKIIEERD 724

                   ..
gi 50428933    539 SK 540
Cdd:pfam05483  725 SE 726
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
382-802 3.72e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.31  E-value: 3.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    382 NKQGEEQPWEADY---ARKPNLPKRWDMLTEPDGQEKKQESFKSWEASgkhQEVSKPAVSLEQRKQDTSKLRSTLPEEQK 458
Cdd:pfam03154  127 NDEGSSDPKDIDQdnrSTSPSIPSPQDNESDSDSSAQQQILQTQPPVL---QAQSGAASPPSPPPPGTTQAATAGPTPSA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    459 KQEISKSKPSPSQwkqdTPKSKAGYVQEEQKKQETPKLWPVQLQKEQDPKKQTPKSWTPSMQSEQNTTKSW--------- 529
Cdd:pfam03154  204 PSVPPQGSPATSQ----PPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSlhgqmppmp 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    530 ---------------TTPMCEEQDSKQPETPKSWENNVESQKHsltsQSQISPKSWGVATASLIPNDQLLPRKLNTEPKD 594
Cdd:pfam03154  280 hslqtgpshmqhpvpPQPFPLTPQSSQSQVPPGPSPAAPGQSQ----QRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI 355
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    595 VPKPvhqpvgsSSTLPKDPVLRKEKLQdlmTQIQGTCNFMQESVLDFD---KPSSAIPTSQPPSATPgSPVaskeqNLSS 671
Cdd:pfam03154  356 KPPP-------TTPIPQLPNPQSHKHP---PHLSGPSPFQMNSNLPPPpalKPLSSLSTHHPPSAHP-PPL-----QLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    672 QSDFLQEPlqATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIP--PGKQPSSLASPNPPMAKGSEQGFQSPPAS 749
Cdd:pfam03154  420 QSQQLPPP--PAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPfvPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 50428933    750 SSSVTINTAPFQAMQTVFNVNAPLPPRKEQEIKESPYSPGYNQSFTTASTQTP 802
Cdd:pfam03154  498 ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
PTZ00121 PTZ00121
MAEBL; Provisional
130-569 2.03e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.90  E-value: 2.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   130 KHKIRNIEKK---KLKLEDYKDRL---KSGEHLNPDQLEAVEKYEEVLHNLEF---AKELQKTFSGLSLDLLKAQKKAQR 200
Cdd:PTZ00121 1456 AKKAEEAKKKaeeAKKADEAKKKAeeaKKADEAKKKAEEAKKKADEAKKAAEAkkkADEAKKAEEAKKADEAKKAEEAKK 1535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   201 REHMLKLEaEKKKLRTILQVQYVLQnltQEHVQKdfkgglngavylpskeldylIKFSKLTCPERNESLSVEDQMEQssl 280
Cdd:PTZ00121 1536 ADEAKKAE-EKKKADELKKAEELKK---AEEKKK--------------------AEEAKKAEEDKNMALRKAEEAKK--- 1588
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   281 yfwdlLEGSEKAVVGTTYKHLKDLLSKLLNSGYFESIPVPKNAKEKEVPLEEEMLIQSEKKtQLSKTESVKESESLMEFA 360
Cdd:PTZ00121 1589 -----AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE-EKKKAEELKKAEEENKIK 1662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   361 QPEIQPQEFLNRRYMTEVdySNKQGEEQPWEADYARKPNLPKRWDMLTEPDGQEKKQEsfkswEASGKHQEVSKPAVSLE 440
Cdd:PTZ00121 1663 AAEEAKKAEEDKKKAEEA--KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKA-----EELKKAEEENKIKAEEA 1735
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   441 QRK--QDTSKLRSTLPEEQKKQEISKSKPSPSQWKQDTPKSKAGYVQEEQKKQETPKlwpvqlQKEQDPKKQTPKSWTPS 518
Cdd:PTZ00121 1736 KKEaeEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR------RMEVDKKIKDIFDNFAN 1809
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 50428933   519 MQSEQNTTKSWTTPMCEEQDSKQPETPKS----WENNVESQKHSLTSQSQISPKS 569
Cdd:PTZ00121 1810 IIEGGKEGNLVINDSKEMEDSAIKEVADSknmqLEEADAFEKHKFNKNNENGEDG 1864
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
647-1012 3.37e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 3.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   647 AIPTSQPPSATPGSPVAskeqnlssqsdflQEPLQATSSPVTCSSNACLVTT-DQASSGSETEFMTSETPEAAIPPGKQP 725
Cdd:PHA03307   61 ACDRFEPPTGPPPGPGT-------------EAPANESRSTPTWSLSTLAPASpAREGSPTPPGPSSPDPPPPTPPPASPP 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   726 SSLASPNPPMAKGSEQGFQSPPASSSSvtintapfqamqtvfnvnAPLPPRKEQEIKESPYSPGYNQSFTTASTQTPpqc 805
Cdd:PHA03307  128 PSPAPDLSEMLRPVGSPGPPPAASPPA------------------AGASPAAVASDAASSRQAALPLSSPEETARAP--- 186
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   806 qlPSIHVEQTVHSQETAANYHP---DGTIQVSNGSLAFYPAQTNVFPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGY 882
Cdd:PHA03307  187 --SSPPAEPPPSTPPAAASPRPprrSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPIT 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   883 KGFDTYRGLPSISNGNYSQLQFQA---REYSGAPYSQRDnfqqcyKRGGTSGGPRANSRAGWSDSSQVSSPERDNETFNS 959
Cdd:PHA03307  265 LPTRIWEASGWNGPSSRPGPASSSsspRERSPSPSPSSP------GSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRG 338
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 50428933   960 GDSGQGDSRSMTPVDVPVTNPAATilpvhvyPLPQQMRVAFSAARTSNLAPGT 1012
Cdd:PHA03307  339 AAVSPGPSPSRSPSPSRPPPPADP-------SSPRKRPRPSRAPSSPAASAGR 384
PHA03247 PHA03247
large tegument protein UL36; Provisional
647-808 4.92e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 4.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   647 AIPTSQPPSATPGSPVASkeqnLSSQSDFLQEPLQATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIPPGKQPS 726
Cdd:PHA03247 2773 AAPAAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933   727 SLaSPNPPMAKGSEqgFQSPPASSSSVTINTAPfqAMQTVFNVNAPLPPRKEQEIKESPYSPgynQSFTTASTQTPPQCQ 806
Cdd:PHA03247 2849 SL-PLGGSVAPGGD--VRRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRSTESFALPPDQP---ERPPQPQAPPPPQPQ 2920

                  ..
gi 50428933   807 LP 808
Cdd:PHA03247 2921 PQ 2922
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
463-858 1.81e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 1.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    463 SKSKPSPSQWKQDTPKSKAGYVQEEQKKQETPKLWPVQLQKEQDPKKQTPKSWTPSMQSEQNTTKSWTTP---------- 532
Cdd:pfam03154   41 SSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPERATAKKSKTQEISRPnspsegeges 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    533 ----------------MCEEQDSKQPETPKSWENNVESQKHSLTSQSQISPKSWGVATASLIPnDQLLPRKLNTEPKDVP 596
Cdd:pfam03154  121 sdgrsvndegssdpkdIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASP-PSPPPPGTTQAATAGP 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    597 KPVHQPVGSSSTLP-KDPVLRKEKLQDLMTQIQGTCNFMQESVLDFDKPSSAIPTSQPPSATPGSPVASKE--------- 666
Cdd:pfam03154  200 TPSAPSVPPQGSPAtSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSlhgqmppmp 279
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    667 QNLSSQSDFLQEPLQATSSPVTCSSNACLV--TTDQASSGSETEFMTSETPEAAIPPGKQPSSLASPNPPMAK------- 737
Cdd:pfam03154  280 HSLQTGPSHMQHPVPPQPFPLTPQSSQSQVppGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMphikppp 359
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50428933    738 -------GSEQGFQSPPASSSSvtintAPFQamqtvFNVNAPLPPRKEQEIKESPYSPgynQSFTTASTQTPPQCQ-LPS 809
Cdd:pfam03154  360 ttpipqlPNPQSHKHPPHLSGP-----SPFQ-----MNSNLPPPPALKPLSSLSTHHP---PSAHPPPLQLMPQSQqLPP 426
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 50428933    810 IHVEQTVHSQetAANYHPDGTIQVSNGSLAFYPAQTnvfPRPTQPFVNS 858
Cdd:pfam03154  427 PPAQPPVLTQ--SQSLPPPAASHPPTSGLHQVPSQS---PFPQHPFVPG 470
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH