NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1370455081|ref|XP_024306355|]
View 

period circadian protein homolog 3 isoform X5 [Homo sapiens]

Protein Classification

PAS and Period_C domain-containing protein( domain architecture ID 12888871)

protein containing domains PAS, Herpes_BLLF1, and Period_C

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Period_C super family cl13540
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1076-1178 1.15e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


The actual alignment was detected with superfamily member pfam12114:

Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.15e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081 1076 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1154
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 1370455081 1155 EELAKVYNWIQSQTVTQEIDIQAC 1178
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-377 1.79e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


:

Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 364
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 1370455081  365 SRKISFIIGRHKV 377
Cdd:cd00130     91 GEVIGLLGVVRDI 103
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1056 3.02e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 3.02e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  745 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 824
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  825 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 902
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  903 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 979
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1370455081  980 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1056
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1076-1178 1.15e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.15e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081 1076 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1154
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 1370455081 1155 EELAKVYNWIQSQTVTQEIDIQAC 1178
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-377 1.79e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 364
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 1370455081  365 SRKISFIIGRHKV 377
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
285-373 6.90e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.90e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  285 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 362
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 1370455081  363 pWSRKISFIIG 373
Cdd:pfam08447   78 -ENGKPVRVIG 87
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1056 3.02e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 3.02e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  745 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 824
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  825 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 902
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  903 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 979
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1370455081  980 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1056
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-329 1.24e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.24e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 1370455081   285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 329
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
742-1057 7.35e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 7.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  742 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 807
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  808 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 887
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  888 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 966
Cdd:PHA03307   217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  967 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1046
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                          330
                   ....*....|.
gi 1370455081 1047 STGSPPSESPS 1057
Cdd:PHA03307   360 ADPSSPRKRPR 370
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
274-374 3.23e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  274 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 353
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 1370455081  354 DSSWSSFVNP-WSRKISFIIGR 374
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1076-1178 1.15e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.15e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081 1076 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1154
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 1370455081 1155 EELAKVYNWIQSQTVTQEIDIQAC 1178
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-377 1.79e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 364
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 1370455081  365 SRKISFIIGRHKV 377
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
285-373 6.90e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.90e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  285 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 362
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 1370455081  363 pWSRKISFIIG 373
Cdd:pfam08447   78 -ENGKPVRVIG 87
PAS_11 pfam14598
PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), ...
275-377 2.99e-09

PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), which binds to an LXXLL motif in the C-terminal region of STAT6 (Signal transducer and activator of transcription 6).


Pssm-ID: 464214 [Multi-domain]  Cd Length: 110  Bit Score: 55.76  E-value: 2.99e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  275 FTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHppfEHSPI-RFCTQNGDYIIL 353
Cdd:pfam14598    4 FTTRHDIDGKIISCDTRAPFSLGYEKDELVGRSIYDLVHPQDLRTAKSHLREIIQTRGR---ATSPSyRLRLRDGDFLSV 80
                           90       100
                   ....*....|....*....|....
gi 1370455081  354 DSSWSSFVNPWSRKISFIIGRHKV 377
Cdd:pfam14598   81 HTKSKLFLNQNSNQQPFIMCTHTI 104
PAS pfam00989
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
272-371 3.85e-09

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya. This domain can bind gases (O2, CO and NO), FAD, 4-hydroxycinnamic acid and NAD+ (Matilla et.al., FEMS Microbiology Reviews, fuab043, 45, 2021, 1. https://doi.org/10.1093/femsre/fuab043).


Pssm-ID: 395786 [Multi-domain]  Cd Length: 113  Bit Score: 55.50  E-value: 3.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  272 KRIFTTTHTPGCV------FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKyAGHPPFEHSpIRFCT 345
Cdd:pfam00989    4 RAILESLPDGIFVvdedgrILYVNAAAEELLGLSREEVIGKSLLDLIPEEDDAEVAELLRQALL-QGEESRGFE-VSFRV 81
                           90       100
                   ....*....|....*....|....*.
gi 1370455081  346 QNGDYIILDSSWSSFVNPWSRKISFI 371
Cdd:pfam00989   82 PDGRPRHVEVRASPVRDAGGEILGFL 107
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1056 3.02e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 3.02e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  745 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 824
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  825 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 902
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  903 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 979
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1370455081  980 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1056
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-329 1.24e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.24e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 1370455081   285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 329
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
749-1059 2.10e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.31  E-value: 2.10e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  749 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 819
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  820 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 894
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  895 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESpDQMRRNTCPQTEYQCVTGNNGSESSPATTGA 973
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTS-DNSTSHMPLLTSAHPTGGENITQVTPASTST 691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  974 --LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGS 1050
Cdd:pfam05109  692 hhVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGK 765

                   ....*....
gi 1370455081 1051 PPSESPSRT 1059
Cdd:pfam05109  766 HTTGHGART 774
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
742-1057 7.35e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 7.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  742 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 807
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  808 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 887
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  888 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 966
Cdd:PHA03307   217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  967 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1046
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                          330
                   ....*....|.
gi 1370455081 1047 STGSPPSESPS 1057
Cdd:PHA03307   360 ADPSSPRKRPR 370
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
746-1058 3.45e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 3.45e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  746 PEPPDSSSSNTGSGPRRGAHQNAQPCC-PSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPL-PAATSPGREYAAPGTAPE 823
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGSPtPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPvGSPGPPPAASPPAAGASP 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  824 GLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMsptldPPPSVTSQRR 903
Cdd:PHA03307   160 AAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA-----SSPAPAPGRS 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  904 EEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSESSPATTGALSTGSPPRE 982
Cdd:PHA03307   228 AADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRPGPASSSSSPRERSPSP 298
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1370455081  983 NPSHPTASALSTGSPPMKNPSHPTASALSTGSPpmknpSHPTAStlSMGLPPSRTPSHPTAtvLSTGSPPSESPSR 1058
Cdd:PHA03307   299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS-----SSESSR--GAAVSPGPSPSRSPS--PSRPPPPADPSSP 365
PHA03247 PHA03247
large tegument protein UL36; Provisional
732-1057 3.57e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  732 SAGCRKGKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSP 811
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  812 GREYAAPGTaPEGlHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPS-FLPCPflgATASSAISPSMSSAMSP 890
Cdd:PHA03247  2736 PAAPAPPAV-PAG-PATPGGPARPARPPTT-------AGPPAPAPPAAPAAGPPrRLTRP---AVASLSESRESLPSPWD 2803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  891 TLDPPPSVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQcvtgNNGSESSPAT 970
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGGDVR----RRPPSRSPAA 2873
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  971 TGALSTGSP----PRENPSHPTASaLSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSrtPSHPTATVL 1046
Cdd:PHA03247  2874 KPAAPARPPvrrlARPAVSRSTES-FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPA 2950
                          330
                   ....*....|.
gi 1370455081 1047 STGSPPSESPS 1057
Cdd:PHA03247  2951 GAGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
748-1056 4.41e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 4.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  748 PPDSSSSNTGSGPRRGAHQNAQPccpsAASSPHTSSPTFPPAAmvPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHG 827
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPP----SPLPPDTHAPDPPPPS--PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR 2665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  828 LPLSEGLQPYPAFPfpyldtfmtvflPDPPVCPLLSPSFLPCPFLGatassaispsmssamsptlDPPPSvtsQRREEEK 907
Cdd:PHA03247  2666 RARRLGRAAQASSP------------PQRPRRRAARPTVGSLTSLA-------------------DPPPP---PPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  908 WEAQSEGHPfitsrssSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNNGSESSPATTGALSTGSPPRENPSHP 987
Cdd:PHA03247  2712 PHALVSATP-------LPPGPAAARQASPALPAAP-----APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1370455081  988 ----TASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPShPTATVLSTGSPPSESP 1056
Cdd:PHA03247  2780 prrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLP 2851
PHA03247 PHA03247
large tegument protein UL36; Provisional
748-1054 8.29e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 8.29e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  748 PPDSSSSNTGSGPRRGAhQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPgreyaAPGTAPEGLHG 827
Cdd:PHA03247  2742 PAVPAGPATPGGPARPA-RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP-----ADPPAAVLAPA 2815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  828 LPLSEGLQPYPAFPFPyldtfmTVFLPDPPVCPllsPSFLPCPFlgATASSAISPSMSSAMSPTLDPPPSVTSQRREEEK 907
Cdd:PHA03247  2816 AALPPAASPAGPLPPP------TSAQPTAPPPP---PGPPPPSL--PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  908 WEAQSEghpfiTSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNG---SESSPATTGALSTGSPPRENP 984
Cdd:PHA03247  2885 RLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprPQPPLAPTTDPAGAGEPSGAV 2959
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1370455081  985 SHPTASALSTGSPPMKN----PSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSE 1054
Cdd:PHA03247  2960 PQPWLGALVPGRVAVPRfrvpQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDD 3033
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
746-1059 8.54e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 8.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  746 PEPPDSSSSNTGSGPRRG-AHQNAQPCCPSAASSPHTSSPtFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 824
Cdd:PHA03307   114 PDPPPPTPPPASPPPSPApDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAE 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  825 LHGLPLSEGLQPYPafpfPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPS---VTSQ 901
Cdd:PHA03307   193 PPPSTPPAAASPRP----PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitLPTR 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  902 RREEEKWEAQSEGHPFITSRSSSPlqlnllqEEMPRPSESPDQMRRNTCPQTeyqcVTGNNGSESSPATTGALSTGSPPR 981
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPR-------ERSPSPSPSSPGSGPAPSSPR----ASSSSSSSRESSSSSTSSSSESSR 337
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1370455081  982 ENPSHPtasalstGSPPMKNPSHPTASALSTGSPPMKN-PSHPTASTLSMGlPPSRTPSHPTATVLSTGSPPSESPSRT 1059
Cdd:PHA03307   338 GAAVSP-------GPSPSRSPSPSRPPPPADPSSPRKRpRPSRAPSSPAAS-AGRPTRRRARAAVAGRARRRDATGRFP 408
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1057 3.21e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 3.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  745 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-HTSSPTFP-PAAMVPSQAPYLVPAFPLPAATSPGREYA-APGTA 821
Cdd:pfam03154  252 MTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPqPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  822 PEGLHGLPLSEglQPYPAFPFPyldtfMTVFLPDP--PVCPLLSPSFLPCPflgatasSAISPSMSSAMSPTLDPPPSVt 899
Cdd:pfam03154  332 SQLQSQQPPRE--QPLPPAPLS-----MPHIKPPPttPIPQLPNPQSHKHP-------PHLSGPSPFQMNSNLPPPPAL- 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  900 sqrreeEKWEAQSEGHPfiTSRSSSPLQLNLLQEEMPRPSESPDQMrrntcpqTEYQCVTGnngSESSPATTGALSTGSP 979
Cdd:pfam03154  397 ------KPLSSLSTHHP--PSAHPPPLQLMPQSQQLPPPPAQPPVL-------TQSQSLPP---PAASHPPTSGLHQVPS 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  980 preNPSHPTASALSTGSPPMKNPSHPTASALSTGS---PPMKNP---SHPTASTLSMGLPPSRTPSHPTATVLSTGS--P 1051
Cdd:pfam03154  459 ---QSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPgiqPPSSASvssSGPVPAAVSCPLPPVQIKEEALDEAEEPESppP 535

                   ....*.
gi 1370455081 1052 PSESPS 1057
Cdd:pfam03154  536 PPRSPS 541
PHA03247 PHA03247
large tegument protein UL36; Provisional
745-1057 6.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 6.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  745 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-------HTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAA 817
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  818 PGTAPE-----GLHGLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPlLSPSFLPCPFLGATASSAISPSMSSAMSPTL 892
Cdd:PHA03247  2704 PPPTPEpaphaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  893 DPPPSVTSQrrEEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNGSESSPATTG 972
Cdd:PHA03247  2783 LTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  973 ALSTGSPPRENPSHPTAsalstgsppmknPSHPTASALStgSPPMKNPSHPTASTlSMGLPPSRTPSHPTATVLSTGSPP 1052
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAA------------PARPPVRRLA--RPAVSRSTESFALP-PDQPERPPQPQAPPPPQPQPQPPP 2925

                   ....*
gi 1370455081 1053 SESPS 1057
Cdd:PHA03247  2926 PPQPQ 2930
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
746-1057 4.30e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 4.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  746 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLP-------AATSPGREYAAP 818
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplqpmTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  819 GTAPEGLHGL--PLSEGLQ--------PYPAFPFPYLDTFMTVFLPDPPVCPLLSPSflpcpflgatassaispsmssAM 888
Cdd:pfam03154  265 PLPQPSLHGQmpPMPHSLQtgpshmqhPVPPQPFPLTPQSSQSQVPPGPSPAAPGQS---------------------QQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  889 SPTLDPPPSVTSQRR--EEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSE----SPDQMRRNTCPQTEYQ---CVT 959
Cdd:pfam03154  324 RIHTPPSQSQLQSQQppREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHlsgpSPFQMNSNLPPPPALKplsSLS 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  960 GNNGSESSPATTGALSTGSPPRENPSHPTASALSTGSPPmKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPS 1039
Cdd:pfam03154  404 THHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPP-PAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPP 482
                          330
                   ....*....|....*...
gi 1370455081 1040 HPTATVLSTGSPPSESPS 1057
Cdd:pfam03154  483 TSTSSAMPGIQPPSSASV 500
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
274-374 3.23e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  274 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 353
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 1370455081  354 DSSWSSFVNP-WSRKISFIIGR 374
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
746-1056 3.56e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 3.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  746 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGL 825
Cdd:PHA03307   129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  826 HGLPLSEG-LQPYPAFP----FPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPP--SV 898
Cdd:PHA03307   209 RSSPISASaSSPAPAPGrsaaDDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPasSS 288
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  899 TSQRREEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPrPSESPDQMRRNTCPQTeyqcvTGNNGSES-SPATTGALSTG 977
Cdd:PHA03307   289 SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS-SSTSSSSESSRGAAVS-----PGPSPSRSpSPSRPPPPADP 362
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1370455081  978 SPPRENPshPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPtastlsmgLPPSRTPSHPTATVLSTGSPPSESP 1056
Cdd:PHA03307   363 SSPRKRP--RPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR--------FPAGRPRPSPLDAGAASGAFYARYP 431
PHA03379 PHA03379
EBNA-3A; Provisional
746-1056 9.33e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 40.43  E-value: 9.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  746 PEPPDSSSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTfPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 824
Cdd:PHA03379   425 PEVPQSLETATSHGSAQVPEPPpVHDLEPGPLHDQHSMAPC-PVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPAG 503
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  825 LHGLPLSEGLQPYPAFPF-PYLDTFMTV-FLPDP------PVCPLLSPSFLPCPflGATASSAISPSMSSAMSPTLDPPP 896
Cdd:PHA03379   504 PIVRPWEASLSQVPGVAFaPVMPQPMPVePVPVPtvalerPVCPAPPLIAMQGP--GETSGIVRVRERWRPAPWTPNPPR 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  897 SVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNLL--QEEMPRPSEsPDQMRRNTCPQTEYQCVTGNNG----------- 963
Cdd:PHA03379   582 SPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQVspQQPMEYPLE-PEQQMFPGSPFSQVADVMRAGGvpamqpqyfdl 660
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455081  964 SESSPATTGALST-------GSPPR--ENPSH---PTASALSTGSP--------PMKNPSHPtASALSTGSPPMKNPSHP 1023
Cdd:PHA03379   661 PLQQPISQGAPLAplrasmgPVPPVpaTQPQYfdiPLTEPINQGASaahflpqqPMEGPLVP-ERWMFQGATLSQSVRPG 739
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1370455081 1024 TASTLSMGLPPSRTPSHPTATVLSTGSPPSESP 1056
Cdd:PHA03379   740 VAQSQYFDLPLTQPINHGAPAAHFLHQPPMEGP 772
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH