NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1844084095|ref|NP_115571|]
View 

protein SON isoform B [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
170-460 2.12e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 2.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  170 PTRAFGPSETNESPAVVLEPPVVSMEVSEPHIleTLKPATK-TAELSVVSTSVISEQSEQSVAVMPEPSMTKILdsfAAA 248
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPAL--PAAPAPPaVPAGPATPGGPARPARPPTTAGPPAPAPPAAP---AAG 2778
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  249 PVPTTTLVLKSSEPVVTMSVeyqmksvlksveSTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDF 328
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESL------------PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  329 PESSAIEALRLPEQPVD--VPSEIADSSMTRPQElPELPKTTALELQESSVASAMELPGPPATSMPELQGPPVTPVLELP 406
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRrrPPSRSPAAKPAAPAR-PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1844084095  407 GPSATPVPELPGPLSTPVPELPGP-----PATAVPE------LPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPagagePSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03379 super family cl33730
EBNA-3A; Provisional
340-673 6.97e-07

EBNA-3A; Provisional


The actual alignment was detected with superfamily member PHA03379:

Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 54.68  E-value: 6.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVLELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1844084095  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
rne super family cl35953
ribonuclease E; Reviewed
1289-1481 8.39e-05

ribonuclease E; Reviewed


The actual alignment was detected with superfamily member PRK10811:

Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 8.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1289 PAMsAEPTVLASEPPVMSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLAESILEPPAMAAPESSAMAVLESSAVTVL 1368
Cdd:PRK10811   834 PEM-ASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVV 912
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1369 ESSTVTVLESSTVTvlepsvvtvpePPVVAEPDYVTIPVPVVSALEPSVPVLEPAVSVLQPS----MIVSEPSVSVQEST 1444
Cdd:PRK10811   913 ETTHPEVIAAPVTE-----------QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAetaeVVVAEPEVVAQPAA 981
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1844084095 1445 VTVSEPAVTVSEQTQVIPTEVAIESTPMILESSIMSS 1481
Cdd:PRK10811   982 PVVAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATA 1018
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
170-460 2.12e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 2.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  170 PTRAFGPSETNESPAVVLEPPVVSMEVSEPHIleTLKPATK-TAELSVVSTSVISEQSEQSVAVMPEPSMTKILdsfAAA 248
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPAL--PAAPAPPaVPAGPATPGGPARPARPPTTAGPPAPAPPAAP---AAG 2778
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  249 PVPTTTLVLKSSEPVVTMSVeyqmksvlksveSTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDF 328
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESL------------PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  329 PESSAIEALRLPEQPVD--VPSEIADSSMTRPQElPELPKTTALELQESSVASAMELPGPPATSMPELQGPPVTPVLELP 406
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRrrPPSRSPAAKPAAPAR-PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1844084095  407 GPSATPVPELPGPLSTPVPELPGP-----PATAVPE------LPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPagagePSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03379 PHA03379
EBNA-3A; Provisional
340-673 6.97e-07

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 54.68  E-value: 6.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVLELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1844084095  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
258-459 1.87e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 53.23  E-value: 1.87e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  258 KSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETlvvsseTPTEVYPEPSTSTTMDFPESSAIEAL 337
Cdd:NF033839   291 KPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEK------PKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  338 RLPEQPVDVPSEIADSSMTRPQELPELPKTT-----ALELQESSVASAMELPGPPATSMPELQGPPVTPVLELPGPSATP 412
Cdd:NF033839   365 VKPQPEKPKPEVKPQPETPKPEVKPQPEKPKpevkpQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKP 444
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1844084095  413 VPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAP 459
Cdd:NF033839   445 QPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQA 491
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
308-493 3.80e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.08  E-value: 3.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  308 VSSETPTE-VYPEPSTSTT--MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAM 381
Cdd:NF033839   279 LTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQP 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  382 ELPGPPATSMPELQGPPVTPVLELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAP 459
Cdd:NF033839   359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKP 438
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1844084095  460 SMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPE 493
Cdd:NF033839   439 KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPK 472
rne PRK10811
ribonuclease E; Reviewed
1289-1481 8.39e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 8.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1289 PAMsAEPTVLASEPPVMSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLAESILEPPAMAAPESSAMAVLESSAVTVL 1368
Cdd:PRK10811   834 PEM-ASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVV 912
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1369 ESSTVTVLESSTVTvlepsvvtvpePPVVAEPDYVTIPVPVVSALEPSVPVLEPAVSVLQPS----MIVSEPSVSVQEST 1444
Cdd:PRK10811   913 ETTHPEVIAAPVTE-----------QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAetaeVVVAEPEVVAQPAA 981
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1844084095 1445 VTVSEPAVTVSEQTQVIPTEVAIESTPMILESSIMSS 1481
Cdd:PRK10811   982 PVVAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATA 1018
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
97-579 1.33e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095   97 TDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----SKTKSHDDGNIDLESDSFLKfDSEPSAVALELPTR 172
Cdd:pfam03154   55 NDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGEGE-SSDGRSVNDEGSSD 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  173 AFGPSETNESPAVVLEPP----VVSMEVSEPHILETLKPATKTAELSVVSTSVISEQSEQSVAVMPEPSMTkildSFAAA 248
Cdd:pfam03154  134 PKDIDQDNRSTSPSIPSPqdneSDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAP----SVPPQ 209
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  249 PVPTTTLVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDF 328
Cdd:pfam03154  210 GSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHM 289
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  329 PESsaiealrLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASAMELPGPPA-TSMPELQGPPVTP 401
Cdd:pfam03154  290 QHP-------VPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPApLSMPHIKPPPTTP 362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  402 VLELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL---SQELPGLPAPSMGLEPPQEVP 470
Cdd:pfam03154  363 IPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmpqSQQLPPPPAQPPVLTQSQSLP 441
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  471 EPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHPEVTTATGLLGQPEATmvleLPG 550
Cdd:pfam03154  442 PPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCP----LPP 516
                          490       500
                   ....*....|....*....|....*....
gi 1844084095  551 QPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154  517 VQIKEEALDEAEEPESPPPPPRSPSPEPT 545
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
312-524 1.07e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 44.29  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPkttalelqESSVASAMELPG--PPAT 389
Cdd:TIGR01645  283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATP--------SSSLPTDIGNKAvvSSAK 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  390 SMPELQG--PPVTPVLELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645  353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1844084095  462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645  431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
320-711 3.60e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 3.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  320 PSTSTTMDFPESSAIEALRLPEQPVdvpSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPATSMPELQGPPV 399
Cdd:pfam03154  149 PSPQDNESDSDSSAQQQILQTQPPV---LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTA 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  400 TPV-------------LELPGPSATPVPELPGPLSTPVPELPGPPA-TAVPELP-----GPSVTPVPQLSQELPGLPAPS 460
Cdd:pfam03154  226 APHtliqqtptlhpqrLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLhGQMPPMPhslqtGPSHMQHPVPPQPFPLTPQSS 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  461 MGLEPPQEVPEPPVMAQELPGLPLVTAAVELP----EQPAVTVAMELTE-QPVTTTELEQPVGMTTVEHPGHpevttatg 535
Cdd:pfam03154  306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQqpprEQPLPPAPLSMPHiKPPPTTPIPQLPNPQSHKHPPH-------- 377
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  536 LLGQPEATMVLELPGQPVATTALELPG-QPSVTGVPELPGLPSATRALELSGQ-PVATGALELPGPLMAAgalefsgqsg 613
Cdd:pfam03154  378 LSGPSPFQMNSNLPPPPALKPLSSLSThHPPSAHPPPLQLMPQSQQLPPPPAQpPVLTQSQSLPPPAASH---------- 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  614 aagalellgqPLATGVLELPGQPGAPELPGQPVATVAleISVQSVVTTSELSTMTVSQS-LEVPSTTALESYNTVAQELP 692
Cdd:pfam03154  448 ----------PPTSGLHQVPSQSPFPQHPFVPGGPPP--ITPPSGPPTSTSSAMPGIQPpSSASVSSSGPVPAAVSCPLP 515
                          410
                   ....*....|....*....
gi 1844084095  693 TTLVGETSVTVGVDPLMAP 711
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPP 534
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
170-460 2.12e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 2.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  170 PTRAFGPSETNESPAVVLEPPVVSMEVSEPHIleTLKPATK-TAELSVVSTSVISEQSEQSVAVMPEPSMTKILdsfAAA 248
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPAL--PAAPAPPaVPAGPATPGGPARPARPPTTAGPPAPAPPAAP---AAG 2778
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  249 PVPTTTLVLKSSEPVVTMSVeyqmksvlksveSTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDF 328
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESL------------PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  329 PESSAIEALRLPEQPVD--VPSEIADSSMTRPQElPELPKTTALELQESSVASAMELPGPPATSMPELQGPPVTPVLELP 406
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRrrPPSRSPAAKPAAPAR-PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1844084095  407 GPSATPVPELPGPLSTPVPELPGP-----PATAVPE------LPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPagagePSGAVPQpwlgalVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03379 PHA03379
EBNA-3A; Provisional
340-673 6.97e-07

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 54.68  E-value: 6.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVLELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1844084095  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
258-459 1.87e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 53.23  E-value: 1.87e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  258 KSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETlvvsseTPTEVYPEPSTSTTMDFPESSAIEAL 337
Cdd:NF033839   291 KPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEK------PKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  338 RLPEQPVDVPSEIADSSMTRPQELPELPKTT-----ALELQESSVASAMELPGPPATSMPELQGPPVTPVLELPGPSATP 412
Cdd:NF033839   365 VKPQPEKPKPEVKPQPETPKPEVKPQPEKPKpevkpQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKP 444
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1844084095  413 VPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAP 459
Cdd:NF033839   445 QPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQA 491
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
308-493 3.80e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.08  E-value: 3.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  308 VSSETPTE-VYPEPSTSTT--MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAM 381
Cdd:NF033839   279 LTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQP 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  382 ELPGPPATSMPELQGPPVTPVLELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAP 459
Cdd:NF033839   359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKP 438
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1844084095  460 SMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPE 493
Cdd:NF033839   439 KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPK 472
PHA03247 PHA03247
large tegument protein UL36; Provisional
340-647 3.21e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 3.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  340 PEQPVDVPSEIADSSMTRPQELPELPKT--TALELQESSVASAMELPGPPATSMPELQGPPVTPV---LELPGPSATPVP 414
Cdd:PHA03247  2570 PPRPAPRPSEPAVTSRARRPDAPPQSARprAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAanePDPHPPPTVPPP 2649
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  415 ELPGPLSTP--------------VPELPGPPATAVPELPGPSVTPVPQLS----QELPGLPAPSMGLEPPQEVPEPPVMA 476
Cdd:PHA03247  2650 ERPRDDPAPgrvsrprrarrlgrAAQASSPPQRPRRRAARPTVGSLTSLAdpppPPPTPEPAPHALVSATPLPPGPAAAR 2729
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  477 QELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGmTTVEHPGHPEVTTATGLLGQPEATMVLELPGQPVATT 556
Cdd:PHA03247  2730 QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP-PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPP 2808
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  557 ALELPGQPSVTGV----PELPGLPSATRALELSGQPVATGALELPGPLMAAGALEFSGQSGAAGAL----------ELLG 622
Cdd:PHA03247  2809 AAVLAPAAALPPAaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKpaaparppvrRLAR 2888
                          330       340
                   ....*....|....*....|....*
gi 1844084095  623 QPLATGVLELPGQPGAPELPGQPVA 647
Cdd:PHA03247  2889 PAVSRSTESFALPPDQPERPPQPQA 2913
rne PRK10811
ribonuclease E; Reviewed
1289-1481 8.39e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 8.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1289 PAMsAEPTVLASEPPVMSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLAESILEPPAMAAPESSAMAVLESSAVTVL 1368
Cdd:PRK10811   834 PEM-ASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVV 912
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1369 ESSTVTVLESSTVTvlepsvvtvpePPVVAEPDYVTIPVPVVSALEPSVPVLEPAVSVLQPS----MIVSEPSVSVQEST 1444
Cdd:PRK10811   913 ETTHPEVIAAPVTE-----------QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAetaeVVVAEPEVVAQPAA 981
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1844084095 1445 VTVSEPAVTVSEQTQVIPTEVAIESTPMILESSIMSS 1481
Cdd:PRK10811   982 PVVAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATA 1018
PHA03247 PHA03247
large tegument protein UL36; Provisional
386-648 8.71e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 8.71e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  386 PPATSMPELQGPPVTPVLELPG-PSATPVPELPG--PLSTPVPELPG--PPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRPDaPPQSARPRAPVddRGDPRGPAPPSplPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP 2648
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  461 MGLEPPQEVPEPPVMAQELPGLPLVTAAVELPE-------QPAVTVAMELTEQPVTTTELEQPVGMTTVEHPGHPEVTTA 533
Cdd:PHA03247  2649 PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  534 TGLLGQPEATMV----LELPGQPVATTALELPGQPSVTGVPELPGLPSATRALELSGQPVATGALELPG-PLMAAGALEF 608
Cdd:PHA03247  2729 RQASPALPAAPAppavPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESlPSPWDPADPP 2808
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1844084095  609 SGQSGAAGALELLGQPLATGVLELPGQPGAPELPGQPVAT 648
Cdd:PHA03247  2809 AAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
97-579 1.33e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095   97 TDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----SKTKSHDDGNIDLESDSFLKfDSEPSAVALELPTR 172
Cdd:pfam03154   55 NDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGEGE-SSDGRSVNDEGSSD 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  173 AFGPSETNESPAVVLEPP----VVSMEVSEPHILETLKPATKTAELSVVSTSVISEQSEQSVAVMPEPSMTkildSFAAA 248
Cdd:pfam03154  134 PKDIDQDNRSTSPSIPSPqdneSDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAP----SVPPQ 209
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  249 PVPTTTLVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDF 328
Cdd:pfam03154  210 GSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHM 289
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  329 PESsaiealrLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASAMELPGPPA-TSMPELQGPPVTP 401
Cdd:pfam03154  290 QHP-------VPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPApLSMPHIKPPPTTP 362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  402 VLELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL---SQELPGLPAPSMGLEPPQEVP 470
Cdd:pfam03154  363 IPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmpqSQQLPPPPAQPPVLTQSQSLP 441
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  471 EPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHPEVTTATGLLGQPEATmvleLPG 550
Cdd:pfam03154  442 PPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCP----LPP 516
                          490       500
                   ....*....|....*....|....*....
gi 1844084095  551 QPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154  517 VQIKEEALDEAEEPESPPPPPRSPSPEPT 545
PHA03378 PHA03378
EBNA-3B; Provisional
181-459 1.83e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  181 ESPAVVLEPPVVSMEVSEPhILETLKPATKTAELSVVSTSViseqseqsvavmpEPSMTKILDSFAAAPVPTTTLVLKSS 260
Cdd:PHA03378   486 VTPVILHQPPAQGVQAHGS-MLDLLEKDDEDMEQRVMATLL-------------PPSPPQPRAGRRAPCVYTEDLDIESD 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  261 EPVVTMSVEYQMKSV-----LKSVESTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDFPESSAIE 335
Cdd:PHA03378   552 EPASTEPVHDQLLPApglgpLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPI 631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  336 ALRL---------------PEQPVDVPSEIADSSMTRPQELPELPKTT--ALELQESSVASAMELPGPPATSMPELQGPP 398
Cdd:PHA03378   632 PMRPlrmqpitfnvlvfptPHQPPQVEITPYKPTWTQIGHIPYQPSPTgaNTMLPIQWAPGTMQPPPRAPTPMRPPAAPP 711
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1844084095  399 V------TPVLELPGPSATP-VPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSqelPGLPAP 459
Cdd:PHA03378   712 GraqrpaAATGRARPPAAAPgRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA---PGAPTP 776
rne PRK10811
ribonuclease E; Reviewed
1220-1381 2.58e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 46.57  E-value: 2.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1220 ISEPSAVPTDYSVSASDPSVLVSEAAVTVPEPPPEpessiTLTPVESAVVAEEHEVVPERPVtcmVSETPAMSAEPTVLA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1300 SEPPVMSETA-ETFDSMRASGHVASEVSTSLLVPAVTTPVLAESILEPPAMAAPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1844084095 1379 STV 1381
Cdd:PRK10811   995 TAV 997
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
369-452 3.92e-04

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 41.99  E-value: 3.92e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  369 ALELQESSVASAMELPGPPATSMPelqgPPVTPVLELPGPSATPVPELPGPlsTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:pfam12526   31 PPESAHPDPPPPVGDPRPPVVDTP----PPVSAVWVLPPPSEPAAPEPDLV--PPVTGPAGPPSPLAPPAPAQKPPLPPP 104

                   ....
gi 1844084095  449 LSQE 452
Cdd:pfam12526  105 RPQR 108
rne PRK10811
ribonuclease E; Reviewed
1271-1444 9.78e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 44.65  E-value: 9.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1271 EEHEVVPERPVTcMVSETPAMSAEPTVLASEPPVMSETAETFDSMRASGHVASE--VSTSLLVPAVTTPVLAESILEPPA 1348
Cdd:PRK10811   851 QDVQVEEQREAE-EVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEpqPEEVVVVETTHPEVIAAPVTEQPQ 929
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1349 MAAPESSAMAvlessAVTVLESSTVTVLESSTVTVLEPSVVTVPEPpvvAEPDYVTIPVPVVSALEPSVPVLEPAVSVLQ 1428
Cdd:PRK10811   930 VITESDVAVA-----QEVAEHAEPVVEPQDETADIEEAAETAEVVV---AEPEVVAQPAAPVVAEVAAEVETVTAVEPEV 1001
                          170
                   ....*....|....*.
gi 1844084095 1429 PSMIVSEPSVSVQEST 1444
Cdd:PRK10811  1002 APAQVPEATVEHNHAT 1017
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
312-524 1.07e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 44.29  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPkttalelqESSVASAMELPG--PPAT 389
Cdd:TIGR01645  283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATP--------SSSLPTDIGNKAvvSSAK 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  390 SMPELQG--PPVTPVLELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645  353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1844084095  462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645  431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
PHA03247 PHA03247
large tegument protein UL36; Provisional
318-525 1.44e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  318 PEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQEL----PELPKTTALElqessvASAMELPGPPATSMP- 392
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagPLPPPTSAQP------TAPPPPPGPPPPSLPl 2852
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  393 ----------ELQGPPVTPVLELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMG 462
Cdd:PHA03247  2853 ggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1844084095  463 LEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPA-----VTVAMELTEQPVTTTELEQPVGMTTVEHP 525
Cdd:PHA03247  2933 PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgrVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
387-558 1.55e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.70  E-value: 1.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  387 PATSMPELQGPPVtPVLELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPP 466
Cdd:PRK07994   361 PAAPLPEPEVPPQ-SAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  467 QEVPEPPVMAQELPGL-----PLVTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGMT----TVEHPGHPEVTTATGLL 537
Cdd:PRK07994   440 KSEPAAASRARPVNSAlerlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKalkkALEHEKTPELAAKLAAE 519
                          170       180
                   ....*....|....*....|....*.
gi 1844084095  538 GQPE---ATMV--LELPGqPVATTAL 558
Cdd:PRK07994   520 AIERdpwAALVsqLGLPG-LVEQLAL 544
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
319-677 2.62e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.24  E-value: 2.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  319 EPSTSTTMDFPESSAIEALRLPeQPVDVPSEIADSSMTRPQEL---PELPKTTALELQESSVASAMELP-GPPATSMPEL 394
Cdd:PHA03307     1 SDNAPDLYDLIEAAAEGGEFFP-RPPATPGDAADDLLSGSQGQlvsDSAELAAVTVVAGAAACDRFEPPtGPPPGPGTEA 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  395 QGPPVTPVLELPGPSATPVPELPGPLSTP--VPELPGPPATAVPELPGPSvtPVPQLSQELPGLPAPSMGLEPPQEVPEP 472
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGSPTPpgPSSPDPPPPTPPPASPPPS--PAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  473 PVMAqelpglplVTAAVELPEQPAVTVAM-ELTEQPVTTTELEQPVGMTTVEHPGHPEVttatglLGQPEATMVLELPGQ 551
Cdd:PHA03307   158 SPAA--------VASDAASSRQAALPLSSpEETARAPSSPPAEPPPSTPPAAASPRPPR------RSSPISASASSPAPA 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  552 PVATTALELPGQPSVTGVPELPGLPSATRALELSGQPvatGALELPGPLMAA-----GALEFSGQSGAAGALELLGQPla 626
Cdd:PHA03307   224 PGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP---APITLPTRIWEAsgwngPSSRPGPASSSSSPRERSPSP-- 298
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1844084095  627 tgvleLPGQPGAPELPGQPVAtVALEISVQSVVTTSELSTMTVSQSLEVPS 677
Cdd:PHA03307   299 -----SPSSPGSGPAPSSPRA-SSSSSSSRESSSSSTSSSSESSRGAAVSP 343
rne PRK10811
ribonuclease E; Reviewed
1262-1472 3.58e-03

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 42.72  E-value: 3.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1262 TPVESAVVAEEHEVVPERPVTCMVSETPAMSAEPTVLASEPPVMSETAETfdsmrasghvASEVSTSLLVPAVTTPVLAE 1341
Cdd:PRK10811   853 VQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVA----------EPQPEEVVVVETTHPEVIAA 922
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095 1342 SILEPPAMAAPESSAMAvlessAVTVLESSTVTvlesstvtvlepsvvtvpeppvvaEPDYVTIPVPVVSALEPsVPVLE 1421
Cdd:PRK10811   923 PVTEQPQVITESDVAVA-----QEVAEHAEPVV------------------------EPQDETADIEEAAETAE-VVVAE 972
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1844084095 1422 PAVSVLQPSMIVSEPSVSVQESTVtvsEPAVTVSEQTQVIPTEVAIESTPM 1472
Cdd:PRK10811   973 PEVVAQPAAPVVAEVAAEVETVTA---VEPEVAPAQVPEATVEHNHATAPM 1020
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
320-711 3.60e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 3.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  320 PSTSTTMDFPESSAIEALRLPEQPVdvpSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPATSMPELQGPPV 399
Cdd:pfam03154  149 PSPQDNESDSDSSAQQQILQTQPPV---LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTA 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  400 TPV-------------LELPGPSATPVPELPGPLSTPVPELPGPPA-TAVPELP-----GPSVTPVPQLSQELPGLPAPS 460
Cdd:pfam03154  226 APHtliqqtptlhpqrLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLhGQMPPMPhslqtGPSHMQHPVPPQPFPLTPQSS 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  461 MGLEPPQEVPEPPVMAQELPGLPLVTAAVELP----EQPAVTVAMELTE-QPVTTTELEQPVGMTTVEHPGHpevttatg 535
Cdd:pfam03154  306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQqpprEQPLPPAPLSMPHiKPPPTTPIPQLPNPQSHKHPPH-------- 377
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  536 LLGQPEATMVLELPGQPVATTALELPG-QPSVTGVPELPGLPSATRALELSGQ-PVATGALELPGPLMAAgalefsgqsg 613
Cdd:pfam03154  378 LSGPSPFQMNSNLPPPPALKPLSSLSThHPPSAHPPPLQLMPQSQQLPPPPAQpPVLTQSQSLPPPAASH---------- 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  614 aagalellgqPLATGVLELPGQPGAPELPGQPVATVAleISVQSVVTTSELSTMTVSQS-LEVPSTTALESYNTVAQELP 692
Cdd:pfam03154  448 ----------PPTSGLHQVPSQSPFPQHPFVPGGPPP--ITPPSGPPTSTSSAMPGIQPpSSASVSSSGPVPAAVSCPLP 515
                          410
                   ....*....|....*....
gi 1844084095  693 TTLVGETSVTVGVDPLMAP 711
Cdd:pfam03154  516 PVQIKEEALDEAEEPESPP 534
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
205-534 8.82e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 8.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  205 LKPATKTAELSVVSTSVISEQSEQSVAVMPEPSMTKILDSFAAAPVPTTTLVLKSSEPVVT-MSVEYQMKSVLKSVESTS 283
Cdd:pfam05109  396 LGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTnLTAPASTGPTVSTADVTS 475
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  284 PEPSKIMLVEPPVAKVLEP-------------SETLVVSSETPTEVYPEPSTSTTMDFPESSAIEALRlPEQPVDVPSEI 350
Cdd:pfam05109  476 PTPAGTTSGASPVTPSPSPrdngteskapdmtSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTS-PTSAVTTPTPN 554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  351 ADSSMtrPQELPELPKTTALELQESSVASAMELPGPPATSmpelqgppvtPVLELPGPSA-TPVPELPGPLSTPVPELPG 429
Cdd:pfam05109  555 ATSPT--PAVTTPTPNATIPTLGKTSPTSAVTTPTPNATS----------PTVGETSPQAnTTNHTLGGTSSTPVVTSPP 622
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1844084095  430 PPATAVPELPGPSVTPVPQLSQELpglpAPSMGLEPPQEVPEPPVMAQelpgLPLVTAAVELPEQ------PAVTVAMEL 503
Cdd:pfam05109  623 KNATSAVTTGQHNITSSSTSSMSL----RPSSISETLSPSTSDNSTSH----MPLLTSAHPTGGEnitqvtPASTSTHHV 694
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1844084095  504 TE-----QPVTTTELEQPVGMTTVEHPGHPEVTTAT 534
Cdd:pfam05109  695 STsspapRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH