|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
168-229 |
3.40e-27 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 104.99 E-value: 3.40e-27
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811 168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
152-229 |
1.33e-14 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 69.64 E-value: 1.33e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
419-768 |
1.64e-11 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 68.50 E-value: 1.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 419 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 496
Cdd:pfam09606 124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 497 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 576
Cdd:pfam09606 204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 577 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 651
Cdd:pfam09606 282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 652 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 726
Cdd:pfam09606 362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 109730811 727 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 768
Cdd:pfam09606 442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
170-228 |
1.35e-10 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 57.50 E-value: 1.35e-10
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109730811 170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
437-816 |
2.73e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.60 E-value: 2.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 437 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 514
Cdd:PHA03379 447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 515 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 585
Cdd:PHA03379 524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 586 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 662
Cdd:PHA03379 598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 663 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 733
Cdd:PHA03379 668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 734 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 813
Cdd:PHA03379 748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815
|
...
gi 109730811 814 LSN 816
Cdd:PHA03379 816 AVN 818
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
474-643 |
1.62e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.57 E-value: 1.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 474 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 553
Cdd:TIGR01628 371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 554 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 629
Cdd:TIGR01628 437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
|
170
....*....|....
gi 109730811 630 GGQMQGLVVQYTPL 643
Cdd:TIGR01628 510 GERLFPLVEAIEPA 523
|
|
| Amelin |
smart00817 |
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ... |
485-620 |
4.83e-03 |
|
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Pssm-ID: 214832 [Multi-domain] Cd Length: 411 Bit Score: 40.64 E-value: 4.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 485 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 564
Cdd:smart00817 82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811 565 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 620
Cdd:smart00817 153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
168-229 |
3.40e-27 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 104.99 E-value: 3.40e-27
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811 168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
152-229 |
1.33e-14 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 69.64 E-value: 1.33e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
419-768 |
1.64e-11 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 68.50 E-value: 1.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 419 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 496
Cdd:pfam09606 124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 497 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 576
Cdd:pfam09606 204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 577 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 651
Cdd:pfam09606 282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 652 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 726
Cdd:pfam09606 362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 109730811 727 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 768
Cdd:pfam09606 442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
|
|
| R3H |
cd02325 |
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ... |
172-228 |
3.74e-11 |
|
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100064 Cd Length: 59 Bit Score: 59.16 E-value: 3.74e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 109730811 172 LLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG--KAVIINK 228
Cdd:cd02325 1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
170-228 |
1.35e-10 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 57.50 E-value: 1.35e-10
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109730811 170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
439-667 |
1.08e-07 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 56.20 E-value: 1.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 439 PALQPSPQPVQFSPSSCPQVLLPVSPPQQ-----------YNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQH 507
Cdd:pfam09770 107 PAARAAQSSAQPPASSLPQYQYASQQSQQpskpvrtgyekYKEPEPIPDLQVDASLWGVAPKKAAAPAPAPQPAAQPASL 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 508 PQQASFIM------------ASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQvsyyPPGQYPNSNQQYRPLSHP 575
Cdd:pfam09770 187 PAPSRKMMsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ----QPQQQPQQPQQHPGQGHP 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 576 VAYspqrgQQLPQASQQPGLQPMMSNQQQTAYQGMLGV-QQP----QNQGLLSNQRSSMGGQMQGlVVQYTPLPSYQVPV 650
Cdd:pfam09770 263 VTI-----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVpVQPtqilQNPNRLSAARVGYPQNPQP-GVQPAPAHQAHRQQ 336
|
250
....*....|....*..
gi 109730811 651 GSDSQNVVQPSFQQPML 667
Cdd:pfam09770 337 GSFGRQAPIITHPQQLA 353
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
437-816 |
2.73e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.60 E-value: 2.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 437 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 514
Cdd:PHA03379 447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 515 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 585
Cdd:PHA03379 524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 586 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 662
Cdd:PHA03379 598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 663 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 733
Cdd:PHA03379 668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 734 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 813
Cdd:PHA03379 748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815
|
...
gi 109730811 814 LSN 816
Cdd:PHA03379 816 AVN 818
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
417-593 |
9.50e-06 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 49.70 E-value: 9.50e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 417 PALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSS 496
Cdd:PRK10263 336 PVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 497 AlfQPPLISQHPQQASFIMASAGQPlPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYpnsnQQYRPLSHPV 576
Cdd:PRK10263 416 A--QQPYYAPAPEQPAQQPYYAPAP-EQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLY----QQPQPVEQQP 488
|
170
....*....|....*..
gi 109730811 577 AYSPQRGQQLPQASQQP 593
Cdd:PRK10263 489 VVEPEPVVEETKPARPP 505
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
410-710 |
5.42e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 5.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 410 QQQQQQLPALPPTPQHQPPlnnhmisqPVPALQPSPQPVQFSPSSCPQVLLPVSPP--QQYNMAEDLSNPFGQMSLSRQG 487
Cdd:pfam03154 170 QPPVLQAQSGAASPPSPPP--------PGTTQAATAGPTPSAPSVPPQGSPATSQPpnQTQSTAAPHTLIQQTPTLHPQR 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 488 STEAADPSSALFQPPLISQHPQQA--SFIMASAGQPLPTSNYSTSSHAPptqQVLPPQGYMQPPQ--QIQVSYYPPGQYP 563
Cdd:pfam03154 242 LPSPHPPLQPMTQPPPPSQVSPQPlpQPSLHGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQssQSQVPPGPSPAAP 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 564 NSNQQ--YRPLSHPVAYSPQ--RGQQLPQASQQ------------PGLQPMMSNQQQTAYQGMLGVQQPQN--------- 618
Cdd:pfam03154 319 GQSQQriHTPPSQSQLQSQQppREQPLPPAPLSmphikpppttpiPQLPNPQSHKHPPHLSGPSPFQMNSNlppppalkp 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 619 -QGLLSNQRSSMGGQMQGLVVQYTPLPS--YQVPVGSDSQNVVQPSFQQPmlvPASQSVQGG----------LPTGGVPV 685
Cdd:pfam03154 399 lSSLSTHHPPSAHPPPLQLMPQSQQLPPppAQPPVLTQSQSLPPPAASHP---PTSGLHQVPsqspfpqhpfVPGGPPPI 475
|
330 340
....*....|....*....|....*
gi 109730811 686 YYSMIPPAQqngTSPSVGFLQPPGS 710
Cdd:pfam03154 476 TPPSGPPTS---TSSAMPGIQPPSS 497
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
325-587 |
6.12e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 6.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 325 PRPWSS----TDSDGSVRSMRPPVTKASSfsgisilTRGDSIGSSKGGSAGRLSRPGMALGAPEVC---NQVTSPQSVRG 397
Cdd:PHA03247 2712 PHALVSatplPPGPAAARQASPALPAAPA-------PPAVPAGPATPGGPARPARPPTTAGPPAPAppaAPAAGPPRRLT 2784
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 398 LLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQvllPVSPPQQYNMAEDLSNP 477
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG---PPPPSLPLGGSVAPGGD 2861
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 478 FGQMSLSRQGSTEAADPS----SALFQPPLisqHPQQASFIMASAGQPLPTSnySTSSHAPPTQQVLPPQGYMQPPQQIQ 553
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPArppvRRLARPAV---SRSTESFALPPDQPERPPQ--PQAPPPPQPQPQPPPPPQPQPPPPPP 2936
|
250 260 270
....*....|....*....|....*....|....
gi 109730811 554 VSYYPPGQyPNSNQQYRPLSHPVAYSPQRGQQLP 587
Cdd:PHA03247 2937 PRPQPPLA-PTTDPAGAGEPSGAVPQPWLGALVP 2969
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
494-645 |
7.95e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 47.00 E-value: 7.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 494 PSSALFQPPLI-SQHPQQasfimasagQPLPTSNYSTSSH-APPTQQVLPPQGYMQPPQQiqvsYYPPGQYPNSNQQYRP 571
Cdd:PRK10263 740 PHEPLFTPIVEpVQQPQQ---------PVAPQQQYQQPQQpVAPQPQYQQPQQPVAPQPQ----YQQPQQPVAPQPQYQQ 806
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109730811 572 LSHPVAYSPQRGQ-QLPQASQQPGLQPmmsnQQQTAyqgmlgvqqPQNQGLLSNQRSSMGGQMQGLVVQYTPLPS 645
Cdd:PRK10263 807 PQQPVAPQPQYQQpQQPVAPQPQYQQP----QQPVA---------PQPQDTLLHPLLMRNGDSRPLHKPTTPLPS 868
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
474-643 |
1.62e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.57 E-value: 1.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 474 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 553
Cdd:TIGR01628 371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 554 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 629
Cdd:TIGR01628 437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
|
170
....*....|....
gi 109730811 630 GGQMQGLVVQYTPL 643
Cdd:TIGR01628 510 GERLFPLVEAIEPA 523
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
417-704 |
7.93e-04 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 43.50 E-value: 7.93e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 417 PALPPTPQHQPPLNNHMISQPVPALQpsPQPVQFSPSScpqvllPVSPPQQYNMAEDLS--NPFGQMSLSRQGSTEAADP 494
Cdd:PHA03377 672 PATQSTPPRPSWLPSVFVLPSVDAGR--AQPSEESHLS------SMSPTQPISHEEQPRyeDPDDPLDLSLHPDQAPPPS 743
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 495 SSALFQPPLISQHPQqasfimasagqplptSNYSTSSHAPPTQqvLPPQGYMQPP-QQIQVSYYPPGQYP-NSNQQYRPL 572
Cdd:PHA03377 744 HQAPYSGHEEPQAQQ---------------APYPGYWEPRPPQ--APYLGYQEPQaQGVQVSSYPGYAGPwGLRAQHPRY 806
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 573 SHPVAYSPQR----GQQLPQASQQPGLQPmmsnqqqtAYQGMLGVQQPQNQGLLSNQRSSMGGQMQGLVVQYTPLPsyQV 648
Cdd:PHA03377 807 RHSWAYWSQYpghgHPQGPWAPRPPHLPP--------QWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYS--QT 876
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 109730811 649 PVGSDSQNVVQPSFQQPMlvpasQSVQGGLPTGGVPVYYSMIPPAQQNGTS-PSVGF 704
Cdd:PHA03377 877 LVSSSAPSWSSPQPRAPI-----RPIPTRFPPPPMPLQDSMAVGCDSSGTAcPSMPF 928
|
|
| G_path_suppress |
pfam15991 |
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ... |
440-599 |
1.16e-03 |
|
G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.
Pssm-ID: 464961 [Multi-domain] Cd Length: 272 Bit Score: 41.83 E-value: 1.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 440 ALQPSPQPVQFSPSS-CPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASFIMASA 518
Cdd:pfam15991 103 ALQQAAAQVFLPQLSmQGQPHHQQHPGPQVGVLKRTRSPSPPVQQQAYYKQPAFSPGYAEHGQQKHDDGRRGYDVARFGS 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 519 GQPLPTSNYSTSSHAPPTQQVLPP-QGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAY---SPQRGQQLPQA--SQQ 592
Cdd:pfam15991 183 WNKSTAQYPPSGQLFYPTHQYLPPpQTQGQADARLQTIYPQPGYALPLQQQYEHANQPSPFvssSPLKQMQSPKAgpGPQ 262
|
....*..
gi 109730811 593 PGLQPMM 599
Cdd:pfam15991 263 PMQLSVL 269
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
488-687 |
1.63e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.38 E-value: 1.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 488 STEAADPSSALFQPPLISQHPQqasfimaSAGQPLPTSNYSTSSHAPPtqqvlpPQGYMQPPQ--QIQVSYYPPGQYPNS 565
Cdd:PRK10263 336 PVEPVTQTPPVASVDVPPAQPT-------VAWQPVPGPQTGEPVIAPA------PEGYPQQSQyaQPAVQYNEPLQQPVQ 402
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 566 NQQyrPLSHPVAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQgLLSNQRSsmggqmqglvvqYTPLPS 645
Cdd:PRK10263 403 PQQ--PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS-TFAPQST------------YQTEQT 467
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 109730811 646 YQVPVGSDSQNVVQPSFQQPMLVPASQSVQGGLPTGGvPVYY 687
Cdd:PRK10263 468 YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP-PLYY 508
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
324-711 |
3.05e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.61 E-value: 3.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 324 EPRPWSSTDSDGSVRSMrPPVTKASSfsgISILTRGDSIGSSKGGSAGRLSRPGMAL---GAPEvcnqvtsPQSVRGLLP 400
Cdd:PRK10263 536 EPEPIKSSLKAPSVAAV-PPVEAAAA---VSPLASGVKKATLATGAAATVAAPVFSLansGGPR-------PQVKEGIGP 604
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 401 ctaqqqqqqqqqqqqlpALPPTPQHQPPLNNHMISQPV--PALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAED-LSNP 477
Cdd:PRK10263 605 -----------------QLPRPKRIRVPTRRELASYGIklPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDeLARQ 667
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 478 FGQMSLSRQGST-------EAADPSSALfQPPLISQ--HPQQASFimaSAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQP 548
Cdd:PRK10263 668 FAQTQQQRYGEQyqhdvpvNAEDADAAA-EAELARQfaQTQQQRY---SGEQPAGANPFSLDDFEFSPMKALLDDGPHEP 743
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 549 pqqiqvsYYPPGQYPNSNQQYRPLSHPVAYSPQRgqqlPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQrss 628
Cdd:PRK10263 744 -------LFTPIVEPVQQPQQPVAPQQQYQQPQQ----PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ--- 809
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 629 mggqmqglvvQYTPLPSYQVPvgsDSQNVVQPSFQQPMLVPASQSVQgglptggvpvyySMIPP-AQQNGTS-------- 699
Cdd:PRK10263 810 ----------PVAPQPQYQQP---QQPVAPQPQYQQPQQPVAPQPQD------------TLLHPlLMRNGDSrplhkptt 864
|
410
....*....|....
gi 109730811 700 --PSVGFLQPPGSE 711
Cdd:PRK10263 865 plPSLDLLTPPPSE 878
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
581-711 |
3.79e-03 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 40.95 E-value: 3.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 581 QRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQ----PQNQGLLSNQRSSMGGQMQGLVVQYTPLPSYQVPvGSDSQN 656
Cdd:TIGR01628 374 QFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQfngqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP-SRNAQN 452
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 109730811 657 VVQPSFQQPMlvPASQSVQGGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSE 711
Cdd:TIGR01628 453 AAQKPPMQPV--MYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQ 505
|
|
| Amelin |
smart00817 |
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ... |
485-620 |
4.83e-03 |
|
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Pssm-ID: 214832 [Multi-domain] Cd Length: 411 Bit Score: 40.64 E-value: 4.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 485 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 564
Cdd:smart00817 82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811 565 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 620
Cdd:smart00817 153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
|
|
| PRK14948 |
PRK14948 |
DNA polymerase III subunit gamma/tau; |
380-593 |
5.64e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237862 [Multi-domain] Cd Length: 620 Bit Score: 40.72 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 380 LGAPEVCNQVTSPQSVRGLLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNN-HMISQPVPALQPSPQPVQfspsscpqv 458
Cdd:PRK14948 360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAsPPIPVPAEPTEPSPTPPA--------- 430
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 459 lLPVSPPQQYNMAE-------DLSNPFGQMSLSRQGSTEAADPSSALFQ------------PPLIS-------QHPQQAS 512
Cdd:PRK14948 431 -NAANAPPSLNLEElwqqilaKLELPSTRMLLSQQAELVSLDSNRAVIAvspnwlgmvqsrKPLLEqafakvlGRSIKLN 509
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 513 FIMASAGQPlptsnySTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQ 592
Cdd:PRK14948 510 LESQSGSAS------NTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEE 583
|
.
gi 109730811 593 P 593
Cdd:PRK14948 584 P 584
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
480-708 |
6.39e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 40.40 E-value: 6.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 480 QMSLSRQGSTEAADPSSALfQPPLISQHPQQASFIMASAGQPLPTS--NYSTSSHAPPTQ--QVL---PPQGYMQPPQQI 552
Cdd:pfam09770 99 QVRFNRQQPAARAAQSSAQ-PPASSLPQYQYASQQSQQPSKPVRTGyeKYKEPEPIPDLQvdASLwgvAPKKAAAPAPAP 177
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 553 QVSYYPPGQYPNSNQ---------QYRplshpvAYSPQRGQQLPQASQQP--GLQPMMSNQQQTAYQGMLGVQQPQNQGL 621
Cdd:pfam09770 178 QPAAQPASLPAPSRKmmsleeveaAMR------AQAKKPAQQPAPAPAQPpaAPPAQQAQQQQQFPPQIQQQQQPQQQPQ 251
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 622 LSNQRSSMGGQMQGL---------VVQYTPLPSYQVPVGSDSQNVVQPS--FQQPMLVPASQSVQGGLPTGGVPVyysmi 690
Cdd:pfam09770 252 QPQQHPGQGHPVTILqrpqspqpdPAQPSIQPQAQQFHQQPPPVPVQPTqiLQNPNRLSAARVGYPQNPQPGVQP----- 326
|
250
....*....|....*...
gi 109730811 691 PPAQQNGTSPSVGFLQPP 708
Cdd:pfam09770 327 APAHQAHRQQGSFGRQAP 344
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
516-836 |
8.87e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.14 E-value: 8.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 516 ASAGQPLPTSNYSTSShAPPTQQVLPPQgymQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQPGL 595
Cdd:pfam03154 144 TSPSIPSPQDNESDSD-SSAQQQILQTQ---PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPN 219
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 596 QPMMSNQQQTAYQGMLGVQQPQnqglLSNQRSSMggqmqglvvqyTPLPSYQVPVGSDSQNVVQPSFQQPMlVPASQSVQ 675
Cdd:pfam03154 220 QTQSTAAPHTLIQQTPTLHPQR----LPSPHPPL-----------QPMTQPPPPSQVSPQPLPQPSLHGQM-PPMPHSLQ 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 676 GGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQMSQQYSGVSPSGPGVVVM-QLNVPNGPQA 754
Cdd:pfam03154 284 TGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMpHIKPPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811 755 PQNPSMVQWSHCKYYSVEQRGQKPGDLYSPDG-----------SPQAN----------AQMGSSPVTSP--TQSPAPSPV 811
Cdd:pfam03154 364 PQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslsthhPPSAHppplqlmpqsQQLPPPPAQPPvlTQSQSLPPP 443
|
330 340
....*....|....*....|....*....
gi 109730811 812 TSLSNVCTGLSPLPVLTPFPR----PGGP 836
Cdd:pfam03154 444 AASHPPTSGLHQVPSQSPFPQhpfvPGGP 472
|
|
|