|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
168-229 |
4.31e-27 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 104.61 E-value: 4.31e-27
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968624 168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
152-229 |
3.03e-14 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 68.87 E-value: 3.03e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
|
|
| SUZ |
pfam12752 |
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ... |
250-303 |
2.26e-12 |
|
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.
Pssm-ID: 463689 [Multi-domain] Cd Length: 56 Bit Score: 62.72 E-value: 2.26e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 568968624 250 EFQQRFILKRDDASMD--RDDNQMRVPLQDGRRSKSIEEREEEYQRVRERIFARET 303
Cdd:pfam12752 1 PPPKMKILRRPSSGSSssSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
451-800 |
8.12e-12 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 69.27 E-value: 8.12e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 451 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 528
Cdd:pfam09606 124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 529 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 608
Cdd:pfam09606 204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 609 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 683
Cdd:pfam09606 282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 684 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 758
Cdd:pfam09606 362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 568968624 759 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 800
Cdd:pfam09606 442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
170-228 |
2.46e-10 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 57.12 E-value: 2.46e-10
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568968624 170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
469-848 |
2.67e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.60 E-value: 2.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 469 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 546
Cdd:PHA03379 447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 547 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 617
Cdd:PHA03379 524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 618 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 694
Cdd:PHA03379 598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 695 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 765
Cdd:PHA03379 668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 766 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 845
Cdd:PHA03379 748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815
|
...
gi 568968624 846 LSN 848
Cdd:PHA03379 816 AVN 818
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
506-675 |
1.79e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.57 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 506 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 585
Cdd:TIGR01628 371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 586 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 661
Cdd:TIGR01628 437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
|
170
....*....|....
gi 568968624 662 GGQMQGLVVQYTPL 675
Cdd:TIGR01628 510 GERLFPLVEAIEPA 523
|
|
| Amelin |
smart00817 |
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ... |
517-652 |
3.80e-03 |
|
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Pssm-ID: 214832 [Multi-domain] Cd Length: 411 Bit Score: 41.03 E-value: 3.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 517 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 596
Cdd:smart00817 82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968624 597 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 652
Cdd:smart00817 153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
|
|
| PRK06462 |
PRK06462 |
asparagine synthetase A; Reviewed |
140-306 |
4.37e-03 |
|
asparagine synthetase A; Reviewed
Pssm-ID: 235808 [Multi-domain] Cd Length: 335 Bit Score: 40.39 E-value: 4.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 140 RDSSQEYTDSTGIDLHEFLVNTLKKnprdrmmllKLEQEILDFINDNNNQFKKFPQMTsyhrmllHRVAAYFGMDHNVDQ 219
Cdd:PRK06462 136 EGADLDEVMDLIEDLIKYLVKELLE---------EHEDELEFFGRDLPHLKRPFKRIT-------HKEAVEILNEEGCRG 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 220 TGKAVIinktssTRIPEQRFSEHIKD--------EKNTEFQQRFILKRDDASMDRDdnqMRVPLQDGRRSkSIEEREEEY 291
Cdd:PRK06462 200 IDLEEL------GSEGEKSLSEHFEEpfwiidipKGSREFYDREDPERPGVLRNYD---LLLPEGYGEAV-SGGEREYEY 269
|
170
....*....|....*
gi 568968624 292 QRVRERIfaRETGQN 306
Cdd:PRK06462 270 EEIVERI--REHGVD 282
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
168-229 |
4.31e-27 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 104.61 E-value: 4.31e-27
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968624 168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
152-229 |
3.03e-14 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 68.87 E-value: 3.03e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
|
|
| SUZ |
pfam12752 |
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ... |
250-303 |
2.26e-12 |
|
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.
Pssm-ID: 463689 [Multi-domain] Cd Length: 56 Bit Score: 62.72 E-value: 2.26e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 568968624 250 EFQQRFILKRDDASMD--RDDNQMRVPLQDGRRSKSIEEREEEYQRVRERIFARET 303
Cdd:pfam12752 1 PPPKMKILRRPSSGSSssSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
451-800 |
8.12e-12 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 69.27 E-value: 8.12e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 451 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 528
Cdd:pfam09606 124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 529 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 608
Cdd:pfam09606 204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 609 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 683
Cdd:pfam09606 282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 684 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 758
Cdd:pfam09606 362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 568968624 759 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 800
Cdd:pfam09606 442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
|
|
| R3H |
cd02325 |
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ... |
172-228 |
3.86e-11 |
|
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100064 Cd Length: 59 Bit Score: 59.16 E-value: 3.86e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 568968624 172 LLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG--KAVIINK 228
Cdd:cd02325 1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
170-228 |
2.46e-10 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 57.12 E-value: 2.46e-10
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568968624 170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
471-699 |
9.66e-08 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 56.20 E-value: 9.66e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 471 PALQPSPQPVQFSPSSCPQVLLPVSPPQQ-----------YNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQH 539
Cdd:pfam09770 107 PAARAAQSSAQPPASSLPQYQYASQQSQQpskpvrtgyekYKEPEPIPDLQVDASLWGVAPKKAAAPAPAPQPAAQPASL 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 540 PQQASFIM------------ASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQvsyyPPGQYPNSNQQYRPLSHP 607
Cdd:pfam09770 187 PAPSRKMMsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ----QPQQQPQQPQQHPGQGHP 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 608 VAYspqrgQQLPQASQQPGLQPMMSNQQQTAYQGMLGV-QQP----QNQGLLSNQRSSMGGQMQGlVVQYTPLPSYQVPV 682
Cdd:pfam09770 263 VTI-----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVpVQPtqilQNPNRLSAARVGYPQNPQP-GVQPAPAHQAHRQQ 336
|
250
....*....|....*..
gi 568968624 683 GSDSQNVVQPSFQQPML 699
Cdd:pfam09770 337 GSFGRQAPIITHPQQLA 353
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
469-848 |
2.67e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.60 E-value: 2.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 469 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 546
Cdd:PHA03379 447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 547 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 617
Cdd:PHA03379 524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 618 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 694
Cdd:PHA03379 598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 695 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 765
Cdd:PHA03379 668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 766 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 845
Cdd:PHA03379 748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815
|
...
gi 568968624 846 LSN 848
Cdd:PHA03379 816 AVN 818
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
449-625 |
9.62e-06 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 50.08 E-value: 9.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 449 PALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSS 528
Cdd:PRK10263 336 PVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 529 AlfQPPLISQHPQQASFIMASAGQPlPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYpnsnQQYRPLSHPV 608
Cdd:PRK10263 416 A--QQPYYAPAPEQPAQQPYYAPAP-EQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLY----QQPQPVEQQP 488
|
170
....*....|....*..
gi 568968624 609 AYSPQRGQQLPQASQQP 625
Cdd:PRK10263 489 VVEPEPVVEETKPARPP 505
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
442-742 |
3.78e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.84 E-value: 3.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 442 QQQQQQLPALPPTPQHQPPlnnhmisqPVPALQPSPQPVQFSPSSCPQVLLPVSPP--QQYNMAEDLSNPFGQMSLSRQG 519
Cdd:pfam03154 170 QPPVLQAQSGAASPPSPPP--------PGTTQAATAGPTPSAPSVPPQGSPATSQPpnQTQSTAAPHTLIQQTPTLHPQR 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 520 STEAADPSSALFQPPLISQHPQQA--SFIMASAGQPLPTSNYSTSSHAPptqQVLPPQGYMQPPQ--QIQVSYYPPGQYP 595
Cdd:pfam03154 242 LPSPHPPLQPMTQPPPPSQVSPQPlpQPSLHGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQssQSQVPPGPSPAAP 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 596 NSNQQ--YRPLSHPVAYSPQ--RGQQLPQASQQ------------PGLQPMMSNQQQTAYQGMLGVQQPQN--------- 650
Cdd:pfam03154 319 GQSQQriHTPPSQSQLQSQQppREQPLPPAPLSmphikpppttpiPQLPNPQSHKHPPHLSGPSPFQMNSNlppppalkp 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 651 -QGLLSNQRSSMGGQMQGLVVQYTPLPS--YQVPVGSDSQNVVQPSFQQPmlvPASQSVQGG----------LPTGGVPV 717
Cdd:pfam03154 399 lSSLSTHHPPSAHPPPLQLMPQSQQLPPppAQPPVLTQSQSLPPPAASHP---PTSGLHQVPsqspfpqhpfVPGGPPPI 475
|
330 340
....*....|....*....|....*
gi 568968624 718 YYSMIPPAQqngTSPSVGFLQPPGS 742
Cdd:pfam03154 476 TPPSGPPTS---TSSAMPGIQPPSS 497
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
357-619 |
6.30e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 6.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 357 PRPWSS----TDSDGSVRSMRPPVTKASSfsgisilTRGDSIGSSKGGSAGRLSRPGMALGAPEVC---NQVTSPQSVRG 429
Cdd:PHA03247 2712 PHALVSatplPPGPAAARQASPALPAAPA-------PPAVPAGPATPGGPARPARPPTTAGPPAPAppaAPAAGPPRRLT 2784
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 430 LLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQvllPVSPPQQYNMAEDLSNP 509
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG---PPPPSLPLGGSVAPGGD 2861
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 510 FGQMSLSRQGSTEAADPS----SALFQPPLisqHPQQASFIMASAGQPLPTSnySTSSHAPPTQQVLPPQGYMQPPQQIQ 585
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPArppvRRLARPAV---SRSTESFALPPDQPERPPQ--PQAPPPPQPQPQPPPPPQPQPPPPPP 2936
|
250 260 270
....*....|....*....|....*....|....
gi 568968624 586 VSYYPPGQyPNSNQQYRPLSHPVAYSPQRGQQLP 619
Cdd:PHA03247 2937 PRPQPPLA-PTTDPAGAGEPSGAVPQPWLGALVP 2969
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
526-677 |
8.12e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 47.00 E-value: 8.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 526 PSSALFQPPLI-SQHPQQasfimasagQPLPTSNYSTSSH-APPTQQVLPPQGYMQPPQQiqvsYYPPGQYPNSNQQYRP 603
Cdd:PRK10263 740 PHEPLFTPIVEpVQQPQQ---------PVAPQQQYQQPQQpVAPQPQYQQPQQPVAPQPQ----YQQPQQPVAPQPQYQQ 806
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568968624 604 LSHPVAYSPQRGQ-QLPQASQQPGLQPmmsnQQQTAyqgmlgvqqPQNQGLLSNQRSSMGGQMQGLVVQYTPLPS 677
Cdd:PRK10263 807 PQQPVAPQPQYQQpQQPVAPQPQYQQP----QQPVA---------PQPQDTLLHPLLMRNGDSRPLHKPTTPLPS 868
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
506-675 |
1.79e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.57 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 506 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 585
Cdd:TIGR01628 371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 586 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 661
Cdd:TIGR01628 437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
|
170
....*....|....
gi 568968624 662 GGQMQGLVVQYTPL 675
Cdd:TIGR01628 510 GERLFPLVEAIEPA 523
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
449-736 |
7.43e-04 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 43.50 E-value: 7.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 449 PALPPTPQHQPPLNNHMISQPVPALQpsPQPVQFSPSScpqvllPVSPPQQYNMAEDLS--NPFGQMSLSRQGSTEAADP 526
Cdd:PHA03377 672 PATQSTPPRPSWLPSVFVLPSVDAGR--AQPSEESHLS------SMSPTQPISHEEQPRyeDPDDPLDLSLHPDQAPPPS 743
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 527 SSALFQPPLISQHPQqasfimasagqplptSNYSTSSHAPPTQqvLPPQGYMQPP-QQIQVSYYPPGQYP-NSNQQYRPL 604
Cdd:PHA03377 744 HQAPYSGHEEPQAQQ---------------APYPGYWEPRPPQ--APYLGYQEPQaQGVQVSSYPGYAGPwGLRAQHPRY 806
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 605 SHPVAYSPQR----GQQLPQASQQPGLQPmmsnqqqtAYQGMLGVQQPQNQGLLSNQRSSMGGQMQGLVVQYTPLPsyQV 680
Cdd:PHA03377 807 RHSWAYWSQYpghgHPQGPWAPRPPHLPP--------QWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYS--QT 876
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 568968624 681 PVGSDSQNVVQPSFQQPMlvpasQSVQGGLPTGGVPVYYSMIPPAQQNGTS-PSVGF 736
Cdd:PHA03377 877 LVSSSAPSWSSPQPRAPI-----RPIPTRFPPPPMPLQDSMAVGCDSSGTAcPSMPF 928
|
|
| G_path_suppress |
pfam15991 |
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ... |
472-631 |
1.22e-03 |
|
G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.
Pssm-ID: 464961 [Multi-domain] Cd Length: 272 Bit Score: 41.83 E-value: 1.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 472 ALQPSPQPVQFSPSS-CPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASFIMASA 550
Cdd:pfam15991 103 ALQQAAAQVFLPQLSmQGQPHHQQHPGPQVGVLKRTRSPSPPVQQQAYYKQPAFSPGYAEHGQQKHDDGRRGYDVARFGS 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 551 GQPLPTSNYSTSSHAPPTQQVLPP-QGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAY---SPQRGQQLPQA--SQQ 624
Cdd:pfam15991 183 WNKSTAQYPPSGQLFYPTHQYLPPpQTQGQADARLQTIYPQPGYALPLQQQYEHANQPSPFvssSPLKQMQSPKAgpGPQ 262
|
....*..
gi 568968624 625 PGLQPMM 631
Cdd:pfam15991 263 PMQLSVL 269
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
520-719 |
1.64e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 1.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 520 STEAADPSSALFQPPLISQHPQqasfimaSAGQPLPTSNYSTSSHAPPtqqvlpPQGYMQPPQ--QIQVSYYPPGQYPNS 597
Cdd:PRK10263 336 PVEPVTQTPPVASVDVPPAQPT-------VAWQPVPGPQTGEPVIAPA------PEGYPQQSQyaQPAVQYNEPLQQPVQ 402
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 598 NQQyrPLSHPVAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQgLLSNQRSsmggqmqglvvqYTPLPS 677
Cdd:PRK10263 403 PQQ--PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS-TFAPQST------------YQTEQT 467
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 568968624 678 YQVPVGSDSQNVVQPSFQQPMLVPASQSVQGGLPTGGvPVYY 719
Cdd:PRK10263 468 YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP-PLYY 508
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
356-743 |
3.09e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.61 E-value: 3.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 356 EPRPWSSTDSDGSVRSMrPPVTKASSfsgISILTRGDSIGSSKGGSAGRLSRPGMAL---GAPEvcnqvtsPQSVRGLLP 432
Cdd:PRK10263 536 EPEPIKSSLKAPSVAAV-PPVEAAAA---VSPLASGVKKATLATGAAATVAAPVFSLansGGPR-------PQVKEGIGP 604
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 433 ctaqqqqqqqqqqqqlpALPPTPQHQPPLNNHMISQPV--PALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAED-LSNP 509
Cdd:PRK10263 605 -----------------QLPRPKRIRVPTRRELASYGIklPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDeLARQ 667
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 510 FGQMSLSRQGST-------EAADPSSALfQPPLISQ--HPQQASFimaSAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQP 580
Cdd:PRK10263 668 FAQTQQQRYGEQyqhdvpvNAEDADAAA-EAELARQfaQTQQQRY---SGEQPAGANPFSLDDFEFSPMKALLDDGPHEP 743
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 581 pqqiqvsYYPPGQYPNSNQQYRPLSHPVAYSPQRgqqlPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQrss 660
Cdd:PRK10263 744 -------LFTPIVEPVQQPQQPVAPQQQYQQPQQ----PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ--- 809
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 661 mggqmqglvvQYTPLPSYQVPvgsDSQNVVQPSFQQPMLVPASQSVQgglptggvpvyySMIPP-AQQNGTS-------- 731
Cdd:PRK10263 810 ----------PVAPQPQYQQP---QQPVAPQPQYQQPQQPVAPQPQD------------TLLHPlLMRNGDSrplhkptt 864
|
410
....*....|....
gi 568968624 732 --PSVGFLQPPGSE 743
Cdd:PRK10263 865 plPSLDLLTPPPSE 878
|
|
| Amelin |
smart00817 |
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ... |
517-652 |
3.80e-03 |
|
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.
Pssm-ID: 214832 [Multi-domain] Cd Length: 411 Bit Score: 41.03 E-value: 3.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 517 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 596
Cdd:smart00817 82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568968624 597 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 652
Cdd:smart00817 153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
613-743 |
4.10e-03 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 40.95 E-value: 4.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 613 QRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQ----PQNQGLLSNQRSSMGGQMQGLVVQYTPLPSYQVPvGSDSQN 688
Cdd:TIGR01628 374 QFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQfngqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP-SRNAQN 452
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 568968624 689 VVQPSFQQPMlvPASQSVQGGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSE 743
Cdd:TIGR01628 453 AAQKPPMQPV--MYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQ 505
|
|
| PRK06462 |
PRK06462 |
asparagine synthetase A; Reviewed |
140-306 |
4.37e-03 |
|
asparagine synthetase A; Reviewed
Pssm-ID: 235808 [Multi-domain] Cd Length: 335 Bit Score: 40.39 E-value: 4.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 140 RDSSQEYTDSTGIDLHEFLVNTLKKnprdrmmllKLEQEILDFINDNNNQFKKFPQMTsyhrmllHRVAAYFGMDHNVDQ 219
Cdd:PRK06462 136 EGADLDEVMDLIEDLIKYLVKELLE---------EHEDELEFFGRDLPHLKRPFKRIT-------HKEAVEILNEEGCRG 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 220 TGKAVIinktssTRIPEQRFSEHIKD--------EKNTEFQQRFILKRDDASMDRDdnqMRVPLQDGRRSkSIEEREEEY 291
Cdd:PRK06462 200 IDLEEL------GSEGEKSLSEHFEEpfwiidipKGSREFYDREDPERPGVLRNYD---LLLPEGYGEAV-SGGEREYEY 269
|
170
....*....|....*
gi 568968624 292 QRVRERIfaRETGQN 306
Cdd:PRK06462 270 EEIVERI--REHGVD 282
|
|
| PRK14948 |
PRK14948 |
DNA polymerase III subunit gamma/tau; |
412-625 |
5.70e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237862 [Multi-domain] Cd Length: 620 Bit Score: 40.72 E-value: 5.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 412 LGAPEVCNQVTSPQSVRGLLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNN-HMISQPVPALQPSPQPVQfspsscpqv 490
Cdd:PRK14948 360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAsPPIPVPAEPTEPSPTPPA--------- 430
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 491 lLPVSPPQQYNMAE-------DLSNPFGQMSLSRQGSTEAADPSSALFQ------------PPLIS-------QHPQQAS 544
Cdd:PRK14948 431 -NAANAPPSLNLEElwqqilaKLELPSTRMLLSQQAELVSLDSNRAVIAvspnwlgmvqsrKPLLEqafakvlGRSIKLN 509
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 545 FIMASAGQPlptsnySTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQ 624
Cdd:PRK14948 510 LESQSGSAS------NTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEE 583
|
.
gi 568968624 625 P 625
Cdd:PRK14948 584 P 584
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
512-740 |
6.14e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 40.79 E-value: 6.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 512 QMSLSRQGSTEAADPSSALfQPPLISQHPQQASFIMASAGQPLPTS--NYSTSSHAPPTQ--QVL---PPQGYMQPPQQI 584
Cdd:pfam09770 99 QVRFNRQQPAARAAQSSAQ-PPASSLPQYQYASQQSQQPSKPVRTGyeKYKEPEPIPDLQvdASLwgvAPKKAAAPAPAP 177
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 585 QVSYYPPGQYPNSNQ---------QYRplshpvAYSPQRGQQLPQASQQP--GLQPMMSNQQQTAYQGMLGVQQPQNQGL 653
Cdd:pfam09770 178 QPAAQPASLPAPSRKmmsleeveaAMR------AQAKKPAQQPAPAPAQPpaAPPAQQAQQQQQFPPQIQQQQQPQQQPQ 251
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 654 LSNQRSSMGGQMQGL---------VVQYTPLPSYQVPVGSDSQNVVQPS--FQQPMLVPASQSVQGGLPTGGVPVyysmi 722
Cdd:pfam09770 252 QPQQHPGQGHPVTILqrpqspqpdPAQPSIQPQAQQFHQQPPPVPVQPTqiLQNPNRLSAARVGYPQNPQPGVQP----- 326
|
250
....*....|....*...
gi 568968624 723 PPAQQNGTSPSVGFLQPP 740
Cdd:pfam09770 327 APAHQAHRQQGSFGRQAP 344
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
548-868 |
7.32e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 7.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 548 ASAGQPLPTSNYSTSShAPPTQQVLPPQgymQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQPGL 627
Cdd:pfam03154 144 TSPSIPSPQDNESDSD-SSAQQQILQTQ---PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPN 219
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 628 QPMMSNQQQTAYQGMLGVQQPQnqglLSNQRSSMggqmqglvvqyTPLPSYQVPVGSDSQNVVQPSFQQPMlVPASQSVQ 707
Cdd:pfam03154 220 QTQSTAAPHTLIQQTPTLHPQR----LPSPHPPL-----------QPMTQPPPPSQVSPQPLPQPSLHGQM-PPMPHSLQ 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 708 GGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQMSQQYSGVSPSGPGVVVM-QLNVPNGPQA 786
Cdd:pfam03154 284 TGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMpHIKPPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568968624 787 PQNPSMVQWSHCKYYSVEQRGQKPGDLYSPDG-----------SPQAN----------AQMGSSPVTSP--TQSPAPSPV 843
Cdd:pfam03154 364 PQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslsthhPPSAHppplqlmpqsQQLPPPPAQPPvlTQSQSLPPP 443
|
330 340
....*....|....*....|....*....
gi 568968624 844 TSLSNVCTGLSPLPVLTPFPR----PGGP 868
Cdd:pfam03154 444 AASHPPTSGLHQVPSQSPFPQhpfvPGGP 472
|
|
|