|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
3-76 |
1.34e-20 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 86.84 E-value: 1.34e-20
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952723901 3 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSVESSSG--PKREEIMESILFKCSDFVVVQFK 76
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
144-205 |
7.76e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 7.76e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952723901 144 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 205
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
664-990 |
2.74e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 2.74e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 664 PKPSTTPTSPRPQAQPSPSMVGHQQPAPVYTqPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQ 743
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAA-PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 744 RQEQHHQS--AMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHV-PHYQSQHPHVYSP----VIQGNARMMAP 816
Cdd:PHA03247 2787 AVASLSESreSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTaPPPPPGPPPPSLPlggsVAPGGDVRRRP 2866
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 817 PAHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQyahPNATLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQ 896
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP---PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 897 AAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfptaqqtvftihPSHVQPA-YTNPPHMAHVPQAHVQSGMV 975
Cdd:PHA03247 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA------------PSREAPAsSTPPLTGHSLSRVSSWASSL 3011
|
330
....*....|....*.
gi 1952723901 976 PSH-PTAHAPMMLMTT 990
Cdd:PHA03247 3012 ALHeETDPPPVSLKQT 3027
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
645-660 |
5.00e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.00e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
3-76 |
1.34e-20 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 86.84 E-value: 1.34e-20
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952723901 3 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSVESSSG--PKREEIMESILFKCSDFVVVQFK 76
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
144-205 |
7.76e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 7.76e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952723901 144 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 205
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
664-990 |
2.74e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 2.74e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 664 PKPSTTPTSPRPQAQPSPSMVGHQQPAPVYTqPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQ 743
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAA-PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 744 RQEQHHQS--AMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHV-PHYQSQHPHVYSP----VIQGNARMMAP 816
Cdd:PHA03247 2787 AVASLSESreSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTaPPPPPGPPPPSLPlggsVAPGGDVRRRP 2866
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 817 PAHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQyahPNATLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQ 896
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP---PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 897 AAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfptaqqtvftihPSHVQPA-YTNPPHMAHVPQAHVQSGMV 975
Cdd:PHA03247 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA------------PSREAPAsSTPPLTGHSLSRVSSWASSL 3011
|
330
....*....|....*.
gi 1952723901 976 PSH-PTAHAPMMLMTT 990
Cdd:PHA03247 3012 ALHeETDPPPVSLKQT 3027
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
645-803 |
1.48e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 49.03 E-value: 1.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 645 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLypiPMTPM 724
Cdd:TIGR01628 367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLR---PNGLA 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 725 PVNQaktyrAGKVPNMPQQRQEQHHQSAMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 798
Cdd:TIGR01628 439 PMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513
|
....*
gi 1952723901 799 HPHVY 803
Cdd:TIGR01628 514 FPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
745-991 |
8.43e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 46.57 E-value: 8.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 745 QEQHHQSAMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 817
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 818 AHAQPGLVSSSATQYG----------------AHEQTHAMYVSTGSLAQQYAHPNATLHPHTPHPQPSATPTGQQQSQHG 881
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 882 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPTAQQTVFTIHPSHVQPAytnPPH 961
Cdd:pfam09770 255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAH 330
|
250 260 270
....*....|....*....|....*....|
gi 1952723901 962 MAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 991
Cdd:pfam09770 331 QAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
645-660 |
5.00e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.00e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
3-76 |
1.34e-20 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 86.84 E-value: 1.34e-20
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952723901 3 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSVESSSG--PKREEIMESILFKCSDFVVVQFK 76
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
144-205 |
7.76e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 7.76e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952723901 144 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 205
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
664-990 |
2.74e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 2.74e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 664 PKPSTTPTSPRPQAQPSPSMVGHQQPAPVYTqPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQ 743
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAA-PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 744 RQEQHHQS--AMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHV-PHYQSQHPHVYSP----VIQGNARMMAP 816
Cdd:PHA03247 2787 AVASLSESreSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTaPPPPPGPPPPSLPlggsVAPGGDVRRRP 2866
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 817 PAHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQyahPNATLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQ 896
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALP---PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 897 AAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfptaqqtvftihPSHVQPA-YTNPPHMAHVPQAHVQSGMV 975
Cdd:PHA03247 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA------------PSREAPAsSTPPLTGHSLSRVSSWASSL 3011
|
330
....*....|....*.
gi 1952723901 976 PSH-PTAHAPMMLMTT 990
Cdd:PHA03247 3012 ALHeETDPPPVSLKQT 3027
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
658-1014 |
8.95e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.98 E-value: 8.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 658 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 737
Cdd:PRK07764 400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 738 PNMPQQRQEQHHQSAMMHPASAAGPPIVATPPAySTQYVAYSPQQFPNqpLVQHVPHYQ-------SQHPHVYSpvIQGN 810
Cdd:PRK07764 472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA-GADDAATLRERWPE--ILAAVPKRSrktwailLPEATVLG--VRGD 546
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 811 ARMMAppaHAQPGLVSSSATQYGA-------HEQTHA-----MYVSTGSLAQQYAHPNA----TLHPHTPHPQPSATPTG 874
Cdd:PRK07764 547 TLVLG---FSTGGLARRFASPGNAevlvtalAEELGGdwqveAVVGPAPGAAGGEGPPApassGPPEEAARPAAPAAPAA 623
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 875 QQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPTAQQTV 944
Cdd:PRK07764 624 PAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1952723901 945 FTIHPSHVQPAYTNPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQPIPVSTTA 1014
Cdd:PRK07764 704 APAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAAAPPPSPPS 780
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
664-941 |
1.23e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.55 E-value: 1.23e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 664 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 735
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 736 KVPNMPQQRQEQHHQSAMMHPASAAGPPIV-ATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 814
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRrLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 815 APPAHAQPGLVSSSATQYGAHEQTHAMYVSTGSLA------QQYAHPNATLHP---HTPHPQPSATPTGQQQSQHGGSHP 885
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPpvrRLARPAVSRSTESFALPPDQPERP 2908
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 1952723901 886 APSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQsPQNSFPTAQ 941
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE-PSGAVPQPW 2963
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
645-803 |
1.48e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 49.03 E-value: 1.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 645 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLypiPMTPM 724
Cdd:TIGR01628 367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLR---PNGLA 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 725 PVNQaktyrAGKVPNMPQQRQEQHHQSAMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 798
Cdd:TIGR01628 439 PMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513
|
....*
gi 1952723901 799 HPHVY 803
Cdd:TIGR01628 514 FPLVE 518
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
659-1023 |
6.27e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 6.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 659 RSFSQPKPSTTPTSP-------RPQAQPSPSM----VGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVN 727
Cdd:PHA03247 2566 RSVPPPRPAPRPSEPavtsrarRPDAPPQSARprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 728 QAKTYRAGKVPNMPQQRQEQHhQSAMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVI 807
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG 2724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 808 QGNARMMAPPAHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQYAHPNAtlhPHTPHPQPSATPTGQQQSQHGGSHPAP 887
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA---PAAGPPRRLTRPAVASLSESRESLPSP 2801
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 888 SPVQHHQHQAAQALHLASPQQQSAiyhaGLAPTPPSMTPASNTQSPQNS------------------FPTAQQTVFTIH- 948
Cdd:PHA03247 2802 WDPADPPAAVLAPAAALPPAASPA----GPLPPPTSAQPTAPPPPPGPPppslplggsvapggdvrrRPPSRSPAAKPAa 2877
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 949 PSHV------QPAYTNPPHMAHVPQAHVQSGMVPSHPTAhaPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFPYMTHP 1022
Cdd:PHA03247 2878 PARPpvrrlaRPAVSRSTESFALPPDQPERPPQPQAPPP--PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
|
.
gi 1952723901 1023 S 1023
Cdd:PHA03247 2956 S 2956
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
745-991 |
8.43e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 46.57 E-value: 8.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 745 QEQHHQSAMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 817
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 818 AHAQPGLVSSSATQYG----------------AHEQTHAMYVSTGSLAQQYAHPNATLHPHTPHPQPSATPTGQQQSQHG 881
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 882 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPTAQQTVFTIHPSHVQPAytnPPH 961
Cdd:pfam09770 255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAH 330
|
250 260 270
....*....|....*....|....*....|
gi 1952723901 962 MAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 991
Cdd:pfam09770 331 QAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
681-907 |
3.02e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.08 E-value: 3.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 681 PSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPgVQPLYPIPMTPMPVNQaktyragkvPNMPQQRQeqhhqsammhPASAA 760
Cdd:PRK10263 309 PLLNGAPITEPVAVAAAATTATQSWAAPVEP-VTQTPPVASVDVPPAQ---------PTVAWQPV----------PGPQT 368
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 761 GPPIVATPPAystqyvAYSPQQFPNQPLVQHVPHYQSQHPHvyspviqgnarmmAPPAHAQPGLVSSSATQYGAHEQTHA 840
Cdd:PRK10263 369 GEPVIAPAPE------GYPQQSQYAQPAVQYNEPLQQPVQP-------------QQPYYAPAAEQPAQQPYYAPAPEQPA 429
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1952723901 841 MYvstGSLAQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQ 907
Cdd:PRK10263 430 QQ---PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE 493
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
643-890 |
3.83e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.64 E-value: 3.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 643 QVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSPG 712
Cdd:pfam09770 99 QVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKKA 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 713 VQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQEQHHQsammhpasaagpPIVATPPAYSTQYVAY 778
Cdd:pfam09770 171 AAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQF 236
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 779 SPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQYAHPNAT 858
Cdd:pfam09770 237 PPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAA 312
|
250 260 270
....*....|....*....|....*....|...
gi 1952723901 859 LHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 890
Cdd:pfam09770 313 RVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
662-934 |
8.07e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 8.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 662 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAktyragkvPNM 740
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPLQQ-PVQPQQPYYA--------PAA 412
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 741 PQQRQEQHHQSAMMHPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgNARMMAPPAHA 820
Cdd:PRK10263 413 EQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-EPLYQQPQPVE 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 821 QPGLVSSSATQYGAHEQTHAMYVSTgSLAQQYAHPNATLHP-HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQ 899
Cdd:PRK10263 486 QQPVVEPEPVVEETKPARPPLYYFE-EVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPL 564
|
250 260 270
....*....|....*....|....*....|....*...
gi 1952723901 900 ALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 934
Cdd:PRK10263 565 ASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
651-1009 |
3.97e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 3.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 651 PNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSmvghqqpapvytqpvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAK 730
Cdd:PHA03247 2601 PVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-----------------------PAANEPDPHPPPTVPPPERPRDDPA 2657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 731 TYRAGKvpnmpqqrqeQHHQSAMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGN 810
Cdd:PHA03247 2658 PGRVSR----------PRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAA 2727
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 811 ARMMAPPAHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQYAHPNAtlhPHTPHPQPSATPTGQQQSQHGGSHPAPSPV 890
Cdd:PHA03247 2728 ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA---PAAGPPRRLTRPAVASLSESRESLPSPWDP 2804
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 891 QHHQHQAAQALHLASPQQQSaiyhAGLAPTPPSMTPASnTQSPQNSFPTAQQTVFTIHP----SHVQPAYTNPPHMAHVP 966
Cdd:PHA03247 2805 ADPPAAVLAPAAALPPAASP----AGPLPPPTSAQPTA-PPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPA 2879
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1952723901 967 QAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIP 1009
Cdd:PHA03247 2880 RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
645-660 |
5.00e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.00e-03
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
606-1017 |
6.36e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 6.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 606 NTEHKRGPEVTSQGVQTSSPGCKQEKDDKEEKKDAAEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 685
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 686 HQQPAPVYTQPvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQ-----QRQEQHHQSAMMHPASAA 760
Cdd:pfam03154 207 PPQGSPATSQP---------PNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQppppsQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 761 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAppahAQPGLVSSSATQYGAHEQTha 840
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI----HTPPSQSQLQSQQPPREQP-- 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 841 myVSTGSLAQQYAHPNATLH-PHTPHPQPSATP---TGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAG 916
Cdd:pfam03154 345 --LPPAPLSMPHIKPPPTTPiPQLPNPQSHKHPphlSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQ 422
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 917 LAPTPPSMTPASnTQSPQNSFPTAQQ-TVFTIHPSHVQPAYTNPPHMAHVPqahvQSGMVPSHPTAHAPMMLMTTQPPgg 995
Cdd:pfam03154 423 QLPPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPTSTSSAMPGIQPP-- 495
|
410 420
....*....|....*....|..
gi 1952723901 996 pqAALAQSALQPIPVSTTAHFP 1017
Cdd:pfam03154 496 --SSASVSSSGPVPAAVSCPLP 515
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
803-981 |
8.40e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.45 E-value: 8.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 803 YSPVIQGNArMMAPPAHAQpglVSSSATQYGAHE-----QTHAMYVSTGSLAQQYAHPNATLHPHTPHP----QPSATPT 873
Cdd:PRK10263 307 YDPLLNGAP-ITEPVAVAA---AATTATQSWAAPvepvtQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQ 382
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 874 GQQQSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPTAQQTVF 945
Cdd:PRK10263 383 QSQYAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462
|
170 180 190
....*....|....*....|....*....|....*.
gi 1952723901 946 TIHPSHVQPAYTNPPHMAhvPQAHVQSGMVPSHPTA 981
Cdd:PRK10263 463 QTEQTYQQPAAQEPLYQQ--PQPVEQQPVVEPEPVV 496
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
710-805 |
9.70e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.07 E-value: 9.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952723901 710 SPGVQPLYPIPMTPMPVNQaktYRAGKVPNMPQQRQEQHHQSAMMHPASAAGPPIVATPPAYST--QYVAYSPQ-QFPNQ 786
Cdd:PRK10263 746 TPIVEPVQQPQQPVAPQQQ---YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQpqQPVAPQPQyQQPQQ 822
|
90
....*....|....*....
gi 1952723901 787 PLVQHVPHYQSQHPHVYSP 805
Cdd:PRK10263 823 PVAPQPQYQQPQQPVAPQP 841
|
|
|