|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-195 |
1.01e-59 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed. :
Pssm-ID: 463988 Cd Length: 143 Bit Score: 201.50 E-value: 1.01e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462580004 128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
|
|
| Med15 super family |
cl26621 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
537-862 |
1.01e-08 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development. The actual alignment was detected with superfamily member pfam09606:
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 60.02 E-value: 1.01e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 537 IFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMN 616
Cdd:pfam09606 47 ILHVRDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTAS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 617 NQQAGTsGVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNP 689
Cdd:pfam09606 122 NLLASL-GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQM 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 690 PSQNLGPS-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIM 768
Cdd:pfam09606 201 PPQMGVPGmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 769 RGPTPNMQgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQ 848
Cdd:pfam09606 281 GQPMGPPG----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWN 353
|
330
....*....|....
gi 2462580004 849 QTNMVPPHVQAMQG 862
Cdd:pfam09606 354 PGNFGGLGANPMQR 367
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1080-1326 |
8.71e-07 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 8.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1080 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1159
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1160 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1228
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1229 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1308
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
|
250
....*....|....*...
gi 2462580004 1309 qAPSNLTMNPSNFATPQT 1326
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
190-491 |
3.99e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 3.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 190 PPGGNVSSSMMAPGPNPELQPRtPRPASQSDamdPLLSGLHIQQQSHPSGSLAP-----PHHPMQPVSVNRQMNPANFPQ 264
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPA-PHALVSAT---PLPPGPAAARQASPALPAAPappavPAGPATPGGPARPARPPTTAG 2765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 265 LQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHqqqqpqgirPQFTAPTQVPVPPGWNQLPSGALQPPPaqgsLGTMT 344
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS---------PWDPADPPAAVLAPAAALPPAASPAGP----LPPPT 2832
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSL-QGGPSRVPTPL 423
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALpPDQPERPPQPQ 2912
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462580004 424 QQPHLTNKSPASSPssfqqgsPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVP 491
Cdd:PHA03247 2913 APPPPQPQPQPPPP-------PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-195 |
1.01e-59 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.
Pssm-ID: 463988 Cd Length: 143 Bit Score: 201.50 E-value: 1.01e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462580004 128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
537-862 |
1.01e-08 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 60.02 E-value: 1.01e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 537 IFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMN 616
Cdd:pfam09606 47 ILHVRDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTAS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 617 NQQAGTsGVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNP 689
Cdd:pfam09606 122 NLLASL-GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQM 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 690 PSQNLGPS-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIM 768
Cdd:pfam09606 201 PPQMGVPGmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 769 RGPTPNMQgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQ 848
Cdd:pfam09606 281 GQPMGPPG----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWN 353
|
330
....*....|....
gi 2462580004 849 QTNMVPPHVQAMQG 862
Cdd:pfam09606 354 PGNFGGLGANPMQR 367
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1080-1326 |
8.71e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 8.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1080 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1159
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1160 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1228
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1229 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1308
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
|
250
....*....|....*...
gi 2462580004 1309 qAPSNLTMNPSNFATPQT 1326
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-491 |
3.99e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 3.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 190 PPGGNVSSSMMAPGPNPELQPRtPRPASQSDamdPLLSGLHIQQQSHPSGSLAP-----PHHPMQPVSVNRQMNPANFPQ 264
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPA-PHALVSAT---PLPPGPAAARQASPALPAAPappavPAGPATPGGPARPARPPTTAG 2765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 265 LQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHqqqqpqgirPQFTAPTQVPVPPGWNQLPSGALQPPPaqgsLGTMT 344
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS---------PWDPADPPAAVLAPAAALPPAASPAGP----LPPPT 2832
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSL-QGGPSRVPTPL 423
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALpPDQPERPPQPQ 2912
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462580004 424 QQPHLTNKSPASSPssfqqgsPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVP 491
Cdd:PHA03247 2913 APPPPQPQPQPPPP-------PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
636-779 |
6.51e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.49 E-value: 6.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 636 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 714
Cdd:TIGR01628 384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 715 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 779
Cdd:TIGR01628 458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1131-1426 |
8.77e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 8.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1131 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1202
Cdd:pfam05109 414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1203 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1276
Cdd:pfam05109 485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1277 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1355
Cdd:pfam05109 565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 1356 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVElPRNVLVSPTPLANP-PVPGSFPNNSGlnPQNST 1426
Cdd:pfam05109 645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVT-PASTSTHHVSTSSPaPRPGTTSQASG--PGNSS 716
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
385-523 |
8.84e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.26 E-value: 8.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 385 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ----QGSPASSPTVNQTQQQMGP 460
Cdd:pfam09770 211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqsPQPDPAQPSIQPQAQQFHQ 290
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 461 RPPQNNPLP-QGFQQP-VSSPGRNPMVQQGnvPPNFMVMQQQPPNQGPQSLHPGLGEKSEPSNLA 523
Cdd:pfam09770 291 QPPPVPVQPtQILQNPnRLSAARVGYPQNP--QPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLA 353
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-195 |
1.01e-59 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.
Pssm-ID: 463988 Cd Length: 143 Bit Score: 201.50 E-value: 1.01e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462580004 128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
537-862 |
1.01e-08 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 60.02 E-value: 1.01e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 537 IFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMN 616
Cdd:pfam09606 47 ILHVRDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTAS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 617 NQQAGTsGVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNP 689
Cdd:pfam09606 122 NLLASL-GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQM 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 690 PSQNLGPS-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIM 768
Cdd:pfam09606 201 PPQMGVPGmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 769 RGPTPNMQgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQ 848
Cdd:pfam09606 281 GQPMGPPG----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWN 353
|
330
....*....|....
gi 2462580004 849 QTNMVPPHVQAMQG 862
Cdd:pfam09606 354 PGNFGGLGANPMQR 367
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1080-1326 |
8.71e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 8.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1080 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1159
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1160 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1228
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1229 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1308
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
|
250
....*....|....*...
gi 2462580004 1309 qAPSNLTMNPSNFATPQT 1326
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
636-806 |
3.44e-06 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 51.96 E-value: 3.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 636 QPQQGPPSQLMGMHQQIVPSQGQMVQQQGTLNPQnpmilsraQLMPQGQMMVNPPSQNLGPSPQRmtppkqmLSQQGPQM 715
Cdd:pfam09770 209 KPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ--------QQQPQQQPQQPQQHPGQGHPVTI-------LQRPQSPQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 716 MAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQsnvMPGPAQIMRGPTPNMQgnmvqftGQMSGQMLPQQG 795
Cdd:pfam09770 274 PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVG---YPQNPQPGVQPAPAHQ-------AHRQQGSFGRQA 343
|
170
....*....|.
gi 2462580004 796 PVNNSPSQVMG 806
Cdd:pfam09770 344 PIITHPQQLAQ 354
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1109-1583 |
1.13e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 1.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1109 PASVPPSPDkQRMPMPVNTPlgsnsrkmvyqeSPQNPSSSPLAEMASLPEASGSeaPSVPGGPNNMPSHVV----LPQNQ 1184
Cdd:PHA03247 2557 PAAPPAAPD-RSVPPPRPAP------------RPSEPAVTSRARRPDAPPQSAR--PRAPVDDRGDPRGPAppspLPPDT 2621
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1185 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1264
Cdd:PHA03247 2622 HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1265 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGA 1344
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAAR----QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1345 TKRASPSNSRRSSPGSSRKTTPS---PGRQNSKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGLN 1421
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSpwdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1422 PQNSTVSVAAVGGVVEDNKESLNVPQDSdcqnsqsrkeqvnIELKAVPAQEVKMVVPEDQSKKDGQPSDPNK--LPSVEE 1499
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAPPPpqPQPQPP 2924
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1500 NKNLVSPAMREAPTSLSQL---LDNSGAPNVTIKPPGLTDLEVTPPVVSGEDLKKASVIPTLQDLSSSKEPSNSLNLPHS 1576
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLaptTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
|
....*..
gi 2462580004 1577 NELCSSL 1583
Cdd:PHA03247 3005 SSWASSL 3011
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-491 |
3.99e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 3.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 190 PPGGNVSSSMMAPGPNPELQPRtPRPASQSDamdPLLSGLHIQQQSHPSGSLAP-----PHHPMQPVSVNRQMNPANFPQ 264
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPA-PHALVSAT---PLPPGPAAARQASPALPAAPappavPAGPATPGGPARPARPPTTAG 2765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 265 LQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHqqqqpqgirPQFTAPTQVPVPPGWNQLPSGALQPPPaqgsLGTMT 344
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS---------PWDPADPPAAVLAPAAALPPAASPAGP----LPPPT 2832
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSL-QGGPSRVPTPL 423
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALpPDQPERPPQPQ 2912
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462580004 424 QQPHLTNKSPASSPssfqqgsPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVP 491
Cdd:PHA03247 2913 APPPPQPQPQPPPP-------PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
953-1514 |
4.72e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 4.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 953 DLNTPDTRPAgleeadqpPLPGEQGINLDNSGPKLPEFSNRP--PGYPS-QPVEQRPLQQMPPQLMQHVAPPPQPPQQQP 1029
Cdd:PHA03247 2565 DRSVPPPRPA--------PRPSEPAVTSRARRPDAPPQSARPraPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1030 QPQLPQQQQPPPPSQPQSQQQqqqqqqmmmmlmmqqdPKSVRLP--VSQNVHPPRGPLNPDSQRMPMQQSGSVPVmVSLQ 1107
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPA----------------PGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSL-TSLA 2699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1108 GPASVPPSPDKQRMPMPVNTPL--GSNSRKMVYQESPQNPSSSPLAEMASLPEASGSEA-PSVPGGPNNmPSHVVLPQNQ 1184
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLppGPAAARQASPALPAAPAPPAVPAGPATPGGPARPArPPTTAGPPA-PAPPAAPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1185 LMMTGPKPGPSPLSATQGATPQQP---PVNSLPSSHGHHFPNVAAPTQTSRPKTpnraSPRPYYPQTPNNRPPSTEPSEI 1261
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWdpaDPPAAVLAPAAALPPAASPAGPLPPPT----SAQPTAPPPPPGPPPPSLPLGG 2854
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1262 SLSPERLNASIAGLFPPQINIPLPPRPNLNRgfdqqglnpTTLKAIGQAPSNLTMNPSNFATPQTHKLDsvvvnSGKQSN 1341
Cdd:PHA03247 2855 SVAPGGDVRRRPPSRSPAAKPAAPARPPVRR---------LARPAVSRSTESFALPPDQPERPPQPQAP-----PPPQPQ 2920
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1342 SGATKRASPSNSRRSSPGSSRKTTPSPGRQNSKAPKLTLASQTNAALLQ-NVELPRNVLVSPTPLANPPVPGSFPNNSGL 1420
Cdd:PHA03247 2921 PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1421 NPQNSTVSVAAVGGVVEDN-----KESLNVPQDSDCQNSQSRKEQVNIELkavpaqevkmvvpeDQSKKDGQPSDPNKLP 1495
Cdd:PHA03247 3001 LSRVSSWASSLALHEETDPppvslKQTLWPPDDTEDSDADSLFDSDSERS--------------DLEALDPLPPEPHDPF 3066
|
570
....*....|....*....
gi 2462580004 1496 SVEENKNLVSPAMREAPTS 1514
Cdd:PHA03247 3067 AHEPDPATPEAGARESPSS 3085
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
636-779 |
6.51e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.49 E-value: 6.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 636 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 714
Cdd:TIGR01628 384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 715 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 779
Cdd:TIGR01628 458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1131-1426 |
8.77e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 8.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1131 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1202
Cdd:pfam05109 414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1203 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1276
Cdd:pfam05109 485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1277 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1355
Cdd:pfam05109 565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 1356 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVElPRNVLVSPTPLANP-PVPGSFPNNSGlnPQNST 1426
Cdd:pfam05109 645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVT-PASTSTHHVSTSSPaPRPGTTSQASG--PGNSS 716
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
643-794 |
1.91e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.95 E-value: 1.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 643 SQLMGMHQQIVPSQGQMVQqqgtLNPQNPMILSRAQLMPQGQMMVNPPSQNLGPSPQRMtPPKQMLSQQGPQMMAPhnqm 722
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM----GSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462580004 723 MGPQGQVLLQQNPMIEQimtNQMQGNKQQFNTQNQSNVMPGPAQimrGPTPNMQGNMvQFTGQMSGQMLPQQ 794
Cdd:TIGR01628 440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQN-KKLAQVLASATPQM 504
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
439-678 |
5.23e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.03 E-value: 5.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 439 SFQQGSPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSS-------PGRNPMVQQ-----GNVPPNFMVMQQQPPNQGP 506
Cdd:pfam09770 103 NRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekykePEPIPDLQVdaslwGVAPKKAAAPAPAPQPAAQ 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 507 QSLHPGLGEK---------------SEPSNLAVAWPQITFREQIAIFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAP 571
Cdd:pfam09770 183 PASLPAPSRKmmsleeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHP 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 572 --QLQANQNVQHAGGQGAGPPQNQMQVSHGPPNMMQPslMGIHGNMNNQQAGTSGVPQvnlsNMQGQPQQGPPSQlmgmH 649
Cdd:pfam09770 263 vtILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP--TQILQNPNRLSAARVGYPQ----NPQPGVQPAPAHQ----A 332
|
250 260
....*....|....*....|....*....
gi 2462580004 650 QQIVPSQGQMVQQQgtLNPQNPMILSRAQ 678
Cdd:pfam09770 333 HRQQGSFGRQAPII--THPQQLAQLSEEE 359
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
385-523 |
8.84e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.26 E-value: 8.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 385 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ----QGSPASSPTVNQTQQQMGP 460
Cdd:pfam09770 211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqsPQPDPAQPSIQPQAQQFHQ 290
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 461 RPPQNNPLP-QGFQQP-VSSPGRNPMVQQGnvPPNFMVMQQQPPNQGPQSLHPGLGEKSEPSNLA 523
Cdd:pfam09770 291 QPPPVPVQPtQILQNPnRLSAARVGYPQNP--QPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLA 353
|
|
| SOBP |
pfam15279 |
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ... |
1104-1318 |
1.18e-03 |
|
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.
Pssm-ID: 464609 [Multi-domain] Cd Length: 325 Bit Score: 42.88 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1104 VSLQGPASVPPSPDKQRMPMPVNTPLGS--NSRKMVYQESPQNPSSSPLAEMASLPEASGSEAPSVPGGPNNMPSHVVLP 1181
Cdd:pfam15279 91 ESVSPGPSSSASPSSSPTSSNSSKPLISvaSSSKLLAPKPHEPPSLPPPPLPPKKGRRHRPGLHPPLGRPPGSPPMSMTP 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 1182 QNQLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHhfPNVAAPTQTSRPKTPNRASPRPYYPQT-PNNRPP-----S 1255
Cdd:pfam15279 171 RGLLGKPQQHPPPSPLPAFMEPSSMPPPFLRPPPSIPQ--PNSPLSNPMLPGIGPPPKPPRNLGPPSnPMHRPPfsphhP 248
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462580004 1256 TEPSEISLSPERLNASIAGLFPPQINIPLPPrpnLNRGFDQQGLNPTTLKAIGQAPSNLTMNP 1318
Cdd:pfam15279 249 PPPPTPPGPPPGLPPPPPRGFTPPFGPPFPP---VNMMPNPPEMNFGLPSLAPLVPPVTVLVP 308
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-483 |
2.83e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 2.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 190 PPGGNVS----SSMMAPGPNPELQPRTPRPASQSDAMDPLlsglhiqqqshpsGSLAPPHHPMQPVSvnrqmnPANFPQL 265
Cdd:PHA03247 2656 PAPGRVSrprrARRLGRAAQASSPPQRPRRRAARPTVGSL-------------TSLADPPPPPPTPE------PAPHALV 2716
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 266 QQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPVPPGWNQLPSGALQPPPAQGSLGTMTA 345
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRE 2796
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 346 NQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP-GQFTAPQMKSLQGGPSR--VPTP 422
Cdd:PHA03247 2797 SLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlGGSVAPGGDVRRRPPSRspAAKP 2875
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462580004 423 LQQPHLTNKS---PASSPSSFQQGSPASSPT-VNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNP 483
Cdd:PHA03247 2876 AAPARPPVRRlarPAVSRSTESFALPPDQPErPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
204-492 |
6.02e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 6.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 204 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLAPPHHPMQPVSVNRQMNPANFPQLQQQQ 269
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 270 QQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPV-----PPGWNQLPSGAlqPPPAQGSLGTMT 344
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLtsladPPPPPPTPEPA--PHALVSATPLPP 2723
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462580004 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPlq 424
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-- 2801
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462580004 425 qphltnkspaSSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPvSSPGRNPMVQQGNVPP 492
Cdd:PHA03247 2802 ----------WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP-PGPPPPSLPLGGSVAP 2858
|
|
|