|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-1031 |
3.56e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 78.44 E-value: 3.56e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 654 KYKTTQSPKIPHSK--------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 725
Cdd:PHA03247 2713 HALVSATPLPPGPAaarqaspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 726 VPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTTIVPITDLERVTDLETPvafrtEAPGTTLVPAvv 805
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTSAQPTAPPPPPGPPPPS-----LPLGGSVAPG-- 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 806 lEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPvtlrtetwvtTKAPKTPKRTRRPRPKPQT 885
Cdd:PHA03247 2860 -GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----------PPQPQAPPPPQPQPQPPPP 2926
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 886 TPTPETPLTKPVAATDLEPSALSTEV--PATVVLATALTP-VTLRTKAPKTTTLAPnvqRTRRPHPRPKTTASTGVSESK 962
Cdd:PHA03247 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAgePSGAVPQPWLGAlVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSR 3003
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 963 sVSDDLELVAFSTESPQKTIAPRQTTSMPPKLK-----------TPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTS 1031
Cdd:PHA03247 3004 -VSSWASSLALHEETDPPPVSLKQTLWPPDDTEdsdadslfdsdSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARES 3082
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1302-1393 |
2.67e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.66 E-value: 2.67e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1302 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1379
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118335 1380 LGEGPASNTVAFST 1393
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1303-1383 |
8.63e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 8.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1303 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1380
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118335 1381 GEG 1383
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1303-1386 |
1.07e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.18 E-value: 1.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1303 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1379
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118335 1380 LGEGPAS 1386
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
991-1305 |
1.94e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.55 E-value: 1.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 991 PPKLKTPHSRMPAKEPVPKEPlhttSKPKMPPSPEVADTTSAPLETRGIPLIP-VISPRPSQEELqtAMEETDQSTQELF 1069
Cdd:PHA03247 2483 PAEARFPFAAGAAPDPGGGGP----PDPDAPPAPSRLAPAILPDEPVGEPVHPrMLTWIRGLEEL--ASDDAGDPPPPLP 2556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1070 TTKIPRTTELAKTTqaphrlhtapvrPRIPGRPHGrPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVD 1149
Cdd:PHA03247 2557 PAAPPAAPDRSVPP------------PRPAPRPSE-PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPD 2620
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1150 SHATRKPGSVSGTRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKE 1229
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907118335 1230 PTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1305
Cdd:PHA03247 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
384-757 |
3.43e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.61 E-value: 3.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154 251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPP 678
Cdd:pfam03154 407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335 679 LASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 757
Cdd:pfam03154 484 STSSAMPGIQPPSSASVSSSGPVPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-702 |
6.68e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.46 E-value: 6.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1907118335 648 THRQRTKYKTTQSPKIPHSKPDLGPitsEPPLASTTKKVRRPRPKPQTTPHPEVP 702
Cdd:NF033839 431 VKPQPEKPKPEVKPQPEKPKPEVKP---QPETPKPEVKPQPEKPKPEVKPQPEKP 482
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1287-1398 |
1.17e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 1.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1287 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1366
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118335 1367 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1398
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.76e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118335 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.53e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.06 E-value: 2.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118335 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
2.64e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 2.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118335 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
307-575 |
4.55e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247 2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
|
250 260 270
....*....|....*....|....*....|....*..
gi 1907118335 544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-769 |
8.64e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 40.80 E-value: 8.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 565 KPALEPATVTPEILV-PKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaiVTTTDIEPVITRTKASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKSEKEGGTASSPMPPNIAIGAKDD----VDATDPSQEAKEYTKNAPMTPEAD 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 644 PRPRThrqrtkykTTQSPKIPHSKPDLGPItsepplaSTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT 723
Cdd:COG5665 440 SAPES--------SVRTEASPSAGSDLEPE-------NTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDS 504
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907118335 724 TLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 769
Cdd:COG5665 505 TPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-1031 |
3.56e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 78.44 E-value: 3.56e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 654 KYKTTQSPKIPHSK--------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 725
Cdd:PHA03247 2713 HALVSATPLPPGPAaarqaspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 726 VPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTTIVPITDLERVTDLETPvafrtEAPGTTLVPAvv 805
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTSAQPTAPPPPPGPPPPS-----LPLGGSVAPG-- 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 806 lEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPvtlrtetwvtTKAPKTPKRTRRPRPKPQT 885
Cdd:PHA03247 2860 -GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----------PPQPQAPPPPQPQPQPPPP 2926
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 886 TPTPETPLTKPVAATDLEPSALSTEV--PATVVLATALTP-VTLRTKAPKTTTLAPnvqRTRRPHPRPKTTASTGVSESK 962
Cdd:PHA03247 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAgePSGAVPQPWLGAlVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSR 3003
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 963 sVSDDLELVAFSTESPQKTIAPRQTTSMPPKLK-----------TPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTS 1031
Cdd:PHA03247 3004 -VSSWASSLALHEETDPPPVSLKQTLWPPDDTEdsdadslfdsdSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARES 3082
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
509-775 |
9.30e-13 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 73.57 E-value: 9.30e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449 542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449 620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 660 SPKIPHS-KPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 738
Cdd:PTZ00449 656 SPKPPKSpKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
|
250 260 270
....*....|....*....|....*....|....*..
gi 1907118335 739 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIVP 775
Cdd:PTZ00449 735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLP 770
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
545-1052 |
8.89e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 8.89e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRP----RTHRQRTKYKTTqSPKIPHSKPDlGPITSEPPLASTTKKVR 687
Cdd:PHA03247 2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPavtsRARRPDAPPQSA-RPRAPVDDRG-DPRGPAPPSPLPPDTHA 2623
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 688 RPRPKPQTTPHP-EVPHTILVPATSLEPFIITEAPGTTLVPK---LPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVT 763
Cdd:PHA03247 2624 PDPPPPSPSPAAnEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 764 PLK--EDPVTTIVPITDLERVtdletPVAFRTEAPGTTLVPAVvlepvtlrPEVQVTTLAPQKTQKKHRPSPKPkpvpsp 841
Cdd:PHA03247 2704 PPPtpEPAPHALVSATPLPPG-----PAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTA------ 2764
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 842 evTESKPVLPRVREPVTLRTETwVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVVLATAL 921
Cdd:PHA03247 2765 --GPPAPAPPAAPAAGPPRRLT-RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 922 TPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSVSDDLELVAFSTES---PQKTIAPRQTTSMPPKLKTPH 998
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalPPDQPERPPQPQAPPPPQPQP 2921
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335 999 SRMPAKEPVPKEPlhTTSKPKMPPSPEvADTTSAPLETRGIP------LIP---------VISPRPSQE 1052
Cdd:PHA03247 2922 QPPPPPQPQPPPP--PPPRPQPPLAPT-TDPAGAGEPSGAVPqpwlgaLVPgrvavprfrVPQPAPSRE 2987
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
448-775 |
2.40e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 65.73 E-value: 2.40e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 448 TATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATVTPEILVPkivpkPPQKPKATRRPEVPQVKPAHE 606
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLP-----PPTSAQPTAPPPPPGPPPPSL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 607 PvTFGSEAPAlaivtttdiEPVITRTKASVTTLAP-KPPRPRTHRqrtkykttqspkiphskpdlgpiTSEPPLASTTKK 685
Cdd:PHA03247 2851 P-LGGSVAPG---------GDVRRRPPSRSPAAKPaAPARPPVRR-----------------------LARPAVSRSTES 2897
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 686 VRRPRPKPQTTPHPEVPHTILVPATSLEPfiiteaPGTTLVPKLPQQPDYPhPKPKTTRSPAASPTELVPTPVFEPVTPL 765
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPP------PQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQPWLGALVPG 2970
|
330
....*....|
gi 1907118335 766 KEDPVTTIVP 775
Cdd:PHA03247 2971 RVAVPRFRVP 2980
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1302-1393 |
2.67e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.66 E-value: 2.67e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1302 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1379
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118335 1380 LGEGPASNTVAFST 1393
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
659-1248 |
3.36e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 62.26 E-value: 3.36e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 659 QSPKIPHSKPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTlvPKLPQQPdyphp 738
Cdd:PHA03247 2487 RFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPP--PPLPPAA----- 2559
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 739 kpkttrsPAASPTELVPTPVFEPVTPlkEDPVTTIVPITDL-ERVTDLETPVAFRTEAPGTTlvPAVVLEPVTLRPEVQV 817
Cdd:PHA03247 2560 -------PPAAPDRSVPPPRPAPRPS--EPAVTSRARRPDApPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPP 2628
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 818 TTLAPQKTQKkhrPSPKPKPVPSPEVTESKPVLPRVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPV 897
Cdd:PHA03247 2629 PSPSPAANEP---DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 898 AATDLEPSALSTEVPATVVLATA--LTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSVSddlelvafST 975
Cdd:PHA03247 2706 PTPEPAPHALVSATPLPPGPAAArqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP--------AA 2777
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 976 ESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTSApletrgIPLIPVISPRPSQEELQ 1055
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA------QPTAPPPPPGPPPPSLP 2851
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1056 TameETDQSTQELFTTKIPrttelaktTQAPHRLHTAPVRPRIpgRPHGRPALNKTTTrpdktkprgtshkngvgtgtKQ 1135
Cdd:PHA03247 2852 L---GGSVAPGGDVRRRPP--------SRSPAAKPAAPARPPV--RRLARPAVSRSTE--------------------SF 2898
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1136 APKPPSPGRnasvdshaTRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHP 1215
Cdd:PHA03247 2899 ALPPDQPER--------PPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVP 2969
|
570 580 590
....*....|....*....|....*....|...
gi 1907118335 1216 IGTATARPGAEQKEPTAPASEEEFGTTTDFSSS 1248
Cdd:PHA03247 2970 GRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1303-1383 |
8.63e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 8.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1303 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1380
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118335 1381 GEG 1383
Cdd:smart00060 81 GEG 83
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
381-766 |
1.08e-07 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 57.01 E-value: 1.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449 494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449 572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449 638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPdlgpitSEPPLASTTKK 685
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPI------GDPDAEQPDDI 747
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 686 VRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVTP 764
Cdd:PTZ00449 748 EFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSLP 814
|
..
gi 1907118335 765 LK 766
Cdd:PTZ00449 815 KK 816
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1303-1386 |
1.07e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.18 E-value: 1.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1303 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1379
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118335 1380 LGEGPAS 1386
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
991-1305 |
1.94e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.55 E-value: 1.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 991 PPKLKTPHSRMPAKEPVPKEPlhttSKPKMPPSPEVADTTSAPLETRGIPLIP-VISPRPSQEELqtAMEETDQSTQELF 1069
Cdd:PHA03247 2483 PAEARFPFAAGAAPDPGGGGP----PDPDAPPAPSRLAPAILPDEPVGEPVHPrMLTWIRGLEEL--ASDDAGDPPPPLP 2556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1070 TTKIPRTTELAKTTqaphrlhtapvrPRIPGRPHGrPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVD 1149
Cdd:PHA03247 2557 PAAPPAAPDRSVPP------------PRPAPRPSE-PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPD 2620
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1150 SHATRKPGSVSGTRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKE 1229
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907118335 1230 PTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1305
Cdd:PHA03247 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
384-757 |
3.43e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.61 E-value: 3.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154 251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPP 678
Cdd:pfam03154 407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335 679 LASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 757
Cdd:pfam03154 484 STSSAMPGIQPPSSASVSSSGPVPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-702 |
6.68e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.46 E-value: 6.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1907118335 648 THRQRTKYKTTQSPKIPHSKPDLGPitsEPPLASTTKKVRRPRPKPQTTPHPEVP 702
Cdd:NF033839 431 VKPQPEKPKPEVKPQPEKPKPEVKP---QPETPKPEVKPQPEKPKPEVKPQPEKP 482
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
517-711 |
9.26e-05 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 47.35 E-value: 9.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377 414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRT------KYKTTQSPKIPHSK 667
Cdd:PHA03377 489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGfqrsgrRQKRATPPKVSPSD 559
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907118335 668 --------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATS 711
Cdd:PHA03377 560 rgppkaspPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASG 611
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1287-1398 |
1.17e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 1.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1287 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1366
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118335 1367 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1398
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1287-1441 |
1.57e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.15 E-value: 1.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1287 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1364
Cdd:COG3401 314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118335 1365 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1441
Cdd:COG3401 388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.76e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118335 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
420-746 |
1.96e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 46.23 E-value: 1.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 420 SRVVQPQTATYDVISSSTTSDE---------TEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263 297 NRATQPEYDEYDPLLNGAPITEpvavaaaatTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263 376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263 456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 638 tlAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPPLASTTKkvrrprpkpQTTPHPEVPHTILVPATSLepfii 717
Cdd:PRK10263 525 --AWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK---------KATLATGAAATVAAPVFSL----- 588
|
330 340
....*....|....*....|....*....
gi 1907118335 718 teAPGTTLVPKLPQQPDYPHPKPKTTRSP 746
Cdd:PRK10263 589 --ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.53e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.06 E-value: 2.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118335 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
2.64e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 2.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118335 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
943-1304 |
4.11e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 4.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 943 TRRPHPRPKTTASTGVSESKSVSDDLELVAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEPlhTTSKPKMPP 1022
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEP--DPHPPPTVP 2647
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1023 SPEVADTTSAPLETR-----GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPrttelaKTTQAPHRLHTAPVRPR 1097
Cdd:PHA03247 2648 PPERPRDDPAPGRVSrprraRRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP------PPTPEPAPHALVSATPL 2721
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1098 IPGRPHGRPALNKTTTRPdktKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRHSSTRPV 1177
Cdd:PHA03247 2722 PPGPAAARQASPALPAAP---APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1178 SPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFSSSPTKETDPLG 1257
Cdd:PHA03247 2799 PSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVRRRPPSRSPAAK 2874
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 1907118335 1258 KPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1304
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
515-626 |
4.54e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 44.80 E-value: 4.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
|
90 100 110
....*....|....*....|....*....|..
gi 1907118335 595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950 434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
559-821 |
1.15e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.92 E-value: 1.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 559 PEVPKSKPALEPATVTPEILVPKIVPKPPQKPKA-----TRRPEVPQVKPA-HEPVTFGSEAPAlaiVTTTdiEPVITrt 632
Cdd:PRK10263 302 PEYDEYDPLLNGAPITEPVAVAAAATTATQSWAApvepvTQTPPVASVDVPpAQPTVAWQPVPG---PQTG--EPVIA-- 374
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 633 kasvttlapkpPRPRTHRQRTKYKttqSPKIPHSKPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATS- 711
Cdd:PRK10263 375 -----------PAPEGYPQQSQYA---QPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEq 440
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 712 --LEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVtPLKEDPVTTIVPITDLERVtdlETPV 789
Cdd:PRK10263 441 pvAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE-PVVEETKPARPPLYYFEEV---EEKR 516
|
250 260 270
....*....|....*....|....*....|....
gi 1907118335 790 AFRTE--APGTTLVPAVVLEPVTLRPEVQVTTLA 821
Cdd:PRK10263 517 AREREqlAAWYQPIPEPVKEPEPIKSSLKAPSVA 550
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
490-618 |
1.57e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 42.93 E-value: 1.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118335 569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994 450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
491-765 |
1.61e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.22 E-value: 1.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPrt 648
Cdd:pfam03154 252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQP-- 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 649 hrqrtkykttqSPKIPHSKPDLGPITSEPPLASTTKKvRRPRPKPQTTPHPEVP--HTILVPATSLEPFIitEAPGTTLV 726
Cdd:pfam03154 298 -----------FPLTPQSSQSQVPPGPSPAAPGQSQQ-RIHTPPSQSQLQSQQPprEQPLPPAPLSMPHI--KPPPTTPI 363
|
250 260 270
....*....|....*....|....*....|....*....
gi 1907118335 727 PKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 765
Cdd:pfam03154 364 PQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
458-770 |
1.74e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 42.91 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003 451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTKAsvttlAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPIT 674
Cdd:PRK07003 531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAR-----AAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAAT 605
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 675 SEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRSPAAsPTELV 754
Cdd:PRK07003 606 GDAPPNGAARAEQAAESRGAPPPWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADAPAP-PVDTR 681
|
330
....*....|....*.
gi 1907118335 755 PTPvfePVTPLkeDPV 770
Cdd:PRK07003 682 PLP---PAIPL--DAI 692
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
480-628 |
1.89e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.87 E-value: 1.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950 351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118335 560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950 427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
531-775 |
2.46e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 2.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 531 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 610
Cdd:PHA03247 255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 611 GSEAPALAIVTTtdiepvITRTKASVTTLAPKPPRPrthrqrtkyktTQSPkiPHSKPDLGPITSEPPLASTTKKVRRpr 690
Cdd:PHA03247 328 DDEDGAMEVVSP------LPRPRQHYPLGFPKRRRP-----------TWTP--PSSLEDLSAGRHHPKRASLPTRKRR-- 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 691 pkpqTTPHPEVPHTiLVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPV 770
Cdd:PHA03247 387 ----SARHAATPFA-RGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEP 461
|
....*
gi 1907118335 771 TTIVP 775
Cdd:PHA03247 462 APDDP 466
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
469-600 |
3.32e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.28 E-value: 3.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764 371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118335 545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764 449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
553-774 |
3.45e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 41.84 E-value: 3.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209 330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 628 VITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHskPDLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTILV 707
Cdd:PLN03209 407 PAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARY--EDLKPPTS-------------PSPTAPTGVSPSVSSTSSV 471
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907118335 708 PATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIV 774
Cdd:PLN03209 472 PAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
515-612 |
3.59e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 41.85 E-value: 3.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954 385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
|
90
....*....|....*...
gi 1907118335 595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954 451 PRNVASGKPG---VDLGS 465
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
307-575 |
4.55e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247 2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
|
250 260 270
....*....|....*....|....*....|....*..
gi 1907118335 544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
515-717 |
4.61e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 41.35 E-value: 4.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 586
Cdd:PRK14086 87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 587 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 666
Cdd:PRK14086 167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1907118335 667 KPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 717
Cdd:PRK14086 242 GPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
448-750 |
5.91e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.31 E-value: 5.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 448 TATRDPILDSVPPKTSRTAEQPRATLAPiealfesrnveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03307 60 AACDRFEPPTGPPPGPGTEAPANESRST-------------PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPE------VPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQV 601
Cdd:PHA03307 127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPaavasdAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 602 KPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPPLAS 681
Cdd:PHA03307 207 PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPAS 286
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335 682 TTKKVRRPRPKPQ---------TTPHPEVPHTILVPATSLE-PFIITEAPGTTLVPklPQQPDYPHPKPKTTRSPAASP 750
Cdd:PHA03307 287 SSSSPRERSPSPSpsspgsgpaPSSPRASSSSSSSRESSSSsTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPS 363
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
491-682 |
6.45e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.01 E-value: 6.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323 463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 1907118335 641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSE---PPLAST 682
Cdd:PRK12323 543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDgdwPALAAR 587
|
|
| COG3979 |
COG3979 |
Chitodextrinase [Carbohydrate transport and metabolism]; |
1299-1398 |
7.48e-03 |
|
Chitodextrinase [Carbohydrate transport and metabolism];
Pssm-ID: 443178 [Multi-domain] Cd Length: 369 Bit Score: 40.53 E-value: 7.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1299 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1377
Cdd:COG3979 2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
|
90 100
....*....|....*....|.
gi 1907118335 1378 nplgeGPASNTVAFSTESADP 1398
Cdd:COG3979 72 -----DAAGNVSAASGTSTAM 87
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
451-539 |
8.40e-03 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 39.60 E-value: 8.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 451 RDPIlDSVPPKTSRTAEQP--------RATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVK 522
Cdd:PRK11633 50 RDEP-DMMPAATQALPTQPpegaaeavRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAP 128
|
90
....*....|....*..
gi 1907118335 523 PAPEPETRPSAQTTKAP 539
Cdd:PRK11633 129 PAPKPEPKPVVEEKAAP 145
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-769 |
8.64e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 40.80 E-value: 8.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 565 KPALEPATVTPEILV-PKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaiVTTTDIEPVITRTKASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKSEKEGGTASSPMPPNIAIGAKDD----VDATDPSQEAKEYTKNAPMTPEAD 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 644 PRPRThrqrtkykTTQSPKIPHSKPDLGPItsepplaSTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT 723
Cdd:COG5665 440 SAPES--------SVRTEASPSAGSDLEPE-------NTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDS 504
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907118335 724 TLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 769
Cdd:COG5665 505 TPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
467-870 |
9.34e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.82 E-value: 9.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 467 EQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPstpkrQSTPKPPRVKPAPEP-ETRPSAQTTKAPR----- 540
Cdd:PHA03378 345 EAVRLPDDPIIVEDDDESEEIESECDPDEDKSGAEALASIP-----QTLPDPPTVYGRPKVfARKADLKSTKKCRaivtd 419
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 541 ---------KTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKP--PQKPKATrrpevPQVKPA--HEP 607
Cdd:PHA03378 420 psvikaieeEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTGPLSVQAPlePWQPLPH-----PQVTPVilHQP 494
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 608 VTFGSEAP-ALAIVTTTDIEPVITRTKAsvTTLAPKPPRPRTHRQ-----------RTKYKTTQSPKIPH--SKPDLGPI 673
Cdd:PHA03378 495 PAQGVQAHgSMLDLLEKDDEDMEQRVMA--TLLPPSPPQPRAGRRapcvytedldiESDEPASTEPVHDQllPAPGLGPL 572
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 674 TSEPPLASTTKKVRRPRPKPQTTPHPeVPHtilvPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTEL 753
Cdd:PHA03378 573 QIQPLTSPTTSQLASSAPSYAQTPWP-VPH----PSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLV 647
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 754 VPTPVFEP-VTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPS 832
Cdd:PHA03378 648 FPTPHQPPqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPP 727
|
410 420 430
....*....|....*....|....*....|....*...
gi 1907118335 833 PKPKPVPSPEVTESKPVLPRVREPVTLRTETWVTTKAP 870
Cdd:PHA03378 728 AAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
491-778 |
9.39e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 40.37 E-value: 9.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PHA03369 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 571 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 650
Cdd:PHA03369 442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 651 QRTKYKTTQSPKI-PHSKPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 723
Cdd:PHA03369 516 FKNAGAKTAAANIePNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118335 724 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 778
Cdd:PHA03369 596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1016-1303 |
9.69e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 40.44 E-value: 9.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1016 SKPKMPP-SPEVADTTSAPLETRGIPLIPVISPRPSQEElqTAMEETDQSTQELFTTKIPRTTELAKTTQAPhrlhtAPV 1094
Cdd:PTZ00449 492 SKKKLAPiEEEDSDKHDEPPEGPEASGLPPKAPGDKEGE--EGEHEDSKESDEPKEGGKPGETKEGEVGKKP-----GPA 564
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1095 RPRipgRPHGRPALNKTTTRPDKTKprgtshkngvgtgtkqAPKPPSPGRnasvdshATRKPGSVSGTRRPPIPHRHSST 1174
Cdd:PTZ00449 565 KEH---KPSKIPTLSKKPEFPKDPK----------------HPKDPEEPK-------KPKRPRSAQRPTRPKSPKLPELL 618
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1175 RPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLkatlhpigTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETD 1254
Cdd:PTZ00449 619 DIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKI--------IKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETK 690
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1907118335 1255 PLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEGNATSPPQNP 1303
Cdd:PTZ00449 691 TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
|
|
|