NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034632399|ref|XP_016861595|]
View 

target of Nesh-SH3 isoform X11 [Homo sapiens]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10414115)

fibronectin type III (FN3) domain-containing protein may be involved in specific interactions with other molecules through its FN3 domain

Gene Ontology:  GO:0005515
PubMed:  8981327

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
635-1202 7.82e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 7.82e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  635 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFET 710
Cdd:PHA03247  2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  711 EAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSA--PTTTTKRTR 788
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdpPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  789 RPHPKPKTTPHPEVPQT--KLVPATILEPVLRTEASGTT--------AAPKVPQRTHRPHPK--PKTTLSPEELQTELVP 856
Cdd:PHA03247  2711 APHALVSATPLPPGPAAarQASPALPAAPAPPAVPAGPAtpggparpARPPTTAGPPAPAPPaaPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  857 ATIFEPVSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATvlePVTLRPE 936
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG---DVRRRPP 2867
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  937 ASTTlASKTSQRTRRPRLRTKTTPRPEAPESKPVPtaelkpvtlrtetwvttqaPKTSQRTRRPRPKTKTTPSPEVPQTK 1016
Cdd:PHA03247  2868 SRSP-AAKPAAPARPPVRRLARPAVSRSTESFALP-------------------PDQPERPPQPQAPPPPQPQPQPPPPP 2927
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1017 LVPSTDLEPGtlRTEAPktmvvttvLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSvSVTGFEPVVHST 1096
Cdd:PHA03247  2928 QPQPPPPPPP--RPQPP--------LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP-SREAPASSTPPL 2996
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1097 DAPGTTFVSDVLESVTLSTESPKETIAPAKTdyvyptakapLWPEEPKTEVVESITYVSEPPETTLET-SPLPSQSiTLP 1175
Cdd:PHA03247  2997 TGHSLSRVSSWASSLALHEETDPPPVSLKQT----------LWPPDDTEDSDADSLFDSDSERSDLEAlDPLPPEP-HDP 3065
                          570       580
                   ....*....|....*....|....*..
gi 1034632399 1176 SPDEPQTePAPKQTPRAPPKPKTSPRP 1202
Cdd:PHA03247  3066 FAHEPDP-ATPEAGARESPSSQFGPPP 3091
PHA03247 super family cl33720
large tegument protein UL36; Provisional
382-800 1.00e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 1.00e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  382 PLSTLAPKSLPEFPEAKTPFPFE-KPRGTLASSEKPWIVPTAKISED----SKVLQPQTATYD----VFSSPTTSDEPEI 452
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDdpapGRVSRPRRARRLgraaQASSPPQRPRRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  453 SDSYTATSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRT 530
Cdd:PHA03247  2688 ARPTVGSLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  531 KPERTTSAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAP 610
Cdd:PHA03247  2766 PPAPAPPAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPP 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  611 RKTKRPGRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPP 681
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  682 KQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPE 761
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPS 2956
                          410       420       430
                   ....*....|....*....|....*....|....*....
gi 1034632399  762 TLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHP 800
Cdd:PHA03247  2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1532-1623 1.18e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 56.74  E-value: 1.18e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1532 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1609
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1034632399 1610 LGEGPVSNTVAFST 1623
Cdd:cd00063     80 GGESPPSESVTVTT 93
PRK10263 super family cl35903
DNA translocase FtsK; Provisional
1133-1385 1.11e-05

DNA translocase FtsK; Provisional


The actual alignment was detected with superfamily member PRK10263:

Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.47  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1133 TAKAPLW--PEEPKTEVVESITYVSEPPETTLETSPLPSqsitlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPV 1210
Cdd:PRK10263   327 TTATQSWaaPVEPVTQTPPVASVDVPPAQPTVAWQPVPG-----PQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1211 PkvPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPVLQPVTfrfEPPKTTIAPLETRGiPFIPMISPSPsq 1290
Cdd:PRK10263   402 Q--PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQA---EEQQSTFAPQSTYQ-TEQTYQQPAA-- 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1291 eelQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSD---------KPHIRPVLNRTTTRPTRPKPS-- 1359
Cdd:PRK10263   474 ---QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRARereqlaawyQPIPEPVKEPEPIKSSLKAPSva 550
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1034632399 1360 ------GMPSGNGVGTGVKQAPRPSGADRNVS 1385
Cdd:PRK10263   551 avppveAAAAVSPLASGVKKATLATGAAATVA 582
FN3 super family cl21522
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 6.21e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


The actual alignment was detected with superfamily member pfam00041:

Pssm-ID: 473895 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 6.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1034632399  200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 super family cl27307
Fibronectin type 3 domain [General function prediction only];
1349-1628 6.99e-04

Fibronectin type 3 domain [General function prediction only];


The actual alignment was detected with superfamily member COG3401:

Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.22  E-value: 6.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1349 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1428
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1429 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1505
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1506 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1584
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1034632399 1585 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1628
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
635-1202 7.82e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 7.82e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  635 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFET 710
Cdd:PHA03247  2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  711 EAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSA--PTTTTKRTR 788
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdpPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  789 RPHPKPKTTPHPEVPQT--KLVPATILEPVLRTEASGTT--------AAPKVPQRTHRPHPK--PKTTLSPEELQTELVP 856
Cdd:PHA03247  2711 APHALVSATPLPPGPAAarQASPALPAAPAPPAVPAGPAtpggparpARPPTTAGPPAPAPPaaPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  857 ATIFEPVSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATvlePVTLRPE 936
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG---DVRRRPP 2867
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  937 ASTTlASKTSQRTRRPRLRTKTTPRPEAPESKPVPtaelkpvtlrtetwvttqaPKTSQRTRRPRPKTKTTPSPEVPQTK 1016
Cdd:PHA03247  2868 SRSP-AAKPAAPARPPVRRLARPAVSRSTESFALP-------------------PDQPERPPQPQAPPPPQPQPQPPPPP 2927
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1017 LVPSTDLEPGtlRTEAPktmvvttvLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSvSVTGFEPVVHST 1096
Cdd:PHA03247  2928 QPQPPPPPPP--RPQPP--------LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP-SREAPASSTPPL 2996
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1097 DAPGTTFVSDVLESVTLSTESPKETIAPAKTdyvyptakapLWPEEPKTEVVESITYVSEPPETTLET-SPLPSQSiTLP 1175
Cdd:PHA03247  2997 TGHSLSRVSSWASSLALHEETDPPPVSLKQT----------LWPPDDTEDSDADSLFDSDSERSDLEAlDPLPPEP-HDP 3065
                          570       580
                   ....*....|....*....|....*..
gi 1034632399 1176 SPDEPQTePAPKQTPRAPPKPKTSPRP 1202
Cdd:PHA03247  3066 FAHEPDP-ATPEAGARESPSSQFGPPP 3091
PHA03247 PHA03247
large tegument protein UL36; Provisional
382-800 1.00e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 1.00e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  382 PLSTLAPKSLPEFPEAKTPFPFE-KPRGTLASSEKPWIVPTAKISED----SKVLQPQTATYD----VFSSPTTSDEPEI 452
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDdpapGRVSRPRRARRLgraaQASSPPQRPRRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  453 SDSYTATSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRT 530
Cdd:PHA03247  2688 ARPTVGSLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  531 KPERTTSAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAP 610
Cdd:PHA03247  2766 PPAPAPPAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPP 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  611 RKTKRPGRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPP 681
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  682 KQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPE 761
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPS 2956
                          410       420       430
                   ....*....|....*....|....*....|....*....
gi 1034632399  762 TLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHP 800
Cdd:PHA03247  2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1532-1623 1.18e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 56.74  E-value: 1.18e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1532 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1609
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1034632399 1610 LGEGPVSNTVAFST 1623
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1533-1613 1.62e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.31  E-value: 1.62e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  1533 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1610
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1034632399  1611 GEG 1613
Cdd:smart00060   81 GEG 83
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1133-1385 1.11e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.47  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1133 TAKAPLW--PEEPKTEVVESITYVSEPPETTLETSPLPSqsitlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPV 1210
Cdd:PRK10263   327 TTATQSWaaPVEPVTQTPPVASVDVPPAQPTVAWQPVPG-----PQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1211 PkvPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPVLQPVTfrfEPPKTTIAPLETRGiPFIPMISPSPsq 1290
Cdd:PRK10263   402 Q--PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQA---EEQQSTFAPQSTYQ-TEQTYQQPAA-- 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1291 eelQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSD---------KPHIRPVLNRTTTRPTRPKPS-- 1359
Cdd:PRK10263   474 ---QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRARereqlaawyQPIPEPVKEPEPIKSSLKAPSva 550
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1034632399 1360 ------GMPSGNGVGTGVKQAPRPSGADRNVS 1385
Cdd:PRK10263   551 avppveAAAAVSPLASGVKKATLATGAAATVA 582
fn3 pfam00041
Fibronectin type III domain;
1533-1616 4.64e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 43.56  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1533 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1609
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1034632399 1610 LGEGPVS 1616
Cdd:pfam00041   79 GGEGPPS 85
fn3 pfam00041
Fibronectin type III domain;
123-202 6.21e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 6.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1034632399  200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 6.44e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 6.44e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399   124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1034632399   200 GVK 202
Cdd:smart00060   73 RVR 75
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1349-1628 6.99e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.22  E-value: 6.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1349 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1428
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1429 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1505
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1506 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1584
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1034632399 1585 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1628
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1527-1671 8.80e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.22  E-value: 8.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1527 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1604
Cdd:COG3401    324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034632399 1605 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1671
Cdd:COG3401    398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 3.08e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 3.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 1034632399  201 VK 202
Cdd:cd00063     74 VR 75
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
565-1009 5.09e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 5.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  565 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 643
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  644 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTksvsePVPFETEAPSMtivpttdi 723
Cdd:pfam03154  223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM-----PHSLQTGPSHM-------- 289
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  724 ePVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgpiTPGTSSAPTTTTKRTRRPHPKPKTTPHPEVP 803
Cdd:pfam03154  290 -QHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHT----------PPSQSQLQSQQPPREQPLPPAPLSMPHIKPP 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  804 QtklvpatilepvlrteasgTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPIK-EAPGTTFVPVTDLEP 882
Cdd:pfam03154  359 P-------------------TTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLStHHPPSAHPPPLQLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  883 VTFRTEIPATT--LATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTlRPEASTTLASKTSQRTRRPRLRTKTTP 960
Cdd:pfam03154  420 QSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGG-PPPITPPSGPPTSTSSAMPGIQPPSSA 498
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034632399  961 RPEAPESKP-VPTAELKPVTLRTETWVTTQAPKT---SQRTRRPRPKTKTTPS 1009
Cdd:pfam03154  499 SVSSSGPVPaAVSCPLPPVQIKEEALDEAEEPESpppPPRSPSPEPTVVNTPS 551
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
635-1202 7.82e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 7.82e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  635 PKSKPALEPATiqPEPLVPTT--ASKPSERPKTT--HRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFET 710
Cdd:PHA03247  2553 PPLPPAAPPAA--PDRSVPPPrpAPRPSEPAVTSraRRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  711 EAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSA--PTTTTKRTR 788
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdpPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  789 RPHPKPKTTPHPEVPQT--KLVPATILEPVLRTEASGTT--------AAPKVPQRTHRPHPK--PKTTLSPEELQTELVP 856
Cdd:PHA03247  2711 APHALVSATPLPPGPAAarQASPALPAAPAPPAVPAGPAtpggparpARPPTTAGPPAPAPPaaPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  857 ATIFEPVSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATvlePVTLRPE 936
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG---DVRRRPP 2867
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  937 ASTTlASKTSQRTRRPRLRTKTTPRPEAPESKPVPtaelkpvtlrtetwvttqaPKTSQRTRRPRPKTKTTPSPEVPQTK 1016
Cdd:PHA03247  2868 SRSP-AAKPAAPARPPVRRLARPAVSRSTESFALP-------------------PDQPERPPQPQAPPPPQPQPQPPPPP 2927
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1017 LVPSTDLEPGtlRTEAPktmvvttvLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSvSVTGFEPVVHST 1096
Cdd:PHA03247  2928 QPQPPPPPPP--RPQPP--------LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP-SREAPASSTPPL 2996
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1097 DAPGTTFVSDVLESVTLSTESPKETIAPAKTdyvyptakapLWPEEPKTEVVESITYVSEPPETTLET-SPLPSQSiTLP 1175
Cdd:PHA03247  2997 TGHSLSRVSSWASSLALHEETDPPPVSLKQT----------LWPPDDTEDSDADSLFDSDSERSDLEAlDPLPPEP-HDP 3065
                          570       580
                   ....*....|....*....|....*..
gi 1034632399 1176 SPDEPQTePAPKQTPRAPPKPKTSPRP 1202
Cdd:PHA03247  3066 FAHEPDP-ATPEAGARESPSSQFGPPP 3091
PHA03247 PHA03247
large tegument protein UL36; Provisional
778-1250 9.10e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 9.10e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  778 SAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVP-ATILEPV-------------LRTEASG-------TTAAPKVPQR--- 833
Cdd:PHA03247  2490 FAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPdEPVGEPVhprmltwirgleeLASDDAGdpppplpPAAPPAAPDRsvp 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  834 THRPHPKPKTTLSPEELQTELVP---ATIFEPVSPIKEAPGTTfvPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PHA03247  2570 PPRPAPRPSEPAVTSRARRPDAPpqsARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  911 TtpsPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLR---------TKTTPRPEAPESKPVPTAELKPVTLR 981
Cdd:PHA03247  2648 P---PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARptvgsltslADPPPPPPTPEPAPHALVSATPLPPG 2724
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  982 TETWVTTQAPKTSQRTRRPRPKTKTTP----SPEVPQTKLVPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPE-TTL 1056
Cdd:PHA03247  2725 PAAARQASPALPAAPAPPAVPAGPATPggpaRPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSpWDP 2804
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1057 APKTQRTRRPRPRPKTTSSPEVPQNKSVSVTGFEPVVHSTD-APGTTFVSDVLESVTLSTESPKETIAPAKTDYVYPTAK 1135
Cdd:PHA03247  2805 ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPpPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1136 APLWPEEPKTEVVESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPK--- 1212
Cdd:PHA03247  2885 RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQpwl 2964
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 1034632399 1213 ---VPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVS 1250
Cdd:PHA03247  2965 galVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
478-980 1.71e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.71e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  478 PRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPE--RTTSAGTITPKISKSPEPTWTT 555
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGdpRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  556 PAPGKTQfiSLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVP 635
Cdd:PHA03247  2631 PSPAANE--PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  636 KSKPalePATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPP--KQLLPKPQTTAEPD--MPPTKSVSEPVPFETE 711
Cdd:PHA03247  2709 EPAP---HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPggPARPARPPTTAGPPapAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  712 APSMTIVPTTDIEP-----------VTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDfGPITPGtssAP 780
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadppaaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-GSVAPG---GD 2861
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  781 TTTTKRTRRPHPKPKTTPHPEV-----PQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEE------ 849
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprpqp 2941
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  850 -LQTELVPATIFEPvSPIKEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAP-----ETKPV 923
Cdd:PHA03247  2942 pLAPTTDPAGAGEP-SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPP 3020
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034632399  924 PATVLEpvTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PHA03247  3021 PVSLKQ--TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATP 3075
PHA03247 PHA03247
large tegument protein UL36; Provisional
382-800 1.00e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 1.00e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  382 PLSTLAPKSLPEFPEAKTPFPFE-KPRGTLASSEKPWIVPTAKISED----SKVLQPQTATYD----VFSSPTTSDEPEI 452
Cdd:PHA03247  2608 PRGPAPPSPLPPDTHAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDdpapGRVSRPRRARRLgraaQASSPPQRPRRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  453 SDSYTATSDRILDsiPPKTSRTLE-QPRATLAPSETPFVPQKL-EIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRT 530
Cdd:PHA03247  2688 ARPTVGSLTSLAD--PPPPPPTPEpAPHALVSATPLPPGPAAArQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  531 KPERTTSAGTITPkiskspePTWTTPAPGKTQFISLKPKIPLSPEvthtkPAPEPQTLLPSQSTigpETPGTKPSTTLAP 610
Cdd:PHA03247  2766 PPAPAPPAAPAAG-------PPRRLTRPAVASLSESRESLPSPWD-----PADPPAAVLAPAAA---LPPAASPAGPLPP 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  611 RKTKRPGRRPRPRPRPKTTPSPE--VPKSKPALEPATIQPEPLVPTTASKPSER-------PKTTHRPDAPQIQPGSKPP 681
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpavSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  682 KQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPsmtIVPTTDIEPVtvrteatvttlapktsqrtrtrrprpkhkttPRPE 761
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP---LAPTTDPAGA-------------------------------GEPS 2956
                          410       420       430
                   ....*....|....*....|....*....|....*....
gi 1034632399  762 TLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHP 800
Cdd:PHA03247  2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1532-1623 1.18e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 56.74  E-value: 1.18e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1532 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQMTNQTFSTVENLKPNTSYEFQVKPKNP 1609
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1034632399 1610 LGEGPVSNTVAFST 1623
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1533-1613 1.62e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.31  E-value: 1.62e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  1533 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQMTNQTFS-TVENLKPNTSYEFQVKPKNPL 1610
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1034632399  1611 GEG 1613
Cdd:smart00060   81 GEG 83
PRK10263 PRK10263
DNA translocase FtsK; Provisional
701-1292 1.86e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 53.17  E-value: 1.86e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  701 SVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTTLAPKTSQRTRtrrprpKHKTTPRPETLQTKLDFGPitpgTSSAP 780
Cdd:PRK10263   315 PITEPVAVAAAATTATQSWAAPVEPVT--QTPPVASVDVPPAQPTV------AWQPVPGPQTGEPVIAPAP----EGYPQ 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  781 TTTTKRTRRPHPKPKTTPHPEVPQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIF 860
Cdd:PRK10263   383 QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  861 EPVSPIKE--APGTTFVPVTDLEPVTFRTEIPATTlATKTSKRTRPPRPRPKTTPSPQ----APETKPVPATVLEPVTLR 934
Cdd:PRK10263   463 QTEQTYQQpaAQEPLYQQPQPVEQQPVVEPEPVVE-ETKPARPPLYYFEEVEEKRAREreqlAAWYQPIPEPVKEPEPIK 541
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  935 PEASTTLASKTSqrtrrprlrtkttPRPEAPESKPVpTAELKPVTLRTETWVTTQAPKTSQRTR-RPRPKTKTTPSPEVP 1013
Cdd:PRK10263   542 SSLKAPSVAAVP-------------PVEAAAAVSPL-ASGVKKATLATGAAATVAAPVFSLANSgGPRPQVKEGIGPQLP 607
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1014 QtklvpstdlepgtlrteaPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTS----------SPEVPQNKS 1083
Cdd:PRK10263   608 R------------------PKRIRVPTRRELASYGIKLPSQRAAEEKAREAQRNQYDSGDQynddeidamqQDELARQFA 669
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1084 VSVTGFEPVVHSTDAPGTTFVSDVLESVTLStespKETIAPAKTDYV--YPTAKAPLWPEEPKTEVVESItyVSEPPETT 1161
Cdd:PRK10263   670 QTQQQRYGEQYQHDVPVNAEDADAAAEAELA----RQFAQTQQQRYSgeQPAGANPFSLDDFEFSPMKAL--LDDGPHEP 743
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1162 LETsPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQ-----PVPKVPQRVTAKPKTSPSPEVSYTTPAP 1236
Cdd:PRK10263   744 LFT-PIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPqyqqpQQPVAPQPQYQQPQQPVAPQPQYQQPQQ 822
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034632399 1237 kdvllPHKPYPEVSQSEPVLQPvtfrfEPPKTTIAPLETRG------------IPFIPMISPSPSQEE 1292
Cdd:PRK10263   823 -----PVAPQPQYQQPQQPVAP-----QPQDTLLHPLLMRNgdsrplhkpttpLPSLDLLTPPPSEVE 880
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1133-1385 1.11e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.47  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1133 TAKAPLW--PEEPKTEVVESITYVSEPPETTLETSPLPSqsitlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPV 1210
Cdd:PRK10263   327 TTATQSWaaPVEPVTQTPPVASVDVPPAQPTVAWQPVPG-----PQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1211 PkvPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPVLQPVTfrfEPPKTTIAPLETRGiPFIPMISPSPsq 1290
Cdd:PRK10263   402 Q--PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQA---EEQQSTFAPQSTYQ-TEQTYQQPAA-- 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1291 eelQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSD---------KPHIRPVLNRTTTRPTRPKPS-- 1359
Cdd:PRK10263   474 ---QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRARereqlaawyQPIPEPVKEPEPIKSSLKAPSva 550
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1034632399 1360 ------GMPSGNGVGTGVKQAPRPSGADRNVS 1385
Cdd:PRK10263   551 avppveAAAAVSPLASGVKKATLATGAAATVA 582
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1098-1359 2.11e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 49.69  E-value: 2.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1098 APGTTFVSDVLESVTLSTESPKETIAPAKTDYVYPTAKaplwPEEPKTEVVESITYVSEPPEttletspLPSQSitlPSP 1177
Cdd:PTZ00449   523 APGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKK----PGPAKEHKPSKIPTLSKKPE-------FPKDP---KHP 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1178 DEPQtEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTA--KPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPv 1255
Cdd:PTZ00449   589 KDPE-EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESpkSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPP- 666
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1256 lqpvtfrFEPPKTTiapletrgiPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPhrfYTTVRPR 1335
Cdd:PTZ00449   667 -------FDPKFKE---------KFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRP---LPPKLPR 727
                          250       260
                   ....*....|....*....|....
gi 1034632399 1336 TSDKPHIRPvlnrttTRPTRPKPS 1359
Cdd:PTZ00449   728 DEEFPFEPI------GDPDAEQPD 745
PHA03247 PHA03247
large tegument protein UL36; Provisional
1136-1497 2.65e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 2.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1136 APLWPEEPKTEVVESityvsEPPETTLETSPLPSQSitlPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVP---K 1212
Cdd:PHA03247  2591 APPQSARPRAPVDDR-----GDPRGPAPPSPLPPDT---HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvS 2662
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1213 VPQRVTAK--PKTSPSPEVSYTTPAPKDVLLP---------------HKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLeT 1275
Cdd:PHA03247  2663 RPRRARRLgrAAQASSPPQRPRRRAARPTVGSltsladpppppptpePAPHALVSATPLPPGPAAARQASPALPAAPA-P 2741
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1276 RGIPFIPMISPSPSQEELQTTLEETDQST--QEPFTTKIPRTTELAKTTQAPHRFYTTVRPRTSDKPHIRPVLNRTTTRP 1353
Cdd:PHA03247  2742 PAVPAGPATPGGPARPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1354 TRPKPSGMPSGNGVGTGVKQAPRPSGADRNV--SVDSTHPTKKPGTRRPplppRPTHPRRKPLPPNNVTGKPGSAGIISS 1431
Cdd:PHA03247  2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPPSRS----PAAKPAAPARPPVRRLARPAVSRSTES 2897
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034632399 1432 GPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVR 1497
Cdd:PHA03247  2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
915-1363 2.88e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 49.30  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  915 PQAPETKPVPATVlePVTLRPEASTTLASKTSQRTRRPRlRTKTTPRPEAPEsKPVPTAELKPVTLRTETwVTTQAPKTS 994
Cdd:PTZ00449   511 PEGPEASGLPPKA--PGDKEGEEGEHEDSKESDEPKEGG-KPGETKEGEVGK-KPGPAKEHKPSKIPTLS-KKPEFPKDP 585
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  995 QRTRRPRpKTKTTPSPEVPQTKLVPSTDLEPGTLrteapktmvvttvlepdtfrtKFPETTLAPKTQRTRRPRPRPKTTS 1074
Cdd:PTZ00449   586 KHPKDPE-EPKKPKRPRSAQRPTRPKSPKLPELL---------------------DIPKSPKRPESPKSPKRPPPPQRPS 643
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1075 SPEVPQNKSVSVTGFEPvvhstdapgttfvsdvlesvtlstESPKETIAPAKTDYVYPT-AKAPLWPEEPKTEVVESITY 1153
Cdd:PTZ00449   644 SPERPEGPKIIKSPKPP------------------------KSPKPPFDPKFKEKFYDDyLDAAAKSKETKTTVVLDESF 699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1154 VSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSP----------RPRIPQTQPVPKVPQRVTAKPKT 1223
Cdd:PTZ00449   700 ESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDiefftppeeeRTFFHETPADTPLPDILAEEFKE 779
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1224 spsPEVSYTTPAPKDVLL----PHKPYPEVSQSEPVLQPVTFRFEPPKTTIAPLETRgiPFIPMISPSPSQEELQTTLEE 1299
Cdd:PTZ00449   780 ---EDIHAETGEPDEAMKrpdsPSEHEDKPPGDHPSLPKKRHRLDGLALSTTDLESD--AGRIAKDASGKIVKLKRSKSF 854
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034632399 1300 TDQSTQEPFTTKIPRTTELA-----------KTTQAPHRFYTTVRPRTSDKPHIRPVLNRTTTRPTRPKPSGMPS 1363
Cdd:PTZ00449   855 DDLTTVEEAEEMGAEARKIVvdddgteaddeDTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPS 929
fn3 pfam00041
Fibronectin type III domain;
1533-1616 4.64e-05

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 43.56  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1533 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQMTNQTFS-TVENLKPNTSYEFQVKPKNP 1609
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1034632399 1610 LGEGPVS 1616
Cdd:pfam00041   79 GGEGPPS 85
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
533-980 7.37e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 47.76  E-value: 7.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  533 ERTTSAGTITPKISKSPEPTWTT-----PAPGKTQFISLKPKIPLSPEVTHTKPAP-EPQTLLPSQSTIGPETPGTKPST 606
Cdd:PTZ00449   533 EHEDSKESDEPKEGGKPGETKEGevgkkPGPAKEHKPSKIPTLSKKPEFPKDPKHPkDPEEPKKPKRPRSAQRPTRPKSP 612
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  607 TLaprktkrpgrrprprprpktTPSPEVPKSKPALEPATIQPEPLVPTtaskpseRPKTTHRPDAPQIQPGSKPPKQllP 686
Cdd:PTZ00449   613 KL--------------------PELLDIPKSPKRPESPKSPKRPPPPQ-------RPSSPERPEGPKIIKSPKPPKS--P 663
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  687 KPqttaepdmpptksvsepvPFEteaPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPrpetLQTK 766
Cdd:PTZ00449   664 KP------------------PFD---PKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP----FTTP 718
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  767 LDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLVPATILEPVLRTEASGTTAAPKVPQRTHRPHPKPKTTLS 846
Cdd:PTZ00449   719 RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS 798
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  847 PEELQTElvpATIFEPVSPIKEAPGTTF-VPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVpa 925
Cdd:PTZ00449   799 PSEHEDK---PPGDHPSLPKKRHRLDGLaLSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVEEAEEMGAEARKI-- 873
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034632399  926 tVLEPVTLRPEASTTLASKTSQ----RTRRPRLRTKTTPRPEAPESKPVPTAELKPVTL 980
Cdd:PTZ00449   874 -VVDDDGTEADDEDTHPPEEKHksevRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1159-1253 8.20e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 8.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1159 ETTLETSPLPSQ-SITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPK 1237
Cdd:PRK14950   357 EALLVPVPAPQPaKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAI 436
                           90
                   ....*....|....*.
gi 1034632399 1238 DVLLPHKPYPEVSQSE 1253
Cdd:PRK14950   437 PVDEKPKYTPPAPPKE 452
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1113-1447 5.18e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 5.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1113 LSTESPKETIAPAKTDYVYPTAKAPLWPE-EPKTEVV--ESITYVSEPPETTLETSPLPSQSITLPSPDEPQTEPAPKQT 1189
Cdd:PHA03307    44 VSDSAELAAVTVVAGAAACDRFEPPTGPPpGPGTEAPanESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1190 PRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAPKDVLLPHKPYPEVSQSEPVLQPVTFRFEPPKTT 1269
Cdd:PHA03307   124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAAS 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1270 IAPLETRGIPFIPMISPSPS---QEELQTTLEETDQSTQEPFTTKIPRTTELAKTTQAPHRFYTTVR------PRTSDKP 1340
Cdd:PHA03307   204 PRPPRRSSPISASASSPAPApgrSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWeasgwnGPSSRPG 283
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1341 HIRPvlnRTTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVT 1420
Cdd:PHA03307   284 PASS---SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPA 360
                          330       340
                   ....*....|....*....|....*..
gi 1034632399 1421 GKPGSAGIISSGPITTPPLRSTPRPTG 1447
Cdd:PHA03307   361 DPSSPRKRPRPSRAPSSPAASAGRPTR 387
fn3 pfam00041
Fibronectin type III domain;
123-202 6.21e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 6.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  123 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPND-RFYTIRYREKDKEKKWIFQICPATET--IVENLKPNTVYEF 199
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1034632399  200 GVK 202
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
124-202 6.44e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 6.44e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399   124 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPNDRFYTIRYREKDKEKKWIFQICPA----TETIVENLKPNTVYEF 199
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1034632399   200 GVK 202
Cdd:smart00060   73 RVR 75
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1349-1628 6.99e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.22  E-value: 6.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1349 TTTRPTRPKPSGMPSGNGVGTGVKQAPRPSGADRNVSVDSTHPTKKPGTRRPPLPPRPTHPRRKPLPPNNVTGKPGSAGI 1428
Cdd:COG3401     48 TKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATT 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1429 ISSGPITTPPLRSTPRPTGTPLERIETDIKQPTVPASGEELENITDFSSSPTRETDPLGKPRFKGPHVRYIQKPDNS--- 1505
Cdd:COG3401    128 ATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTyyy 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1506 -PCSITDSVKRFPKEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQMT 1584
Cdd:COG3401    208 rVAATDTGGESAPSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATV 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1034632399 1585 NQTFSTVENLKPNTSYEFQVKPKNPLG-EGPVSNTVAFSTESADP 1628
Cdd:COG3401    283 TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
PRK11633 PRK11633
cell division protein DedD; Provisional
1134-1237 8.06e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 43.07  E-value: 8.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1134 AKAPLWP---EEPKTEVVESITYV--SEPPETTLE-----TSPLPSQSITLPSPDEPQTEPAPKqtPRAPPKPKtsPRPR 1203
Cdd:PRK11633    39 AAIPLVPkpgDRDEPDMMPAATQAlpTQPPEGAAEavragDAAAPSLDPATVAPPNTPVEPEPA--PVEPPKPK--PVEK 114
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1034632399 1204 iPQTQPVPKVPQRVTAKPKTSPSPEVSyTTPAPK 1237
Cdd:PRK11633   115 -PKPKPKPQQKVEAPPAPKPEPKPVVE-EKAAPT 146
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1527-1671 8.80e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.22  E-value: 8.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1527 TSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqmtNQTFSTVENLKPNTSYEFQV 1604
Cdd:COG3401    324 LTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGLTPGTTYYYKV 397
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034632399 1605 KPKNPLG-EGPVSNTVAFSTESADPRVSEPVSAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1671
Cdd:COG3401    398 TAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
639-707 1.30e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.64  E-value: 1.30e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034632399  639 PALEPATIQPEPLVPTtASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVP 707
Cdd:PRK14950   362 PVPAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
831-1213 1.38e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 1.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  831 PQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPikEAPGTTFVPVTDLEPVTFRTEIPATTLATKTSKRTRPPRPRPK 910
Cdd:PTZ00449   563 PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKP--KRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQ 640
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  911 TTPSPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTprpeapeskpVPTAELKPVTLRTETWVTTQA 990
Cdd:PTZ00449   641 RPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKET----------KTTVVLDESFESILKETLPET 710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  991 PKTSQRTRRPRPktkttpsPEVPQTKLVPSTDLEPGTlrteAPKTMVVTTVLEPDTFRTKFPETT----LAPKTQRTRRP 1066
Cdd:PTZ00449   711 PGTPFTTPRPLP-------PKLPRDEEFPFEPIGDPD----AEQPDDIEFFTPPEEERTFFHETPadtpLPDILAEEFKE 779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1067 RPRPKTTSSPEVPQNKSVSVTGFEPVvHSTDAPGTTFVSDVLESVTLSTESPKETIAPAKTDyvyPTAKaPLWPEEPKTe 1146
Cdd:PTZ00449   780 EDIHAETGEPDEAMKRPDSPSEHEDK-PPGDHPSLPKKRHRLDGLALSTTDLESDAGRIAKD---ASGK-IVKLKRSKS- 853
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034632399 1147 vVESITYVSEPPETTLETSPLPSQSITLPSPDEpQTEPA------------PKQTPRAPPKPKTSPRPRIPQTQPVPKV 1213
Cdd:PTZ00449   854 -FDDLTTVEEAEEMGAEARKIVVDDDGTEADDE-DTHPPeekhksevrrrrPPKKPSKPKKPSKPKKPKKPDSAFIPSI 930
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
657-1028 1.82e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  657 SKPSERPKTTHRPDAPQIQPGSKPPKqllpkPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTvrTEATVTT 736
Cdd:PTZ00449   537 SKESDEPKEGGKPGETKEGEVGKKPG-----PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRS--AQRPTRP 609
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  737 LAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEV-PQTK-LVPATILE 814
Cdd:PTZ00449   610 KSPKLPELLDIPKSPKRPESPKSPKR--------PPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFdPKFKeKFYDDYLD 681
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  815 PVLRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVP------ATIFEPV-SPIKEAPGTTFVPVTDLEPVTFRT 887
Cdd:PTZ00449   682 AAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPklprdeEFPFEPIgDPDAEQPDDIEFFTPPEEERTFFH 761
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  888 EIPATTLATKTSKRTRPPRPRPKTTPSPQAPETKPvpatvLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTPRpEAPES 967
Cdd:PTZ00449   762 ETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRP-----DSPSEHEDKPPGDHPSLPKKRHRLDGLALSTTDL-ESDAG 835
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  968 KPVPTAELKPVTLR-----------------------------------TETWVTTQAPKTSQRTRRPRPKTKTTPSPEV 1012
Cdd:PTZ00449   836 RIAKDASGKIVKLKrsksfddlttveeaeemgaearkivvdddgteaddEDTHPPEEKHKSEVRRRRPPKKPSKPKKPSK 915
                          410
                   ....*....|....*.
gi 1034632399 1013 PQTKLVPSTDLEPGTL 1028
Cdd:PTZ00449   916 PKKPKKPDSAFIPSII 931
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
890-1058 2.31e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 2.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  890 PATTLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEA-----STTLASKTSQRTRRPRLRTKTTPRPEA 964
Cdd:PRK12323   374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAaparrSPAPEALAAARQASARGPGGAPAPAPA 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  965 PESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPS-PEVPQTKLVPS-TDLEPGTLRTEAPKTMVVTTVL 1042
Cdd:PRK12323   454 PAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPwEELPPEFASPApAQPDAAPAGWVAESIPDPATAD 533
                          170
                   ....*....|....*.
gi 1034632399 1043 EPDTFRTKFPETTLAP 1058
Cdd:PRK12323   534 PDDAFETLAPAPAAAP 549
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1157-1234 3.01e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.46  E-value: 3.01e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034632399 1157 PPETTLETSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQRVTAKPKTSPSPEVSYTTP 1234
Cdd:PRK14971   391 QPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGP 468
PHA03247 PHA03247
large tegument protein UL36; Provisional
264-587 3.05e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  264 DSAKSPEKAPlggvilvHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVP 343
Cdd:PHA03247  2703 PPPPTPEPAP-------HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  344 RSTKPTTSSALDVSETTLVL----SKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKP--W 417
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESReslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggS 2855
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  418 IVPTAKISEDSKVLQPQTatydvfsSPTTSDEPEISdsytatsdRILDSIPPKTSRTLEQPRATLAPSETPFVPQKleif 497
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAA-------KPAAPARPPVR--------RLARPAVSRSTESFALPPDQPERPPQPQAPPP---- 2916
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  498 tsPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKperTTSAGTITPKIsksPEPTWTTPAPGKTQFISLKpkiplSPEVT 577
Cdd:PHA03247  2917 --PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD---PAGAGEPSGAV---PQPWLGALVPGRVAVPRFR-----VPQPA 2983
                          330
                   ....*....|
gi 1034632399  578 HTKPAPEPQT 587
Cdd:PHA03247  2984 PSREAPASST 2993
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
123-202 3.08e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 3.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  123 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpndrFYTIRYREKDKE--KKWIFQICPATETIVENLKPNTVYEFG 200
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 1034632399  201 VK 202
Cdd:cd00063     74 VR 75
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
565-1009 5.09e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 5.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  565 SLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEP- 643
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTq 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  644 ATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTksvsePVPFETEAPSMtivpttdi 723
Cdd:pfam03154  223 STAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM-----PHSLQTGPSHM-------- 289
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  724 ePVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETlqtkldfgpiTPGTSSAPTTTTKRTRRPHPKPKTTPHPEVP 803
Cdd:pfam03154  290 -QHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHT----------PPSQSQLQSQQPPREQPLPPAPLSMPHIKPP 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  804 QtklvpatilepvlrteasgTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSPIK-EAPGTTFVPVTDLEP 882
Cdd:pfam03154  359 P-------------------TTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLStHHPPSAHPPPLQLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  883 VTFRTEIPATT--LATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTlRPEASTTLASKTSQRTRRPRLRTKTTP 960
Cdd:pfam03154  420 QSQQLPPPPAQppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGG-PPPITPPSGPPTSTSSAMPGIQPPSSA 498
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034632399  961 RPEAPESKP-VPTAELKPVTLRTETWVTTQAPKT---SQRTRRPRPKTKTTPS 1009
Cdd:pfam03154  499 SVSSSGPVPaAVSCPLPPVQIKEEALDEAEEPESpppPPRSPSPEPTVVNTPS 551
rne PRK10811
ribonuclease E; Reviewed
1164-1338 5.29e-03

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 41.56  E-value: 5.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1164 TSPLPSQSITLPSPDEPQTEPAPKQTPRAPPKPKTSPRPRIPQTQPVPKVPQrVTAKPKTSPSPEVSYTTPAPKDVLLPH 1243
Cdd:PRK10811   848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPV-VVAEPQPEEVVVVETTHPEVIAAPVTE 926
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1244 KPYPEVSQSEPVLQPVTFRFEP---------PKTTIAPLETRGIPFIPMISPSPSQEELQTTLEETDQSTQEPFTTKIPR 1314
Cdd:PRK10811   927 QPQVITESDVAVAQEVAEHAEPvvepqdetaDIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQV 1006
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1034632399 1315 TTELAKT-------TQAPHRFYTTVRPRTSD 1338
Cdd:PRK10811  1007 PEATVEHnhatapmTRAPAPEYVPEAPRHSD 1037
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
1178-1258 5.84e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.47  E-value: 5.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1178 DEPQTEPAPKQTPRAPPKPKTSPRP-RIPQTQPVPKVPQRVTAKPKTSPSPEVSYT----TPAPKDVLLPHKPYPEVSQS 1252
Cdd:PRK14954   374 VRNDGGVAPSPAGSPDVKKKAPEPDlPQPDRHPGPAKPEAPGARPAELPSPASAPTpeqqPPVARSAPLPPSPQASAPRN 453

                   ....*.
gi 1034632399 1253 EPVLQP 1258
Cdd:PRK14954   454 VASGKP 459
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
568-794 7.52e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 7.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  568 PKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRktkrpgrrprprprpKTTPSPEVPKSKPALEPATIQ 647
Cdd:PRK12323   374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---------------AVAAAPARRSPAPEALAAARQ 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  648 PEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVT 727
Cdd:PRK12323   439 ASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034632399  728 VRTEAtvttlapktsqrtrtrrprpkhKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKP 794
Cdd:PRK12323   519 AGWVA----------------------ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
955-1236 8.67e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 40.71  E-value: 8.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  955 RTKTTPRPEAPES--KPVPTAELKPVTLRTETwVTTQAPKTSQrTRRPRPKTKTTPSPEV----PQTKLVPSTDLEPGTL 1028
Cdd:pfam17823  113 RALAAAASSSPSSaaQSLPAAIAALPSEAFSA-PRAAACRANA-SAAPRAAIAAASAPHAaspaPRTAASSTTAASSTTA 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1029 RTEAPKTMVVTTvlePDTFRTKFPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSVSVTGFEPVVHSTDAPGTTFVSDVL 1108
Cdd:pfam17823  191 ASSAPTTAASSA---PATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAA 267
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399 1109 ESVTLSTESPKeTIAPAKTDYVYPTAKAPLWPEEPKTEVVESITYVSEPPETTlETSPLPSQSITLPSPDEPQ------- 1181
Cdd:pfam17823  268 GTINMGDPHAR-RLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNT-AGEPTPSPSNTTLEPNTPKsvastnl 345
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034632399 1182 ----TEPAPKQTPRAPPKP--KTSPRPRI----PQTQPVPKVPQRVTAKPKTSPSPEVSYTTPAP 1236
Cdd:pfam17823  346 avvtTTKAQAKEPSASPVPvlHTSMIPEVeatsPTTQPSPLLPTQGAAGPGILLAPEQVATEATA 410
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
582-1021 9.86e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 40.67  E-value: 9.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  582 APEPQTLLPSQSTIGPETPGTK---PSTTLAPRKTKRPGRRPRPRPRPKTTpSPEVPKSKPALEPATIQPEPLVPTTASK 658
Cdd:pfam05109  424 APESTTTSPTLNTTGFAAPNTTtglPSSTHVPTNLTAPASTGPTVSTADVT-SPTPAGTTSGASPVTPSPSPRDNGTESK 502
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  659 PSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEP--DMPPTKSVSEPVPFETEAPSMTIVPTTDiepvtvrteATVTT 736
Cdd:pfam05109  503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTlgKTSPTSAVTTPTPNATSPTPAVTTPTPN---------ATIPT 573
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  737 LApKTSqrtrtrrprpkhkttprpetlqtkldfgPITPGTSSAPTTTTKRTRRPHPKPKTTPHPevpqtklVPATILEPV 816
Cdd:pfam05109  574 LG-KTS----------------------------PTSAVTTPTPNATSPTVGETSPQANTTNHT-------LGGTSSTPV 617
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  817 LRTEASGTTAAPKVPQRTHRPHPKPKTTLSPEELQTELVPATIFEPVSpikEAPGTTFVPVTDLEPVTFRTEIPATTLAT 896
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPASTSTHHV 694
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034632399  897 KTSkRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTK------------TTPRPEA 964
Cdd:pfam05109  695 STS-SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTggkansttggkhTTGHGAR 773
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034632399  965 PESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPST 1021
Cdd:pfam05109  774 TSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPT 830
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH