NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907118335|ref|XP_036015887|]
View 

target of Nesh-SH3 isoform X25 [Mus musculus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
507-1031 3.56e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 78.44  E-value: 3.56e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247  2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  654 KYKTTQSPKIPHSK--------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 725
Cdd:PHA03247  2713 HALVSATPLPPGPAaarqaspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  726 VPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTTIVPITDLERVTDLETPvafrtEAPGTTLVPAvv 805
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTSAQPTAPPPPPGPPPPS-----LPLGGSVAPG-- 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  806 lEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPvtlrtetwvtTKAPKTPKRTRRPRPKPQT 885
Cdd:PHA03247  2860 -GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----------PPQPQAPPPPQPQPQPPPP 2926
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  886 TPTPETPLTKPVAATDLEPSALSTEV--PATVVLATALTP-VTLRTKAPKTTTLAPnvqRTRRPHPRPKTTASTGVSESK 962
Cdd:PHA03247  2927 PQPQPPPPPPPRPQPPLAPTTDPAGAgePSGAVPQPWLGAlVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSR 3003
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  963 sVSDDLELVAFSTESPQKTIAPRQTTSMPPKLK-----------TPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTS 1031
Cdd:PHA03247  3004 -VSSWASSLALHEETDPPPVSLKQTLWPPDDTEdsdadslfdsdSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARES 3082
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1302-1393 2.67e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.66  E-value: 2.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1302 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1379
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118335 1380 LGEGPASNTVAFST 1393
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
991-1305 1.94e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 1.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  991 PPKLKTPHSRMPAKEPVPKEPlhttSKPKMPPSPEVADTTSAPLETRGIPLIP-VISPRPSQEELqtAMEETDQSTQELF 1069
Cdd:PHA03247  2483 PAEARFPFAAGAAPDPGGGGP----PDPDAPPAPSRLAPAILPDEPVGEPVHPrMLTWIRGLEEL--ASDDAGDPPPPLP 2556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1070 TTKIPRTTELAKTTqaphrlhtapvrPRIPGRPHGrPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVD 1149
Cdd:PHA03247  2557 PAAPPAAPDRSVPP------------PRPAPRPSE-PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPD 2620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1150 SHATRKPGSVSGTRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKE 1229
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907118335 1230 PTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1305
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
fn3 pfam00041
Fibronectin type III domain;
116-195 1.76e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118335  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PHA03247 super family cl33720
large tegument protein UL36; Provisional
307-575 4.55e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247  2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1907118335  544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247  2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-1031 3.56e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 78.44  E-value: 3.56e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247  2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  654 KYKTTQSPKIPHSK--------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 725
Cdd:PHA03247  2713 HALVSATPLPPGPAaarqaspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  726 VPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTTIVPITDLERVTDLETPvafrtEAPGTTLVPAvv 805
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTSAQPTAPPPPPGPPPPS-----LPLGGSVAPG-- 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  806 lEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPvtlrtetwvtTKAPKTPKRTRRPRPKPQT 885
Cdd:PHA03247  2860 -GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----------PPQPQAPPPPQPQPQPPPP 2926
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  886 TPTPETPLTKPVAATDLEPSALSTEV--PATVVLATALTP-VTLRTKAPKTTTLAPnvqRTRRPHPRPKTTASTGVSESK 962
Cdd:PHA03247  2927 PQPQPPPPPPPRPQPPLAPTTDPAGAgePSGAVPQPWLGAlVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSR 3003
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  963 sVSDDLELVAFSTESPQKTIAPRQTTSMPPKLK-----------TPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTS 1031
Cdd:PHA03247  3004 -VSSWASSLALHEETDPPPVSLKQTLWPPDDTEdsdadslfdsdSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARES 3082
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1302-1393 2.67e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.66  E-value: 2.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1302 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1379
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118335 1380 LGEGPASNTVAFST 1393
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1303-1383 8.63e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 8.63e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  1303 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1380
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1907118335  1381 GEG 1383
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1303-1386 1.07e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.18  E-value: 1.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1303 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1379
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1907118335 1380 LGEGPAS 1386
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
991-1305 1.94e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 1.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  991 PPKLKTPHSRMPAKEPVPKEPlhttSKPKMPPSPEVADTTSAPLETRGIPLIP-VISPRPSQEELqtAMEETDQSTQELF 1069
Cdd:PHA03247  2483 PAEARFPFAAGAAPDPGGGGP----PDPDAPPAPSRLAPAILPDEPVGEPVHPrMLTWIRGLEEL--ASDDAGDPPPPLP 2556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1070 TTKIPRTTELAKTTqaphrlhtapvrPRIPGRPHGrPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVD 1149
Cdd:PHA03247  2557 PAAPPAAPDRSVPP------------PRPAPRPSE-PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPD 2620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1150 SHATRKPGSVSGTRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKE 1229
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907118335 1230 PTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1305
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
384-757 3.43e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.61  E-value: 3.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154  251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154  327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPP 678
Cdd:pfam03154  407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335  679 LASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 757
Cdd:pfam03154  484 STSSAMPGIQPPSSASVSSSGPVPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-702 6.68e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.46  E-value: 6.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839   286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839   365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907118335  648 THRQRTKYKTTQSPKIPHSKPDLGPitsEPPLASTTKKVRRPRPKPQTTPHPEVP 702
Cdd:NF033839   431 VKPQPEKPKPEVKPQPEKPKPEVKP---QPETPKPEVKPQPEKPKPEVKPQPEKP 482
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1287-1398 1.17e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1287 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1366
Cdd:COG3401    220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907118335 1367 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1398
Cdd:COG3401    295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
fn3 pfam00041
Fibronectin type III domain;
116-195 1.76e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118335  193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.53e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.53e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335   114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 1907118335   190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 2.64e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                   ....
gi 1907118335  192 FGVK 195
Cdd:cd00063     72 FRVR 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 4.55e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247  2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1907118335  544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247  2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-769 8.64e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 40.80  E-value: 8.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665    208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665    288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  565 KPALEPATVTPEILV-PKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaiVTTTDIEPVITRTKASVTTLAPKP 643
Cdd:COG5665    364 DLATPVSPTPPETSVdKKVSPDSATSSTKSEKEGGTASSPMPPNIAIGAKDD----VDATDPSQEAKEYTKNAPMTPEAD 439
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  644 PRPRThrqrtkykTTQSPKIPHSKPDLGPItsepplaSTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT 723
Cdd:COG5665    440 SAPES--------SVRTEASPSAGSDLEPE-------NTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDS 504
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907118335  724 TLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 769
Cdd:COG5665    505 TPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-1031 3.56e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 78.44  E-value: 3.56e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247  2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  654 KYKTTQSPKIPHSK--------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 725
Cdd:PHA03247  2713 HALVSATPLPPGPAaarqaspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  726 VPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTTIVPITDLERVTDLETPvafrtEAPGTTLVPAvv 805
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTSAQPTAPPPPPGPPPPS-----LPLGGSVAPG-- 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  806 lEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPvtlrtetwvtTKAPKTPKRTRRPRPKPQT 885
Cdd:PHA03247  2860 -GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----------PPQPQAPPPPQPQPQPPPP 2926
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  886 TPTPETPLTKPVAATDLEPSALSTEV--PATVVLATALTP-VTLRTKAPKTTTLAPnvqRTRRPHPRPKTTASTGVSESK 962
Cdd:PHA03247  2927 PQPQPPPPPPPRPQPPLAPTTDPAGAgePSGAVPQPWLGAlVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSR 3003
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  963 sVSDDLELVAFSTESPQKTIAPRQTTSMPPKLK-----------TPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTS 1031
Cdd:PHA03247  3004 -VSSWASSLALHEETDPPPVSLKQTLWPPDDTEdsdadslfdsdSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARES 3082
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
509-775 9.30e-13

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 73.57  E-value: 9.30e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449   542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449   620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  660 SPKIPHS-KPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 738
Cdd:PTZ00449   656 SPKPPKSpKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1907118335  739 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIVP 775
Cdd:PTZ00449   735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLP 770
PHA03247 PHA03247
large tegument protein UL36; Provisional
545-1052 8.89e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 8.89e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247  2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRP----RTHRQRTKYKTTqSPKIPHSKPDlGPITSEPPLASTTKKVR 687
Cdd:PHA03247  2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPavtsRARRPDAPPQSA-RPRAPVDDRG-DPRGPAPPSPLPPDTHA 2623
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  688 RPRPKPQTTPHP-EVPHTILVPATSLEPFIITEAPGTTLVPK---LPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVT 763
Cdd:PHA03247  2624 PDPPPPSPSPAAnEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  764 PLK--EDPVTTIVPITDLERVtdletPVAFRTEAPGTTLVPAVvlepvtlrPEVQVTTLAPQKTQKKHRPSPKPkpvpsp 841
Cdd:PHA03247  2704 PPPtpEPAPHALVSATPLPPG-----PAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTA------ 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  842 evTESKPVLPRVREPVTLRTETwVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVVLATAL 921
Cdd:PHA03247  2765 --GPPAPAPPAAPAAGPPRRLT-RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  922 TPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSVSDDLELVAFSTES---PQKTIAPRQTTSMPPKLKTPH 998
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalPPDQPERPPQPQAPPPPQPQP 2921
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335  999 SRMPAKEPVPKEPlhTTSKPKMPPSPEvADTTSAPLETRGIP------LIP---------VISPRPSQE 1052
Cdd:PHA03247  2922 QPPPPPQPQPPPP--PPPRPQPPLAPT-TDPAGAGEPSGAVPqpwlgaLVPgrvavprfrVPQPAPSRE 2987
PHA03247 PHA03247
large tegument protein UL36; Provisional
448-775 2.40e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.73  E-value: 2.40e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  448 TATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03247  2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATVTPEILVPkivpkPPQKPKATRRPEVPQVKPAHE 606
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLP-----PPTSAQPTAPPPPPGPPPPSL 2850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  607 PvTFGSEAPAlaivtttdiEPVITRTKASVTTLAP-KPPRPRTHRqrtkykttqspkiphskpdlgpiTSEPPLASTTKK 685
Cdd:PHA03247  2851 P-LGGSVAPG---------GDVRRRPPSRSPAAKPaAPARPPVRR-----------------------LARPAVSRSTES 2897
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  686 VRRPRPKPQTTPHPEVPHTILVPATSLEPfiiteaPGTTLVPKLPQQPDYPhPKPKTTRSPAASPTELVPTPVFEPVTPL 765
Cdd:PHA03247  2898 FALPPDQPERPPQPQAPPPPQPQPQPPPP------PQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQPWLGALVPG 2970
                          330
                   ....*....|
gi 1907118335  766 KEDPVTTIVP 775
Cdd:PHA03247  2971 RVAVPRFRVP 2980
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1302-1393 2.67e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.66  E-value: 2.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1302 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1379
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118335 1380 LGEGPASNTVAFST 1393
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
659-1248 3.36e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 3.36e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  659 QSPKIPHSKPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTlvPKLPQQPdyphp 738
Cdd:PHA03247  2487 RFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPP--PPLPPAA----- 2559
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  739 kpkttrsPAASPTELVPTPVFEPVTPlkEDPVTTIVPITDL-ERVTDLETPVAFRTEAPGTTlvPAVVLEPVTLRPEVQV 817
Cdd:PHA03247  2560 -------PPAAPDRSVPPPRPAPRPS--EPAVTSRARRPDApPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPP 2628
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  818 TTLAPQKTQKkhrPSPKPKPVPSPEVTESKPVLPRVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPV 897
Cdd:PHA03247  2629 PSPSPAANEP---DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  898 AATDLEPSALSTEVPATVVLATA--LTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSVSddlelvafST 975
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAArqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP--------AA 2777
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  976 ESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTSApletrgIPLIPVISPRPSQEELQ 1055
Cdd:PHA03247  2778 GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA------QPTAPPPPPGPPPPSLP 2851
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1056 TameETDQSTQELFTTKIPrttelaktTQAPHRLHTAPVRPRIpgRPHGRPALNKTTTrpdktkprgtshkngvgtgtKQ 1135
Cdd:PHA03247  2852 L---GGSVAPGGDVRRRPP--------SRSPAAKPAAPARPPV--RRLARPAVSRSTE--------------------SF 2898
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1136 APKPPSPGRnasvdshaTRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHP 1215
Cdd:PHA03247  2899 ALPPDQPER--------PPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVP 2969
                          570       580       590
                   ....*....|....*....|....*....|...
gi 1907118335 1216 IGTATARPGAEQKEPTAPASEEEFGTTTDFSSS 1248
Cdd:PHA03247  2970 GRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1303-1383 8.63e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 8.63e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  1303 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1380
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1907118335  1381 GEG 1383
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
381-766 1.08e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 57.01  E-value: 1.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449   494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449   572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449   638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPdlgpitSEPPLASTTKK 685
Cdd:PTZ00449   686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPI------GDPDAEQPDDI 747
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  686 VRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVTP 764
Cdd:PTZ00449   748 EFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSLP 814

                   ..
gi 1907118335  765 LK 766
Cdd:PTZ00449   815 KK 816
fn3 pfam00041
Fibronectin type III domain;
1303-1386 1.07e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.18  E-value: 1.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1303 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1379
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1907118335 1380 LGEGPAS 1386
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
991-1305 1.94e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 1.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  991 PPKLKTPHSRMPAKEPVPKEPlhttSKPKMPPSPEVADTTSAPLETRGIPLIP-VISPRPSQEELqtAMEETDQSTQELF 1069
Cdd:PHA03247  2483 PAEARFPFAAGAAPDPGGGGP----PDPDAPPAPSRLAPAILPDEPVGEPVHPrMLTWIRGLEEL--ASDDAGDPPPPLP 2556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1070 TTKIPRTTELAKTTqaphrlhtapvrPRIPGRPHGrPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVD 1149
Cdd:PHA03247  2557 PAAPPAAPDRSVPP------------PRPAPRPSE-PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPD 2620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1150 SHATRKPGSVSGTRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKE 1229
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907118335 1230 PTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1305
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
384-757 3.43e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.61  E-value: 3.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154  251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154  327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPP 678
Cdd:pfam03154  407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335  679 LASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 757
Cdd:pfam03154  484 STSSAMPGIQPPSSASVSSSGPVPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-702 6.68e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.46  E-value: 6.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839   286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839   365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907118335  648 THRQRTKYKTTQSPKIPHSKPDLGPitsEPPLASTTKKVRRPRPKPQTTPHPEVP 702
Cdd:NF033839   431 VKPQPEKPKPEVKPQPEKPKPEVKP---QPETPKPEVKPQPEKPKPEVKPQPEKP 482
PHA03377 PHA03377
EBNA-3C; Provisional
517-711 9.26e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 47.35  E-value: 9.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377   414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRT------KYKTTQSPKIPHSK 667
Cdd:PHA03377   489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGfqrsgrRQKRATPPKVSPSD 559
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907118335  668 --------PDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATS 711
Cdd:PHA03377   560 rgppkaspPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASG 611
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1287-1398 1.17e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1287 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1366
Cdd:COG3401    220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907118335 1367 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1398
Cdd:COG3401    295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1287-1441 1.57e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.15  E-value: 1.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1287 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1364
Cdd:COG3401    314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118335 1365 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1441
Cdd:COG3401    388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
fn3 pfam00041
Fibronectin type III domain;
116-195 1.76e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118335  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PRK10263 PRK10263
DNA translocase FtsK; Provisional
420-746 1.96e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 1.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  420 SRVVQPQTATYDVISSSTTSDE---------TEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263   297 NRATQPEYDEYDPLLNGAPITEpvavaaaatTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263   376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263   456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  638 tlAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPPLASTTKkvrrprpkpQTTPHPEVPHTILVPATSLepfii 717
Cdd:PRK10263   525 --AWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK---------KATLATGAAATVAAPVFSL----- 588
                          330       340
                   ....*....|....*....|....*....
gi 1907118335  718 teAPGTTLVPKLPQQPDYPHPKPKTTRSP 746
Cdd:PRK10263   589 --ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.53e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.53e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335   114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 1907118335   190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 2.64e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                   ....
gi 1907118335  192 FGVK 195
Cdd:cd00063     72 FRVR 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
943-1304 4.11e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 4.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  943 TRRPHPRPKTTASTGVSESKSVSDDLELVAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEPlhTTSKPKMPP 1022
Cdd:PHA03247  2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEP--DPHPPPTVP 2647
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1023 SPEVADTTSAPLETR-----GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPrttelaKTTQAPHRLHTAPVRPR 1097
Cdd:PHA03247  2648 PPERPRDDPAPGRVSrprraRRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP------PPTPEPAPHALVSATPL 2721
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1098 IPGRPHGRPALNKTTTRPdktKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRHSSTRPV 1177
Cdd:PHA03247  2722 PPGPAAARQASPALPAAP---APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1178 SPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFSSSPTKETDPLG 1257
Cdd:PHA03247  2799 PSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVRRRPPSRSPAAK 2874
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907118335 1258 KPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1304
Cdd:PHA03247  2875 PAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
515-626 4.54e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 4.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907118335  595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950   434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
PRK10263 PRK10263
DNA translocase FtsK; Provisional
559-821 1.15e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  559 PEVPKSKPALEPATVTPEILVPKIVPKPPQKPKA-----TRRPEVPQVKPA-HEPVTFGSEAPAlaiVTTTdiEPVITrt 632
Cdd:PRK10263   302 PEYDEYDPLLNGAPITEPVAVAAAATTATQSWAApvepvTQTPPVASVDVPpAQPTVAWQPVPG---PQTG--EPVIA-- 374
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  633 kasvttlapkpPRPRTHRQRTKYKttqSPKIPHSKPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATS- 711
Cdd:PRK10263   375 -----------PAPEGYPQQSQYA---QPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEq 440
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  712 --LEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVtPLKEDPVTTIVPITDLERVtdlETPV 789
Cdd:PRK10263   441 pvAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE-PVVEETKPARPPLYYFEEV---EEKR 516
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1907118335  790 AFRTE--APGTTLVPAVVLEPVTLRPEVQVTTLA 821
Cdd:PRK10263   517 AREREqlAAWYQPIPEPVKEPEPIKSSLKAPSVA 550
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
490-618 1.57e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994   370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118335  569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994   450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
491-765 1.61e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPrt 648
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQP-- 297
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  649 hrqrtkykttqSPKIPHSKPDLGPITSEPPLASTTKKvRRPRPKPQTTPHPEVP--HTILVPATSLEPFIitEAPGTTLV 726
Cdd:pfam03154  298 -----------FPLTPQSSQSQVPPGPSPAAPGQSQQ-RIHTPPSQSQLQSQQPprEQPLPPAPLSMPHI--KPPPTTPI 363
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907118335  727 PKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 765
Cdd:pfam03154  364 PQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
458-770 1.74e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.91  E-value: 1.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003   372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003   451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTKAsvttlAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPIT 674
Cdd:PRK07003   531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAR-----AAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAAT 605
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  675 SEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRSPAAsPTELV 754
Cdd:PRK07003   606 GDAPPNGAARAEQAAESRGAPPPWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADAPAP-PVDTR 681
                          330
                   ....*....|....*.
gi 1907118335  755 PTPvfePVTPLkeDPV 770
Cdd:PRK07003   682 PLP---PAIPL--DAI 692
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
480-628 1.89e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.87  E-value: 1.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950   351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118335  560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950   427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
PHA03247 PHA03247
large tegument protein UL36; Provisional
531-775 2.46e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 2.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  531 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 610
Cdd:PHA03247   255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  611 GSEAPALAIVTTtdiepvITRTKASVTTLAPKPPRPrthrqrtkyktTQSPkiPHSKPDLGPITSEPPLASTTKKVRRpr 690
Cdd:PHA03247   328 DDEDGAMEVVSP------LPRPRQHYPLGFPKRRRP-----------TWTP--PSSLEDLSAGRHHPKRASLPTRKRR-- 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  691 pkpqTTPHPEVPHTiLVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPV 770
Cdd:PHA03247   387 ----SARHAATPFA-RGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEP 461

                   ....*
gi 1907118335  771 TTIVP 775
Cdd:PHA03247   462 APDDP 466
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
469-600 3.32e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 3.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764   371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118335  545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764   449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
553-774 3.45e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.84  E-value: 3.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209   330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  628 VITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHskPDLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTILV 707
Cdd:PLN03209   407 PAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARY--EDLKPPTS-------------PSPTAPTGVSPSVSSTSSV 471
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907118335  708 PATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIV 774
Cdd:PLN03209   472 PAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
515-612 3.59e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.85  E-value: 3.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954   385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
                           90
                   ....*....|....*...
gi 1907118335  595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954   451 PRNVASGKPG---VDLGS 465
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 4.55e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247  2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1907118335  544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247  2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
dnaA PRK14086
chromosomal replication initiator protein DnaA;
515-717 4.61e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.35  E-value: 4.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 586
Cdd:PRK14086    87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  587 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 666
Cdd:PRK14086   167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907118335  667 KPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 717
Cdd:PRK14086   242 GPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
448-750 5.91e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 5.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  448 TATRDPILDSVPPKTSRTAEQPRATLAPiealfesrnveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03307    60 AACDRFEPPTGPPPGPGTEAPANESRST-------------PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPE------VPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQV 601
Cdd:PHA03307   127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPaavasdAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  602 KPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSEPPLAS 681
Cdd:PHA03307   207 PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPAS 286
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118335  682 TTKKVRRPRPKPQ---------TTPHPEVPHTILVPATSLE-PFIITEAPGTTLVPklPQQPDYPHPKPKTTRSPAASP 750
Cdd:PHA03307   287 SSSSPRERSPSPSpsspgsgpaPSSPRASSSSSSSRESSSSsTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPS 363
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
491-682 6.45e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 6.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323   383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323   463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907118335  641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPDLGPITSE---PPLAST 682
Cdd:PRK12323   543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDgdwPALAAR 587
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1299-1398 7.48e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 40.53  E-value: 7.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1299 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1377
Cdd:COG3979      2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
                           90       100
                   ....*....|....*....|.
gi 1907118335 1378 nplgeGPASNTVAFSTESADP 1398
Cdd:COG3979     72 -----DAAGNVSAASGTSTAM 87
PRK11633 PRK11633
cell division protein DedD; Provisional
451-539 8.40e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 39.60  E-value: 8.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  451 RDPIlDSVPPKTSRTAEQP--------RATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVK 522
Cdd:PRK11633    50 RDEP-DMMPAATQALPTQPpegaaeavRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAP 128
                           90
                   ....*....|....*..
gi 1907118335  523 PAPEPETRPSAQTTKAP 539
Cdd:PRK11633   129 PAPKPEPKPVVEEKAAP 145
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-769 8.64e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 40.80  E-value: 8.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665    208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665    288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  565 KPALEPATVTPEILV-PKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaiVTTTDIEPVITRTKASVTTLAPKP 643
Cdd:COG5665    364 DLATPVSPTPPETSVdKKVSPDSATSSTKSEKEGGTASSPMPPNIAIGAKDD----VDATDPSQEAKEYTKNAPMTPEAD 439
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  644 PRPRThrqrtkykTTQSPKIPHSKPDLGPItsepplaSTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT 723
Cdd:COG5665    440 SAPES--------SVRTEASPSAGSDLEPE-------NTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDS 504
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907118335  724 TLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 769
Cdd:COG5665    505 TPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03378 PHA03378
EBNA-3B; Provisional
467-870 9.34e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.82  E-value: 9.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  467 EQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPstpkrQSTPKPPRVKPAPEP-ETRPSAQTTKAPR----- 540
Cdd:PHA03378   345 EAVRLPDDPIIVEDDDESEEIESECDPDEDKSGAEALASIP-----QTLPDPPTVYGRPKVfARKADLKSTKKCRaivtd 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  541 ---------KTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKP--PQKPKATrrpevPQVKPA--HEP 607
Cdd:PHA03378   420 psvikaieeEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTGPLSVQAPlePWQPLPH-----PQVTPVilHQP 494
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  608 VTFGSEAP-ALAIVTTTDIEPVITRTKAsvTTLAPKPPRPRTHRQ-----------RTKYKTTQSPKIPH--SKPDLGPI 673
Cdd:PHA03378   495 PAQGVQAHgSMLDLLEKDDEDMEQRVMA--TLLPPSPPQPRAGRRapcvytedldiESDEPASTEPVHDQllPAPGLGPL 572
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  674 TSEPPLASTTKKVRRPRPKPQTTPHPeVPHtilvPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTEL 753
Cdd:PHA03378   573 QIQPLTSPTTSQLASSAPSYAQTPWP-VPH----PSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLV 647
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  754 VPTPVFEP-VTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPS 832
Cdd:PHA03378   648 FPTPHQPPqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPP 727
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 1907118335  833 PKPKPVPSPEVTESKPVLPRVREPVTLRTETWVTTKAP 870
Cdd:PHA03378   728 AAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765
PHA03369 PHA03369
capsid maturational protease; Provisional
491-778 9.39e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 40.37  E-value: 9.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PHA03369   362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  571 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 650
Cdd:PHA03369   442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335  651 QRTKYKTTQSPKI-PHSKPDLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 723
Cdd:PHA03369   516 FKNAGAKTAAANIePNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118335  724 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 778
Cdd:PHA03369   596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1016-1303 9.69e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.44  E-value: 9.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1016 SKPKMPP-SPEVADTTSAPLETRGIPLIPVISPRPSQEElqTAMEETDQSTQELFTTKIPRTTELAKTTQAPhrlhtAPV 1094
Cdd:PTZ00449   492 SKKKLAPiEEEDSDKHDEPPEGPEASGLPPKAPGDKEGE--EGEHEDSKESDEPKEGGKPGETKEGEVGKKP-----GPA 564
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1095 RPRipgRPHGRPALNKTTTRPDKTKprgtshkngvgtgtkqAPKPPSPGRnasvdshATRKPGSVSGTRRPPIPHRHSST 1174
Cdd:PTZ00449   565 KEH---KPSKIPTLSKKPEFPKDPK----------------HPKDPEEPK-------KPKRPRSAQRPTRPKSPKLPELL 618
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118335 1175 RPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLkatlhpigTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETD 1254
Cdd:PTZ00449   619 DIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKI--------IKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETK 690
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1907118335 1255 PLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEGNATSPPQNP 1303
Cdd:PTZ00449   691 TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH