NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217391871|ref|XP_047298007|]
View 

host cell factor 1 isoform X1 [Homo sapiens]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 13287381)

uncharacterized fibronectin type III (FN3) domain-containing protein; also contains kelch repeats

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.04e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.15  E-value: 1.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2217391871  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
466-771 8.80e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.60  E-value: 8.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  466 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 545
Cdd:pfam05109  507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  546 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 619
Cdd:pfam05109  583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  620 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 695
Cdd:pfam05109  654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217391871  696 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 771
Cdd:pfam05109  726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1128-1620 1.30e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1128 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1203
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1204 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1275
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1276 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1351
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1352 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1431
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1432 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1511
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1512 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1591
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391871 1592 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1620
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1904-1933 2.91e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.91e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391871 1904 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1933
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1939-2044 7.80e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.86  E-value: 7.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1939 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 2017
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2217391871 2018 HIDYTtkpaiiFRIAARNEKGYGPATQ 2044
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.04e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.15  E-value: 1.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2217391871  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
PLN02193 PLN02193
nitrile-specifier protein
27-322 5.13e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.08  E-value: 5.13e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193   162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193   240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193   306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217391871  262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 8.08e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 8.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2217391871   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
466-771 8.80e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.60  E-value: 8.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  466 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 545
Cdd:pfam05109  507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  546 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 619
Cdd:pfam05109  583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  620 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 695
Cdd:pfam05109  654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217391871  696 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 771
Cdd:pfam05109  726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
PHA03247 PHA03247
large tegument protein UL36; Provisional
1128-1620 1.30e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1128 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1203
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1204 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1275
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1276 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1351
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1352 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1431
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1432 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1511
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1512 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1591
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391871 1592 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1620
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1904-1933 2.91e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.91e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391871 1904 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1933
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1939-2044 7.80e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.86  E-value: 7.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1939 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 2017
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2217391871 2018 HIDYTtkpaiiFRIAARNEKGYGPATQ 2044
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.04e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.15  E-value: 1.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2217391871  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
76-345 3.83e-17

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 84.05  E-value: 3.83e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055      7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055     78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  234 TLDIDTLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055    141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2217391871  313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055    188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
PLN02193 PLN02193
nitrile-specifier protein
27-322 5.13e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.08  E-value: 5.13e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193   162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193   240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193   306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217391871  262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
PLN02153 PLN02153
epithiospecifier protein
12-330 3.42e-15

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 79.26  E-value: 3.42e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153     2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPPcpRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153    82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPEA--RTFHSMASDENHVYVFGGVSKGGlmktperf 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153   159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  222 ggmsgcrlgdlwtLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWvplVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153   222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGGE---VWPDLK--GHLGPGTLSNEGYALDT 283
                          330       340
                   ....*....|....*....|....*....
gi 2217391871  302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153   284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
PLN02193 PLN02193
nitrile-specifier protein
126-272 3.39e-08

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 58.43  E-value: 3.39e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  126 KTPKNGPPPCPRLGHSFSLVGNKCYLFGGlandSEDPKNNIPRYlndLYILELRPGSgvvaWDIPITYGVLPppresHTA 205
Cdd:PLN02193   155 KVEQKGEGPGLRCSHGIAQVGNKIYSFGG----EFTPNQPIDKH---LYVFDLETRT----WSISPATGDVP-----HLS 218
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217391871  206 VVYTEKDNKKSKLVIYGGMSGCR-LGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGG 272
Cdd:PLN02193   219 CLGVRMVSIGSTLYVFGGRDASRqYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGG 286
PRK14131 PRK14131
N-acetylneuraminate epimerase;
18-99 1.82e-05

N-acetylneuraminate epimerase;


Pssm-ID: 237617 [Multi-domain]  Cd Length: 376  Bit Score: 49.24  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131    63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
                           90
                   ....*....|
gi 2217391871   90 DGTRLLVFGG 99
Cdd:PRK14131   138 HNGKAYITGG 147
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
27-114 4.68e-05

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 47.46  E-value: 4.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871   27 GPVPRPRHGHRAVAIKELIVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKY 106
Cdd:COG3055    191 APLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETKPGVR 266

                   ....*...
gi 2217391871  107 SNDLYELQ 114
Cdd:COG3055    267 TPLVTSAE 274
PLN02193 PLN02193
nitrile-specifier protein
242-393 7.74e-05

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 47.64  E-value: 7.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGG-WVPlvmdDVKVATHekewkctntLACLNLDTMAWEtilMDTLEDNIPR 320
Cdd:PLN02193   153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGeFTP----NQPIDKH---------LYVFDLETRTWS---ISPATGDVPH 216
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  321 ARA-GHCAVAINTRLYIWSGRDGYRK--AWNNQVCCKDLWYLET--EKPPPPARVQLVRANTNSLEVSWGAVATA----- 390
Cdd:PLN02193   217 LSClGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYVFGGVSATArlktl 296

                   ...
gi 2217391871  391 DSY 393
Cdd:PLN02193   297 DSY 299
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 8.08e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 8.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2217391871   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
466-771 8.80e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.60  E-value: 8.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  466 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 545
Cdd:pfam05109  507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  546 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 619
Cdd:pfam05109  583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  620 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 695
Cdd:pfam05109  654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217391871  696 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 771
Cdd:pfam05109  726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
Kelch_3 pfam13415
Galactose oxidase, central domain;
264-330 1.16e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.51  E-value: 1.16e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217391871  264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
241-348 1.68e-04

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 45.92  E-value: 1.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871  241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055      2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
                           90       100       110
                   ....*....|....*....|....*....|
gi 2217391871  320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055     58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
Kelch_3 pfam13415
Galactose oxidase, central domain;
215-263 3.86e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 39.97  E-value: 3.86e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217391871  215 KSKLVIYGG---MSGCRLGDLWTLDIDTLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
1128-1620 1.30e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1128 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1203
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1204 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1275
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1276 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1351
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1352 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1431
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1432 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1511
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1512 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1591
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391871 1592 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1620
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
Kelch_4 pfam13418
Galactose oxidase, central domain;
32-80 1.37e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 38.36  E-value: 1.37e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217391871   32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418    1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
30-67 2.66e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.54  E-value: 2.66e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2217391871   30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
319-351 2.85e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.85e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217391871  319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1904-1933 2.91e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.91e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391871 1904 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1933
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1939-2044 7.80e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.86  E-value: 7.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391871 1939 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 2017
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2217391871 2018 HIDYTtkpaiiFRIAARNEKGYGPATQ 2044
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
Kelch_3 pfam13415
Galactose oxidase, central domain;
146-208 9.58e-03

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 36.11  E-value: 9.58e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217391871  146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
254-273 9.86e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 36.05  E-value: 9.86e-03
                           10        20
                   ....*....|....*....|
gi 2217391871  254 PRSLHSATTIGNKMYVFGGW 273
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGF 20
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH