NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2009554257|ref|XP_040121385|]
View 

host cell factor 1 isoform X1 [Oryx dammah]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.84e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 96.76  E-value: 1.84e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2009554257  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1870-1899 2.62e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.62e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2009554257 1870 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1899
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
PRK13914 super family cl36314
invasion associated endopeptidase;
467-809 6.58e-03

invasion associated endopeptidase;


The actual alignment was detected with superfamily member PRK13914:

Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 41.33  E-value: 6.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  467 VPGSSISVPAAARTQGVPAVLKVTGPQATTGtplvtmrpasqAGKAPVTVTSLPAGVRMVVPTQSAQG-TVIGSSPQMSG 545
Cdd:PRK13914    64 VPGQKLQVNEVAAAEKTEKSVSATWLNVRSG-----------AGVDNSIITSIKGGTKVTVETTESNGwHKITYNDGKTG 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  546 MAALAAAAAATQKIPPSSAPTV-------LSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVSNPATRMLKTAAAQVG 618
Cdd:PRK13914   133 FVNGKYLTDKVTSTPVAPTQEVkketttqQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWA 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  619 TSVSSAAntSTRPIITVH--KSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALIsnlgKVMSVVQTKPVQTSA 696
Cdd:PRK13914   213 LSVKYGV--SVQDIMSWNnlSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVV----KENTNTNTATTEKKE 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  697 VTGQASTGPVTQIIQTK-GPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPST-TKPGTTTIIKTIPM 774
Cdd:PRK13914   287 TTTQQQTAPKAPTEAAKpAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTnANQGSSNNNSNSSA 366
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 2009554257  775 SAIITQA----------GATGRGLPRCAGEKAGVTSSAGIKSPIT 809
Cdd:PRK13914   367 SAIIAEAqkhlgkayswGGNGPTTFDCSGYTKYVFAKAGISLPRT 411
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1905-2010 9.51e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 9.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257 1905 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqsaqaggetKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1983
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2009554257 1984 HIDYTtkpaiiFRIAARNEKGYGPATQ 2010
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.84e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 96.76  E-value: 1.84e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2009554257  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
PLN02193 PLN02193
nitrile-specifier protein
10-322 3.30e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.85  E-value: 3.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193   144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193   222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETLTWN 243
Cdd:PLN02193   293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2009554257  244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.72e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.72e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2009554257   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1870-1899 2.62e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.62e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2009554257 1870 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1899
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
PRK13914 PRK13914
invasion associated endopeptidase;
467-809 6.58e-03

invasion associated endopeptidase;


Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 41.33  E-value: 6.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  467 VPGSSISVPAAARTQGVPAVLKVTGPQATTGtplvtmrpasqAGKAPVTVTSLPAGVRMVVPTQSAQG-TVIGSSPQMSG 545
Cdd:PRK13914    64 VPGQKLQVNEVAAAEKTEKSVSATWLNVRSG-----------AGVDNSIITSIKGGTKVTVETTESNGwHKITYNDGKTG 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  546 MAALAAAAAATQKIPPSSAPTV-------LSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVSNPATRMLKTAAAQVG 618
Cdd:PRK13914   133 FVNGKYLTDKVTSTPVAPTQEVkketttqQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWA 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  619 TSVSSAAntSTRPIITVH--KSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALIsnlgKVMSVVQTKPVQTSA 696
Cdd:PRK13914   213 LSVKYGV--SVQDIMSWNnlSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVV----KENTNTNTATTEKKE 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  697 VTGQASTGPVTQIIQTK-GPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPST-TKPGTTTIIKTIPM 774
Cdd:PRK13914   287 TTTQQQTAPKAPTEAAKpAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTnANQGSSNNNSNSSA 366
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 2009554257  775 SAIITQA----------GATGRGLPRCAGEKAGVTSSAGIKSPIT 809
Cdd:PRK13914   367 SAIIAEAqkhlgkayswGGNGPTTFDCSGYTKYVFAKAGISLPRT 411
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
464-767 7.47e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 7.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  464 LPTVPGSSisVPAAARTQGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 543
Cdd:pfam17823  114 ALAAAASS--SPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  544 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 623
Cdd:pfam17823  192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  624 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 692
Cdd:pfam17823  269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2009554257  693 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGTKPTILGISS-VSPSTTKPGTTT 767
Cdd:pfam17823  349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGILLAPEQVATeATAGTASAGPTP 419
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1905-2010 9.51e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 9.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257 1905 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqsaqaggetKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1983
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2009554257 1984 HIDYTtkpaiiFRIAARNEKGYGPATQ 2010
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.84e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 96.76  E-value: 1.84e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2009554257  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
76-345 8.49e-17

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 82.90  E-value: 8.49e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055      7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055     78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  234 TLDIETLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055    141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2009554257  313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055    188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
PLN02193 PLN02193
nitrile-specifier protein
10-322 3.30e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.85  E-value: 3.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193   144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193   222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETLTWN 243
Cdd:PLN02193   293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2009554257  244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
PLN02153 PLN02153
epithiospecifier protein
12-330 3.81e-15

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 79.26  E-value: 3.81e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153     2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPpcPRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153    82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPE--ARTFHSMASDENHVYVFGGVSKGGlmktperf 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153   159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  222 ggmsgcrlgdlwtLDIETLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGwvpLVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153   222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGG---EVWPDLK--GHLGPGTLSNEGYALDT 283
                          330       340
                   ....*....|....*....|....*....
gi 2009554257  302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153   284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
PRK14131 PRK14131
N-acetylneuraminate epimerase;
18-99 2.32e-05

N-acetylneuraminate epimerase;


Pssm-ID: 237617 [Multi-domain]  Cd Length: 376  Bit Score: 48.86  E-value: 2.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131    63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
                           90
                   ....*....|
gi 2009554257   90 DGTRLLVFGG 99
Cdd:PRK14131   138 HNGKAYITGG 147
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
27-114 5.50e-05

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 47.07  E-value: 5.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   27 GPVPRPRHGHRAVAIKELIVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKY 106
Cdd:COG3055    191 APLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETKPGVR 266

                   ....*...
gi 2009554257  107 SNDLYELQ 114
Cdd:COG3055    267 TPLVTSAE 274
PLN02193 PLN02193
nitrile-specifier protein
242-393 7.60e-05

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 47.64  E-value: 7.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGG-WVPlvmdDVKVATHekewkctntLACLNLDTMAWEtilMDTLEDNIPR 320
Cdd:PLN02193   153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGeFTP----NQPIDKH---------LYVFDLETRTWS---ISPATGDVPH 216
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  321 ARA-GHCAVAINTRLYIWSGRDGYRK--AWNNQVCCKDLWYLET--EKPPPPARVQLVRANTNSLEVSWGAVATA----- 390
Cdd:PLN02193   217 LSClGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYVFGGVSATArlktl 296

                   ...
gi 2009554257  391 DSY 393
Cdd:PLN02193   297 DSY 299
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.72e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.72e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2009554257   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Kelch_3 pfam13415
Galactose oxidase, central domain;
264-330 1.11e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.51  E-value: 1.11e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2009554257  264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
241-348 2.10e-04

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 45.53  E-value: 2.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055      2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
                           90       100       110
                   ....*....|....*....|....*....|
gi 2009554257  320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055     58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
Kelch_3 pfam13415
Galactose oxidase, central domain;
215-263 7.72e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 39.20  E-value: 7.72e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2009554257  215 KSKLVIYGG---MSGCRLGDLWTLDIETLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
Kelch_4 pfam13418
Galactose oxidase, central domain;
32-80 1.35e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 38.36  E-value: 1.35e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2009554257   32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418    1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
30-67 2.62e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.54  E-value: 2.62e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2009554257   30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1870-1899 2.62e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.62e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2009554257 1870 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1899
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
319-351 2.80e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.80e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2009554257  319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
PHA03098 PHA03098
kelch-like protein; Provisional
38-295 6.28e-03

kelch-like protein; Provisional


Pssm-ID: 222983 [Multi-domain]  Cd Length: 534  Bit Score: 41.29  E-value: 6.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257   38 AVAIKELIVVFGGGNEG--IVDELHVYNTATNQWF-IPavrgDIPPGCAAYGFVCDGTRLLVFGGmVEYGKYSNDLYELQ 114
Cdd:PHA03098   290 SVVLNNVIYFIGGMNKNnlSVNSVVSYDTKTKSWNkVP----ELIYPRKNPGVTVFNNRIYVIGG-IYNSISLNTVESWK 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  115 ASRWEWKRLKaktpkngPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNnIPRYlnDLYILELRPGSGvvawdipityg 194
Cdd:PHA03098   365 PGESKWREEP-------PLIFPRYNPCVVNVNNLIYVIGGISKNDELLKT-VECF--SLNTNKWSKGSP----------- 423
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  195 vLPPPRESHTAVVYtekdnkKSKLVIYGGMS----GCRLGDLWTLDIETLTWNKPSLSGVaplPRSLHSATTIGNKMYVF 270
Cdd:PHA03098   424 -LPISHYGGCAIYH------DGKIYVIGGISyidnIKVYNIVESYNPVTNKWTELSSLNF---PRINASLCIFNNKIYVV 493
                          250       260
                   ....*....|....*....|....*
gi 2009554257  271 GGWvplvMDDVKVATHEKEWKCTNT 295
Cdd:PHA03098   494 GGD----KYEYYINEIEVYDDKTNT 514
PRK13914 PRK13914
invasion associated endopeptidase;
467-809 6.58e-03

invasion associated endopeptidase;


Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 41.33  E-value: 6.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  467 VPGSSISVPAAARTQGVPAVLKVTGPQATTGtplvtmrpasqAGKAPVTVTSLPAGVRMVVPTQSAQG-TVIGSSPQMSG 545
Cdd:PRK13914    64 VPGQKLQVNEVAAAEKTEKSVSATWLNVRSG-----------AGVDNSIITSIKGGTKVTVETTESNGwHKITYNDGKTG 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  546 MAALAAAAAATQKIPPSSAPTV-------LSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVSNPATRMLKTAAAQVG 618
Cdd:PRK13914   133 FVNGKYLTDKVTSTPVAPTQEVkketttqQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWA 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  619 TSVSSAAntSTRPIITVH--KSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALIsnlgKVMSVVQTKPVQTSA 696
Cdd:PRK13914   213 LSVKYGV--SVQDIMSWNnlSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVV----KENTNTNTATTEKKE 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  697 VTGQASTGPVTQIIQTK-GPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPST-TKPGTTTIIKTIPM 774
Cdd:PRK13914   287 TTTQQQTAPKAPTEAAKpAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTnANQGSSNNNSNSSA 366
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 2009554257  775 SAIITQA----------GATGRGLPRCAGEKAGVTSSAGIKSPIT 809
Cdd:PRK13914   367 SAIIAEAqkhlgkayswGGNGPTTFDCSGYTKYVFAKAGISLPRT 411
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
464-767 7.47e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 7.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  464 LPTVPGSSisVPAAARTQGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 543
Cdd:pfam17823  114 ALAAAASS--SPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  544 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 623
Cdd:pfam17823  192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257  624 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 692
Cdd:pfam17823  269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2009554257  693 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGTKPTILGISS-VSPSTTKPGTTT 767
Cdd:pfam17823  349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGILLAPEQVATeATAGTASAGPTP 419
Kelch_4 pfam13418
Galactose oxidase, central domain;
199-254 9.19e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 36.05  E-value: 9.19e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2009554257  199 PRESHTAVVytekdNKKSKLVIYGGMS--GCRLGDLWTLDIETLTWNKpslsgVAPLP 254
Cdd:pfam13418    1 PRAYHTSTS-----IPDDTIYLFGGEGedGTLLSDLWVFDLSTNEWTR-----LGSLP 48
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
254-273 9.23e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 36.05  E-value: 9.23e-03
                           10        20
                   ....*....|....*....|
gi 2009554257  254 PRSLHSATTIGNKMYVFGGW 273
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGF 20
Kelch_3 pfam13415
Galactose oxidase, central domain;
146-208 9.24e-03

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 36.11  E-value: 9.24e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2009554257  146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1905-2010 9.51e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 9.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009554257 1905 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqsaqaggetKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1983
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2009554257 1984 HIDYTtkpaiiFRIAARNEKGYGPATQ 2010
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH