NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1812311596|ref|XP_032404739|]
View 

host cell factor 1 [Xiphophorus hellerii]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
26-345 5.38e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 94.84  E-value: 5.38e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   26 WS--GPVPRPRHGHRAVAIKELMVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 100
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  101 MVEY---GKYSNDLYELQASRWEWKKLkaknpknGPPPCPRLGHSFSLVGNKCYLFGGlaNDSEDPKNNIPRYlnDLYTL 177
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG--WDDGGNVAWVEVY--DPATG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  178 ELRPGSSvvgwdipitygvLPPPRESHTAVVYTEkmtrkSRLIIYGGMSGcrlgdlwtldidTLTWNKPSVGGTAPLPRS 257
Cdd:COG3055    148 TWTQLAP------------LPTPRDHLAAAVLPD-----GKILVIGGRNG------------SGFSNTWTTLAPLPTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  258 LHSATTITNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDSMCWETVlmdtleDNIPRARAGHCAVAINSRLYV 337
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 1812311596  338 WSG--RDGYR 345
Cdd:COG3055    257 IGGetKPGVR 266
FN3 super family cl21522
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1569-1671 5.07e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


The actual alignment was detected with superfamily member pfam00041:

Pssm-ID: 473895 [Multi-domain]  Cd Length: 85  Bit Score: 37.78  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596 1569 GAPCAIKIS-KSPDGAHLTWEPPSVTSGKIIEYSVYLAIQSSQTAEAKASTPAQLAFMRVycgpnpsclvqsSSLsNAHI 1647
Cdd:pfam00041    1 SAPSNLTVTdVTSTSLTVSWTPPPDGNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTL------------TGL-KPGT 67
                           90       100
                   ....*....|....*....|....
gi 1812311596 1648 DYTtkpaiiFRIAARNEKGYGPAT 1671
Cdd:pfam00041   68 EYE------VRVQAVNGGGEGPPS 85
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1533-1562 7.99e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 7.99e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1812311596 1533 LQPGTAYKFRVAGINACGRGTFSEISAFKT 1562
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
26-345 5.38e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 94.84  E-value: 5.38e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   26 WS--GPVPRPRHGHRAVAIKELMVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 100
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  101 MVEY---GKYSNDLYELQASRWEWKKLkaknpknGPPPCPRLGHSFSLVGNKCYLFGGlaNDSEDPKNNIPRYlnDLYTL 177
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG--WDDGGNVAWVEVY--DPATG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  178 ELRPGSSvvgwdipitygvLPPPRESHTAVVYTEkmtrkSRLIIYGGMSGcrlgdlwtldidTLTWNKPSVGGTAPLPRS 257
Cdd:COG3055    148 TWTQLAP------------LPTPRDHLAAAVLPD-----GKILVIGGRNG------------SGFSNTWTTLAPLPTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  258 LHSATTITNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDSMCWETVlmdtleDNIPRARAGHCAVAINSRLYV 337
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 1812311596  338 WSG--RDGYR 345
Cdd:COG3055    257 IGGetKPGVR 266
PLN02193 PLN02193
nitrile-specifier protein
5-323 2.41e-13

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 74.61  E-value: 2.41e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596    5 GSAVSGTTPSVLQPRWKRV-LGWSGPVPRPRHGHRAVAIKelMVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIP 80
Cdd:PLN02193   138 GAYISLPSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVP 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   81 P-GCAAYGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKKLKAKNPKngppPCPRLGHSFSLVGNKCYLFGGLAND 159
Cdd:PLN02193   216 HlSCLGVRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLLTPVEEG----PTPRSFHSMAADEENVYVFGGVSAT 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  160 SE----DPKNNIPRYLNDLYTlelrPGSSVV---GWDIPITYGvlpppresHTAVVYtekmtrksrliiygGMSGCRLGD 232
Cdd:PLN02193   291 ARlktlDSYNIVDKKWFHCST----PGDSFSirgGAGLEVVQG--------KVWVVY--------------GFNGCEVDD 344
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  233 LWTLDIDTLTWNKPSVGGTAPLPRSLHSATTITNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDSMCWETVLM 312
Cdd:PLN02193   345 VHYYDPVQDKWTQVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDK 419
                          330
                   ....*....|.
gi 1812311596  313 DTLEDNIPRAR 323
Cdd:PLN02193   420 FGEEEETPSSR 430
Kelch_3 pfam13415
Galactose oxidase, central domain;
216-264 2.25e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 40.35  E-value: 2.25e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1812311596  216 KSRLIIYGG---MSGCRLGDLWTLDIDTLTWNKPsvgGTAPLPRSLHSATTI 264
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
fn3 pfam00041
Fibronectin type III domain;
1569-1671 5.07e-03

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 37.78  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596 1569 GAPCAIKIS-KSPDGAHLTWEPPSVTSGKIIEYSVYLAIQSSQTAEAKASTPAQLAFMRVycgpnpsclvqsSSLsNAHI 1647
Cdd:pfam00041    1 SAPSNLTVTdVTSTSLTVSWTPPPDGNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTL------------TGL-KPGT 67
                           90       100
                   ....*....|....*....|....
gi 1812311596 1648 DYTtkpaiiFRIAARNEKGYGPAT 1671
Cdd:pfam00041   68 EYE------VRVQAVNGGGEGPPS 85
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1533-1562 7.99e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 7.99e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1812311596 1533 LQPGTAYKFRVAGINACGRGTFSEISAFKT 1562
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
26-345 5.38e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 94.84  E-value: 5.38e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   26 WS--GPVPRPRHGHRAVAIKELMVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 100
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  101 MVEY---GKYSNDLYELQASRWEWKKLkaknpknGPPPCPRLGHSFSLVGNKCYLFGGlaNDSEDPKNNIPRYlnDLYTL 177
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG--WDDGGNVAWVEVY--DPATG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  178 ELRPGSSvvgwdipitygvLPPPRESHTAVVYTEkmtrkSRLIIYGGMSGcrlgdlwtldidTLTWNKPSVGGTAPLPRS 257
Cdd:COG3055    148 TWTQLAP------------LPTPRDHLAAAVLPD-----GKILVIGGRNG------------SGFSNTWTTLAPLPTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  258 LHSATTITNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDSMCWETVlmdtleDNIPRARAGHCAVAINSRLYV 337
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 1812311596  338 WSG--RDGYR 345
Cdd:COG3055    257 IGGetKPGVR 266
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
77-346 6.12e-18

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 85.98  E-value: 6.12e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   77 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKKLkaknpknGPPPCPRLGHSFSLV-GNKCYLFG 154
Cdd:COG3055      7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  155 GLandseDPKNNIPRYLNDLYTLELRPGSsvvgWdipITYGVLPPPRESHTAVVYTEKMtrksrLIIYGGMSGCRLGDLW 234
Cdd:COG3055     78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLLDGKI-----YVVGGWDDGGNVAWVE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  235 TLDIDTLTWnkpSVGGTAPLPRSLHSATTITN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldsmcwetvlmd 313
Cdd:COG3055    141 VYDPATGTW---TQLAPLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1812311596  314 TLEDNIPRARAGHCAVAINSRLYVWSGRDGYRK 346
Cdd:COG3055    188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
PLN02193 PLN02193
nitrile-specifier protein
5-323 2.41e-13

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 74.61  E-value: 2.41e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596    5 GSAVSGTTPSVLQPRWKRV-LGWSGPVPRPRHGHRAVAIKelMVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIP 80
Cdd:PLN02193   138 GAYISLPSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVP 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   81 P-GCAAYGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKKLKAKNPKngppPCPRLGHSFSLVGNKCYLFGGLAND 159
Cdd:PLN02193   216 HlSCLGVRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLLTPVEEG----PTPRSFHSMAADEENVYVFGGVSAT 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  160 SE----DPKNNIPRYLNDLYTlelrPGSSVV---GWDIPITYGvlpppresHTAVVYtekmtrksrliiygGMSGCRLGD 232
Cdd:PLN02193   291 ARlktlDSYNIVDKKWFHCST----PGDSFSirgGAGLEVVQG--------KVWVVY--------------GFNGCEVDD 344
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  233 LWTLDIDTLTWNKPSVGGTAPLPRSLHSATTITNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDSMCWETVLM 312
Cdd:PLN02193   345 VHYYDPVQDKWTQVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDK 419
                          330
                   ....*....|.
gi 1812311596  313 DTLEDNIPRAR 323
Cdd:PLN02193   420 FGEEEETPSSR 430
PLN02153 PLN02153
epithiospecifier protein
16-276 5.86e-13

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 72.33  E-value: 5.86e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   16 LQPRWKRVLGWSGPVPRPRHGHRAVAIKELMVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCD 91
Cdd:PLN02153     5 LQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRMVAV 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   92 GTRLLVFGGMVEYGKYsNDLYELQASRWEWKKLKAKNPKNGPPpcPRLGHSFSLVGNKCYLFGGLandSEDPKNNIPRYL 171
Cdd:PLN02153    85 GTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPE--ARTFHSMASDENHVYVFGGV---SKGGLMKTPERF 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  172 NDLY-----------------TLELRPGS--SVVGWDIPITYG----VLPPPRESH--TAVVYTEKMTRK---------- 216
Cdd:PLN02153   159 RTIEayniadgkwvqlpdpgeNFEKRGGAgfAVVQGKIWVVYGfatsILPGGKSDYesNAVQFFDPASGKwtevettgak 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  217 -------------SRLIIYGG----------MSGCRLGDLWTLDIDTLTWNKPSVGGTAPLPRSLHSATTIT----NKMY 269
Cdd:PLN02153   239 psarsvfahavvgKYIIIFGGevwpdlkghlGPGTLSNEGYALDTETLVWEKLGECGEPAMPRGWTAYTTATvygkNGLL 318

                   ....*..
gi 1812311596  270 VFGGWVP 276
Cdd:PLN02153   319 MHGGKLP 325
PLN02193 PLN02193
nitrile-specifier protein
92-273 2.91e-09

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 61.51  E-value: 2.91e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   92 GTRLLVFGGMVE--YGKYSNDLYELQA--SRWEWKKLKAKNPK---NGPPPCPRLGHSFSLVGNKCYLFGGlandSEDPK 164
Cdd:PLN02193   113 GVKFVLQGGKIVgfHGRSTDVLHSLGAyiSLPSTPKLLGKWIKveqKGEGPGLRCSHGIAQVGNKIYSFGG----EFTPN 188
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  165 NNIPRYLNdLYTLELRPgssvvgWDIPITYGVLPppresHTAVVYTEKMTRKSRLIIYGGMSGCR-LGDLWTLDIDTLTW 243
Cdd:PLN02193   189 QPIDKHLY-VFDLETRT------WSISPATGDVP-----HLSCLGVRMVSIGSTLYVFGGRDASRqYNGFYSFDTTTNEW 256
                          170       180       190
                   ....*....|....*....|....*....|
gi 1812311596  244 NKPSVGGTAPLPRSLHSATTITNKMYVFGG 273
Cdd:PLN02193   257 KLLTPVEEGPTPRSFHSMAADEENVYVFGG 286
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
26-115 8.02e-05

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 46.30  E-value: 8.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   26 WS--GPVPRPRHGHRAVAIKELMVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVE 103
Cdd:COG3055    187 WTtlAPLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETK 262
                           90
                   ....*....|..
gi 1812311596  104 YGKYSNDLYELQ 115
Cdd:COG3055    263 PGVRTPLVTSAE 274
PRK14131 PRK14131
N-acetylneuraminate epimerase;
19-100 8.63e-05

N-acetylneuraminate epimerase;


Pssm-ID: 237617 [Multi-domain]  Cd Length: 376  Bit Score: 46.93  E-value: 8.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596   19 RWKRVLGWSGPvprPRHGHRAVAIKELMVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 90
Cdd:PRK14131    63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
                           90
                   ....*....|
gi 1812311596   91 DGTRLLVFGG 100
Cdd:PRK14131   138 HNGKAYITGG 147
Kelch_3 pfam13415
Galactose oxidase, central domain;
216-264 2.25e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 40.35  E-value: 2.25e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1812311596  216 KSRLIIYGG---MSGCRLGDLWTLDIDTLTWNKPsvgGTAPLPRSLHSATTI 264
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
33-70 3.30e-04

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 39.90  E-value: 3.30e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1812311596   33 PRHGHRAVAIKELMVVFGGGNEG-IVDELHVYNTATNQW 70
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Kelch_3 pfam13415
Galactose oxidase, central domain;
265-331 1.25e-03

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 38.43  E-value: 1.25e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1812311596  265 TNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDSMCWETVlmdtleDNIPRARAGHCAVAI 331
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
31-68 3.24e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 36.77  E-value: 3.24e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1812311596   31 PRPRHGHRAVAIKELMVVFGG---GNEGIVDELHVYNTATN 68
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
fn3 pfam00041
Fibronectin type III domain;
1569-1671 5.07e-03

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 37.78  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596 1569 GAPCAIKIS-KSPDGAHLTWEPPSVTSGKIIEYSVYLAIQSSQTAEAKASTPAQLAFMRVycgpnpsclvqsSSLsNAHI 1647
Cdd:pfam00041    1 SAPSNLTVTdVTSTSLTVSWTPPPDGNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTL------------TGL-KPGT 67
                           90       100
                   ....*....|....*....|....
gi 1812311596 1648 DYTtkpaiiFRIAARNEKGYGPAT 1671
Cdd:pfam00041   68 EYE------VRVQAVNGGGEGPPS 85
Kelch_4 pfam13418
Galactose oxidase, central domain;
33-81 6.11e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 36.44  E-value: 6.11e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1812311596   33 PRHGHRAVAIKELMV-VFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 81
Cdd:pfam13418    1 PRAYHTSTSIPDDTIyLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
242-349 6.18e-03

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 40.52  E-value: 6.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1812311596  242 TWnkpSVGGTAPLPRSLHSATTITNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDSMCWETVlmdtleDNIPR 321
Cdd:COG3055      2 TW---SSLPDLPTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLPG 58
                           90       100
                   ....*....|....*....|....*....
gi 1812311596  322 ARAGH-CAVAINSRLYVWSGRDGYRKAWN 349
Cdd:COG3055     59 PPRHHaAAVAQDGKLYVFGGFTGANPSST 87
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1533-1562 7.99e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 7.99e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1812311596 1533 LQPGTAYKFRVAGINACGRGTFSEISAFKT 1562
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH