NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2112827423|ref|XP_044091620|]
View 

host cell factor 1 isoform X7 [Neogale vison]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 13287381)

uncharacterized fibronectin type III (FN3) domain-containing protein; also contains kelch repeats

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.79e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 96.76  E-value: 1.79e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2112827423  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
377-763 2.89e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 2.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  377 TNSLEAASAPPTTTTIQVLPTV----PGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPA 452
Cdd:pfam05109  416 THKVIFSKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPR 495
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  453 --GVRMVVPTQSAQGTVI------GSSPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLP 521
Cdd:pfam05109  496 dnGTESKAPDMTSPTSAVttptpnATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  522 ATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISV 601
Cdd:pfam05109  576 KTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISE 653
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  602 PGGSALISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkpt 680
Cdd:pfam05109  654 TLSPSTSDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--- 714
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  681 ilgissvSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQG 760
Cdd:pfam05109  715 -------SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGART 774

                   ...
gi 2112827423  761 VTQ 763
Cdd:pfam05109  775 STE 777
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1833-1862 2.40e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.40e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2112827423 1833 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1862
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1868-1973 8.06e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 8.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 1868 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqssqaggepKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1946
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2112827423 1947 HIDYTtkpaiiFRIAARNEKGYGPATQ 1973
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.79e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 96.76  E-value: 1.79e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2112827423  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
PLN02193 PLN02193
nitrile-specifier protein
10-322 4.17e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.47  E-value: 4.17e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193   144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193   222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWN 243
Cdd:PLN02193   293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2112827423  244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.58e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.58e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2112827423   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
377-763 2.89e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 2.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  377 TNSLEAASAPPTTTTIQVLPTV----PGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPA 452
Cdd:pfam05109  416 THKVIFSKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPR 495
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  453 --GVRMVVPTQSAQGTVI------GSSPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLP 521
Cdd:pfam05109  496 dnGTESKAPDMTSPTSAVttptpnATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  522 ATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISV 601
Cdd:pfam05109  576 KTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISE 653
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  602 PGGSALISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkpt 680
Cdd:pfam05109  654 TLSPSTSDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--- 714
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  681 ilgissvSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQG 760
Cdd:pfam05109  715 -------SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGART 774

                   ...
gi 2112827423  761 VTQ 763
Cdd:pfam05109  775 STE 777
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
375-874 6.95e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 44.77  E-value: 6.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  375 ANTNSLEAASAPPTTTTIQVLPTVPGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGV 454
Cdd:COG4625      1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  455 RMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVS 534
Cdd:COG4625     81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  535 NPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTIT-LVKSPISVPGGSALISNLGK 613
Cdd:COG4625    161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGgGGGGGGGGGGGGGGGGGGGG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTK 693
Cdd:COG4625    241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  694 PGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQP 773
Cdd:COG4625    321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  774 GT-ILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGHSTSASLATPITTLGTIATLSS 852
Cdd:COG4625    401 GGgGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
                          490       500
                   ....*....|....*....|..
gi 2112827423  853 QVINPTAITVSAAQTTLTAAGG 874
Cdd:COG4625    481 NNTYTGTTTVNGGGNYTQSAGS 502
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1833-1862 2.40e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.40e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2112827423 1833 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1862
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
PHA03255 PHA03255
BDLF3; Provisional
614-754 7.62e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 40.27  E-value: 7.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 692
Cdd:PHA03255    14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2112827423  693 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 754
Cdd:PHA03255    94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1868-1973 8.06e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 8.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 1868 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqssqaggepKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1946
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2112827423 1947 HIDYTtkpaiiFRIAARNEKGYGPATQ 1973
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.79e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 96.76  E-value: 1.79e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2112827423  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
76-345 6.05e-17

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 83.28  E-value: 6.05e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055      7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055     78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  234 TLDIDTLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055    141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2112827423  313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055    188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
PLN02193 PLN02193
nitrile-specifier protein
10-322 4.17e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.47  E-value: 4.17e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193   144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193   222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWN 243
Cdd:PLN02193   293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2112827423  244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
PLN02153 PLN02153
epithiospecifier protein
12-330 6.00e-15

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 78.49  E-value: 6.00e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153     2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPPcpRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153    82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPEA--RTFHSMASDENHVYVFGGVSKGGlmktperf 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153   159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  222 ggmsgcrlgdlwtLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGwvpLVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153   222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGG---EVWPDLK--GHLGPGTLSNEGYALDT 283
                          330       340
                   ....*....|....*....|....*....
gi 2112827423  302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153   284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
PRK14131 PRK14131
N-acetylneuraminate epimerase;
18-99 2.36e-05

N-acetylneuraminate epimerase;


Pssm-ID: 237617 [Multi-domain]  Cd Length: 376  Bit Score: 48.86  E-value: 2.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423   18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131    63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
                           90
                   ....*....|
gi 2112827423   90 DGTRLLVFGG 99
Cdd:PRK14131   138 HNGKAYITGG 147
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.58e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.58e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2112827423   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Kelch_3 pfam13415
Galactose oxidase, central domain;
264-330 9.70e-05

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.51  E-value: 9.70e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2112827423  264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
241-348 2.04e-04

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 45.53  E-value: 2.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055      2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
                           90       100       110
                   ....*....|....*....|....*....|
gi 2112827423  320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055     58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
377-763 2.89e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 2.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  377 TNSLEAASAPPTTTTIQVLPTV----PGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPA 452
Cdd:pfam05109  416 THKVIFSKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPR 495
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  453 --GVRMVVPTQSAQGTVI------GSSPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLP 521
Cdd:pfam05109  496 dnGTESKAPDMTSPTSAVttptpnATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  522 ATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISV 601
Cdd:pfam05109  576 KTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISE 653
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  602 PGGSALISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkpt 680
Cdd:pfam05109  654 TLSPSTSDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--- 714
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  681 ilgissvSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQG 760
Cdd:pfam05109  715 -------SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGART 774

                   ...
gi 2112827423  761 VTQ 763
Cdd:pfam05109  775 STE 777
Kelch_3 pfam13415
Galactose oxidase, central domain;
215-263 3.16e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 40.35  E-value: 3.16e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2112827423  215 KSKLVIYGG---MSGCRLGDLWTLDIDTLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
375-874 6.95e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 44.77  E-value: 6.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  375 ANTNSLEAASAPPTTTTIQVLPTVPGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGV 454
Cdd:COG4625      1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  455 RMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVS 534
Cdd:COG4625     81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  535 NPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTIT-LVKSPISVPGGSALISNLGK 613
Cdd:COG4625    161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGgGGGGGGGGGGGGGGGGGGGG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTK 693
Cdd:COG4625    241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  694 PGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQP 773
Cdd:COG4625    321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  774 GT-ILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGHSTSASLATPITTLGTIATLSS 852
Cdd:COG4625    401 GGgGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
                          490       500
                   ....*....|....*....|..
gi 2112827423  853 QVINPTAITVSAAQTTLTAAGG 874
Cdd:COG4625    481 NNTYTGTTTVNGGGNYTQSAGS 502
Kelch_4 pfam13418
Galactose oxidase, central domain;
32-80 1.32e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 38.36  E-value: 1.32e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2112827423   32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418    1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1833-1862 2.40e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.40e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2112827423 1833 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1862
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
30-67 2.57e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.54  E-value: 2.57e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2112827423   30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
319-351 2.75e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.75e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2112827423  319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
361-560 4.02e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 4.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  361 TEKPPPPARVQLVRANTNSLEAASAPPTTTTI---QVLPTVPGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPA 437
Cdd:COG3469      3 SVSTAASPTAGGASATAVTLLGAAATAASVTLtaaTATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  438 SQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGT 517
Cdd:COG3469     83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGG 162
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2112827423  518 TTLPATVKVASSPVmVSNPATRMLKTAAAQVGTSVSSAANTST 560
Cdd:COG3469    163 TTTTSTTTTTTSAS-TTPSATTTATATTASGATTPSATTTATT 204
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
395-677 6.02e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.48  E-value: 6.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  395 LPTVPGSSISVPAAARTPGVPAVLkvTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 474
Cdd:pfam17823  114 ALAAAASSSPSSAAQSLPAAIAAL--PSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  475 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 554
Cdd:pfam17823  192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  555 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 623
Cdd:pfam17823  269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2112827423  624 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGT 677
Cdd:pfam17823  349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGI 397
PHA03255 PHA03255
BDLF3; Provisional
614-754 7.62e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 40.27  E-value: 7.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423  614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 692
Cdd:PHA03255    14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2112827423  693 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 754
Cdd:PHA03255    94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
Kelch_3 pfam13415
Galactose oxidase, central domain;
146-208 7.83e-03

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 36.11  E-value: 7.83e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2112827423  146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1868-1973 8.06e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.48  E-value: 8.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 1868 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqssqaggepKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1946
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2112827423 1947 HIDYTtkpaiiFRIAARNEKGYGPATQ 1973
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
254-273 9.06e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 36.05  E-value: 9.06e-03
                           10        20
                   ....*....|....*....|
gi 2112827423  254 PRSLHSATTIGNKMYVFGGW 273
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGF 20
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH