NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1949710151|ref|XP_038172414|]
View 

host cell factor 1 isoform X5 [Arvicola amphibius]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 13287381)

uncharacterized fibronectin type III (FN3) domain-containing protein; also contains kelch repeats

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.04e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.15  E-value: 1.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 1949710151  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
441-771 7.45e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 7.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  441 TQVGITLVPQAATAPPSTTTIQVLPTVPGSSISVPTAARTQGVPTVLKVTgPQATTGTPLVTMRPASQTGKAPVTVTSLP 520
Cdd:pfam05109  510 TSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT-PAVTTPTPNATIPTLGKTSPTSAVTTPTP 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  521 ASVRMVVPTQSAQGTVigSNPQMSGMaalaaaaaatqkippSSAPTVLSVPAGTtivkTVAVTPGTTTLPATvkvasspv 600
Cdd:pfam05109  589 NATSPTVGETSPQANT--TNHTLGGT---------------SSTPVVTSPPKNA----TSAVTTGQHNITSS-------- 639
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  601 mvSNPATRMLKTAAAQVGTSVSSAANTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISN 679
Cdd:pfam05109  640 --STSSMSLRPSSISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASG 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  680 LGKvmSVVQTKPVQTSAVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPS 759
Cdd:pfam05109  712 PGN--SSTSTKPGEVNVTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPT 779
                          330
                   ....*....|..
gi 1949710151  760 TTKPGTTTIIKT 771
Cdd:pfam05109  780 TDYGGDSTTPRT 791
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1865-1894 2.72e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.72e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1949710151 1865 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1894
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.04e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.15  E-value: 1.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 1949710151  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
PLN02193 PLN02193
nitrile-specifier protein
27-322 3.89e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.47  E-value: 3.89e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193   162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193   240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193   306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1949710151  262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.77e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.77e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1949710151   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
441-771 7.45e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 7.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  441 TQVGITLVPQAATAPPSTTTIQVLPTVPGSSISVPTAARTQGVPTVLKVTgPQATTGTPLVTMRPASQTGKAPVTVTSLP 520
Cdd:pfam05109  510 TSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT-PAVTTPTPNATIPTLGKTSPTSAVTTPTP 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  521 ASVRMVVPTQSAQGTVigSNPQMSGMaalaaaaaatqkippSSAPTVLSVPAGTtivkTVAVTPGTTTLPATvkvasspv 600
Cdd:pfam05109  589 NATSPTVGETSPQANT--TNHTLGGT---------------SSTPVVTSPPKNA----TSAVTTGQHNITSS-------- 639
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  601 mvSNPATRMLKTAAAQVGTSVSSAANTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISN 679
Cdd:pfam05109  640 --STSSMSLRPSSISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASG 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  680 LGKvmSVVQTKPVQTSAVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPS 759
Cdd:pfam05109  712 PGN--SSTSTKPGEVNVTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPT 779
                          330
                   ....*....|..
gi 1949710151  760 TTKPGTTTIIKT 771
Cdd:pfam05109  780 TDYGGDSTTPRT 791
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1865-1894 2.72e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.72e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1949710151 1865 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1894
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
444-950 3.65e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 42.46  E-value: 3.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  444 GITLVPQAATAPPSTTTIQVLPTVPGSSISVPTAARTQGVPTVLKVTGPQATTGTPLVTMRPASQTGKAPVTVTSLPASV 523
Cdd:COG4625      1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  524 RMVVPTQSAQGTVIGSNPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVS 603
Cdd:COG4625     81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  604 NPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTIT-LVKSPISVPGGSALISNLGK 682
Cdd:COG4625    161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGgGGGGGGGGGGGGGGGGGGGG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  683 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTK 762
Cdd:COG4625    241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  763 PGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQP 842
Cdd:COG4625    321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  843 GTILRTVPMGGVRLVN-PVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGAHSTSASLATPITTLGTIATLSS 921
Cdd:COG4625    401 GGGGAGGTGGGGAGGGgGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
                          490       500
                   ....*....|....*....|....*....
gi 1949710151  922 QVINPTAITVSAAQTTLTAAGGLTTPTIT 950
Cdd:COG4625    481 NNTYTGTTTVNGGGNYTQSAGSTLAVEVD 509
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 1.04e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.15  E-value: 1.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 1949710151  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
76-345 5.25e-17

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 83.28  E-value: 5.25e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055      7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055     78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  234 TLDIETLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055    141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1949710151  313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055    188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
PLN02193 PLN02193
nitrile-specifier protein
27-322 3.89e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.47  E-value: 3.89e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193   162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193   240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193   306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1949710151  262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
PLN02153 PLN02153
epithiospecifier protein
12-330 2.64e-15

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 79.65  E-value: 2.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153     2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPpcPRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153    82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPE--ARTFHSMASDENHVYVFGGVSKGGlmktperf 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153   159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  222 ggmsgcrlgdlwtLDIETLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWvplVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153   222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGGE---VWPDLK--GHLGPGTLSNEGYALDT 283
                          330       340
                   ....*....|....*....|....*....
gi 1949710151  302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153   284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
PLN02193 PLN02193
nitrile-specifier protein
126-272 3.12e-08

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 58.43  E-value: 3.12e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  126 KTPKNGPPPCPRLGHSFSLVGNKCYLFGGlandSEDPKNNIPRYlndLYILELRPGSgvvaWDIPITYGVLPppresHTA 205
Cdd:PLN02193   155 KVEQKGEGPGLRCSHGIAQVGNKIYSFGG----EFTPNQPIDKH---LYVFDLETRT----WSISPATGDVP-----HLS 218
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1949710151  206 VVYTEKDNKKSKLVIYGGMSGCR-LGDLWTLDIETLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGG 272
Cdd:PLN02193   219 CLGVRMVSIGSTLYVFGGRDASRqYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGG 286
PRK14131 PRK14131
N-acetylneuraminate epimerase;
18-99 2.14e-05

N-acetylneuraminate epimerase;


Pssm-ID: 237617 [Multi-domain]  Cd Length: 376  Bit Score: 49.24  E-value: 2.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131    63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
                           90
                   ....*....|
gi 1949710151   90 DGTRLLVFGG 99
Cdd:PRK14131   138 HNGKAYITGG 147
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
27-114 4.63e-05

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 47.46  E-value: 4.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   27 GPVPRPRHGHRAVAIKELIVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKY 106
Cdd:COG3055    191 APLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETKPGVR 266

                   ....*...
gi 1949710151  107 SNDLYELQ 114
Cdd:COG3055    267 TPLVTSAE 274
PLN02193 PLN02193
nitrile-specifier protein
242-393 7.58e-05

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 47.64  E-value: 7.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGG-WVPlvmdDVKVATHekewkctntLACLNLDTMAWEtilMDTLEDNIPR 320
Cdd:PLN02193   153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGeFTP----NQPIDKH---------LYVFDLETRTWS---ISPATGDVPH 216
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  321 ARA-GHCAVAINTRLYIWSGRDGYRK--AWNNQVCCKDLWYLET--EKPPPPARVQLVRANTNSLEVSWGAVATA----- 390
Cdd:PLN02193   217 LSClGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYVFGGVSATArlktl 296

                   ...
gi 1949710151  391 DSY 393
Cdd:PLN02193   297 DSY 299
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.77e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.77e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1949710151   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Kelch_3 pfam13415
Galactose oxidase, central domain;
264-330 1.29e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.12  E-value: 1.29e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1949710151  264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
241-348 1.66e-04

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 45.53  E-value: 1.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055      2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
                           90       100       110
                   ....*....|....*....|....*....|
gi 1949710151  320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055     58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
441-771 7.45e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 7.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  441 TQVGITLVPQAATAPPSTTTIQVLPTVPGSSISVPTAARTQGVPTVLKVTgPQATTGTPLVTMRPASQTGKAPVTVTSLP 520
Cdd:pfam05109  510 TSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT-PAVTTPTPNATIPTLGKTSPTSAVTTPTP 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  521 ASVRMVVPTQSAQGTVigSNPQMSGMaalaaaaaatqkippSSAPTVLSVPAGTtivkTVAVTPGTTTLPATvkvasspv 600
Cdd:pfam05109  589 NATSPTVGETSPQANT--TNHTLGGT---------------SSTPVVTSPPKNA----TSAVTTGQHNITSS-------- 639
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  601 mvSNPATRMLKTAAAQVGTSVSSAANTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISN 679
Cdd:pfam05109  640 --STSSMSLRPSSISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASG 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  680 LGKvmSVVQTKPVQTSAVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPS 759
Cdd:pfam05109  712 PGN--SSTSTKPGEVNVTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPT 779
                          330
                   ....*....|..
gi 1949710151  760 TTKPGTTTIIKT 771
Cdd:pfam05109  780 TDYGGDSTTPRT 791
Kelch_3 pfam13415
Galactose oxidase, central domain;
215-263 8.75e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 38.81  E-value: 8.75e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1949710151  215 KSKLVIYGG---MSGCRLGDLWTLDIETLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
452-832 9.13e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 9.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  452 ATAPPSTTTIQVLPTV----PGSSISVPTAARTQGVPTVLKVTGPQATTGTPLVTMRPASQTGKAPVTVTSLP------- 520
Cdd:pfam05109  422 SKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPrdngtes 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  521 ASVRMVVPTQSAQG-TVIGSNPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLPATVKVA 596
Cdd:pfam05109  502 KAPDMTSPTSAVTTpTPNATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLGKTSPTS 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  597 SSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISVPGGSAL 676
Cdd:pfam05109  582 AVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISETLSPST 659
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  677 ISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkptilgiss 755
Cdd:pfam05109  660 SDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--------- 714
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1949710151  756 vSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQGVTQ 832
Cdd:pfam05109  715 -SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGARTSTE 777
Kelch_4 pfam13418
Galactose oxidase, central domain;
32-80 1.34e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 38.36  E-value: 1.34e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1949710151   32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418    1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
30-67 2.61e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.54  E-value: 2.61e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1949710151   30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1865-1894 2.72e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.72e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1949710151 1865 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1894
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
319-351 2.80e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.80e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1949710151  319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
444-950 3.65e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 42.46  E-value: 3.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  444 GITLVPQAATAPPSTTTIQVLPTVPGSSISVPTAARTQGVPTVLKVTGPQATTGTPLVTMRPASQTGKAPVTVTSLPASV 523
Cdd:COG4625      1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  524 RMVVPTQSAQGTVIGSNPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVS 603
Cdd:COG4625     81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  604 NPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTIT-LVKSPISVPGGSALISNLGK 682
Cdd:COG4625    161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGgGGGGGGGGGGGGGGGGGGGG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  683 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTK 762
Cdd:COG4625    241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  763 PGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQP 842
Cdd:COG4625    321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  843 GTILRTVPMGGVRLVN-PVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGAHSTSASLATPITTLGTIATLSS 921
Cdd:COG4625    401 GGGGAGGTGGGGAGGGgGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
                          490       500
                   ....*....|....*....|....*....
gi 1949710151  922 QVINPTAITVSAAQTTLTAAGGLTTPTIT 950
Cdd:COG4625    481 NNTYTGTTTVNGGGNYTQSAGSTLAVEVD 509
PHA03098 PHA03098
kelch-like protein; Provisional
38-295 6.26e-03

kelch-like protein; Provisional


Pssm-ID: 222983 [Multi-domain]  Cd Length: 534  Bit Score: 41.29  E-value: 6.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151   38 AVAIKELIVVFGGGNEG--IVDELHVYNTATNQWF-IPavrgDIPPGCAAYGFVCDGTRLLVFGGmVEYGKYSNDLYELQ 114
Cdd:PHA03098   290 SVVLNNVIYFIGGMNKNnlSVNSVVSYDTKTKSWNkVP----ELIYPRKNPGVTVFNNRIYVIGG-IYNSISLNTVESWK 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  115 ASRWEWKRLKaktpkngPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNnIPRYlnDLYILELRPGSGvvawdipityg 194
Cdd:PHA03098   365 PGESKWREEP-------PLIFPRYNPCVVNVNNLIYVIGGISKNDELLKT-VECF--SLNTNKWSKGSP----------- 423
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1949710151  195 vLPPPRESHTAVVYtekdnkKSKLVIYGGMS----GCRLGDLWTLDIETLTWNKPSLSGVaplPRSLHSATTIGNKMYVF 270
Cdd:PHA03098   424 -LPISHYGGCAIYH------DGKIYVIGGISyidnIKVYNIVESYNPVTNKWTELSSLNF---PRINASLCIFNNKIYVV 493
                          250       260
                   ....*....|....*....|....*
gi 1949710151  271 GGWvplvMDDVKVATHEKEWKCTNT 295
Cdd:PHA03098   494 GGD----KYEYYINEIEVYDDKTNT 514
Kelch_4 pfam13418
Galactose oxidase, central domain;
199-254 9.17e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 36.05  E-value: 9.17e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1949710151  199 PRESHTAVVytekdNKKSKLVIYGGMS--GCRLGDLWTLDIETLTWNKpslsgVAPLP 254
Cdd:pfam13418    1 PRAYHTSTS-----IPDDTIYLFGGEGedGTLLSDLWVFDLSTNEWTR-----LGSLP 48
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
254-273 9.58e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 36.05  E-value: 9.58e-03
                           10        20
                   ....*....|....*....|
gi 1949710151  254 PRSLHSATTIGNKMYVFGGW 273
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGF 20
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH