|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.79e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; :
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 96.76 E-value: 1.79e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2112827423 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
377-763 |
2.89e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 377 TNSLEAASAPPTTTTIQVLPTV----PGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPA 452
Cdd:pfam05109 416 THKVIFSKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPR 495
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 453 --GVRMVVPTQSAQGTVI------GSSPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLP 521
Cdd:pfam05109 496 dnGTESKAPDMTSPTSAVttptpnATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 522 ATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISV 601
Cdd:pfam05109 576 KTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISE 653
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 602 PGGSALISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkpt 680
Cdd:pfam05109 654 TLSPSTSDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--- 714
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 681 ilgissvSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQG 760
Cdd:pfam05109 715 -------SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGART 774
|
...
gi 2112827423 761 VTQ 763
Cdd:pfam05109 775 STE 777
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1833-1862 |
2.40e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.40e-03
10 20 30
....*....|....*....|....*....|
gi 2112827423 1833 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1862
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1868-1973 |
8.06e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.48 E-value: 8.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 1868 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqssqaggepKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1946
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2112827423 1947 HIDYTtkpaiiFRIAARNEKGYGPATQ 1973
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.79e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 96.76 E-value: 1.79e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2112827423 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
10-322 |
4.17e-16 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 83.47 E-value: 4.17e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193 144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193 222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWN 243
Cdd:PLN02193 293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2112827423 244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193 357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
32-69 |
7.58e-05 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 41.83 E-value: 7.58e-05
10 20 30
....*....|....*....|....*....|....*....
gi 2112827423 32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
377-763 |
2.89e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 377 TNSLEAASAPPTTTTIQVLPTV----PGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPA 452
Cdd:pfam05109 416 THKVIFSKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPR 495
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 453 --GVRMVVPTQSAQGTVI------GSSPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLP 521
Cdd:pfam05109 496 dnGTESKAPDMTSPTSAVttptpnATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 522 ATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISV 601
Cdd:pfam05109 576 KTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISE 653
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 602 PGGSALISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkpt 680
Cdd:pfam05109 654 TLSPSTSDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--- 714
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 681 ilgissvSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQG 760
Cdd:pfam05109 715 -------SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGART 774
|
...
gi 2112827423 761 VTQ 763
Cdd:pfam05109 775 STE 777
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
375-874 |
6.95e-04 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 44.77 E-value: 6.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 375 ANTNSLEAASAPPTTTTIQVLPTVPGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGV 454
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 455 RMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVS 534
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 535 NPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTIT-LVKSPISVPGGSALISNLGK 613
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGgGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTK 693
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 694 PGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQP 773
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 774 GT-ILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGHSTSASLATPITTLGTIATLSS 852
Cdd:COG4625 401 GGgGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
|
490 500
....*....|....*....|..
gi 2112827423 853 QVINPTAITVSAAQTTLTAAGG 874
Cdd:COG4625 481 NNTYTGTTTVNGGGNYTQSAGS 502
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1833-1862 |
2.40e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.40e-03
10 20 30
....*....|....*....|....*....|
gi 2112827423 1833 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1862
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
614-754 |
7.62e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 40.27 E-value: 7.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 692
Cdd:PHA03255 14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2112827423 693 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 754
Cdd:PHA03255 94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1868-1973 |
8.06e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.48 E-value: 8.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 1868 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqssqaggepKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1946
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2112827423 1947 HIDYTtkpaiiFRIAARNEKGYGPATQ 1973
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.79e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 96.76 E-value: 1.79e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2112827423 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
76-345 |
6.05e-17 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 83.28 E-value: 6.05e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055 7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055 78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 234 TLDIDTLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055 141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
|
250 260 270
....*....|....*....|....*....|...
gi 2112827423 313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055 188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
10-322 |
4.17e-16 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 83.47 E-value: 4.17e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193 144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193 222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWN 243
Cdd:PLN02193 293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2112827423 244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193 357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
12-330 |
6.00e-15 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 78.49 E-value: 6.00e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153 2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPPcpRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153 82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPEA--RTFHSMASDENHVYVFGGVSKGGlmktperf 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153 159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 222 ggmsgcrlgdlwtLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGwvpLVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153 222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGG---EVWPDLK--GHLGPGTLSNEGYALDT 283
|
330 340
....*....|....*....|....*....
gi 2112827423 302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153 284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
|
|
| PRK14131 |
PRK14131 |
N-acetylneuraminate epimerase; |
18-99 |
2.36e-05 |
|
N-acetylneuraminate epimerase;
Pssm-ID: 237617 [Multi-domain] Cd Length: 376 Bit Score: 48.86 E-value: 2.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131 63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
|
90
....*....|
gi 2112827423 90 DGTRLLVFGG 99
Cdd:PRK14131 138 HNGKAYITGG 147
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
32-69 |
7.58e-05 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 41.83 E-value: 7.58e-05
10 20 30
....*....|....*....|....*....|....*....
gi 2112827423 32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
264-330 |
9.70e-05 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 41.51 E-value: 9.70e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2112827423 264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
241-348 |
2.04e-04 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 45.53 E-value: 2.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055 2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
|
90 100 110
....*....|....*....|....*....|
gi 2112827423 320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055 58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
377-763 |
2.89e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 377 TNSLEAASAPPTTTTIQVLPTV----PGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPA 452
Cdd:pfam05109 416 THKVIFSKAPESTTTSPTLNTTgfaaPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPR 495
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 453 --GVRMVVPTQSAQGTVI------GSSPQMSGMAALAAAAAAT-QKIPPSSAPTVLSVPAGTTIVKTVAVTPGTT--TLP 521
Cdd:pfam05109 496 dnGTESKAPDMTSPTSAVttptpnATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSPTPAVTTPTPNATipTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 522 ATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVtkTITLVKSPISV 601
Cdd:pfam05109 576 KTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTS--SMSLRPSSISE 653
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 602 PGGSALISNLGKVMSVVQT-KPVQTSAVTGQASTGPVTQIIQTKGPLPagtilklvtsadgKPTtiiTTTQASGAGTkpt 680
Cdd:pfam05109 654 TLSPSTSDNSTSHMPLLTSaHPTGGENITQVTPASTSTHHVSTSSPAP-------------RPG---TTSQASGPGN--- 714
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 681 ilgissvSPSTTKPGTTTIIKTIPmsaiitQAGATGVTSSPGIKSPITIITTkvmtsgTGAPAKIITAvPKIATGHGQQG 760
Cdd:pfam05109 715 -------SSTSTKPGEVNVTKGTP------PKNATSPQAPSGQKTAVPTVTS------TGGKANSTTG-GKHTTGHGART 774
|
...
gi 2112827423 761 VTQ 763
Cdd:pfam05109 775 STE 777
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
215-263 |
3.16e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 40.35 E-value: 3.16e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2112827423 215 KSKLVIYGG---MSGCRLGDLWTLDIDTLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415 1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
375-874 |
6.95e-04 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 44.77 E-value: 6.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 375 ANTNSLEAASAPPTTTTIQVLPTVPGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGV 454
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 455 RMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVMVS 534
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 535 NPATRMLKTAAAQVGTSVSSAANTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTIT-LVKSPISVPGGSALISNLGK 613
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGgGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTK 693
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 694 PGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQP 773
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 774 GT-ILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGHSTSASLATPITTLGTIATLSS 852
Cdd:COG4625 401 GGgGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
|
490 500
....*....|....*....|..
gi 2112827423 853 QVINPTAITVSAAQTTLTAAGG 874
Cdd:COG4625 481 NNTYTGTTTVNGGGNYTQSAGS 502
|
|
| Kelch_4 |
pfam13418 |
Galactose oxidase, central domain; |
32-80 |
1.32e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433191 [Multi-domain] Cd Length: 49 Bit Score: 38.36 E-value: 1.32e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2112827423 32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418 1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1833-1862 |
2.40e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.40e-03
10 20 30
....*....|....*....|....*....|
gi 2112827423 1833 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1862
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
30-67 |
2.57e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.54 E-value: 2.57e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2112827423 30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
319-351 |
2.75e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 2.75e-03
10 20 30
....*....|....*....|....*....|...
gi 2112827423 319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
361-560 |
4.02e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 4.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 361 TEKPPPPARVQLVRANTNSLEAASAPPTTTTI---QVLPTVPGSSISVPAAARTPGVPAVLKVTGPQATTGTPLVTMRPA 437
Cdd:COG3469 3 SVSTAASPTAGGASATAVTLLGAAATAASVTLtaaTATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 438 SQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGT 517
Cdd:COG3469 83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGG 162
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 2112827423 518 TTLPATVKVASSPVmVSNPATRMLKTAAAQVGTSVSSAANTST 560
Cdd:COG3469 163 TTTTSTTTTTTSAS-TTPSATTTATATTASGATTPSATTTATT 204
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
395-677 |
6.02e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.48 E-value: 6.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 395 LPTVPGSSISVPAAARTPGVPAVLkvTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 474
Cdd:pfam17823 114 ALAAAASSSPSSAAQSLPAAIAAL--PSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 475 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 554
Cdd:pfam17823 192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 555 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 623
Cdd:pfam17823 269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 2112827423 624 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGT 677
Cdd:pfam17823 349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGI 397
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
614-754 |
7.62e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 40.27 E-value: 7.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 614 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 692
Cdd:PHA03255 14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2112827423 693 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 754
Cdd:PHA03255 94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
146-208 |
7.83e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 36.11 E-value: 7.83e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2112827423 146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1868-1973 |
8.06e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.48 E-value: 8.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2112827423 1868 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqssqaggepKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1946
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2112827423 1947 HIDYTtkpaiiFRIAARNEKGYGPATQ 1973
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
254-273 |
9.06e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 36.05 E-value: 9.06e-03
|
|