|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.82e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; :
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 96.38 E-value: 1.82e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2090188793 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1813-1842 |
2.55e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.55e-03
10 20 30
....*....|....*....|....*....|
gi 2090188793 1813 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1842
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| DUF5585 super family |
cl39316 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
464-767 |
6.43e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. The actual alignment was detected with superfamily member pfam17823:
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.48 E-value: 6.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 464 LPTVPGSSisVPAAARTQGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 543
Cdd:pfam17823 114 ALAAAASS--SPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 544 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 623
Cdd:pfam17823 192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 624 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 692
Cdd:pfam17823 269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2090188793 693 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGTKPTILGISS-VSPSTTKPGTTT 767
Cdd:pfam17823 349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGILLAPEQVATeATAGTASAGPTP 419
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1848-1953 |
9.15e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.48 E-value: 9.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 1848 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqsaqaggetKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1926
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2090188793 1927 HIDYTtkpaiiFRIAARNEKGYGPATQ 1953
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.82e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 96.38 E-value: 1.82e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2090188793 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
10-322 |
3.19e-16 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 83.85 E-value: 3.19e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193 144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193 222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETLTWN 243
Cdd:PLN02193 293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2090188793 244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193 357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
32-69 |
7.43e-05 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 41.83 E-value: 7.43e-05
10 20 30
....*....|....*....|....*....|....*....
gi 2090188793 32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1813-1842 |
2.55e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.55e-03
10 20 30
....*....|....*....|....*....|
gi 2090188793 1813 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1842
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
464-767 |
6.43e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.48 E-value: 6.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 464 LPTVPGSSisVPAAARTQGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 543
Cdd:pfam17823 114 ALAAAASS--SPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 544 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 623
Cdd:pfam17823 192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 624 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 692
Cdd:pfam17823 269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2090188793 693 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGTKPTILGISS-VSPSTTKPGTTT 767
Cdd:pfam17823 349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGILLAPEQVATeATAGTASAGPTP 419
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1848-1953 |
9.15e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.48 E-value: 9.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 1848 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqsaqaggetKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1926
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2090188793 1927 HIDYTtkpaiiFRIAARNEKGYGPATQ 1953
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.82e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 96.38 E-value: 1.82e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldietltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2090188793 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
76-345 |
8.39e-17 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 82.90 E-value: 8.39e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055 7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055 78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 234 TLDIETLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055 141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
|
250 260 270
....*....|....*....|....*....|...
gi 2090188793 313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055 188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
10-322 |
3.19e-16 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 83.85 E-value: 3.19e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 10 SPAVLLQPRWKRV-VGWSGPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAA 84
Cdd:PLN02193 144 PSTPKLLGKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLG 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 85 YGFVCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpk 163
Cdd:PLN02193 222 VRMVSIGSTLYVFGGRDASRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR--- 292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 164 nniPRYLNDLYILELRpgsgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIETLTWN 243
Cdd:PLN02193 293 ---LKTLDSYNIVDKK-------WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2090188793 244 KPSLSGVAPLPRSLHSATTIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193 357 QVETFGVRPSERSVFASAAVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
12-330 |
3.34e-15 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 79.26 E-value: 3.34e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153 2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPpcPRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153 82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPE--ARTFHSMASDENHVYVFGGVSKGGlmktperf 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153 159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 222 ggmsgcrlgdlwtLDIETLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGwvpLVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153 222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGG---EVWPDLK--GHLGPGTLSNEGYALDT 283
|
330 340
....*....|....*....|....*....
gi 2090188793 302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153 284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
|
|
| PRK14131 |
PRK14131 |
N-acetylneuraminate epimerase; |
18-99 |
2.35e-05 |
|
N-acetylneuraminate epimerase;
Pssm-ID: 237617 [Multi-domain] Cd Length: 376 Bit Score: 48.86 E-value: 2.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131 63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
|
90
....*....|
gi 2090188793 90 DGTRLLVFGG 99
Cdd:PRK14131 138 HNGKAYITGG 147
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
27-114 |
5.34e-05 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 47.07 E-value: 5.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 27 GPVPRPRHGHRAVAIKELIVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKY 106
Cdd:COG3055 191 APLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETKPGVR 266
|
....*...
gi 2090188793 107 SNDLYELQ 114
Cdd:COG3055 267 TPLVTSAE 274
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
242-393 |
7.37e-05 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 47.64 E-value: 7.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGG-WVPlvmdDVKVATHekewkctntLACLNLDTMAWEtilMDTLEDNIPR 320
Cdd:PLN02193 153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGeFTP----NQPIDKH---------LYVFDLETRTWS---ISPATGDVPH 216
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 321 ARA-GHCAVAINTRLYIWSGRDGYRK--AWNNQVCCKDLWYLET--EKPPPPARVQLVRANTNSLEVSWGAVATA----- 390
Cdd:PLN02193 217 LSClGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYVFGGVSATArlktl 296
|
...
gi 2090188793 391 DSY 393
Cdd:PLN02193 297 DSY 299
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
32-69 |
7.43e-05 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 41.83 E-value: 7.43e-05
10 20 30
....*....|....*....|....*....|....*....
gi 2090188793 32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
264-330 |
1.12e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 41.51 E-value: 1.12e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2090188793 264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
241-348 |
2.05e-04 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 45.53 E-value: 2.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055 2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
|
90 100 110
....*....|....*....|....*....|
gi 2090188793 320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055 58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
215-263 |
7.65e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 39.20 E-value: 7.65e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2090188793 215 KSKLVIYGG---MSGCRLGDLWTLDIETLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415 1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| Kelch_4 |
pfam13418 |
Galactose oxidase, central domain; |
32-80 |
1.31e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433191 [Multi-domain] Cd Length: 49 Bit Score: 38.36 E-value: 1.31e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2090188793 32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418 1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
30-67 |
2.54e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.54 E-value: 2.54e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2090188793 30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1813-1842 |
2.55e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.55e-03
10 20 30
....*....|....*....|....*....|
gi 2090188793 1813 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1842
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
319-351 |
2.73e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 2.73e-03
10 20 30
....*....|....*....|....*....|...
gi 2090188793 319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
|
|
| PHA03098 |
PHA03098 |
kelch-like protein; Provisional |
38-295 |
6.09e-03 |
|
kelch-like protein; Provisional
Pssm-ID: 222983 [Multi-domain] Cd Length: 534 Bit Score: 41.29 E-value: 6.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 38 AVAIKELIVVFGGGNEG--IVDELHVYNTATNQWF-IPavrgDIPPGCAAYGFVCDGTRLLVFGGmVEYGKYSNDLYELQ 114
Cdd:PHA03098 290 SVVLNNVIYFIGGMNKNnlSVNSVVSYDTKTKSWNkVP----ELIYPRKNPGVTVFNNRIYVIGG-IYNSISLNTVESWK 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 115 ASRWEWKRLKaktpkngPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNnIPRYlnDLYILELRPGSGvvawdipityg 194
Cdd:PHA03098 365 PGESKWREEP-------PLIFPRYNPCVVNVNNLIYVIGGISKNDELLKT-VECF--SLNTNKWSKGSP----------- 423
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 195 vLPPPRESHTAVVYtekdnkKSKLVIYGGMS----GCRLGDLWTLDIETLTWNKPSLSGVaplPRSLHSATTIGNKMYVF 270
Cdd:PHA03098 424 -LPISHYGGCAIYH------DGKIYVIGGISyidnIKVYNIVESYNPVTNKWTELSSLNF---PRINASLCIFNNKIYVV 493
|
250 260
....*....|....*....|....*
gi 2090188793 271 GGWvplvMDDVKVATHEKEWKCTNT 295
Cdd:PHA03098 494 GGD----KYEYYINEIEVYDDKTNT 514
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
464-767 |
6.43e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.48 E-value: 6.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 464 LPTVPGSSisVPAAARTQGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQM 543
Cdd:pfam17823 114 ALAAAASS--SPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAA 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 544 SGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTVAVTPGTTTLPATVKVASSPVmvsNPATRMLKTAAAQVGTSVSS 623
Cdd:pfam17823 192 SSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTV---TPAALATLAAAAGTVASAAG 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 624 AANTST-----------RPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPV 692
Cdd:pfam17823 269 TINMGDpharrlspakhMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVV 348
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2090188793 693 QTSAV-TGQASTGPVtqiiqtkgPLPAGTILKLV--TSADGKPTTIITTTQASGAGTKPTILGISS-VSPSTTKPGTTT 767
Cdd:pfam17823 349 TTTKAqAKEPSASPV--------PVLHTSMIPEVeaTSPTTQPSPLLPTQGAAGPGILLAPEQVATeATAGTASAGPTP 419
|
|
| Kelch_4 |
pfam13418 |
Galactose oxidase, central domain; |
199-254 |
8.94e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433191 [Multi-domain] Cd Length: 49 Bit Score: 36.05 E-value: 8.94e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 2090188793 199 PRESHTAVVytekdNKKSKLVIYGGMS--GCRLGDLWTLDIETLTWNKpslsgVAPLP 254
Cdd:pfam13418 1 PRAYHTSTS-----IPDDTIYLFGGEGedGTLLSDLWVFDLSTNEWTR-----LGSLP 48
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
254-273 |
8.97e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 36.05 E-value: 8.97e-03
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1848-1953 |
9.15e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.48 E-value: 9.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2090188793 1848 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVYLaiqsaqaggetKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1926
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVVEY-----------REKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2090188793 1927 HIDYTtkpaiiFRIAARNEKGYGPATQ 1953
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
146-208 |
9.16e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 36.11 E-value: 9.16e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2090188793 146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
|
|
|