|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
14-332 |
3.78e-19 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 88.29 E-value: 3.78e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 14 SFTGPVPRARHGHRAVAIRELMIIFGGGNEG-IADELHVYNTVTNQWflpaVRGDIPPGCAAHGFVC--DGTRILVFGGM 90
Cdd:COG3055 4 SSLPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGGF 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 91 VEY---GRYSNELYELQASRWLWKKVkpqpppsGLPPCPRLGHSFSLYGNKCYLFAGlanesedsnNNVPRYLNDFYELE 167
Cdd:COG3055 80 TGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVYD 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 168 LQHGSgvvgWSvpaTKGTVPSPRESHTAVIyckrdSGSPKMYVFGGMCGARLDDLWqldletmswskpETKGTVPLPRSL 247
Cdd:COG3055 144 PATGT----WT---QLAPLPTPRDHLAAAV-----LPDGKILVIGGRNGSGFSNTW------------TTLAPLPTARAG 199
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 248 HTASVIGNKMYIFGGwvphkgentENSPHDCEWRctssfsyLNLDTAEWTTLvsdsqedkkNSRPRPRAGHCAVAIGTRL 327
Cdd:COG3055 200 HAAAVLGGKILVFGG---------ESGFSDEVEA-------YDPATNTWTAL---------GELPTPRHGHAAVLTDGKV 254
|
....*
gi 56605790 328 YFWSG 332
Cdd:COG3055 255 YVIGG 259
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
2-262 |
4.82e-11 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 65.75 E-value: 4.82e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 2 AAPSLL-NWRRV-SSFTGPVPRARHGHRAVAIRelMIIFGGG---NEGIADELHVYNTVTNQWFLPAVRGDIPP-GCAAH 75
Cdd:PLN02193 145 STPKLLgKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGV 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 76 GFVCDGTRILVFGGMvEYGRYSNELYELQASRWLWKKVkpqpPPSGLPPCPRLGHSFSLYGNKCYLFAGLANESE----D 151
Cdd:PLN02193 223 RMVSIGSTLYVFGGR-DASRQYNGFYSFDTTTNEWKLL----TPVEEGPTPRSFHSMAADEENVYVFGGVSATARlktlD 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 152 SNNNVPRylndfyelelqhgsgvvGWSVPATkgtvpsPRESHTAVIYCKRDSGSPKMYVFGGMCGARLDDLWQLDLETMS 231
Cdd:PLN02193 298 SYNIVDK-----------------KWFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDK 354
|
250 260 270
....*....|....*....|....*....|.
gi 56605790 232 WSKPETKGTVPLPRSLHTASVIGNKMYIFGG 262
Cdd:PLN02193 355 WTQVETFGVRPSERSVFASAAVGKHIVIFGG 385
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
207-253 |
1.38e-06 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 45.74 E-value: 1.38e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 56605790 207 KMYVFGGMC---GARLDDLWQLDLETMSWSKPetkGTVPLPRSLHTASVI 253
Cdd:pfam13415 3 KLYIFGGLGfdgQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
549-603 |
6.56e-06 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 45.18 E-value: 6.56e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 56605790 549 PKGKQSMSKVGNADVPDYSLlKKQDLVPGTVYKFRVAAINGCGIGPFSKLSEFKT 603
Cdd:cd00063 40 EKGSGDWKEVEVTPGSETSY-TLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
609-709 |
7.38e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 42.10 E-value: 7.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 609 PGAPSTVRISKNVEG-IHLSWEPPTSPSGNILEYSaylaIRTAQVQDNPSQLVfmRIYCGLKTSCIVTagQLaNAHIDYT 687
Cdd:cd00063 1 PSPPTNLRVTDVTSTsVTLSWTPPEDDGGPITGYV----VEYREKGSGDWKEV--EVTPGSETSYTLT--GL-KPGTEYE 71
|
90 100
....*....|....*....|..
gi 56605790 688 srpaivFRISAKNEKGYGPATQ 709
Cdd:cd00063 72 ------FRVRAVNGGGESPPSE 87
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
572-596 |
2.94e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 40.09 E-value: 2.94e-04
10 20
....*....|....*....|....*
gi 56605790 572 QDLVPGTVYKFRVAAINGCGIGPFS 596
Cdd:pfam00041 61 TGLKPGTEYEVRVQAVNGGGEGPPS 85
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
312-355 |
1.18e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 1.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 56605790 312 PRPRAGHCAVAIGTRLYFWSGRDGYkkalNSQVcCKDLWYLDTE 355
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGG----EGQP-SDDVYVLSLP 39
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
609-705 |
5.18e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 36.44 E-value: 5.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 609 PGAPSTVRISKNVEG-IHLSWEPPTSPSGN--ILEYsaylairtaQVQDNPSQLVFMRIYC-GLKTSCIVTagQLaNAHI 684
Cdd:smart00060 1 PSPPSNLRVTDVTSTsVTLSWEPPPDDGITgyIVGY---------RVEYREEGSEWKEVNVtPSSTSYTLT--GL-KPGT 68
|
90 100
....*....|....*....|.
gi 56605790 685 DYTsrpaivFRISAKNEKGYG 705
Cdd:smart00060 69 EYE------FRVRAVNGAGEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
610-708 |
5.38e-03 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 36.62 E-value: 5.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 610 GAPSTVRIS-KNVEGIHLSWEPPTSPSGNILEYSaylaIRTAQVQDNPSQLVFMRiyCGLKTSCIVTagQLaNAHIDYTs 688
Cdd:pfam00041 1 SAPSNLTVTdVTSTSLTVSWTPPPDGNGPITGYE----VEYRPKNSGEPWNEITV--PGTTTSVTLT--GL-KPGTEYE- 70
|
90 100
....*....|....*....|
gi 56605790 689 rpaivFRISAKNEKGYGPAT 708
Cdd:pfam00041 71 -----VRVQAVNGGGEGPPS 85
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
556-635 |
7.04e-03 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 39.60 E-value: 7.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 556 SKVGNADVPDYSLLKKQDLVPGTVYKFRVAAINGCGIGPFSKLSEFKTcIPGFPGAPSTVRISKNVEG-IHLSWEPPTSP 634
Cdd:COG3401 181 ATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTT-PTTPPSAPTGLTATADTPGsVTLSWDPVTES 259
|
.
gi 56605790 635 S 635
Cdd:COG3401 260 D 260
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
14-332 |
3.78e-19 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 88.29 E-value: 3.78e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 14 SFTGPVPRARHGHRAVAIRELMIIFGGGNEG-IADELHVYNTVTNQWflpaVRGDIPPGCAAHGFVC--DGTRILVFGGM 90
Cdd:COG3055 4 SSLPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGGF 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 91 VEY---GRYSNELYELQASRWLWKKVkpqpppsGLPPCPRLGHSFSLYGNKCYLFAGlanesedsnNNVPRYLNDFYELE 167
Cdd:COG3055 80 TGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVYD 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 168 LQHGSgvvgWSvpaTKGTVPSPRESHTAVIyckrdSGSPKMYVFGGMCGARLDDLWqldletmswskpETKGTVPLPRSL 247
Cdd:COG3055 144 PATGT----WT---QLAPLPTPRDHLAAAV-----LPDGKILVIGGRNGSGFSNTW------------TTLAPLPTARAG 199
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 248 HTASVIGNKMYIFGGwvphkgentENSPHDCEWRctssfsyLNLDTAEWTTLvsdsqedkkNSRPRPRAGHCAVAIGTRL 327
Cdd:COG3055 200 HAAAVLGGKILVFGG---------ESGFSDEVEA-------YDPATNTWTAL---------GELPTPRHGHAAVLTDGKV 254
|
....*
gi 56605790 328 YFWSG 332
Cdd:COG3055 255 YVIGG 259
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
9-264 |
2.41e-17 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 82.90 E-value: 2.41e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 9 WRRVSSFTGPvprARHGHRAVAIRELMIIFGG-----GNEGIADELHVYNTVTNQWFlpaVRGDIPPGCAAHGFVCDGTR 83
Cdd:COG3055 50 WSELAPLPGP---PRHHAAAVAQDGKLYVFGGftganPSSTPLNDVYVYDPATNTWT---KLAPMPTPRGGATALLLDGK 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 84 ILVFGGMVEYGRYSN-ELYELQASRWlwKKVKPQPPPsglppcpRLGHS-FSLYGNKCYLFAGlANESEDSNNnvpryln 161
Cdd:COG3055 124 IYVVGGWDDGGNVAWvEVYDPATGTW--TQLAPLPTP-------RDHLAaAVLPDGKILVIGG-RNGSGFSNT------- 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 162 dfyelelqhgsgvvgWSvpaTKGTVPSPRESHTAVIYckrdsgSPKMYVFGGMCGArLDDLWQLDLETMSWSkpeTKGTV 241
Cdd:COG3055 187 ---------------WT---TLAPLPTARAGHAAAVL------GGKILVFGGESGF-SDEVEAYDPATNTWT---ALGEL 238
|
250 260
....*....|....*....|...
gi 56605790 242 PLPRSLHTASVIGNKMYIFGGWV 264
Cdd:COG3055 239 PTPRHGHAAVLTDGKVYVIGGET 261
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
134-335 |
1.13e-15 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 77.89 E-value: 1.13e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 134 LYGNKCYLFAGLANESedsnnnvprYLNDFYELELQHGSgvvgWSvpaTKGTVPSPRESHTAVIYckrDSGspKMYVFGG 213
Cdd:COG3055 20 LLDGKVYVAGGLSGGS---------ASNSFEVYDPATNT----WS---ELAPLPGPPRHHAAAVA---QDG--KLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 214 MCGAR-----LDDLWQLDLETMSWSKpetKGTVPLPRSLHTASVIGNKMYIFGGWVPHKGentensphdcewrcTSSFSY 288
Cdd:COG3055 79 FTGANpsstpLNDVYVYDPATNTWTK---LAPMPTPRGGATALLLDGKIYVVGGWDDGGN--------------VAWVEV 141
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 56605790 289 LNLDTAEWTTLVSDsqedkknsrPRPRAGHCA-VAIGTRLYFWSGRDG 335
Cdd:COG3055 142 YDPATGTWTQLAPL---------PTPRDHLAAaVLPDGKILVIGGRNG 180
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
66-336 |
1.52e-15 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 77.50 E-value: 1.52e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 66 GDIP-PGCAAHGFVCDGtRILVFGGMvEYGRYSN--ELYELQASRWlwkkvkpqpPPSGLPPCPRLGHSFS-LYGNKCYL 141
Cdd:COG3055 7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNsfEVYDPATNTW---------SELAPLPGPPRHHAAAvAQDGKLYV 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 142 FAGLaneseDSNNNVPRYLNDFYELELQHGSgvvgWSvpaTKGTVPSPRESHTAVIYckrdsgSPKMYVFGGMCGA-RLD 220
Cdd:COG3055 76 FGGF-----TGANPSSTPLNDVYVYDPATNT----WT---KLAPMPTPRGGATALLL------DGKIYVVGGWDDGgNVA 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 221 DLWQLDLETMSWSkpeTKGTVPLPRSLHTASVIGN-KMYIFGGwvphkgentensphdcewrcTSSFSYLNldtaEWTTL 299
Cdd:COG3055 138 WVEVYDPATGTWT---QLAPLPTPRDHLAAAVLPDgKILVIGG--------------------RNGSGFSN----TWTTL 190
|
250 260 270
....*....|....*....|....*....|....*..
gi 56605790 300 vsdsqedkkNSRPRPRAGHCAVAIGTRLYFWSGRDGY 336
Cdd:COG3055 191 ---------APLPTARAGHAAAVLGGKILVFGGESGF 218
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
2-262 |
4.82e-11 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 65.75 E-value: 4.82e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 2 AAPSLL-NWRRV-SSFTGPVPRARHGHRAVAIRelMIIFGGG---NEGIADELHVYNTVTNQWFLPAVRGDIPP-GCAAH 75
Cdd:PLN02193 145 STPKLLgKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGV 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 76 GFVCDGTRILVFGGMvEYGRYSNELYELQASRWLWKKVkpqpPPSGLPPCPRLGHSFSLYGNKCYLFAGLANESE----D 151
Cdd:PLN02193 223 RMVSIGSTLYVFGGR-DASRQYNGFYSFDTTTNEWKLL----TPVEEGPTPRSFHSMAADEENVYVFGGVSATARlktlD 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 152 SNNNVPRylndfyelelqhgsgvvGWSVPATkgtvpsPRESHTAVIYCKRDSGSPKMYVFGGMCGARLDDLWQLDLETMS 231
Cdd:PLN02193 298 SYNIVDK-----------------KWFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDK 354
|
250 260 270
....*....|....*....|....*....|.
gi 56605790 232 WSKPETKGTVPLPRSLHTASVIGNKMYIFGG 262
Cdd:PLN02193 355 WTQVETFGVRPSERSVFASAAVGKHIVIFGG 385
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
16-264 |
2.27e-09 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 59.61 E-value: 2.27e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 16 TGPVPRARHGHRAVAirELMIIFGGG---NEGIADELHVYNTVTNQWFLPAVRGDIPP-GCAAHGFVCDGTRILVFGGMV 91
Cdd:PLN02153 18 KGPGPRCSHGIAVVG--DKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRMVAVGTKLYIFGGRD 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 92 EYGRYSN-ELYELQASRWLWkkvkPQPPPSGLPPCPRLGHSFSLYGNKCYLFAGLaneSEDSNNNVPRYLNDFYELELQH 170
Cdd:PLN02153 96 EKREFSDfYSYDTVKNEWTF----LTKLDEEGGPEARTFHSMASDENHVYVFGGV---SKGGLMKTPERFRTIEAYNIAD 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 171 GsgvvGWSvpatkgTVPSPRESHTA--------------VIYCKRDSGSPkmyvfGGMCGARLDDLWQLDLETMSWSKPE 236
Cdd:PLN02153 169 G----KWV------QLPDPGENFEKrggagfavvqgkiwVVYGFATSILP-----GGKSDYESNAVQFFDPASGKWTEVE 233
|
250 260
....*....|....*....|....*...
gi 56605790 237 TKGTVPLPRSLHTASVIGNKMYIFGGWV 264
Cdd:PLN02153 234 TTGAKPSARSVFAHAVVGKYIIIFGGEV 261
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
17-268 |
4.83e-08 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 55.38 E-value: 4.83e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 17 GPVPR-ARHGHRAVAIRELMIIFGGGNEGIA-DELHVYNTVTNQW-FLPAVrgDIPPGCAA---HGFVCDGTRILVFGGM 90
Cdd:PLN02153 69 GDVPRiSCLGVRMVAVGTKLYIFGGRDEKREfSDFYSYDTVKNEWtFLTKL--DEEGGPEArtfHSMASDENHVYVFGGV 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 91 VEYG------RYSN-ELYELQASRWLwkkvkpQPPPSGLPPCPRLGHSFSLYGNKCYLFAGLANESedsnnnVPRYLNDF 163
Cdd:PLN02153 147 SKGGlmktpeRFRTiEAYNIADGKWV------QLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSI------LPGGKSDY 214
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 164 YELELQHGSGVVG-WSVPATKGTVPSPRE--SHTAViyckrdsgSPKMYVFGGMC----------GARLDDLWQLDLETM 230
Cdd:PLN02153 215 ESNAVQFFDPASGkWTEVETTGAKPSARSvfAHAVV--------GKYIIIFGGEVwpdlkghlgpGTLSNEGYALDTETL 286
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 56605790 231 SWSKPETKGTVPLPR---SLHTASVIG-NKMYIFGGWVPHKG 268
Cdd:PLN02153 287 VWEKLGECGEPAMPRgwtAYTTATVYGkNGLLMHGGKLPTNE 328
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
232-375 |
5.92e-07 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 52.65 E-value: 5.92e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 232 WSKPETKGTVPLPRSLHTASVIGNKMYIFGgwvphkGENTENSPHDcewrctSSFSYLNLDTAEWTtlVSDSQEDKKNSR 311
Cdd:PLN02193 153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFG------GEFTPNQPID------KHLYVFDLETRTWS--ISPATGDVPHLS 218
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 56605790 312 PrprAGHCAVAIGTRLYFWSGRDGYKK--ALNSQVCCKDLWYLDT--EKPPAPSQVQLIKATTNSFHV 375
Cdd:PLN02193 219 C---LGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYV 283
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
207-253 |
1.38e-06 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 45.74 E-value: 1.38e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 56605790 207 KMYVFGGMC---GARLDDLWQLDLETMSWSKPetkGTVPLPRSLHTASVI 253
Cdd:pfam13415 3 KLYIFGGLGfdgQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
77-262 |
2.15e-06 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 50.72 E-value: 2.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 77 FVCDGTRILVFggmveYGRYSNELYELQASRWL---------WKKVKPQPPPSGLppcpRLGHSFSLYGNKCYLFAGlan 147
Cdd:PLN02193 116 FVLQGGKIVGF-----HGRSTDVLHSLGAYISLpstpkllgkWIKVEQKGEGPGL----RCSHGIAQVGNKIYSFGG--- 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 148 eSEDSNNNVPRYLNDFyELELQhgsgvvGWSVPATKGTVPspresHTAVIYCKRDSGSPKMYVFGGMCGAR-LDDLWQLD 226
Cdd:PLN02193 184 -EFTPNQPIDKHLYVF-DLETR------TWSISPATGDVP-----HLSCLGVRMVSIGSTLYVFGGRDASRqYNGFYSFD 250
|
170 180 190
....*....|....*....|....*....|....*...
gi 56605790 227 LETMSWS--KPETKGtvPLPRSLHTASVIGNKMYIFGG 262
Cdd:PLN02193 251 TTTNEWKllTPVEEG--PTPRSFHSMAADEENVYVFGG 286
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
549-603 |
6.56e-06 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 45.18 E-value: 6.56e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 56605790 549 PKGKQSMSKVGNADVPDYSLlKKQDLVPGTVYKFRVAAINGCGIGPFSKLSEFKT 603
Cdd:cd00063 40 EKGSGDWKEVEVTPGSETSY-TLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
8-104 |
1.18e-05 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 47.84 E-value: 1.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 8 NWRRVSSFtgPVPRARHGHRAVAIRelmIIFGGGNEGIADELHVYNTVTNQWFlpaVRGDIPPGCAAHGFVCDGTRILVF 87
Cdd:COG3055 186 TWTTLAPL--PTARAGHAAAVLGGK---ILVFGGESGFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVI 257
|
90
....*....|....*..
gi 56605790 88 GGMVEYGRYSNELYELQ 104
Cdd:COG3055 258 GGETKPGVRTPLVTSAE 274
|
|
| PRK14131 |
PRK14131 |
N-acetylneuraminate epimerase; |
9-89 |
1.56e-05 |
|
N-acetylneuraminate epimerase;
Pssm-ID: 237617 [Multi-domain] Cd Length: 376 Bit Score: 47.70 E-value: 1.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 9 WRRVSSFTGPvprARHGHRAVAIRELMIIFGG----GNEG---IADELHVYNTVTNQWFLPAVRGdiPPGCAAH-GFVCD 80
Cdd:PRK14131 64 WTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAGHvAVSLH 138
|
....*....
gi 56605790 81 GTRILVFGG 89
Cdd:PRK14131 139 NGKAYITGG 147
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
609-709 |
7.38e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 42.10 E-value: 7.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 609 PGAPSTVRISKNVEG-IHLSWEPPTSPSGNILEYSaylaIRTAQVQDNPSQLVfmRIYCGLKTSCIVTagQLaNAHIDYT 687
Cdd:cd00063 1 PSPPTNLRVTDVTSTsVTLSWTPPEDDGGPITGYV----VEYREKGSGDWKEV--EVTPGSETSYTLT--GL-KPGTEYE 71
|
90 100
....*....|....*....|..
gi 56605790 688 srpaivFRISAKNEKGYGPATQ 709
Cdd:cd00063 72 ------FRVRAVNGGGESPPSE 87
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
573-634 |
8.95e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 45.76 E-value: 8.95e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 56605790 573 DLVPGTVYKFRVAAINGCGI-GPFSKLSEFKTCIPGfPGAPSTVRISKNVE-GIHLSWEPPTSP 634
Cdd:COG3401 291 GLTNGTTYYYRVTAVDAAGNeSAPSNVVSVTTDLTP-PAAPSGLTATAVGSsSITLSWTASSDA 353
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
572-596 |
2.94e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 40.09 E-value: 2.94e-04
10 20
....*....|....*....|....*
gi 56605790 572 QDLVPGTVYKFRVAAINGCGIGPFS 596
Cdd:pfam00041 61 TGLKPGTEYEVRVQAVNGGGEGPPS 85
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
254-323 |
3.46e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 38.81 E-value: 3.46e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 254 GNKMYIFGGWVPHKGEntensphdcewrCTSSFSYLNLDTAEWTTLvsdsqedkkNSRPRPRAGHCAVAI 323
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI---------GDLPPPRSGHSATYI 49
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
232-334 |
5.73e-04 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 42.67 E-value: 5.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 232 WSKPETKG-TVPLPRSLHTASVIGNKMYIFGGwvphkgENTENSPHDcewrctSSFSYLNLDTAEWttlvsdSQEDKKNS 310
Cdd:PLN02153 9 WIKVEQKGgKGPGPRCSHGIAVVGDKLYSFGG------ELKPNEHID------KDLYVFDFNTHTW------SIAPANGD 70
|
90 100
....*....|....*....|....*
gi 56605790 311 RPRPRA-GHCAVAIGTRLYFWSGRD 334
Cdd:PLN02153 71 VPRISClGVRMVAVGTKLYIFGGRD 95
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
244-301 |
6.13e-04 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 37.98 E-value: 6.13e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 56605790 244 PRSLHTASVIGNKMYIFGGWvphkgentensphdCEWRCTSSFSYLNLDTAEWTTLVS 301
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGF--------------DGNQSLNSVEVYDPETNTWSKLPS 44
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
312-355 |
1.18e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 1.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 56605790 312 PRPRAGHCAVAIGTRLYFWSGRDGYkkalNSQVcCKDLWYLDTE 355
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGG----EGQP-SDDVYVLSLP 39
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
231-337 |
1.18e-03 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 41.29 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 231 SWSkpeTKGTVPLPRSLHTASVIGNKMYIFGGWVphkgentensphdcEWRCTSSFSYLNLDTAEWTTLVSDSQEdkkns 310
Cdd:COG3055 2 TWS---SLPDLPTPRSEAAAALLDGKVYVAGGLS--------------GGSASNSFEVYDPATNTWSELAPLPGP----- 59
|
90 100
....*....|....*....|....*..
gi 56605790 311 rprPRAGHCAVAIGTRLYFWSGRDGYK 337
Cdd:COG3055 60 ---PRHHAAAVAQDGKLYVFGGFTGAN 83
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
187-229 |
1.62e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 36.77 E-value: 1.62e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 56605790 187 PSPRESHTAVIYckrdsgSPKMYVFGGMCGAR---LDDLWQLDLET 229
Cdd:pfam13854 1 PVPRYGHCAVTV------GDYIYLYGGYTGGEgqpSDDVYVLSLPT 40
|
|
| Kelch_4 |
pfam13418 |
Galactose oxidase, central domain; |
189-234 |
2.49e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433191 [Multi-domain] Cd Length: 49 Bit Score: 36.44 E-value: 2.49e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 56605790 189 PRESHTAViYCKRDSGspkmYVFGGMC--GARLDDLWQLDLETMSWSK 234
Cdd:pfam13418 1 PRAYHTST-SIPDDTI----YLFGGEGedGTLLSDLWVFDLSTNEWTR 43
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
22-59 |
4.11e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 35.67 E-value: 4.11e-03
10 20 30
....*....|....*....|....*....|....*....
gi 56605790 22 ARHGHRAVAIRELMIIFGGGNEGIA-DELHVYNTVTNQW 59
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNQSlNSVEVYDPETNTW 39
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
609-705 |
5.18e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 36.44 E-value: 5.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 609 PGAPSTVRISKNVEG-IHLSWEPPTSPSGN--ILEYsaylairtaQVQDNPSQLVFMRIYC-GLKTSCIVTagQLaNAHI 684
Cdd:smart00060 1 PSPPSNLRVTDVTSTsVTLSWEPPPDDGITgyIVGY---------RVEYREEGSEWKEVNVtPSSTSYTLT--GL-KPGT 68
|
90 100
....*....|....*....|.
gi 56605790 685 DYTsrpaivFRISAKNEKGYG 705
Cdd:smart00060 69 EYE------FRVRAVNGAGEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
610-708 |
5.38e-03 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 36.62 E-value: 5.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 610 GAPSTVRIS-KNVEGIHLSWEPPTSPSGNILEYSaylaIRTAQVQDNPSQLVFMRiyCGLKTSCIVTagQLaNAHIDYTs 688
Cdd:pfam00041 1 SAPSNLTVTdVTSTSLTVSWTPPPDGNGPITGYE----VEYRPKNSGEPWNEITV--PGTTTSVTLT--GL-KPGTEYE- 70
|
90 100
....*....|....*....|
gi 56605790 689 rpaivFRISAKNEKGYGPAT 708
Cdd:pfam00041 71 -----VRVQAVNGGGEGPPS 85
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
556-635 |
7.04e-03 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 39.60 E-value: 7.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 56605790 556 SKVGNADVPDYSLLKKQDLVPGTVYKFRVAAINGCGIGPFSKLSEFKTcIPGFPGAPSTVRISKNVEG-IHLSWEPPTSP 634
Cdd:COG3401 181 ATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTT-PTTPPSAPTGLTATADTPGsVTLSWDPVTES 259
|
.
gi 56605790 635 S 635
Cdd:COG3401 260 D 260
|
|
|