NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1226657552|ref|WP_093941491|]
View 

cellulase family glycosylhydrolase [Actinoalloteichus hoggarensis]

Protein Classification

cellulase family glycosylhydrolase; fibronectin type III domain-containing protein( domain architecture ID 10445025)

cellulase family glycosylhydrolase containing a cellulose binding domain; similar to Streptomyces lividans endoglucanase CelA that catalyzes endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans; fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh), also called ABI gene family member 3-binding protein (ABI3BP), with 2 repeats

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cellulase pfam00150
Cellulase (glycosyl hydrolase family 5);
40-287 3.61e-37

Cellulase (glycosyl hydrolase family 5);


:

Pssm-ID: 395098 [Multi-domain]  Cd Length: 272  Bit Score: 138.28  E-value: 3.61e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  40 YDANGQEFVIRGVSHPHTWFTSE-TTRSLAD-IKSLGANTVRIVLSSGD--------QWTENSTADVANVVSQCKANRLI 109
Cdd:pfam00150   1 VDANGKPVQLRGVTHGGQWGNPYvTTKAMIDlVKDWGFNVVRLPVSWGGyvpnnpdyLIDENWLNRVDEVVDYAIDNGMY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 110 CMLEVHDTTGYGEQAAAvSLDRAVDY----WLRIQDAViGEEDYVLVNIGNEPYGNtDYAGWGADT----SAAINRLRDA 181
Cdd:pfam00150  81 VIIDWHHDGGWPGDPNG-NIDTAKAFfkkiWTQIATRY-GNNPNVIFELMNEPHGN-DQATWADDVkdyaQEAIDAIRAA 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 182 GFDHTLVVDGPNWGQDWSGTMrsqapevfASDPHA--NTVFSIHMYG-------------VYDTAAEIEDYLHHFVDQGL 246
Cdd:pfam00150 158 GPNNLIIVGGNSWSQNPDGAA--------LNDPNDddNLIYSVHFYApsdfsgtwfdcedPTNLAQRLRAAANWALDNGI 229
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 1226657552 247 PIIVGEFGHNHSDG--DPDEDTIMATTEDLGLGYIGWSWSGNG 287
Cdd:pfam00150 230 PVFIGEFGGGNADGpcRDEAEKWLDYLKENGISWTGWSNGNKS 272
CBM_2 pfam00553
Cellulose binding domain; Two tryptophan residues are involved in cellulose binding. Cellulose ...
434-534 1.24e-34

Cellulose binding domain; Two tryptophan residues are involved in cellulose binding. Cellulose binding domain found in bacteria.


:

Pssm-ID: 425748 [Multi-domain]  Cd Length: 101  Bit Score: 125.64  E-value: 1.24e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 434 CAVTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGF 513
Cdd:pfam00553   1 CTATYAVTNQWGTGFTANVTITNTGSSPINGWTVSWTYPAGQRVTQSWNATVSQSGNPVTATNVSWNGTIAPGGSASFGF 80
                          90       100
                  ....*....|....*....|.
gi 1226657552 514 LGSWTGANSAPSAFTLNDGSC 534
Cdd:pfam00553  81 QGSGTGSNSAPTSFTVNGAAC 101
CelA1 COG5297
Cellulase/cellobiase CelA1 [Carbohydrate transport and metabolism];
280-537 2.80e-29

Cellulase/cellobiase CelA1 [Carbohydrate transport and metabolism];


:

Pssm-ID: 444099 [Multi-domain]  Cd Length: 301  Bit Score: 117.23  E-value: 2.80e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 280 GWSWSGNGSPVEYLDMVHGFDGDSLTPWGERIFHGADGIAQTSREASVFGGGSDTTAPTTPGTPSVSAVSASSVTLTWAP 359
Cdd:COG5297    43 GSASGSSASGAGAGAGAAASGADTGSGGGAGAGAGVGVGAGGAAGGAGGSAGVAGAASAAGAAGAAATSSGATSSSSSGT 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 360 STDDVGVTGYQVVRVAGGQETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSAT----TTVTTESGPAAGSCA 435
Cdd:COG5297   123 SSGGGGGSSSSTSDTTTSSATSGSSTSTSSPTTGSTSTTTGGGIANQPTAVWDRIAAITATvrahLDAALAAPLPGRDCG 202
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 436 VTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGFLG 515
Cdd:COG5297   203 VTYSVYSDWGSGYTAAVTVTNPGSLPLNGWTLAWDLPGGAGVTNAWGFTLNQSGYTVTATNVSWNGTLAPGGSVSFGFDT 282
                         250       260
                  ....*....|....*....|..
gi 1226657552 516 SWTGANsaPSAFTLnDGSCTQG 537
Cdd:COG5297   283 SRNGWN--PSGRGL-DGACTGG 301
 
Name Accession Description Interval E-value
Cellulase pfam00150
Cellulase (glycosyl hydrolase family 5);
40-287 3.61e-37

Cellulase (glycosyl hydrolase family 5);


Pssm-ID: 395098 [Multi-domain]  Cd Length: 272  Bit Score: 138.28  E-value: 3.61e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  40 YDANGQEFVIRGVSHPHTWFTSE-TTRSLAD-IKSLGANTVRIVLSSGD--------QWTENSTADVANVVSQCKANRLI 109
Cdd:pfam00150   1 VDANGKPVQLRGVTHGGQWGNPYvTTKAMIDlVKDWGFNVVRLPVSWGGyvpnnpdyLIDENWLNRVDEVVDYAIDNGMY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 110 CMLEVHDTTGYGEQAAAvSLDRAVDY----WLRIQDAViGEEDYVLVNIGNEPYGNtDYAGWGADT----SAAINRLRDA 181
Cdd:pfam00150  81 VIIDWHHDGGWPGDPNG-NIDTAKAFfkkiWTQIATRY-GNNPNVIFELMNEPHGN-DQATWADDVkdyaQEAIDAIRAA 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 182 GFDHTLVVDGPNWGQDWSGTMrsqapevfASDPHA--NTVFSIHMYG-------------VYDTAAEIEDYLHHFVDQGL 246
Cdd:pfam00150 158 GPNNLIIVGGNSWSQNPDGAA--------LNDPNDddNLIYSVHFYApsdfsgtwfdcedPTNLAQRLRAAANWALDNGI 229
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 1226657552 247 PIIVGEFGHNHSDG--DPDEDTIMATTEDLGLGYIGWSWSGNG 287
Cdd:pfam00150 230 PVFIGEFGGGNADGpcRDEAEKWLDYLKENGISWTGWSNGNKS 272
BglC COG2730
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism];
43-294 5.74e-36

Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism];


Pssm-ID: 442036 [Multi-domain]  Cd Length: 295  Bit Score: 135.94  E-value: 5.74e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  43 NGQEFVIrgvsHPHTWFTSETTRSLADIKSLGANTVRIVLS--------SGDQWTENSTADVANVVSQCKANRLICMLEV 114
Cdd:COG2730    12 LGNWLEL----WFETLWGNITEEDIDAIADWGFNTVRLPVSwerlqdpdNPYTLDEAYLERVDEVVDWAKARGLYVILDL 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 115 HDTTGYGEQAAAVSLDRAVDYWLRIQDAVIGEEDYVLVNIGNEPYGNTdYAGWGADTSAAINRLRDAGFDHTLVVDGPNW 194
Cdd:COG2730    88 HHAPGYQGWYDAATQERFIAFWRQLAERYKDYPNVLGFELLNEPHGAT-WADWNALAQRAIDAIRATNPDRLIIVEGNNW 166
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 195 GQDWsgTMRSQAPEvfaSDPhaNTVFSIHMYGVYD------------TAAEIEDYLHHF----VDQGLPIIVGEFGHNHS 258
Cdd:COG2730   167 GGAH--NLRALDPL---DDD--NLVYSVHFYGPFVfthqgawfagptYPANLEARLDNWgdwaADNGVPVFVGEFGAYND 239
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 1226657552 259 DGDPDE----DTIMATTEDLGLGYIGWSWSGNGSPVEYLD 294
Cdd:COG2730   240 DPDASRlawlRDLLDYLEENGIGWTYWSFNPSGDTGGLLD 279
CBM_2 pfam00553
Cellulose binding domain; Two tryptophan residues are involved in cellulose binding. Cellulose ...
434-534 1.24e-34

Cellulose binding domain; Two tryptophan residues are involved in cellulose binding. Cellulose binding domain found in bacteria.


Pssm-ID: 425748 [Multi-domain]  Cd Length: 101  Bit Score: 125.64  E-value: 1.24e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 434 CAVTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGF 513
Cdd:pfam00553   1 CTATYAVTNQWGTGFTANVTITNTGSSPINGWTVSWTYPAGQRVTQSWNATVSQSGNPVTATNVSWNGTIAPGGSASFGF 80
                          90       100
                  ....*....|....*....|.
gi 1226657552 514 LGSWTGANSAPSAFTLNDGSC 534
Cdd:pfam00553  81 QGSGTGSNSAPTSFTVNGAAC 101
CelA1 COG5297
Cellulase/cellobiase CelA1 [Carbohydrate transport and metabolism];
280-537 2.80e-29

Cellulase/cellobiase CelA1 [Carbohydrate transport and metabolism];


Pssm-ID: 444099 [Multi-domain]  Cd Length: 301  Bit Score: 117.23  E-value: 2.80e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 280 GWSWSGNGSPVEYLDMVHGFDGDSLTPWGERIFHGADGIAQTSREASVFGGGSDTTAPTTPGTPSVSAVSASSVTLTWAP 359
Cdd:COG5297    43 GSASGSSASGAGAGAGAAASGADTGSGGGAGAGAGVGVGAGGAAGGAGGSAGVAGAASAAGAAGAAATSSGATSSSSSGT 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 360 STDDVGVTGYQVVRVAGGQETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSAT----TTVTTESGPAAGSCA 435
Cdd:COG5297   123 SSGGGGGSSSSTSDTTTSSATSGSSTSTSSPTTGSTSTTTGGGIANQPTAVWDRIAAITATvrahLDAALAAPLPGRDCG 202
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 436 VTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGFLG 515
Cdd:COG5297   203 VTYSVYSDWGSGYTAAVTVTNPGSLPLNGWTLAWDLPGGAGVTNAWGFTLNQSGYTVTATNVSWNGTLAPGGSVSFGFDT 282
                         250       260
                  ....*....|....*....|..
gi 1226657552 516 SWTGANsaPSAFTLnDGSCTQG 537
Cdd:COG5297   283 SRNGWN--PSGRGL-DGACTGG 301
CBD_II smart00637
CBD_II domain;
441-533 2.04e-26

CBD_II domain;


Pssm-ID: 214754 [Multi-domain]  Cd Length: 92  Bit Score: 102.87  E-value: 2.04e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  441 SGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGFLGsWTGA 520
Cdd:smart00637   1 TSDWGSGFTANVTVTNTGSSAINGWTLTFDLPGGQTVTNSWNATVSQSGGHVTATNASWNGTIAPGGSVSFGFQG-KTGS 79
                           90
                   ....*....|...
gi 1226657552  521 NSAPSAFTLNDGS 533
Cdd:smart00637  80 SAAPTGFTLNGAA 92
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
354-425 3.67e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 54.04  E-value: 3.67e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1226657552 354 TLTWAPSTDDVG-VTGYQV-VRVAGGQETAAISTP---TNSAIVGGLSPDTDYTFAVYARDAAGnRSARSATTTVTT 425
Cdd:cd00063    18 TLSWTPPEDDGGpITGYVVeYREKGSGDWKEVEVTpgsETSYTLTGLKPGTEYEFRVRAVNGGG-ESPPSESVTVTT 93
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
354-535 4.27e-08

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 55.78  E-value: 4.27e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 354 TLTWAPSTDdVGVTGYQVVRVA---GGQETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSATTTVTTESGPA 430
Cdd:COG3401   344 TLSWTASSD-ADVTGYNVYRSTsggGTYTKIAETVTTTSYTDTGLTPGTTYYYKVTAVDAAGNESAPSEEVSATTASAAS 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 431 AGSC-AVTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSV 509
Cdd:COG3401   423 GESLtASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGA 502
                         170       180
                  ....*....|....*....|....*.
gi 1226657552 510 SFGFLGSWTGANSAPSAFTLNDGSCT 535
Cdd:COG3401   503 SSVTNSVSVIGASAAAAVGGAPDGTP 528
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
354-412 5.43e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 47.61  E-value: 5.43e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1226657552  354 TLTWAPSTDDVG---VTGYQVVRVAGG--QETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAG 412
Cdd:smart00060  18 TLSWEPPPDDGItgyIVGYRVEYREEGseWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81
fn3 pfam00041
Fibronectin type III domain;
354-413 2.65e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 45.48  E-value: 2.65e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1226657552 354 TLTWAPSTDDVG-VTGYQV-VRVAGGQETAA---ISTPTNSAIVGGLSPDTDYTFAVYARDAAGN 413
Cdd:pfam00041  17 TVSWTPPPDGNGpITGYEVeYRPKNSGEPWNeitVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81
 
Name Accession Description Interval E-value
Cellulase pfam00150
Cellulase (glycosyl hydrolase family 5);
40-287 3.61e-37

Cellulase (glycosyl hydrolase family 5);


Pssm-ID: 395098 [Multi-domain]  Cd Length: 272  Bit Score: 138.28  E-value: 3.61e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  40 YDANGQEFVIRGVSHPHTWFTSE-TTRSLAD-IKSLGANTVRIVLSSGD--------QWTENSTADVANVVSQCKANRLI 109
Cdd:pfam00150   1 VDANGKPVQLRGVTHGGQWGNPYvTTKAMIDlVKDWGFNVVRLPVSWGGyvpnnpdyLIDENWLNRVDEVVDYAIDNGMY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 110 CMLEVHDTTGYGEQAAAvSLDRAVDY----WLRIQDAViGEEDYVLVNIGNEPYGNtDYAGWGADT----SAAINRLRDA 181
Cdd:pfam00150  81 VIIDWHHDGGWPGDPNG-NIDTAKAFfkkiWTQIATRY-GNNPNVIFELMNEPHGN-DQATWADDVkdyaQEAIDAIRAA 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 182 GFDHTLVVDGPNWGQDWSGTMrsqapevfASDPHA--NTVFSIHMYG-------------VYDTAAEIEDYLHHFVDQGL 246
Cdd:pfam00150 158 GPNNLIIVGGNSWSQNPDGAA--------LNDPNDddNLIYSVHFYApsdfsgtwfdcedPTNLAQRLRAAANWALDNGI 229
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 1226657552 247 PIIVGEFGHNHSDG--DPDEDTIMATTEDLGLGYIGWSWSGNG 287
Cdd:pfam00150 230 PVFIGEFGGGNADGpcRDEAEKWLDYLKENGISWTGWSNGNKS 272
BglC COG2730
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism];
43-294 5.74e-36

Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism];


Pssm-ID: 442036 [Multi-domain]  Cd Length: 295  Bit Score: 135.94  E-value: 5.74e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  43 NGQEFVIrgvsHPHTWFTSETTRSLADIKSLGANTVRIVLS--------SGDQWTENSTADVANVVSQCKANRLICMLEV 114
Cdd:COG2730    12 LGNWLEL----WFETLWGNITEEDIDAIADWGFNTVRLPVSwerlqdpdNPYTLDEAYLERVDEVVDWAKARGLYVILDL 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 115 HDTTGYGEQAAAVSLDRAVDYWLRIQDAVIGEEDYVLVNIGNEPYGNTdYAGWGADTSAAINRLRDAGFDHTLVVDGPNW 194
Cdd:COG2730    88 HHAPGYQGWYDAATQERFIAFWRQLAERYKDYPNVLGFELLNEPHGAT-WADWNALAQRAIDAIRATNPDRLIIVEGNNW 166
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 195 GQDWsgTMRSQAPEvfaSDPhaNTVFSIHMYGVYD------------TAAEIEDYLHHF----VDQGLPIIVGEFGHNHS 258
Cdd:COG2730   167 GGAH--NLRALDPL---DDD--NLVYSVHFYGPFVfthqgawfagptYPANLEARLDNWgdwaADNGVPVFVGEFGAYND 239
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 1226657552 259 DGDPDE----DTIMATTEDLGLGYIGWSWSGNGSPVEYLD 294
Cdd:COG2730   240 DPDASRlawlRDLLDYLEENGIGWTYWSFNPSGDTGGLLD 279
CBM_2 pfam00553
Cellulose binding domain; Two tryptophan residues are involved in cellulose binding. Cellulose ...
434-534 1.24e-34

Cellulose binding domain; Two tryptophan residues are involved in cellulose binding. Cellulose binding domain found in bacteria.


Pssm-ID: 425748 [Multi-domain]  Cd Length: 101  Bit Score: 125.64  E-value: 1.24e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 434 CAVTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGF 513
Cdd:pfam00553   1 CTATYAVTNQWGTGFTANVTITNTGSSPINGWTVSWTYPAGQRVTQSWNATVSQSGNPVTATNVSWNGTIAPGGSASFGF 80
                          90       100
                  ....*....|....*....|.
gi 1226657552 514 LGSWTGANSAPSAFTLNDGSC 534
Cdd:pfam00553  81 QGSGTGSNSAPTSFTVNGAAC 101
CelA1 COG5297
Cellulase/cellobiase CelA1 [Carbohydrate transport and metabolism];
280-537 2.80e-29

Cellulase/cellobiase CelA1 [Carbohydrate transport and metabolism];


Pssm-ID: 444099 [Multi-domain]  Cd Length: 301  Bit Score: 117.23  E-value: 2.80e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 280 GWSWSGNGSPVEYLDMVHGFDGDSLTPWGERIFHGADGIAQTSREASVFGGGSDTTAPTTPGTPSVSAVSASSVTLTWAP 359
Cdd:COG5297    43 GSASGSSASGAGAGAGAAASGADTGSGGGAGAGAGVGVGAGGAAGGAGGSAGVAGAASAAGAAGAAATSSGATSSSSSGT 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 360 STDDVGVTGYQVVRVAGGQETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSAT----TTVTTESGPAAGSCA 435
Cdd:COG5297   123 SSGGGGGSSSSTSDTTTSSATSGSSTSTSSPTTGSTSTTTGGGIANQPTAVWDRIAAITATvrahLDAALAAPLPGRDCG 202
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 436 VTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGFLG 515
Cdd:COG5297   203 VTYSVYSDWGSGYTAAVTVTNPGSLPLNGWTLAWDLPGGAGVTNAWGFTLNQSGYTVTATNVSWNGTLAPGGSVSFGFDT 282
                         250       260
                  ....*....|....*....|..
gi 1226657552 516 SWTGANsaPSAFTLnDGSCTQG 537
Cdd:COG5297   283 SRNGWN--PSGRGL-DGACTGG 301
CBD_II smart00637
CBD_II domain;
441-533 2.04e-26

CBD_II domain;


Pssm-ID: 214754 [Multi-domain]  Cd Length: 92  Bit Score: 102.87  E-value: 2.04e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552  441 SGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGFLGsWTGA 520
Cdd:smart00637   1 TSDWGSGFTANVTVTNTGSSAINGWTLTFDLPGGQTVTNSWNATVSQSGGHVTATNASWNGTIAPGGSVSFGFQG-KTGS 79
                           90
                   ....*....|...
gi 1226657552  521 NSAPSAFTLNDGS 533
Cdd:smart00637  80 SAAPTGFTLNGAA 92
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
354-425 3.67e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 54.04  E-value: 3.67e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1226657552 354 TLTWAPSTDDVG-VTGYQV-VRVAGGQETAAISTP---TNSAIVGGLSPDTDYTFAVYARDAAGnRSARSATTTVTT 425
Cdd:cd00063    18 TLSWTPPEDDGGpITGYVVeYREKGSGDWKEVEVTpgsETSYTLTGLKPGTEYEFRVRAVNGGG-ESPPSESVTVTT 93
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
354-535 4.27e-08

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 55.78  E-value: 4.27e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 354 TLTWAPSTDdVGVTGYQVVRVA---GGQETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSATTTVTTESGPA 430
Cdd:COG3401   344 TLSWTASSD-ADVTGYNVYRSTsggGTYTKIAETVTTTSYTDTGLTPGTTYYYKVTAVDAAGNESAPSEEVSATTASAAS 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 431 AGSC-AVTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSV 509
Cdd:COG3401   423 GESLtASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGA 502
                         170       180
                  ....*....|....*....|....*.
gi 1226657552 510 SFGFLGSWTGANSAPSAFTLNDGSCT 535
Cdd:COG3401   503 SSVTNSVSVIGASAAAAVGGAPDGTP 528
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
354-412 5.43e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 47.61  E-value: 5.43e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1226657552  354 TLTWAPSTDDVG---VTGYQVVRVAGG--QETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAG 412
Cdd:smart00060  18 TLSWEPPPDDGItgyIVGYRVEYREEGseWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
354-443 2.25e-06

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 50.39  E-value: 2.25e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 354 TLTWAPSTDDvGVTGYQVVRVAGGQETAAI--STPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSATTTVTTESGPAA 431
Cdd:COG3401   250 TLSWDPVTES-DATGYRVYRSNSGDGPFTKvaTVTTTSYTDTGLTNGTTYYYRVTAVDAAGNESAPSNVVSVTTDLTPPA 328
                          90
                  ....*....|..
gi 1226657552 432 GSCAVTYRVSGE 443
Cdd:COG3401   329 APSGLTATAVGS 340
fn3 pfam00041
Fibronectin type III domain;
354-413 2.65e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 45.48  E-value: 2.65e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1226657552 354 TLTWAPSTDDVG-VTGYQV-VRVAGGQETAA---ISTPTNSAIVGGLSPDTDYTFAVYARDAAGN 413
Cdd:pfam00041  17 TVSWTPPPDGNGpITGYEVeYRPKNSGEPWNeitVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
354-533 8.71e-06

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 48.23  E-value: 8.71e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 354 TLTWAPSTDDVGVTGYQVVRvagGQETAAISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSATTTVTTESGPAAGS 433
Cdd:COG3979    20 SLSWDASTDNVGVTGYDVYR---GGDQVATVTGLTAWTVTGLTPGTEYTFTVGACDAAGNVSAASGTSTAMFGGSSTTLG 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 434 CAVTYRVSGEWNGGFQAEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPVGGSVSFGF 513
Cdd:COG3979    97 SAEGVADTSGNLAASGAFFGVTTPPTPSSTLVVDGTTTVNAAATANGGTGGSGGTTTIITTGVEGGGGSKTAQSLNAITA 176
                         170       180
                  ....*....|....*....|
gi 1226657552 514 LGSWTGANSAPSAFTLNDGS 533
Cdd:COG3979   177 AGTAALNGGVVGGADEVLTC 196
COG4733 COG4733
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
354-533 8.96e-05

Phage-related protein, tail protein J [Mobilome: prophages, transposons];


Pssm-ID: 443767 [Multi-domain]  Cd Length: 978  Bit Score: 45.32  E-value: 8.96e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 354 TLTWAPStDDVGVTGYQVVRVAGGQETAA----ISTPTNSAIVGGLSPDTDYTFAVYARDAAGNRSARSATTTVTTESGP 429
Cdd:COG4733   646 TLSWSFP-VDADTLRTEIRYSTTGDWASAtvaqALYPGNTYTLAGLKAGQTYYYRARAVDRSGNVSAWWVSGQASADAAG 724
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1226657552 430 AAGSCAVTYRVSGEWNGGFQ----AEVTIRNTGTSPVDGWTLGWSYLDGQTVTNVWGGVATQTGDTVTVRSVDYTAAIPV 505
Cdd:COG4733   725 ILDAITGQILETELGQELDAiiqnATVAEVVAATVTDVTAQIDTAVLFAGVATAAAIGAEARVAATVAESATAAAATGTA 804
                         170       180
                  ....*....|....*....|....*...
gi 1226657552 506 GGSVSFGFLGSWTGANSAPSAFTLNDGS 533
Cdd:COG4733   805 ADAAGDASGGVTAGTSGTTGAGDTAAST 832
CBM49 pfam09478
Carbohydrate binding domain CBM49; This domain is found at the C terminal of cellulases and in ...
449-513 1.61e-04

Carbohydrate binding domain CBM49; This domain is found at the C terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose.


Pssm-ID: 286553  Cd Length: 80  Bit Score: 40.38  E-value: 1.61e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1226657552 449 QAEVTIRNTGTSPVDGWTLGWSYLDGQtvtnVWgGVATQTGDTVTVRSvdYTAAIPVGGSVSFGF 513
Cdd:pfam09478  20 QYSVTITNNGSKTIKSLTISIDNLYGP----IW-GVEKVSGNTYSFPS--WLPSLPPGASFSFGY 77
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH