NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034670753|ref|XP_016870426|]
View 

non-lysosomal glucosylceramidase isoform X3 [Homo sapiens]

Protein Classification

Glyco_hydr_116N and DUF608 domain-containing protein( domain architecture ID 13407181)

Glyco_hydr_116N and DUF608 domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF608 pfam04685
Glycosyl-hydrolase family 116, catalytic region; This represents a family of archaeal, ...
521-862 0e+00

Glycosyl-hydrolase family 116, catalytic region; This represents a family of archaeal, bacterial and eukaryotic glycosyl hydrolases, that belong to superfamily GH116. The primary catabolic pathway for glucosylceramide is catalysis by the lysosomal enzyme glucocerebrosidase. In higher eukaryotes, glucosylceramide is the precursor of glycosphingolipids, a complex group of ubiquitous membrane lipids. Mutations in the human protein cause motor-neurone defects in hereditary spastic paraplegia. The catalytic nucleophile, identified in UniProtKB:Q97YG8_SULSO, is a glutamine-335, with the likely acid/base at Asp-442 and the aspartates at Asp-406 and Asp-458 residues also playing a role in the catalysis of glucosides and xylosides that are beta-bound to hydrophobic groups. The family is defined as GH116, which presently includes enzymes with beta-glucosidase, EC:3.2.1.21, beta-xylosidase, EC:3.2.1.37, and glucocerebrosidase EC:3.2.1.45 activity.


:

Pssm-ID: 461391  Cd Length: 362  Bit Score: 608.87  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 521 GRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDD 600
Cdd:pfam04685   1 GRFAYLEGQEYRMYNTYDVHFYASFALLMLWPKLELSLQRDFADAVLREDPTKRKILMDGKTAPRKVKGAVPHDLGDPLE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 601 EPWLRVNAYLIHDTADWKDLNLKFVLQ----------------------AVMESEMKFDKDHDGLIENGGYADQTYDGWV 658
Cdd:pfam04685  81 DPWIRVNAYNIHDTSRWKDLNPKFVLQvyrdyvftgdkeflkamwpsvkAVMDYLLQFDKDGDGLIENEGFPDQTYDAWS 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 659 TTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFL 738
Cdd:pfam04685 161 MTGPSAYCGGLWLAALQAAIEMAELLGDKADVAKYKEKLEKAKKVFEKKLWNGKYFNYDSSSSSYSNSIMADQLAGQWYL 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 739 KACGLGegdtEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWE 818
Cdd:pfam04685 241 RACGLE----PIVPEDKVRSALETIYELNVMKFKGGKMGAVNGMRPDGGVDTSSVQSEEVWTGVTYALAATMIQEGMVEE 316
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 1034670753 819 GFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 862
Cdd:pfam04685 317 GFKTAEGIYDTVWERLGMQFQTPEAWTANGEYRALGYMRPLSIW 360
COG4354 super family cl34754
Uncharacterized conserved protein, contains GBA2_N and DUF608 domains [Function unknown];
167-862 3.67e-123

Uncharacterized conserved protein, contains GBA2_N and DUF608 domains [Function unknown];


The actual alignment was detected with superfamily member COG4354:

Pssm-ID: 443491 [Multi-domain]  Cd Length: 744  Bit Score: 390.43  E-value: 3.67e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 167 RGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLRSWNWGLCGYFAF--------YHALYPRAWTV 238
Cdd:COG4354    44 RGDLNDWEIDNGPHKFGFLPACQFAIFEEDEGGKAQARALEGPLPPYDGSGTEGSPAPNHGlprfekgtFKALYPRSWVE 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 239 YQLPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLE----- 313
Cdd:COG4354   124 YEDPVFPVQVTCEAFSPIIPGNYQESSYPVAVFEWTVHNPTDKPITLSIALSWQNFVGWFTNAIKSPKVKVRWGGsdgnf 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 314 ---RSGETVRGLLLHHPTLPNP---YTMAVAARVTAATTVTHiTAFDPDS---TGQQVWQDLLQDGQLDSPTGqSTPTQK 384
Cdd:COG4354   204 newREDNGLVGILMTSEGVDPDsegEGQMALATITNPGVSYR-TRWNPGAwggDGLDFWDDFSADGSLPDRED-ETPAEA 281
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 385 GVGIAGAVCVSSKLRPrGQCR-LEFSLAWDMPrIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQ 463
Cdd:COG4354   282 GEQPAGALAVRFTLAP-GETReIPFVLAWDFP-NREFWWGVNYYRRYTNFYATQGKDAWAVARTALKHLDELEEQTLAFQ 359
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 464 SPVLDDrSLPAWYKSALFNELYFLADGGTVWLEvledslpeelgrnmchlrptlrdYGRFGYLEG---QEYRMYNTYDVH 540
Cdd:COG4354   360 EPLLRS-DLPDWVKEALLNNLYTLRSGTCLRTE-----------------------DGQFAVWEGlddDEGSCYGSCTHV 415
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 541 FYASFALIMLWPKLELS-LQYDMALATLREDLTrrrylmsgvmapvkrrnviPHDIGDPDDEPWLRVNAylihdtadWKD 619
Cdd:COG4354   416 WNYSFALAFLFPELEKSmREVEFARAIDDEGAM-------------------PFRLGLPLEHPWEDCNL--------AAD 468
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 620 LNLKFVLQ-------------------AVMES----EMKFDKDHDGLIEngGYADQTYDgWVTTGPSAYCGGLWLAAVAV 676
Cdd:COG4354   469 GQMGFVLQvyrdwllsgddeflrecwpAVKKAleyaWIGWDADQDGVPE--GAQHNTFD-WELYGPNPYCGGLYLAALEA 545
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 677 MVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDS----SSRPQ-SRSVMSDQCAGQWFLKACGLGegdtEVF 751
Cdd:COG4354   546 AAEMAEYLGDEEFAKTYRQWLEQGSKWYDENLWNGEYYIQDIdsgaSEEYQlGEGCLADQLCGQFYAHLLGLG----DLV 621
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 752 PTQHVVRALQTIFELNVQAFAGGAM------------GAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEG 819
Cdd:COG4354   622 PPENIRSALKSIYKYNFRKFFREHFnngrsyalgdefGLANGTWPDGRPENPFPYPLEVWTGIEYGAAAFMIQEGMKEEG 701
                         730       740       750       760
                  ....*....|....*....|....*....|....*....|...
gi 1034670753 820 FQTAEGCYRTVWERLGLAFQTPEAYCQqrvfrslaYMRPLSIW 862
Cdd:COG4354   702 LKLIEAVRDRYDGNKRNPFNEPECGSH--------YARAMASW 736
 
Name Accession Description Interval E-value
DUF608 pfam04685
Glycosyl-hydrolase family 116, catalytic region; This represents a family of archaeal, ...
521-862 0e+00

Glycosyl-hydrolase family 116, catalytic region; This represents a family of archaeal, bacterial and eukaryotic glycosyl hydrolases, that belong to superfamily GH116. The primary catabolic pathway for glucosylceramide is catalysis by the lysosomal enzyme glucocerebrosidase. In higher eukaryotes, glucosylceramide is the precursor of glycosphingolipids, a complex group of ubiquitous membrane lipids. Mutations in the human protein cause motor-neurone defects in hereditary spastic paraplegia. The catalytic nucleophile, identified in UniProtKB:Q97YG8_SULSO, is a glutamine-335, with the likely acid/base at Asp-442 and the aspartates at Asp-406 and Asp-458 residues also playing a role in the catalysis of glucosides and xylosides that are beta-bound to hydrophobic groups. The family is defined as GH116, which presently includes enzymes with beta-glucosidase, EC:3.2.1.21, beta-xylosidase, EC:3.2.1.37, and glucocerebrosidase EC:3.2.1.45 activity.


Pssm-ID: 461391  Cd Length: 362  Bit Score: 608.87  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 521 GRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDD 600
Cdd:pfam04685   1 GRFAYLEGQEYRMYNTYDVHFYASFALLMLWPKLELSLQRDFADAVLREDPTKRKILMDGKTAPRKVKGAVPHDLGDPLE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 601 EPWLRVNAYLIHDTADWKDLNLKFVLQ----------------------AVMESEMKFDKDHDGLIENGGYADQTYDGWV 658
Cdd:pfam04685  81 DPWIRVNAYNIHDTSRWKDLNPKFVLQvyrdyvftgdkeflkamwpsvkAVMDYLLQFDKDGDGLIENEGFPDQTYDAWS 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 659 TTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFL 738
Cdd:pfam04685 161 MTGPSAYCGGLWLAALQAAIEMAELLGDKADVAKYKEKLEKAKKVFEKKLWNGKYFNYDSSSSSYSNSIMADQLAGQWYL 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 739 KACGLGegdtEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWE 818
Cdd:pfam04685 241 RACGLE----PIVPEDKVRSALETIYELNVMKFKGGKMGAVNGMRPDGGVDTSSVQSEEVWTGVTYALAATMIQEGMVEE 316
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 1034670753 819 GFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 862
Cdd:pfam04685 317 GFKTAEGIYDTVWERLGMQFQTPEAWTANGEYRALGYMRPLSIW 360
COG4354 COG4354
Uncharacterized conserved protein, contains GBA2_N and DUF608 domains [Function unknown];
167-862 3.67e-123

Uncharacterized conserved protein, contains GBA2_N and DUF608 domains [Function unknown];


Pssm-ID: 443491 [Multi-domain]  Cd Length: 744  Bit Score: 390.43  E-value: 3.67e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 167 RGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLRSWNWGLCGYFAF--------YHALYPRAWTV 238
Cdd:COG4354    44 RGDLNDWEIDNGPHKFGFLPACQFAIFEEDEGGKAQARALEGPLPPYDGSGTEGSPAPNHGlprfekgtFKALYPRSWVE 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 239 YQLPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLE----- 313
Cdd:COG4354   124 YEDPVFPVQVTCEAFSPIIPGNYQESSYPVAVFEWTVHNPTDKPITLSIALSWQNFVGWFTNAIKSPKVKVRWGGsdgnf 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 314 ---RSGETVRGLLLHHPTLPNP---YTMAVAARVTAATTVTHiTAFDPDS---TGQQVWQDLLQDGQLDSPTGqSTPTQK 384
Cdd:COG4354   204 newREDNGLVGILMTSEGVDPDsegEGQMALATITNPGVSYR-TRWNPGAwggDGLDFWDDFSADGSLPDRED-ETPAEA 281
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 385 GVGIAGAVCVSSKLRPrGQCR-LEFSLAWDMPrIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQ 463
Cdd:COG4354   282 GEQPAGALAVRFTLAP-GETReIPFVLAWDFP-NREFWWGVNYYRRYTNFYATQGKDAWAVARTALKHLDELEEQTLAFQ 359
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 464 SPVLDDrSLPAWYKSALFNELYFLADGGTVWLEvledslpeelgrnmchlrptlrdYGRFGYLEG---QEYRMYNTYDVH 540
Cdd:COG4354   360 EPLLRS-DLPDWVKEALLNNLYTLRSGTCLRTE-----------------------DGQFAVWEGlddDEGSCYGSCTHV 415
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 541 FYASFALIMLWPKLELS-LQYDMALATLREDLTrrrylmsgvmapvkrrnviPHDIGDPDDEPWLRVNAylihdtadWKD 619
Cdd:COG4354   416 WNYSFALAFLFPELEKSmREVEFARAIDDEGAM-------------------PFRLGLPLEHPWEDCNL--------AAD 468
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 620 LNLKFVLQ-------------------AVMES----EMKFDKDHDGLIEngGYADQTYDgWVTTGPSAYCGGLWLAAVAV 676
Cdd:COG4354   469 GQMGFVLQvyrdwllsgddeflrecwpAVKKAleyaWIGWDADQDGVPE--GAQHNTFD-WELYGPNPYCGGLYLAALEA 545
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 677 MVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDS----SSRPQ-SRSVMSDQCAGQWFLKACGLGegdtEVF 751
Cdd:COG4354   546 AAEMAEYLGDEEFAKTYRQWLEQGSKWYDENLWNGEYYIQDIdsgaSEEYQlGEGCLADQLCGQFYAHLLGLG----DLV 621
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 752 PTQHVVRALQTIFELNVQAFAGGAM------------GAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEG 819
Cdd:COG4354   622 PPENIRSALKSIYKYNFRKFFREHFnngrsyalgdefGLANGTWPDGRPENPFPYPLEVWTGIEYGAAAFMIQEGMKEEG 701
                         730       740       750       760
                  ....*....|....*....|....*....|....*....|...
gi 1034670753 820 FQTAEGCYRTVWERLGLAFQTPEAYCQqrvfrslaYMRPLSIW 862
Cdd:COG4354   702 LKLIEAVRDRYDGNKRNPFNEPECGSH--------YARAMASW 736
Glyco_hydr_116N pfam12215
beta-glucosidase 2, glycosyl-hydrolase family 116 N-term; This domain is found in bacteria, ...
166-455 2.57e-118

beta-glucosidase 2, glycosyl-hydrolase family 116 N-term; This domain is found in bacteria, archaea and eukaryotes. This domain is typically between 320 to 354 amino acids in length. This domain is found associated with pfam04685. It is found just after the extreme N terminus. The N-terminal is thought to be the luminal domain while the C terminal is the cytosolic domain. The catalytic domain of GBA-2 is unknown. The primary catabolic pathway for glucosylceramide is catalysis by the lysosomal enzyme glucocerebrosidase. In higher eukaryotes, glucosylceramide is the precursor of glycosphingolipids, a complex group of ubiquitous membrane lipids. Mutations in the human protein cause motor-neurone defects in hereditary spastic paraplegia. The catalytic nucleophile, identified in UniProtKB:Q97YG8_SULSO, is a glutamine-335 in the downstream family pfam04685.


Pssm-ID: 463496  Cd Length: 309  Bit Score: 362.36  E-value: 2.57e-118
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 166 WRGQFCRWQLNPGMYQHRTVIADQFTVCLRRE-GQTVYQQVLSLERP--SVLRSWNWGLCGYFAFYHALYPRAWTVYQLP 242
Cdd:pfam12215  16 GRGDFRRWQIFPGPHEFKSVPANQFAVFVSKGgGLKVQARVLSTEPPdgSLLSSWDWNYPGSKGTYHALYPRAWTVYEDP 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 243 GQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGG--DDAPGGLWNEPFcleRSGETVR 320
Cdd:pfam12215  96 DFPVKVTCEQFSPIIPHNYKESSLPVAVFEWTVENPTDEPVDVSIMFTWQNGVGWFevSDRDGGHPNEPF---KERDGVV 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 321 GLLLHHPTLPN----PYTMAVAARVTAATTVTHITAFDPDSTGQQVWQDLLQDGQLdSPTGQSTPTQKGVGIAGAVCVSS 396
Cdd:pfam12215 173 GVLLHHVTLDEegegPGTFAIATEKNPGVEVTYCTRFDPSGDGRELWDDFAKDGSL-PNSGDSTPSSPGEQIAAAVAVRF 251
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034670753 397 KLRPRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGqDGDAAPALSHYALCRYAEW 455
Cdd:pfam12215 252 TLAPGESKTIPFLLAWDFPVREFAWGRKYYGRRYTKFFG-NGDNAWAVAHYALKNYKRW 309
GDB1 COG3408
Glycogen debranching enzyme (alpha-1,6-glucosidase) [Carbohydrate transport and metabolism];
545-734 4.25e-03

Glycogen debranching enzyme (alpha-1,6-glucosidase) [Carbohydrate transport and metabolism];


Pssm-ID: 442634 [Multi-domain]  Cd Length: 353  Bit Score: 40.25  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 545 FALIMLwpkleLSLQYDMALATLREdLTRRrylmsgvmapVKRRNVIPH--------DIGDPDDEPWLrvnAYLIHD--- 613
Cdd:COG3408    39 IALPGL-----LLLDPELARGILRT-LARY----------QEEPGKIPHevrdgeepYYGTVDATPWF---IIALGEyyr 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 614 -TADWKDLNLKF-VLQAVMESEMKFDKDHDGLIE-------NGGYADQTYDGWVT-TGPSAYCGGLWLAAVAVMVQMAAL 683
Cdd:COG3408   100 wTGDLAFLRELLpALEAALDWILRGDRDGDGLLEygrsgldNQTWMDSKVDSVTPrSGALVEVQALWYNALRALAELARA 179
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1034670753 684 CGAQDIQDKFSSILSRGQEAYERLLWN---GRYYNYDSSSRPQSRSVMSDQCAG 734
Cdd:COG3408   180 LGDPELAARWRELAERLKESFNERFWNeelGYLADALDGDGRPDDSIRPNQLFA 233
 
Name Accession Description Interval E-value
DUF608 pfam04685
Glycosyl-hydrolase family 116, catalytic region; This represents a family of archaeal, ...
521-862 0e+00

Glycosyl-hydrolase family 116, catalytic region; This represents a family of archaeal, bacterial and eukaryotic glycosyl hydrolases, that belong to superfamily GH116. The primary catabolic pathway for glucosylceramide is catalysis by the lysosomal enzyme glucocerebrosidase. In higher eukaryotes, glucosylceramide is the precursor of glycosphingolipids, a complex group of ubiquitous membrane lipids. Mutations in the human protein cause motor-neurone defects in hereditary spastic paraplegia. The catalytic nucleophile, identified in UniProtKB:Q97YG8_SULSO, is a glutamine-335, with the likely acid/base at Asp-442 and the aspartates at Asp-406 and Asp-458 residues also playing a role in the catalysis of glucosides and xylosides that are beta-bound to hydrophobic groups. The family is defined as GH116, which presently includes enzymes with beta-glucosidase, EC:3.2.1.21, beta-xylosidase, EC:3.2.1.37, and glucocerebrosidase EC:3.2.1.45 activity.


Pssm-ID: 461391  Cd Length: 362  Bit Score: 608.87  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 521 GRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDD 600
Cdd:pfam04685   1 GRFAYLEGQEYRMYNTYDVHFYASFALLMLWPKLELSLQRDFADAVLREDPTKRKILMDGKTAPRKVKGAVPHDLGDPLE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 601 EPWLRVNAYLIHDTADWKDLNLKFVLQ----------------------AVMESEMKFDKDHDGLIENGGYADQTYDGWV 658
Cdd:pfam04685  81 DPWIRVNAYNIHDTSRWKDLNPKFVLQvyrdyvftgdkeflkamwpsvkAVMDYLLQFDKDGDGLIENEGFPDQTYDAWS 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 659 TTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFL 738
Cdd:pfam04685 161 MTGPSAYCGGLWLAALQAAIEMAELLGDKADVAKYKEKLEKAKKVFEKKLWNGKYFNYDSSSSSYSNSIMADQLAGQWYL 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 739 KACGLGegdtEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWE 818
Cdd:pfam04685 241 RACGLE----PIVPEDKVRSALETIYELNVMKFKGGKMGAVNGMRPDGGVDTSSVQSEEVWTGVTYALAATMIQEGMVEE 316
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 1034670753 819 GFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 862
Cdd:pfam04685 317 GFKTAEGIYDTVWERLGMQFQTPEAWTANGEYRALGYMRPLSIW 360
COG4354 COG4354
Uncharacterized conserved protein, contains GBA2_N and DUF608 domains [Function unknown];
167-862 3.67e-123

Uncharacterized conserved protein, contains GBA2_N and DUF608 domains [Function unknown];


Pssm-ID: 443491 [Multi-domain]  Cd Length: 744  Bit Score: 390.43  E-value: 3.67e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 167 RGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLRSWNWGLCGYFAF--------YHALYPRAWTV 238
Cdd:COG4354    44 RGDLNDWEIDNGPHKFGFLPACQFAIFEEDEGGKAQARALEGPLPPYDGSGTEGSPAPNHGlprfekgtFKALYPRSWVE 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 239 YQLPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLE----- 313
Cdd:COG4354   124 YEDPVFPVQVTCEAFSPIIPGNYQESSYPVAVFEWTVHNPTDKPITLSIALSWQNFVGWFTNAIKSPKVKVRWGGsdgnf 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 314 ---RSGETVRGLLLHHPTLPNP---YTMAVAARVTAATTVTHiTAFDPDS---TGQQVWQDLLQDGQLDSPTGqSTPTQK 384
Cdd:COG4354   204 newREDNGLVGILMTSEGVDPDsegEGQMALATITNPGVSYR-TRWNPGAwggDGLDFWDDFSADGSLPDRED-ETPAEA 281
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 385 GVGIAGAVCVSSKLRPrGQCR-LEFSLAWDMPrIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQ 463
Cdd:COG4354   282 GEQPAGALAVRFTLAP-GETReIPFVLAWDFP-NREFWWGVNYYRRYTNFYATQGKDAWAVARTALKHLDELEEQTLAFQ 359
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 464 SPVLDDrSLPAWYKSALFNELYFLADGGTVWLEvledslpeelgrnmchlrptlrdYGRFGYLEG---QEYRMYNTYDVH 540
Cdd:COG4354   360 EPLLRS-DLPDWVKEALLNNLYTLRSGTCLRTE-----------------------DGQFAVWEGlddDEGSCYGSCTHV 415
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 541 FYASFALIMLWPKLELS-LQYDMALATLREDLTrrrylmsgvmapvkrrnviPHDIGDPDDEPWLRVNAylihdtadWKD 619
Cdd:COG4354   416 WNYSFALAFLFPELEKSmREVEFARAIDDEGAM-------------------PFRLGLPLEHPWEDCNL--------AAD 468
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 620 LNLKFVLQ-------------------AVMES----EMKFDKDHDGLIEngGYADQTYDgWVTTGPSAYCGGLWLAAVAV 676
Cdd:COG4354   469 GQMGFVLQvyrdwllsgddeflrecwpAVKKAleyaWIGWDADQDGVPE--GAQHNTFD-WELYGPNPYCGGLYLAALEA 545
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 677 MVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDS----SSRPQ-SRSVMSDQCAGQWFLKACGLGegdtEVF 751
Cdd:COG4354   546 AAEMAEYLGDEEFAKTYRQWLEQGSKWYDENLWNGEYYIQDIdsgaSEEYQlGEGCLADQLCGQFYAHLLGLG----DLV 621
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 752 PTQHVVRALQTIFELNVQAFAGGAM------------GAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEG 819
Cdd:COG4354   622 PPENIRSALKSIYKYNFRKFFREHFnngrsyalgdefGLANGTWPDGRPENPFPYPLEVWTGIEYGAAAFMIQEGMKEEG 701
                         730       740       750       760
                  ....*....|....*....|....*....|....*....|...
gi 1034670753 820 FQTAEGCYRTVWERLGLAFQTPEAYCQqrvfrslaYMRPLSIW 862
Cdd:COG4354   702 LKLIEAVRDRYDGNKRNPFNEPECGSH--------YARAMASW 736
Glyco_hydr_116N pfam12215
beta-glucosidase 2, glycosyl-hydrolase family 116 N-term; This domain is found in bacteria, ...
166-455 2.57e-118

beta-glucosidase 2, glycosyl-hydrolase family 116 N-term; This domain is found in bacteria, archaea and eukaryotes. This domain is typically between 320 to 354 amino acids in length. This domain is found associated with pfam04685. It is found just after the extreme N terminus. The N-terminal is thought to be the luminal domain while the C terminal is the cytosolic domain. The catalytic domain of GBA-2 is unknown. The primary catabolic pathway for glucosylceramide is catalysis by the lysosomal enzyme glucocerebrosidase. In higher eukaryotes, glucosylceramide is the precursor of glycosphingolipids, a complex group of ubiquitous membrane lipids. Mutations in the human protein cause motor-neurone defects in hereditary spastic paraplegia. The catalytic nucleophile, identified in UniProtKB:Q97YG8_SULSO, is a glutamine-335 in the downstream family pfam04685.


Pssm-ID: 463496  Cd Length: 309  Bit Score: 362.36  E-value: 2.57e-118
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 166 WRGQFCRWQLNPGMYQHRTVIADQFTVCLRRE-GQTVYQQVLSLERP--SVLRSWNWGLCGYFAFYHALYPRAWTVYQLP 242
Cdd:pfam12215  16 GRGDFRRWQIFPGPHEFKSVPANQFAVFVSKGgGLKVQARVLSTEPPdgSLLSSWDWNYPGSKGTYHALYPRAWTVYEDP 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 243 GQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGG--DDAPGGLWNEPFcleRSGETVR 320
Cdd:pfam12215  96 DFPVKVTCEQFSPIIPHNYKESSLPVAVFEWTVENPTDEPVDVSIMFTWQNGVGWFevSDRDGGHPNEPF---KERDGVV 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 321 GLLLHHPTLPN----PYTMAVAARVTAATTVTHITAFDPDSTGQQVWQDLLQDGQLdSPTGQSTPTQKGVGIAGAVCVSS 396
Cdd:pfam12215 173 GVLLHHVTLDEegegPGTFAIATEKNPGVEVTYCTRFDPSGDGRELWDDFAKDGSL-PNSGDSTPSSPGEQIAAAVAVRF 251
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034670753 397 KLRPRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGqDGDAAPALSHYALCRYAEW 455
Cdd:pfam12215 252 TLAPGESKTIPFLLAWDFPVREFAWGRKYYGRRYTKFFG-NGDNAWAVAHYALKNYKRW 309
GDB1 COG3408
Glycogen debranching enzyme (alpha-1,6-glucosidase) [Carbohydrate transport and metabolism];
545-734 4.25e-03

Glycogen debranching enzyme (alpha-1,6-glucosidase) [Carbohydrate transport and metabolism];


Pssm-ID: 442634 [Multi-domain]  Cd Length: 353  Bit Score: 40.25  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 545 FALIMLwpkleLSLQYDMALATLREdLTRRrylmsgvmapVKRRNVIPH--------DIGDPDDEPWLrvnAYLIHD--- 613
Cdd:COG3408    39 IALPGL-----LLLDPELARGILRT-LARY----------QEEPGKIPHevrdgeepYYGTVDATPWF---IIALGEyyr 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034670753 614 -TADWKDLNLKF-VLQAVMESEMKFDKDHDGLIE-------NGGYADQTYDGWVT-TGPSAYCGGLWLAAVAVMVQMAAL 683
Cdd:COG3408   100 wTGDLAFLRELLpALEAALDWILRGDRDGDGLLEygrsgldNQTWMDSKVDSVTPrSGALVEVQALWYNALRALAELARA 179
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1034670753 684 CGAQDIQDKFSSILSRGQEAYERLLWN---GRYYNYDSSSRPQSRSVMSDQCAG 734
Cdd:COG3408   180 LGDPELAARWRELAERLKESFNERFWNeelGYLADALDGDGRPDDSIRPNQLFA 233
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH