NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|701462432|dbj|BAP82243|]
View 

terpene synthase [synthetic construct]

Protein Classification

terpene synthase family protein( domain architecture ID 10090936)

terpene synthase family protein similar to pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Terpene_cyclase_nonplant_C1 cd00687
Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene ...
10-309 3.35e-110

Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in fungi, bacteria and Dictyostelium.


:

Pssm-ID: 173835 [Multi-domain]  Cd Length: 303  Bit Score: 322.39  E-value: 3.35e-110
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  10 YCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLYQRFSKAKFYLLTAGAYPHCQLEELKIANDVISWLFIWDDQCDISD 89
Cdd:cd00687    1 PSPFPYRLNPYVKEAQDEYLEWVLEEMLIPSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  90 lgKKPELLKTWCNRFLEILNGAELT--PDDLPLGFALRDIRNRIINRGGITFFHHFVRNFEDYFHGCIEEAHNRVNVSVP 167
Cdd:cd00687   81 --KSPEDGEAGVTRLLDILRGDGLDspDDATPLEFGLADLWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVP 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 168 DVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQMTINILAWSNDIFSAPREI-ANGEVHNLVFVIHHH 246
Cdd:cd00687  159 DVAEYLEMRRFNIGADPCLGLSEFIGGPEVPAAVRLDPVMRALEALASDAIALVNDIYSYEKEIkANGEVHNLVKVLAEE 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 701462432 247 QKIPLEKAMLAAAAMHNHEVQKLVNLESKIASFSAET--DAEITKYISGLHAWIRGNLDWYAHSG 309
Cdd:cd00687  239 HGLSLEEAISVVRDMHNERITQFEELEASLIKSGDLEeeSPAVRAYVEGLHNWISGNLDWHRTSP 303
 
Name Accession Description Interval E-value
Terpene_cyclase_nonplant_C1 cd00687
Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene ...
10-309 3.35e-110

Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in fungi, bacteria and Dictyostelium.


Pssm-ID: 173835 [Multi-domain]  Cd Length: 303  Bit Score: 322.39  E-value: 3.35e-110
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  10 YCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLYQRFSKAKFYLLTAGAYPHCQLEELKIANDVISWLFIWDDQCDISD 89
Cdd:cd00687    1 PSPFPYRLNPYVKEAQDEYLEWVLEEMLIPSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  90 lgKKPELLKTWCNRFLEILNGAELT--PDDLPLGFALRDIRNRIINRGGITFFHHFVRNFEDYFHGCIEEAHNRVNVSVP 167
Cdd:cd00687   81 --KSPEDGEAGVTRLLDILRGDGLDspDDATPLEFGLADLWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVP 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 168 DVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQMTINILAWSNDIFSAPREI-ANGEVHNLVFVIHHH 246
Cdd:cd00687  159 DVAEYLEMRRFNIGADPCLGLSEFIGGPEVPAAVRLDPVMRALEALASDAIALVNDIYSYEKEIkANGEVHNLVKVLAEE 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 701462432 247 QKIPLEKAMLAAAAMHNHEVQKLVNLESKIASFSAET--DAEITKYISGLHAWIRGNLDWYAHSG 309
Cdd:cd00687  239 HGLSLEEAISVVRDMHNERITQFEELEASLIKSGDLEeeSPAVRAYVEGLHNWISGNLDWHRTSP 303
Terpene_syn_C_2 pfam19086
Terpene synthase family 2, C-terminal metal binding;
73-268 1.58e-40

Terpene synthase family 2, C-terminal metal binding;


Pssm-ID: 465972 [Multi-domain]  Cd Length: 199  Bit Score: 140.43  E-value: 1.58e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432   73 DVISWLFIWDDQCDiSDLGKKPEL-LKTWCNRFLEI---LNGAELTPDDLPLGFALRDIRNRIINRGGITFFHHFVRNFE 148
Cdd:pfam19086   1 KWLAWLFILDDIYD-EVYGTLEELeLFTEAIERWDAllpLDGPELPEYMKPLYRALADLWERLAKEASPDWRRRFKEAWK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  149 DYFHGCIEEAHNRVNVSVPDVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQMTINILAWSNDIFSAP 228
Cdd:pfam19086  80 DYLDAYLWEAKWRASGYVPTLEEYLELRRVTSGVPPLLALIEFGLGIELPDEVFEHPVVRRLVRAASDIVRLVNDLFSYK 159
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 701462432  229 REIANGEVHNLVFVIHHHQKIPLEKAMLAAAAMHNHEVQK 268
Cdd:pfam19086 160 KEQARGDVHNLVLVLMKEYGVSLQEAVDEVGELIEEAWKD 199
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
4-311 3.16e-12

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 67.35  E-value: 3.16e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432   4 FTFPNLYCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLY-----QRFSKAKFYLLTAGAYPHCQLEELKIANDVISWL 78
Cdd:NF041168 382 VPLPDFYMPYPLRLNPHLDAARRNSKEWARRMGMLDVVPGVgvwdeRKFDGADFALCAAGIHPDAPAAELDLSADWLTWG 461
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  79 FIWDDQCDI-----SDL-GKKPellktwCNRFLeilngAELTPDDL--------PLGFALRDIRNRII------NRGGit 138
Cdd:NF041168 462 TYGDDYFPVvfgrtRDLaGAKA------FNARL-----SAFMPLDAgplpvptnPLERGLADLWSRTAgpmspeARRA-- 528
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 139 ffhhFVRNFEDYFHGCIEEAHNRVNVSVPDVEAYIKIRSANaaaalclnlieF-CDRVM----------IPYSLRNHETL 207
Cdd:NF041168 529 ----FRRAVEDMLESWLWELANQIQNRVPDPVDYIEMRRKT-----------FgSDLTMslsrlahgdsLPPEVFRTRPM 593
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 208 KKLTQMTINILAWSNDIFSAPREIA-NGEVHNLVFVIHHHQKIPLEKAM-----LAAAAMHNHEVQKLVNLESKIASF-- 279
Cdd:NF041168 594 RALENAAADYACLTNDIFSYQKEIEfEGELHNGVLVVQRFLDCDRQQAVavvndLMTARMRQFEHIVATELPALFEEFgl 673
                        330       340       350
                 ....*....|....*....|....*....|..
gi 701462432 280 SAETDAEITKYISGLHAWIRGNLDWYAHSGRY 311
Cdd:NF041168 674 DAEAREALRGYVEELQDWMAGILHWHRGTSRY 705
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
4-311 1.61e-10

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 61.95  E-value: 1.61e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432   4 FTFPNLYCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLY--------QRFSKAKFYLLTAGAYPHCQLEELKIANDVI 75
Cdd:NF041168   3 FELPDFYVPYPARLNPHLEGARAHSKAWAREMGMLDSPGDGdgepvwdeADFDAHDYALLCAYTHPDAPAPELDLITDWY 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  76 SWLFIWDDqcdisdlgkkpellktwcnRFLEI------LNGA--------ELTPDDL---------PLGFALRDIRNRII 132
Cdd:NF041168  83 VWVFFFDD-------------------HFLEAfkrtrdLAGArayldrlpAFMPVDPgtappeptnPVERGLADLWPRTV 143
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 133 NRGGITFFHHFVRNFEDYFHGCIEEAHNRVNVSVPDVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQ 212
Cdd:NF041168 144 PTMSADWRRRFAESTRNLLEESLWELANISEGRVANPIEYIEMRRKVGGAPWSANLVEHAVGAEVPARVAASRPMRVLRD 223
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 213 MTINILAWSNDIFSAPREIAN-GEVHNLVFVIHHHQKIPLEKAM-----LAAAAMHNHEVQKLVNLESKIASFSAETD-- 284
Cdd:NF041168 224 TFADAVHLRNDLFSYQREVEEeGELSNGVLVVERFLGCDTQRAAdlvndLLTSRLQQFEHTALTELPALFDEHGLDPAer 303
                        330       340
                 ....*....|....*....|....*..
gi 701462432 285 AEITKYISGLHAWIRGNLDWYAHSGRY 311
Cdd:NF041168 304 ADVLAYVKGLQDWQSGGHEWHMRSSRY 330
 
Name Accession Description Interval E-value
Terpene_cyclase_nonplant_C1 cd00687
Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene ...
10-309 3.35e-110

Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in fungi, bacteria and Dictyostelium.


Pssm-ID: 173835 [Multi-domain]  Cd Length: 303  Bit Score: 322.39  E-value: 3.35e-110
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  10 YCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLYQRFSKAKFYLLTAGAYPHCQLEELKIANDVISWLFIWDDQCDISD 89
Cdd:cd00687    1 PSPFPYRLNPYVKEAQDEYLEWVLEEMLIPSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  90 lgKKPELLKTWCNRFLEILNGAELT--PDDLPLGFALRDIRNRIINRGGITFFHHFVRNFEDYFHGCIEEAHNRVNVSVP 167
Cdd:cd00687   81 --KSPEDGEAGVTRLLDILRGDGLDspDDATPLEFGLADLWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVP 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 168 DVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQMTINILAWSNDIFSAPREI-ANGEVHNLVFVIHHH 246
Cdd:cd00687  159 DVAEYLEMRRFNIGADPCLGLSEFIGGPEVPAAVRLDPVMRALEALASDAIALVNDIYSYEKEIkANGEVHNLVKVLAEE 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 701462432 247 QKIPLEKAMLAAAAMHNHEVQKLVNLESKIASFSAET--DAEITKYISGLHAWIRGNLDWYAHSG 309
Cdd:cd00687  239 HGLSLEEAISVVRDMHNERITQFEELEASLIKSGDLEeeSPAVRAYVEGLHNWISGNLDWHRTSP 303
Terpene_cyclase_C1 cd00868
Terpene cyclases, Class 1; Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid ...
25-305 8.03e-48

Terpene cyclases, Class 1; Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational changes that close the active site to solvent, stabilizing reactive carbocation intermediates. Mechanistically and structurally distinct, class II terpene cyclases and cis-IPPS are not included in this CD. Taxonomic distribution includes bacteria, fungi and plants.


Pssm-ID: 173837 [Multi-domain]  Cd Length: 284  Bit Score: 162.15  E-value: 8.03e-48
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  25 QDYALQWVIRFKLIDSESlYQRFSKAKFYLLTAGAYPHCQL-EELKIANDVISWLFIWDDQCDISDLGKKPELLKTWCNR 103
Cdd:cd00868    6 LKELSRWWKELGLQEKLP-FARDRLVECYFWAAGSYFEPQYsEARIALAKTIALLTVIDDTYDDYGTLEELELFTEAVER 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 104 fLEILNGAELTPDDLPLGFALRDIRNRIIN----RGGITFFHHFVRNFEDYFHGCIEEAHNRVNVSVPDVEAYIKIRSAN 179
Cdd:cd00868   85 -WDISAIDELPEYMKPVFKALYDLVNEIEEelakEGGSESLPYLKEAWKDLLRAYLVEAKWANEGYVPSFEEYLENRRVS 163
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 180 AAAALCLNLIEFCDRVMIP-YSLRNHETLKKLTQMTINILAWSNDIFSAPREIANGEVHNLVFVIHHHQKIPLEKAMLAA 258
Cdd:cd00868  164 IGYPPLLALSFLGMGDILPeEAFEWLPSYPKLVRASSTIGRLLNDIASYEKEIARGEVANSVECYMKEYGVSEEEALEEL 243
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*..
gi 701462432 259 AAMHNHEVQKLVNLESKiasfsaETDAEITKYISGLHAWIRGNLDWY 305
Cdd:cd00868  244 RKMIEEAWKELNEEVLK------LSSDVPRAVLETLLNLARGIYVWY 284
Terpene_syn_C_2 pfam19086
Terpene synthase family 2, C-terminal metal binding;
73-268 1.58e-40

Terpene synthase family 2, C-terminal metal binding;


Pssm-ID: 465972 [Multi-domain]  Cd Length: 199  Bit Score: 140.43  E-value: 1.58e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432   73 DVISWLFIWDDQCDiSDLGKKPEL-LKTWCNRFLEI---LNGAELTPDDLPLGFALRDIRNRIINRGGITFFHHFVRNFE 148
Cdd:pfam19086   1 KWLAWLFILDDIYD-EVYGTLEELeLFTEAIERWDAllpLDGPELPEYMKPLYRALADLWERLAKEASPDWRRRFKEAWK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  149 DYFHGCIEEAHNRVNVSVPDVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQMTINILAWSNDIFSAP 228
Cdd:pfam19086  80 DYLDAYLWEAKWRASGYVPTLEEYLELRRVTSGVPPLLALIEFGLGIELPDEVFEHPVVRRLVRAASDIVRLVNDLFSYK 159
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 701462432  229 REIANGEVHNLVFVIHHHQKIPLEKAMLAAAAMHNHEVQK 268
Cdd:pfam19086 160 KEQARGDVHNLVLVLMKEYGVSLQEAVDEVGELIEEAWKD 199
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
4-311 3.16e-12

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 67.35  E-value: 3.16e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432   4 FTFPNLYCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLY-----QRFSKAKFYLLTAGAYPHCQLEELKIANDVISWL 78
Cdd:NF041168 382 VPLPDFYMPYPLRLNPHLDAARRNSKEWARRMGMLDVVPGVgvwdeRKFDGADFALCAAGIHPDAPAAELDLSADWLTWG 461
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  79 FIWDDQCDI-----SDL-GKKPellktwCNRFLeilngAELTPDDL--------PLGFALRDIRNRII------NRGGit 138
Cdd:NF041168 462 TYGDDYFPVvfgrtRDLaGAKA------FNARL-----SAFMPLDAgplpvptnPLERGLADLWSRTAgpmspeARRA-- 528
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 139 ffhhFVRNFEDYFHGCIEEAHNRVNVSVPDVEAYIKIRSANaaaalclnlieF-CDRVM----------IPYSLRNHETL 207
Cdd:NF041168 529 ----FRRAVEDMLESWLWELANQIQNRVPDPVDYIEMRRKT-----------FgSDLTMslsrlahgdsLPPEVFRTRPM 593
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 208 KKLTQMTINILAWSNDIFSAPREIA-NGEVHNLVFVIHHHQKIPLEKAM-----LAAAAMHNHEVQKLVNLESKIASF-- 279
Cdd:NF041168 594 RALENAAADYACLTNDIFSYQKEIEfEGELHNGVLVVQRFLDCDRQQAVavvndLMTARMRQFEHIVATELPALFEEFgl 673
                        330       340       350
                 ....*....|....*....|....*....|..
gi 701462432 280 SAETDAEITKYISGLHAWIRGNLDWYAHSGRY 311
Cdd:NF041168 674 DAEAREALRGYVEELQDWMAGILHWHRGTSRY 705
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
4-311 1.61e-10

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 61.95  E-value: 1.61e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432   4 FTFPNLYCPFPERKNPYSEFLQDYALQWVIRFKLIDSESLY--------QRFSKAKFYLLTAGAYPHCQLEELKIANDVI 75
Cdd:NF041168   3 FELPDFYVPYPARLNPHLEGARAHSKAWAREMGMLDSPGDGdgepvwdeADFDAHDYALLCAYTHPDAPAPELDLITDWY 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432  76 SWLFIWDDqcdisdlgkkpellktwcnRFLEI------LNGA--------ELTPDDL---------PLGFALRDIRNRII 132
Cdd:NF041168  83 VWVFFFDD-------------------HFLEAfkrtrdLAGArayldrlpAFMPVDPgtappeptnPVERGLADLWPRTV 143
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 133 NRGGITFFHHFVRNFEDYFHGCIEEAHNRVNVSVPDVEAYIKIRSANAAAALCLNLIEFCDRVMIPYSLRNHETLKKLTQ 212
Cdd:NF041168 144 PTMSADWRRRFAESTRNLLEESLWELANISEGRVANPIEYIEMRRKVGGAPWSANLVEHAVGAEVPARVAASRPMRVLRD 223
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462432 213 MTINILAWSNDIFSAPREIAN-GEVHNLVFVIHHHQKIPLEKAM-----LAAAAMHNHEVQKLVNLESKIASFSAETD-- 284
Cdd:NF041168 224 TFADAVHLRNDLFSYQREVEEeGELSNGVLVVERFLGCDTQRAAdlvndLLTSRLQQFEHTALTELPALFDEHGLDPAer 303
                        330       340
                 ....*....|....*....|....*..
gi 701462432 285 AEITKYISGLHAWIRGNLDWYAHSGRY 311
Cdd:NF041168 304 ADVLAYVKGLQDWQSGGHEWHMRSSRY 330
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH