NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|701462440|dbj|BAP82247|]
View 

terpene synthase [synthetic construct]

Protein Classification

terpene synthase family protein( domain architecture ID 10090936)

terpene synthase family protein similar to pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Terpene_cyclase_nonplant_C1 cd00687
Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene ...
10-309 1.78e-124

Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in fungi, bacteria and Dictyostelium.


:

Pssm-ID: 173835 [Multi-domain]  Cd Length: 303  Bit Score: 358.60  E-value: 1.78e-124
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  10 YCPFPSQTNKYVDVLEEYSLEWVLRFNLLANESAYKRFCKSKFFFLAASAYPDSKFEELKITHDWLSWVFIWDDQCDLSE 89
Cdd:cd00687    1 PSPFPYRLNPYVKEAQDEYLEWVLEEMLIPSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  90 lkKQPEVLNNFHQRYLEILNGAELTS--QDTLFSHALIDLRKRTLQRASIKWFNYFISYLEDYFYGCVQEATNRAKGIVP 167
Cdd:cd00687   81 --KSPEDGEAGVTRLLDILRGDGLDSpdDATPLEFGLADLWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVP 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 168 DLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASREI-ASGDVHNLIFVLHYH 246
Cdd:cd00687  159 DVAEYLEMRRFNIGADPCLGLSEFIGGPEVPAAVRLDPVMRALEALASDAIALVNDIYSYEKEIkANGEVHNLVKVLAEE 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 701462440 247 KKISLEKAIEQVVKIHNEEVHSLIKVESSLSFFS--EELDVEITKYISGMHSWIRGNLDWCYESY 309
Cdd:cd00687  239 HGLSLEEAISVVRDMHNERITQFEELEASLIKSGdlEEESPAVRAYVEGLHNWISGNLDWHRTSP 303
 
Name Accession Description Interval E-value
Terpene_cyclase_nonplant_C1 cd00687
Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene ...
10-309 1.78e-124

Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in fungi, bacteria and Dictyostelium.


Pssm-ID: 173835 [Multi-domain]  Cd Length: 303  Bit Score: 358.60  E-value: 1.78e-124
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  10 YCPFPSQTNKYVDVLEEYSLEWVLRFNLLANESAYKRFCKSKFFFLAASAYPDSKFEELKITHDWLSWVFIWDDQCDLSE 89
Cdd:cd00687    1 PSPFPYRLNPYVKEAQDEYLEWVLEEMLIPSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  90 lkKQPEVLNNFHQRYLEILNGAELTS--QDTLFSHALIDLRKRTLQRASIKWFNYFISYLEDYFYGCVQEATNRAKGIVP 167
Cdd:cd00687   81 --KSPEDGEAGVTRLLDILRGDGLDSpdDATPLEFGLADLWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVP 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 168 DLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASREI-ASGDVHNLIFVLHYH 246
Cdd:cd00687  159 DVAEYLEMRRFNIGADPCLGLSEFIGGPEVPAAVRLDPVMRALEALASDAIALVNDIYSYEKEIkANGEVHNLVKVLAEE 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 701462440 247 KKISLEKAIEQVVKIHNEEVHSLIKVESSLSFFS--EELDVEITKYISGMHSWIRGNLDWCYESY 309
Cdd:cd00687  239 HGLSLEEAISVVRDMHNERITQFEELEASLIKSGdlEEESPAVRAYVEGLHNWISGNLDWHRTSP 303
Terpene_syn_C_2 pfam19086
Terpene synthase family 2, C-terminal metal binding;
73-267 3.12e-53

Terpene synthase family 2, C-terminal metal binding;


Pssm-ID: 465972 [Multi-domain]  Cd Length: 199  Bit Score: 173.17  E-value: 3.12e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440   73 DWLSWVFIWDDQCDLSELKKQ-PEVLNNFHQRYLEILN--GAELTSQDTLFSHALIDLRKRTLQRASIKWFNYFISYLED 149
Cdd:pfam19086   1 KWLAWLFILDDIYDEVYGTLEeLELFTEAIERWDALLPldGPELPEYMKPLYRALADLWERLAKEASPDWRRRFKEAWKD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  150 YFYGCVQEATNRAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASR 229
Cdd:pfam19086  81 YLDAYLWEAKWRASGYVPTLEEYLELRRVTSGVPPLLALIEFGLGIELPDEVFEHPVVRRLVRAASDIVRLVNDLFSYKK 160
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 701462440  230 EIASGDVHNLIFVLHYHKKISLEKAIEQVVKIHNEEVH 267
Cdd:pfam19086 161 EQARGDVHNLVLVLMKEYGVSLQEAVDEVGELIEEAWK 198
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
7-311 7.69e-21

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 93.15  E-value: 7.69e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440   7 PGLYCPFPSQTNKYVDVLEEYSLEWVLRFNLL-ANESAY----KRFCKSKFFFLAASAYPDSKFEELKITHDWLSWVFIW 81
Cdd:NF041168 385 PDFYMPYPLRLNPHLDAARRNSKEWARRMGMLdVVPGVGvwdeRKFDGADFALCAAGIHPDAPAAELDLSADWLTWGTYG 464
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  82 DD--------QCDLSELKKQPEVLNNFHQryleiLNGAELTSQDTLFSHALIDLRKRTLQRASIKWFNYFISYLEDYFYG 153
Cdd:NF041168 465 DDyfpvvfgrTRDLAGAKAFNARLSAFMP-----LDAGPLPVPTNPLERGLADLWSRTAGPMSPEARRAFRRAVEDMLES 539
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 154 CVQEATNRAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASREIA- 232
Cdd:NF041168 540 WLWELANQIQNRVPDPVDYIEMRRKTFGSDLTMSLSRLAHGDSLPPEVFRTRPMRALENAAADYACLTNDIFSYQKEIEf 619
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 233 SGDVHNLIFVLHYHKKISLEKAIEQVVKIHNEEVHSLIK-VESSLSFFSEELDV------EITKYISGMHSWIRGNLDWC 305
Cdd:NF041168 620 EGELHNGVLVVQRFLDCDRQQAVAVVNDLMTARMRQFEHiVATELPALFEEFGLdaeareALRGYVEELQDWMAGILHWH 699

                 ....*.
gi 701462440 306 YESYRY 311
Cdd:NF041168 700 RGTSRY 705
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
7-313 4.01e-12

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 66.96  E-value: 4.01e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440   7 PGLYCPFPSQTNKYVDVLEEYSLEWVLRFNLLANESAY--------KRFCKSKFFFLAASAYPDSKFEELKITHDWLSWV 78
Cdd:NF041168   6 PDFYVPYPARLNPHLEGARAHSKAWAREMGMLDSPGDGdgepvwdeADFDAHDYALLCAYTHPDAPAPELDLITDWYVWV 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  79 FIWDDQcdLSELKKQPEVLNNfHQRYLEIL----------------NGAELtsqdtlfshALIDLRKRTLQRASIKWFNY 142
Cdd:NF041168  86 FFFDDH--FLEAFKRTRDLAG-ARAYLDRLpafmpvdpgtappeptNPVER---------GLADLWPRTVPTMSADWRRR 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 143 FISYLEDYFYGCVQEATNRAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSN 222
Cdd:NF041168 154 FAESTRNLLEESLWELANISEGRVANPIEYIEMRRKVGGAPWSANLVEHAVGAEVPARVAASRPMRVLRDTFADAVHLRN 233
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 223 DIFSASREIA-SGDVHNLIFVLHYHKKISLEKAIEQVvkihNEEVHSLIK-----VESSLSFFSEELDVE------ITKY 290
Cdd:NF041168 234 DLFSYQREVEeEGELSNGVLVVERFLGCDTQRAADLV----NDLLTSRLQqfehtALTELPALFDEHGLDpaeradVLAY 309
                        330       340
                 ....*....|....*....|...
gi 701462440 291 ISGMHSWIRGNLDWCYESYRYHN 313
Cdd:NF041168 310 VKGLQDWQSGGHEWHMRSSRYMN 332
 
Name Accession Description Interval E-value
Terpene_cyclase_nonplant_C1 cd00687
Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene ...
10-309 1.78e-124

Non-plant Terpene Cyclases, Class 1; This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in fungi, bacteria and Dictyostelium.


Pssm-ID: 173835 [Multi-domain]  Cd Length: 303  Bit Score: 358.60  E-value: 1.78e-124
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  10 YCPFPSQTNKYVDVLEEYSLEWVLRFNLLANESAYKRFCKSKFFFLAASAYPDSKFEELKITHDWLSWVFIWDDQCDLSE 89
Cdd:cd00687    1 PSPFPYRLNPYVKEAQDEYLEWVLEEMLIPSEKAEKRFLSADFGDLAALFYPDADDERLMLAADLMAWLFVFDDLLDRDQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  90 lkKQPEVLNNFHQRYLEILNGAELTS--QDTLFSHALIDLRKRTLQRASIKWFNYFISYLEDYFYGCVQEATNRAKGIVP 167
Cdd:cd00687   81 --KSPEDGEAGVTRLLDILRGDGLDSpdDATPLEFGLADLWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVP 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 168 DLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASREI-ASGDVHNLIFVLHYH 246
Cdd:cd00687  159 DVAEYLEMRRFNIGADPCLGLSEFIGGPEVPAAVRLDPVMRALEALASDAIALVNDIYSYEKEIkANGEVHNLVKVLAEE 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 701462440 247 KKISLEKAIEQVVKIHNEEVHSLIKVESSLSFFS--EELDVEITKYISGMHSWIRGNLDWCYESY 309
Cdd:cd00687  239 HGLSLEEAISVVRDMHNERITQFEELEASLIKSGdlEEESPAVRAYVEGLHNWISGNLDWHRTSP 303
Terpene_syn_C_2 pfam19086
Terpene synthase family 2, C-terminal metal binding;
73-267 3.12e-53

Terpene synthase family 2, C-terminal metal binding;


Pssm-ID: 465972 [Multi-domain]  Cd Length: 199  Bit Score: 173.17  E-value: 3.12e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440   73 DWLSWVFIWDDQCDLSELKKQ-PEVLNNFHQRYLEILN--GAELTSQDTLFSHALIDLRKRTLQRASIKWFNYFISYLED 149
Cdd:pfam19086   1 KWLAWLFILDDIYDEVYGTLEeLELFTEAIERWDALLPldGPELPEYMKPLYRALADLWERLAKEASPDWRRRFKEAWKD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  150 YFYGCVQEATNRAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASR 229
Cdd:pfam19086  81 YLDAYLWEAKWRASGYVPTLEEYLELRRVTSGVPPLLALIEFGLGIELPDEVFEHPVVRRLVRAASDIVRLVNDLFSYKK 160
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 701462440  230 EIASGDVHNLIFVLHYHKKISLEKAIEQVVKIHNEEVH 267
Cdd:pfam19086 161 EQARGDVHNLVLVLMKEYGVSLQEAVDEVGELIEEAWK 198
Terpene_cyclase_C1 cd00868
Terpene cyclases, Class 1; Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid ...
23-305 1.53e-51

Terpene cyclases, Class 1; Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational changes that close the active site to solvent, stabilizing reactive carbocation intermediates. Mechanistically and structurally distinct, class II terpene cyclases and cis-IPPS are not included in this CD. Taxonomic distribution includes bacteria, fungi and plants.


Pssm-ID: 173837 [Multi-domain]  Cd Length: 284  Bit Score: 171.78  E-value: 1.53e-51
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  23 VLEEYSlEWVLRFNLLANESaYKRFCKSKFFFLAASAYPDSKF-EELKITHDWLSWVFIWDDQCDLSELKKQPEVLNNFH 101
Cdd:cd00868    5 ELKELS-RWWKELGLQEKLP-FARDRLVECYFWAAGSYFEPQYsEARIALAKTIALLTVIDDTYDDYGTLEELELFTEAV 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 102 QRyLEILNGAELTSQDTLFSHALIDLRKRTLQRAS----IKWFNYFISYLEDYFYGCVQEATNRAKGIVPDLDTYIMIRR 177
Cdd:cd00868   83 ER-WDISAIDELPEYMKPVFKALYDLVNEIEEELAkeggSESLPYLKEAWKDLLRAYLVEAKWANEGYVPSFEEYLENRR 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 178 SSVGVYAVLALSEFCNQFIIP-DVLRNHHLVKKLELITTDIIAWSNDIFSASREIASGDVHNLIFVLHYHKKISLEKAIE 256
Cdd:cd00868  162 VSIGYPPLLALSFLGMGDILPeEAFEWLPSYPKLVRASSTIGRLLNDIASYEKEIARGEVANSVECYMKEYGVSEEEALE 241
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*....
gi 701462440 257 QVVKIHNEEVHSLIKVESSLSffseelDVEITKYISGMHSWIRGNLDWC 305
Cdd:cd00868  242 ELRKMIEEAWKELNEEVLKLS------SDVPRAVLETLLNLARGIYVWY 284
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
7-311 7.69e-21

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 93.15  E-value: 7.69e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440   7 PGLYCPFPSQTNKYVDVLEEYSLEWVLRFNLL-ANESAY----KRFCKSKFFFLAASAYPDSKFEELKITHDWLSWVFIW 81
Cdd:NF041168 385 PDFYMPYPLRLNPHLDAARRNSKEWARRMGMLdVVPGVGvwdeRKFDGADFALCAAGIHPDAPAAELDLSADWLTWGTYG 464
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  82 DD--------QCDLSELKKQPEVLNNFHQryleiLNGAELTSQDTLFSHALIDLRKRTLQRASIKWFNYFISYLEDYFYG 153
Cdd:NF041168 465 DDyfpvvfgrTRDLAGAKAFNARLSAFMP-----LDAGPLPVPTNPLERGLADLWSRTAGPMSPEARRAFRRAVEDMLES 539
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 154 CVQEATNRAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASREIA- 232
Cdd:NF041168 540 WLWELANQIQNRVPDPVDYIEMRRKTFGSDLTMSLSRLAHGDSLPPEVFRTRPMRALENAAADYACLTNDIFSYQKEIEf 619
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 233 SGDVHNLIFVLHYHKKISLEKAIEQVVKIHNEEVHSLIK-VESSLSFFSEELDV------EITKYISGMHSWIRGNLDWC 305
Cdd:NF041168 620 EGELHNGVLVVQRFLDCDRQQAVAVVNDLMTARMRQFEHiVATELPALFEEFGLdaeareALRGYVEELQDWMAGILHWH 699

                 ....*.
gi 701462440 306 YESYRY 311
Cdd:NF041168 700 RGTSRY 705
f2_encap_cargo3 NF041168
family 2 encapsulin nanocompartment cargo protein terpene cyclase;
7-313 4.01e-12

family 2 encapsulin nanocompartment cargo protein terpene cyclase;


Pssm-ID: 469079 [Multi-domain]  Cd Length: 733  Bit Score: 66.96  E-value: 4.01e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440   7 PGLYCPFPSQTNKYVDVLEEYSLEWVLRFNLLANESAY--------KRFCKSKFFFLAASAYPDSKFEELKITHDWLSWV 78
Cdd:NF041168   6 PDFYVPYPARLNPHLEGARAHSKAWAREMGMLDSPGDGdgepvwdeADFDAHDYALLCAYTHPDAPAPELDLITDWYVWV 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440  79 FIWDDQcdLSELKKQPEVLNNfHQRYLEIL----------------NGAELtsqdtlfshALIDLRKRTLQRASIKWFNY 142
Cdd:NF041168  86 FFFDDH--FLEAFKRTRDLAG-ARAYLDRLpafmpvdpgtappeptNPVER---------GLADLWPRTVPTMSADWRRR 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 143 FISYLEDYFYGCVQEATNRAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSN 222
Cdd:NF041168 154 FAESTRNLLEESLWELANISEGRVANPIEYIEMRRKVGGAPWSANLVEHAVGAEVPARVAASRPMRVLRDTFADAVHLRN 233
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 701462440 223 DIFSASREIA-SGDVHNLIFVLHYHKKISLEKAIEQVvkihNEEVHSLIK-----VESSLSFFSEELDVE------ITKY 290
Cdd:NF041168 234 DLFSYQREVEeEGELSNGVLVVERFLGCDTQRAADLV----NDLLTSRLQqfehtALTELPALFDEHGLDpaeradVLAY 309
                        330       340
                 ....*....|....*....|...
gi 701462440 291 ISGMHSWIRGNLDWCYESYRYHN 313
Cdd:NF041168 310 VKGLQDWQSGGHEWHMRSSRYMN 332
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH