|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on May 13, 2024 |
Title |
Genetics, energetics and allostery during a billion years of hydrophobic protein core evolution |
Organism |
Saccharomyces cerevisiae |
Experiment type |
Other
|
Summary |
Protein folding is driven by the burial of hydrophobic amino acids in a tightly-packed core that excludes water. The genetics, biophysics and evolution of hydrophobic cores are not well understood, in part because of a lack of systematic experimental data on sequence combinations that do - and do not - constitute stable and functional cores. Here we randomize protein hydrophobic cores and evaluate their stability and function at scale. The data show that vast numbers of amino acid combinations can constitute stable protein cores but that these alternative cores frequently disrupt protein function because of allosteric effects. These strong allosteric effects are not due to complicated, highly epistatic fitness landscapes but rather, to the pervasive nature of allostery, with many individually small energy changes combining to disrupt function. Indeed both protein stability and ligand binding can be accurately predicted over very large evolutionary distances using additive energy models with a small contribution from pairwise energetic couplings. As a result, energy models trained on one protein can accurately predict core stability across hundreds of millions of years of protein evolution, with only rare energetic couplings that we experimentally identify limiting the transplantation of cores between highly diverged proteins. Our results reveal the simple energetic architecture of protein hydrophobic cores and suggest that allostery is a major constraint on sequence evolution.
|
|
|
Overall design |
We built combinatorial libraries in the hydrophobic cores of three small protein domains (FYN-SH3, CI-2A and CspA) using a reduced alphabet consisting of the amino acids F, L, M, V, I encoded by the DTS degenerate codon. By bottlenecking and pooling the libraries, in the sparse_DTS_core_mutagenesis experiment we sparsely measured the intracellular abundance of protein variants in yeast cells using abundancePCA, a protein complementation assay that couples cell growth rate with query protein intracellular abundance under selection by methotrexate. For the SH3 domain of the human FYN kinase, we selected a few query core amino acid combinations that are severely deleterious in abundance fitness and designed a suppressor "permissivity" library by introducing non-core mutations associated with SH3 domains naturally carrying such query core combinations that are deleterious in FYN (FYN-SH3_core_permissivity experiment). Also for FYN-SH3, we assessed the impact of core reconfiguration in function by measuring the binding to its short linear motif ligand PRD1super using bindingPCA, a protein complementation assay that couples cell growth rate with query variant intracellular binding to an interacting partner under selection by methotrexate (FYN-SH3_core_DTS_binding experiment).
|
Web link |
https://www.biorxiv.org/content/10.1101/2024.05.11.593672v1
|
|
|
Contributor(s) |
Escobedo A, Voigt G, Faure AJ, Lehner B |
Citation missing |
Has this study been published? Please login to update or notify GEO. |
|
Submission date |
Apr 30, 2024 |
Last update date |
May 13, 2024 |
Contact name |
Albert Escobedo |
Phone |
+34933160209
|
Organization name |
CRG
|
Department |
Systems and synthetic biology
|
Lab |
Ben Lehner
|
Street address |
Carrer Dr Aiguader 88
|
City |
Barcelona |
State/province |
Catalonia |
ZIP/Postal code |
08003 |
Country |
Spain |
|
|
Platforms (2) |
GPL19756 |
Illumina NextSeq 500 (Saccharomyces cerevisiae) |
GPL31112 |
NextSeq 2000 (Saccharomyces cerevisiae) |
|
Samples (16)
|
|
Relations |
BioProject |
PRJNA1106529 |
Supplementary file |
Size |
Download |
File type/resource |
GSE266299_FYN-SH3_core_DTS_binding_scores_PRD1super.csv.gz |
2.0 Mb |
(ftp)(http) |
CSV |
GSE266299_FYN-SH3_core_permissivity_scores.csv.gz |
500.7 Kb |
(ftp)(http) |
CSV |
GSE266299_sparse_DTS_core_mutagenesis_scores.csv.gz |
1.1 Mb |
(ftp)(http) |
CSV |
SRA Run Selector |
Raw data are available in SRA |
|
|
|
|
|