U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

SRX20598916: GSM7457379: PADI Library 1, Bin 7; Saccharomyces cerevisiae; OTHER
1 ILLUMINA (Illumina NovaSeq 6000) run: 3.8M spots, 1.2G bases, 402.1Mb downloads

External Id: GSM7457379_r1
Submitted by: Strader Lab, Biology, Duke University
Study: High Throughput Plant Activation Domain Identification and Mapping
show Abstracthide Abstract
Arabidopsis gene expression is regulated by more than 1,900 transcription factors (TFs), which have been identified genome-wide by the presence of well-conserved DNA binding domains. Activator TFs contain activation domains (ADs) that recruit coactivator complexes; however, for most Arabidopsis TFs, we lack knowledge about the presence, location, and transcriptional strength of their ADs. To address this gap, we experimentally identified Arabidopsis ADs on a proteome-wide scale, finding that over half of Arabidopsis TFs carry an AD. We annotated 1,553 ADs, the vast majority of which were previously unknown. We used the dataset generated to develop a neural network to accurately predict ADs and to identify sequence features necessary to recruit coactivator complexes. We uncovered six distinct sequence feature combinations that resulted in activation activity, providing a framework to interrogate activation domain sub-functionalization. Furthermore, we identified activation domains within the ancient AUXIN RESPONSE FACTOR (ARF) family of transcription factors, finding conservation of AD positioning in distinct clades. Our findings provide a deep resource for understanding transcriptional activation, a framework for examination of function within intrinsically disordered regions, and a predictive model of activation domains. Overall design: Arabidopsis TFs (PADI), or ARF regions (ARF Evolution and Pilot) were fragmented into 40 amino acid tiles with a 10 amino acid step size were made as oligo pools and cloned into plasmid libraries. Yeast transformations ensured integration of a single synthetic TF, each expressing one tile, at the URA3 locus. TF activity was induced with beta-estradiol and TF activity was measured using cell sorting based on the ratio of reporter to synthetic TF (GFP:mcherry). The distribution of tiles across GFP:mCherry ratio bins was determined with Next-generation sequencing of the integrated TF. Median GFP:mcherry values were used to calculate activation domain scores.
Sample: PADI Library 1, Bin 7
SAMN35639120 • SRS17898972 • All experiments • All runs
Library:
Name: GSM7457379
Instrument: Illumina NovaSeq 6000
Strategy: OTHER
Source: GENOMIC
Selection: other
Layout: PAIRED
Construction protocol: Sorted cells were grown overnight and then pelleted. Genomic DNA was extracted from cell pellets with YeaStar Genomic DNA Kit (Zymo Research). Sequencing libraries were generated through three PCR reactions. PCR1 amplified a 600bp fragment that contained the tested fragment from the integrated locus. PCR2 amplified the fragment itself and added phasing and Illumina sequencing adapters. PCR3 completed the Illumina sequencing adapters and added indexes to each sample that were specific for each bin and yeast library tested.
Runs: 1 run, 3.8M spots, 1.2G bases, 402.1Mb
Run# of Spots# of BasesSizePublished
SRR248344633,810,3181.2G402.1Mb2024-05-19

ID:
28027625

Supplemental Content

Search details

See more...

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...