NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE159503 Query DataSets for GSE159503
Status Public on Oct 14, 2020
Title Effects of sheared chromatin length on ChIP-seq quality and sensitivity
Organism Mus musculus
Experiment type Genome binding/occupancy profiling by high throughput sequencing
Third-party reanalysis
Summary Chromatin immunoprecipitation followed by massively parallel, high throughput sequencing (ChIP-seq) is the method of choice for identifying, on a genome-wide scale, the segments of DNA bound by specific transcription factors (TFs) or in chromatin with particular histone modifications. However, the quality of ChIP-seq datasets vary over a wide range, with a substantial fraction being of intermediate to poor quality. Such experimental variability can lead to many false positives or false negatives, impairing the ability to interpret the data. Thus, it is important to discern and control the factors that contribute to variation in ChIP-seq. In this study we focus on the sonication step to produce sheared chromatin, a variable controllable by the user and applicable to all ChIP-seq protocols. We systematically varied the amount of shearing of fixed chromatin from a mouse erythroid cell line, carefully measured the distribution of resultant fragment lengths using the Agilent Bioanalyzer 2100, and then immunoprecipitated these batches of chromatin using highly specific antibodies against either TAL1 or CTCF. We found that the level of sonication, which was affected by both the number of sonication cycles, as well as the starting cell number, had a pronounced impact on the quality of resulting ChIP-seq signals. Specifically, over-sonication led to degradation of quality (e.g. increased background and reduction in signal), while the impact of under-sonication of chromatin differed between the two transcription factors, leading to the loss of sites occupied by TAL1 but not those bound by CTCF. We leveraged these findings to produce a set of CTCF ChIPs-seq datasets in primary hematopoietic progenitor cells, including several rare cell types. Together, these results suggest that the amount of sonication is a key variable in success of ChIP-seq experiments, and that carefully monitoring the level of chromatin sonication is one way to improve ChIP-seq quality and reproducibility, which in turn facilitates low input ChIP-seq in rare cell types.
 
Overall design We systematically varied the extent of sonication of fixed chromatin from G1E-ER4+E2 cells, carefully measured the distribution of resultant fragment lengths using the Agilent Bioanalyzer 2100, and then immunoprecipitated these batches of chromatin using highly specific antibodies against either TAL1 or CTCF. Following ChIP-seq library preparation, these samples were sequenced on the Illumina NextSeq 500. The datasets were then examined by both subjective visual assessments of the signal tracks at various loci on the genome browser, as well as established, genome-wide, objective quality metrics to determine the effects of sheared chromatin length on ChIP-seq quality and sensitivity. These findings were then leveraged to produce a set of CTCF ChIP-seq datasets in 8 bone marrow-derived, hematopoietic stem and progenitor cells, including several rare cell types.
Samples GSM946520 (GSE36028, GSE49847, GSE51338) and GSM995439 (GSE36029, GSE49847, GSE51338) were used as the input samples for the G1E-ER4+E2 samples.
 
Contributor(s) Keller CA, Wixom AQ, Heuston EF, Giardine BM, Hsiung CC, Long MR, Miller A, Wixom AQ, Anderson SM, Cockburn AK, Blobel GA, Bodine DM, Hardison RC
Citation(s) 33788948
Submission date Oct 13, 2020
Last update date Aug 24, 2021
Contact name Ross Hardison
E-mail(s) rch8@psu.edu
Organization name Pennsylvania State University
Street address 303 Wartik Lab
City University Park
State/province PA
ZIP/Postal code 16802
Country USA
 
Platforms (3)
GPL13112 Illumina HiSeq 2000 (Mus musculus)
GPL16417 Illumina MiSeq (Mus musculus)
GPL19057 Illumina NextSeq 500 (Mus musculus)
Samples (65)
GSM4831211 CTCF G1E-ER4+E2 [692]
GSM4831212 CTCF G1E-ER4+E2 [693]
GSM4831213 CTCF G1E-ER4+E2 [694]
Relations
Reanalysis of GSM946520
Reanalysis of GSM995439
BioProject PRJNA669079
SRA SRP287324

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE159503_1295.mm10.broadpeak.gz 282.9 Kb (ftp)(http) BROADPEAK
GSE159503_1908.mm10.broadpeak.gz 302.8 Kb (ftp)(http) BROADPEAK
GSE159503_1909.mm10.broadpeak.gz 130.3 Kb (ftp)(http) BROADPEAK
GSE159503_1910.mm10.broadpeak.gz 333.3 Kb (ftp)(http) BROADPEAK
GSE159503_1926.mm10.broadpeak.gz 105.3 Kb (ftp)(http) BROADPEAK
GSE159503_1927.mm10.broadpeak.gz 448.7 Kb (ftp)(http) BROADPEAK
GSE159503_1976.mm10.broadpeak.gz 385.7 Kb (ftp)(http) BROADPEAK
GSE159503_1977.mm10.broadpeak.gz 505.3 Kb (ftp)(http) BROADPEAK
GSE159503_1978.mm10.broadpeak.gz 210.3 Kb (ftp)(http) BROADPEAK
GSE159503_1979.mm10.broadpeak.gz 544.1 Kb (ftp)(http) BROADPEAK
GSE159503_1980.mm10.broadpeak.gz 189.7 Kb (ftp)(http) BROADPEAK
GSE159503_335.mm10.broadpeak.gz 253.7 Kb (ftp)(http) BROADPEAK
GSE159503_366.mm10.broadpeak.gz 89.8 Kb (ftp)(http) BROADPEAK
GSE159503_369.mm10.broadpeak.gz 295.9 Kb (ftp)(http) BROADPEAK
GSE159503_485.mm10.broadpeak.gz 50.3 Kb (ftp)(http) BROADPEAK
GSE159503_505.mm10.broadpeak.gz 5.2 Kb (ftp)(http) BROADPEAK
GSE159503_580.mm10.broadpeak.gz 159.0 Kb (ftp)(http) BROADPEAK
GSE159503_581.mm10.broadpeak.gz 7.4 Kb (ftp)(http) BROADPEAK
GSE159503_582.mm10.broadpeak.gz 3.0 Kb (ftp)(http) BROADPEAK
GSE159503_583.mm10.broadpeak.gz 46.4 Kb (ftp)(http) BROADPEAK
GSE159503_584.mm10.broadpeak.gz 23.1 Kb (ftp)(http) BROADPEAK
GSE159503_585.mm10.broadpeak.gz 17.7 Kb (ftp)(http) BROADPEAK
GSE159503_670.mm10.broadpeak.gz 4.2 Kb (ftp)(http) BROADPEAK
GSE159503_671.mm10.broadpeak.gz 40.1 Kb (ftp)(http) BROADPEAK
GSE159503_672.mm10.broadpeak.gz 65.2 Kb (ftp)(http) BROADPEAK
GSE159503_673.mm10.broadpeak.gz 115.4 Kb (ftp)(http) BROADPEAK
GSE159503_674.mm10.broadpeak.gz 133.3 Kb (ftp)(http) BROADPEAK
GSE159503_692.mm10.broadpeak.gz 202.4 Kb (ftp)(http) BROADPEAK
GSE159503_693.mm10.broadpeak.gz 233.5 Kb (ftp)(http) BROADPEAK
GSE159503_694.mm10.broadpeak.gz 207.6 Kb (ftp)(http) BROADPEAK
GSE159503_695.mm10.broadpeak.gz 286.0 Kb (ftp)(http) BROADPEAK
GSE159503_696.mm10.broadpeak.gz 186.5 Kb (ftp)(http) BROADPEAK
GSE159503_702.mm10.broadpeak.gz 65.3 Kb (ftp)(http) BROADPEAK
GSE159503_703.mm10.broadpeak.gz 57.5 Kb (ftp)(http) BROADPEAK
GSE159503_704.mm10.broadpeak.gz 51.3 Kb (ftp)(http) BROADPEAK
GSE159503_705.mm10.broadpeak.gz 24.8 Kb (ftp)(http) BROADPEAK
GSE159503_706.mm10.broadpeak.gz 8.4 Kb (ftp)(http) BROADPEAK
GSE159503_707.mm10.broadpeak.gz 39.0 Kb (ftp)(http) BROADPEAK
GSE159503_732.mm10.broadpeak.gz 208.8 Kb (ftp)(http) BROADPEAK
GSE159503_733.mm10.broadpeak.gz 157.4 Kb (ftp)(http) BROADPEAK
GSE159503_734.mm10.broadpeak.gz 169.1 Kb (ftp)(http) BROADPEAK
GSE159503_735.mm10.broadpeak.gz 171.4 Kb (ftp)(http) BROADPEAK
GSE159503_736.mm10.broadpeak.gz 94.5 Kb (ftp)(http) BROADPEAK
GSE159503_785.mm10.broadpeak.gz 211.4 Kb (ftp)(http) BROADPEAK
GSE159503_786.mm10.broadpeak.gz 92.7 Kb (ftp)(http) BROADPEAK
GSE159503_787.mm10.broadpeak.gz 172.0 Kb (ftp)(http) BROADPEAK
GSE159503_788.mm10.broadpeak.gz 39.1 Kb (ftp)(http) BROADPEAK
GSE159503_789.mm10.broadpeak.gz 178.1 Kb (ftp)(http) BROADPEAK
GSE159503_Keller_etal_2021_Supplemental_Tables_GEO.xlsx 94.5 Kb (ftp)(http) XLSX
GSE159503_RAW.tar 159.5 Gb (http)(custom) TAR (of BROADPEAK, BW)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap