NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE34399 Query DataSets for GSE34399
Status Public on Apr 27, 2012
Title Replication Timing by Repli-seq from ENCODE/University of Washington
Project ENCODE
Organism Homo sapiens
Experiment type Other
Summary This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Richard Sandstrom mailto:sull@u.washington.edu). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu).

This track is produced as part of the ENCODE Project. This track shows genome-wide assessment of DNA replication timing in cell lines using the sequencing-based "Repli-seq" methodology (see below). Replication timing is known to be an important feature for epigenetic control of gene expression that usually operates at a higher-order level than at the level of specific genes. For each experiment (cell line, replicate), replication timing was ascertained by the isolation and sequencing of newly replicated DNA from six cell cycle fractions: G1/G1b, S1, S2, S3, S4, G2 (six fraction profile). Replication patterns are visualized as a continuous function based on sequencing tag density (Percentage-normalized Signal) and as a wavelet-smoothed transform of the six fraction profile (Wavelet-smoothed Signal). Replication peaks corresponding to replication initiation zones (Peaks) and valleys corresponding to replication termination zones (Valleys) were determined from local maxima and minima, respectively, in the wavelet-smoothed signal data. A measure of relative copy number at each genomic location (Summed Densities) was determined by summing the normalized tag density values of each cell cycle fraction at that location (equals one replicated genome equivalent).

For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf
 
Overall design Cells were grown according to the approved ENCODE cell culture protocols. Repli-seq was performed as described by Hansen et al. (2010). Briefly, newly replicated DNA was labeled in vivo with a pulse of 5-bromo-2-deoxyuridine (BrdU), cells were fractionated into six different parts of the cell cycle by flow cytometry according to DNA content, cell cycle fractionated DNA was sonicated and an anti-BrdU monoclonal antibody was used to isolate the newly replicating DNA. Fragment ends were sequenced using the Illumina Genome Analyzer II or HiSeq platforms (36 bp reads). Some experiments (BJ, K562, BG02ES, GM06990) were originally performed and mapped to an earlier version of the human reference genome NCBI36/hg18 (Hansen et al., 2010) and were remapped to the more recent reference genome GRCh37/hg19.

Uniquely mapping high-quality reads were mapped to the genome minus the Y chromosome. Replication signals within each six cell cycle fraction were derived from the density of sequence tags mapping within a 50 kb sliding window (stepped 1 kb across the genome); these densities were normalized to 4 million tags per genome. To avoid variation due to copy number or sequence bias, cell cycle-specific replication signals at each location were determined as a percentage of the sum of the six normalized tag density signals (Percentage-normalized Signal).

To transform the six fraction replication signals into one track (Wavelet-smoothed Signal), the percentage-normalized signals at each location were used to calculate a weighted average value based on the average DNA content of each fraction according to flow cytometry [higher values correspond to earlier replication; formula=(0.917*G1b)+(0.750*S1)+(0.583*S2)+(0.417*S3)+(0.250*S4)+(0*G2)]. These weighted average data were smoothed by wavelet transformation [J7 level, corresponding to a scale of 128 kb; see Thurman et al. (2007)].

Replication initiation zones were flagged by determining local maxima in the wavelet-smoothed data (Peaks) and, similarly, replication termination zones were flagged by local minima (Valleys).

The sum of the 4 million normalized replication tag densities correspond to replication of one genome and can, therefore, be used as a measure of relative genomic copy number (Summed Densities). This is useful for evaluation of unusual replication patterns, such as "biphasic" ones where replication has both early and late components [as described by Hansen et al. (2010)].
Web link http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=wgEncodeUwRepliSeq
 
Contributor(s) Sandstrom R
Citation missing Has this study been published? Please login to update or notify GEO.
BioProject PRJNA63447
Submission date Dec 13, 2011
Last update date May 15, 2019
Contact name ENCODE DCC
E-mail(s) encode-help@lists.stanford.edu
Organization name ENCODE DCC
Street address 300 Pasteur Dr
City Stanford
State/province CA
ZIP/Postal code 94305-5120
Country USA
 
Platforms (1)
GPL9115 Illumina Genome Analyzer II (Homo sapiens)
Samples (15)
GSM923439 UW_RepliSeq_GM12812
GSM923440 UW_RepliSeq_GM12801
GSM923441 UW_RepliSeq_SK-N-SH
Relations
SRA SRP012560

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE34399_RAW.tar 1.1 Gb (http)(custom) TAR (of BED, BIGWIG)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap