NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM1974103 Query DataSets for GSM1974103
Status Public on May 05, 2016
Title iPS_2i_Rep1_5C_library
Sample type SRA
 
Source name NPC-derived Induced Pluripotent Stem Cells
Organism Mus musculus
Characteristics cell type: NPC-derived Induced Pluripotent Stem Cells
strain: 129SvJae x C57BL/6
genotype/variation: Sox2-eGFP
Treatment protocol none
Growth protocol Induced pluripotent stem cells were derived from primary NPCs as described by (Eminli et al., 2008). After reprogramming and initial expansion, iPS cells were cultured on Mitomycin-C inactived MEFs in 2i serum-free media containing LIF, CHIR99021, PD0325901 (Axon Medchem), as previously described (Rais et al., 2013). After 2i expansion, iPS cells were passaged onto 0.1% gelatin to remove contaminating feeder cells. Cells were grown to ~7e6 cells per 15 cm dish at the time of fixation with 1% formaldehyde before downstream assay.
Extracted molecule genomic DNA
Extraction protocol 3C templates were generated using HindIII as previously described (Dekker et al., 2002; van Berkum and Dekker, 2009). 5C libraries were generated from 3C templates using an alternating primer design across 1-2 Mb regions around Oct4, Nanog, Sox2, Nestin, Olig1-Olig2, Klf4, and a gene desert negative control (described previously Dostie and Dekker, 2007; Dostie et al., 2006; van Berkum and Dekker, 2009).
 
Library strategy OTHER
Library source genomic
Library selection other
Instrument model Illumina NextSeq 500
 
Description Chromosome-Conformation-Capture-Carbon-Copy (5C) Library
BED_ES-NPC-iPS-LOCI_mm9.bed
Beagan_et_al_5C_processed_data_file_2_23_2016_10samples.txt
BED_binned_mm9_20kbbin_4kbstep-10samples.bed
Data processing Paired-end reads were aligned to a pseudo-genome consisting of all 5C primers using Bowtie (http://bowtie-bio.sourceforge.net/index.shtml) (Langmead, 2010). Only reads with one unique alignment were considered for downstream analyses. Interactions were counted when both paired-end reads could be uniquely mapped to the 5C primer pseudo-genome. Only interactions between forward-reverse primer pairs were tallied as a true count. Counts were converted to contact matrices for each genomic region queried (i.e. ~1-2 Megabase regions around developmentally regulated genes Nanog, Oct4, Sox2, Nestin, Olig, Klf4) to generate the raw .counts file for each sample.
To correct for bias related to intrinsic properties of restriction fragments (i.e. G-C content; fragment size) and procedural batch effects, we converted our raw .counts data to ‘Observed’ values through a series of pre-processing steps. Briefly, raw .counts matrices were (i) trimmed of primers with less than 100 total counts in any replicate, (ii) quantile normalized across biological conditions, (iii) normalized for primer biases as previously described (Phillips-Cremins et al. 2013), (iv) trimmed of low information primer-primer pairs as described below, (v) log transformed, (vi) binned into 4 kilobase(kb)-sized bins and (vii) smoothed using a sliding 20 kb smoothing windows with 4 kb step-size. A primer-primer pair was set to NaN and removed from downstream analyses if it did not cross a threshold of 10 counts in any one library. Four kb bins were withheld from downstream processing if greater than 80% of the primer-primer pairs within that bin’s smoothing window were NaN. Next, we developed an empirical ‘Expected’ model of the distance-dependent background level of non-specific chromatin interactions. We computed our ‘Expected’ values locally (i.e. for each developmentally regulated region independently) by averaging the observed values across all bin-bin pairs representing equidistant interactions in each region. We then assigned each bin-bin pair a “log2(Observed/Expected)” value by subtracting the region-specific Expected value for a given bin-bin contact distance from the Observed value of bins at the same distance. To compute p-values for each individual bin-bin pair, we modeled our ‘Observed over Expected’ values as a Logistic distribution with location/scale parameters computed independently for each region and each biological replicate. The resulting ‘Interaction Score’ (computed as -10*log2(p-value)) was comparable within and between replicates and allowed for robust detection of fragment-to-fragment looping interactions that were significant above the expected background signal for each genomic region. The processed data files Beagan_et_al_5C_processed_data_file_2_23_2016_6samples.txt and Beagan_et_al_5C_processed_data_file_2_23_2016_10samples.txt were created for 6 sample (V6.5_Rep1, V6.5_Rep2, pNPC_Rep1, pNPC_Rep2, iPS_Rep1, iPS_Rep2) and 10 sample (V6.5_Rep1, V6.5_Rep2, pNPC_Rep1, pNPC_Rep2, iPS_Rep1, iPS_Rep2, V6.52i_Rep1, V6.52i_Rep2, iPS2i_Rep1, iPS2i_Rep2) experimental analyses.
Supplementary_files_format_and_content: The processed data files (Beagan_et_al_5C_processed_data_file_2_23_2016_6samples.txt, Beagan_et_al_5C_processed_data_file_2_23_2016_10samples.txt) contain the following information for each bin-bin pair: “Chromosome” (chromosome containing the 5C region), “Region” (5C Region), “Bin1 ID” (unique identifier of the downstream bin), “Bin2 ID” (unique identifier of the upstream bin), “Bin1 Start” (genomic coordinate of the start of the downstream bin), “Bin1 End” (genomic coordinate of the end of the downstream bin), “Bin2 Start” (genomic coordinate of the start of the upstream bin), “Bin2 End” (genomic coordinate of the end of the upstream bin), “Distance” (mid-to-mid distance between interaction bins), “_obs.counts” (normalized, logged, binned and smoothed interaction counts between Bin1 and Bin2), “_exp.counts” (expected interaction counts as determined by distance-dependent expected model), “_obs_over_exp.counts” (calculated by subtracting Expected counts from Observed counts) “_obs_over_exp_pvalues.counts” (p-value for a specific bin-bin interaction calculated by fitting Observed over Expected counts to a Logistic distribution). The processed data files Beagan_et_al_5C_processed_data_file_2_23_2016_6samples.txt and Beagan_et_al_5C_processed_data_file_2_23_2016_10samples.txt were created for 6 sample (V6.5_Rep1, V6.5_Rep2, pNPC_Rep1, pNPC_Rep2, iPS_Rep1, iPS_Rep2) and 10 sample (V6.5_Rep1, V6.5_Rep2, pNPC_Rep1, pNPC_Rep2, iPS_Rep1, iPS_Rep2, V6.52i_Rep1, V6.52i_Rep2, iPS2i_Rep1, iPS2i_Rep2) experimental analyses. NOTE: Figures comparing chromatin architecture between only ES, NPC, and iPS cells were generated using the ‘6 sample’ processed data, while figures including the ES in 2i and iPS in 2i conditions were generated from the ’10 sample’ processed data.
Supplementary_files_format_and_content: The raw primer bed file (BED_ES-NPC-iPS-LOCI_mm9.bed) contains the following information for each individual primer used in the 5C experiment: “Chromosome” (chromosome containing the 5C region), “Start Site” (genomic coordinate of the start of the primer fragment), “End Site” (genomic coordinate of the end of the primer fragment), “Primer ID” (unique identifier within our primer set).
Supplementary_files_format_and_content: The binned primer bed file (BED_binned_mm9_20kbbin_4kbstep-6samples.bed, BED_binned_mm9_20kbbin_4kbstep-10samples.bed) contains the following information for each bin: “Chromosome” (chromosome containing the 5C region), “Start Site” (genomic coordinate of the start of the bin), “End Site” (genomic coordinate of the end of the bin), “Bin ID” (unique identifier within our primer set). BED_binned_mm9_20kbbin_4kbstep-6samples.bed and BED_binned_mm9_20kbbin_4kbstep-10samples.bed were created for 6 sample (V6.5_Rep1, V6.5_Rep2, pNPC_Rep1, pNPC_Rep2, iPS_Rep1, iPS_Rep2) and 10 sample (V6.5_Rep1, V6.5_Rep2, pNPC_Rep1, pNPC_Rep2, iPS_Rep1, iPS_Rep2, V6.52i_Rep1, V6.52i_Rep2, iPS2i_Rep1, iPS2i_Rep2) experimental analyses.
Supplementary_files_format_and_content: The raw counts files (v65_Rep1.counts, v65_Rep2.counts, v65_2i_Rep1.counts, v65_2i_Rep2.counts, pNPC_Rep1.counts, pNPC_Rep2.counts, iPS_Rep1.counts, iPS_Rep2.counts, iPS_2i_Rep1.counts, iPS_2i_Rep2.counts) were generated by first aligning paired-end reads to a pseudo-genome consisting of all 5C primers using Bowtie (http://bowtie-bio.sourceforge.net/index.shtml) (Langmead, 2010). Only reads with one unique alignment were considered for downstream analyses. Interactions were counted when both paired-end reads could be uniquely mapped to the 5C primer pseudo-genome. Only interactions between forward-reverse primer pairs were tallied as a true count. Counts were converted to contact matrices for each genomic region queried (i.e. ~1-2 Megabase regions around developmentally regulated genes Nanog, Oct4, Sox2, Nestin, Olig, Klf4). The raw counts files (v65_Rep1.counts, v65_Rep2.counts, v65_2i_Rep1.counts, v65_2i_Rep2.counts, pNPC_Rep1.counts, pNPC_Rep2.counts, iPS_Rep1.counts, iPS_Rep2.counts, iPS_2i_Rep1.counts, iPS_2i_Rep2.counts) contain the following information for each primer-primer pair used for downstream analyses: “Forward primer ID” (the forward primer in this primer-primer pair), “Reverse primer ID” (the reverse primer in this primer-primer pair), “Count” (the number of mapped reads to this primer-primer pair).
Supplementary_files_format_and_content: The final interaction classification file (Beagan_et_al_5C_processed_classification_data_file_2_23_2016_6samples.txt) contains the following information for each bin-bin pair that falls into one of our classifications: “Chromosome” (chromosome containing the 5C region), “Region” (5C Region), “Classification”
 
Submission date Dec 17, 2015
Last update date May 15, 2019
Contact name Jennifer E Phillips-Cremins
E-mail(s) jcremins@seas.upenn.edu
Organization name University of Pennsylvania
Department Bioengineering
Street address 415 Curie Blvd
City Philadelphia
State/province Pennsylvania
ZIP/Postal code 19104
Country USA
 
Platform ID GPL19057
Series (1)
GSE68582 Local Genome Topology Can Exhibit an Incompletely Rewired 3D-Folding State During Somatic Cell Reprogramming
Relations
BioSample SAMN04348387
SRA SRX1488350

Supplementary file Size Download File type/resource
GSM1974103_ips_2i_Rep1.counts.txt.gz 2.2 Mb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap