NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM3380974 Query DataSets for GSM3380974
Status Public on Jan 31, 2019
Title S224A Dox Hi-C^2, rep1
Sample type SRA
 
Source name mESC
Organism Mus musculus
Characteristics cell type: ES cell (F1-2.1)
strain: CAST/Ei x 129/Sv/Jae
genotype: Tet-on CTCF S224A
treatment: Doxycycline (1ug/mL), 2 days
chip antibody: none
Treatment protocol Tet-on CTCF S224E mESCs were grown for six days with or without 1ug/mL doxycycline (RNA-seq). Tet-on CTCF, CTCF S224A, and CTCF S224E mESCs were grown for two days with or without 1ug/mL doxycycline (Hi-C^2).
Growth protocol ES cells were grown on γ-irradiated MEF feeders in ES medium (1% Pen/Strep, high-glucose DMEM+GlutaMAX, LIF) with 15% FBS.
Extracted molecule genomic DNA
Extraction protocol ChIP-seq: Harvested cells were crosslinked for 5 minutes in 1% formaldehyde. Crosslinking was quenched by the addition of glycine to 125mM. Fixed cells were PBS washed and cell pellets were frozen at -80C. Approximately 2x10^7 cells were used per ChIP. Pellets were resuspended in Lysis buffer 1 (25mM HEPES pH 7.5, 25mM EDTA, 0.5% SDS). Chromatin was sheared in a Qsonica Q800R for 20 minutes at 40% power 30sec on 30sec off at 4C to ~150bp. After centrifuging away debris, 4 volumes of Lysis buffer 2 (25mM HEPES pH 7.5, 187.5mM NaCl, 1.25% Triton X-100, 0.625% sodium deoxycholate, 6.25% glycerol, 12.5mM sodium pyrophosphate, 12.5mM B-glycerophosphate, 6.25mM NaF, 1.25mM Na3VO4) were added with protease inhibitors (1mM PMSF, 1ug/mL pepstatin, and 1 ug/mL leupeptin). Sepharose 4B (Sigma-Aldrich) was washed with ChIP Wash Buffer 1 (25mM HEPES pH 7.5, 150mM NaCl, 1% Triton X-100, 0.5% sodium deoxycholate, 5% glycerol, 5mM EDTA, 10mM sodium pyrophosphate, 10mM B-glycerophosphate, 5mM NaF, 1mM Na3VO4, protease inhibitors) and used to preclear the chromatin samples. 25μL/IP Protein G Dynabeads (ThermoFisher) were blocked in ChIP Wash Buffer 1 100ug/mL Acyclovir, 100ug/mL Zalcitabine, 5% Gly-Gly (Sigma-Aldrich). Beads were then washed with ChIP Wash Buffer 1. Precleared chromatin, 2ug of anti-CTCF or anti-CTCF Ser224-P antibodies (Table 1), and blocked Protein G Dynabeads were incubated together overnight at 4C with rotation. 10% of the precleared chromatin input was also saved. IP samples were washed with ChIP Wash Buffer 1, then with ChIP Wash Buffer 2 (25mM HEPES pH 7.5, 300mM NaCl, 1% Triton X-100, 0.5% sodium deoxycholate, 5% glycerol, 5mM EDTA, 10mM sodium pyrophosphate, 10mM B-glycerophosphate, 5mM NaF, 1mM Na3VO4, protease inhibitors) and then with TENG (50mM Tris-HCl pH 8.0, 1mM EDTA, 50mM NaCl, 5% glycerol, protease inhibitors). IPs were incubated for 30minutes at 37C in TENG with 200ug/mL RNase A. 10% Input samples were also treated with 200ug/mL RNase A. IPs were washed with ChIP Wash Buffer 2 and then TENG. IPs were then eluted at 65C in Elution Buffer (50mM Tris-HCl pH7.5, 10mM EDTA, 1% SDS). An equal volume of TE (Tris-HCl pH 8.0, 1mM EDTA) was added to the eluate with 40ug Proteinase K and incubated for 2hrs at 45C. The input samples were similarly protease treated in parallel. Both IP and input cross-links were reversed at 65C overnight and then phenol/chloroform/isoamyl alcohol extracted. The aqueous phase was further chloroform extracted and then alcohol precipitated. The DNA was suspended in TE.
RNA-seq: RNA was purified by TRIzol (Thermo Fisher) extraction via manufacturer’s instructions. Polyadenylated mRNA was the selected from total cell RNA using oligo(dT) beads (New England BioLabs).
Hi-C^2: In-situ Hi-C was performed as described in Rao et al. (2014): 3-5 million cells were crosslinked with 1% formaldehyde for 10min. Crosslinked cell pellets were lysed with 300uL ice-cold HiC lysis buffer (10mM Tris HCl pH 8, 10mM NaCl, 0.2% Igepal CA630) with protease inhibitors (Roche). Cell suspension was incubated for 30min at 4C with rotation, then pelleted at 2500xg for 5min. Pelleted nuclei were then washed with 500ul ice-cold HiC lysis buffer and resuspended in 50uL 0.5% SDS solution and incubated at 62C for 10min. 145ul H2O and 25ul 10% Triton X-100 were added to quench SDS and nuclei incubated at 37C for 15min. 26.1ul of 10X NEB Buffer 2 and 100U of MboI restriction enzyme were added and nuclei digested overnight at 37C with rotation. MboI was then heat activated at 62C for 20min, then nuclei cooled to room temperature. To fill in MboI fragment overhangs and mark DNA ends with biotin, 50ul of fill-in master mix was added (37.5ul 0.4mM biotin-14-dATP, 1.5ul 10mM dCTP, 1.5ul 10mM dGTP, 1.5ul 10mM dTTP, 8ul 5U/ul DNA Polymerase I, Large (Klenow) Fragment) and mixed by pipetting. Nuclei were then incubated at 37C for 1h with rotation. To ligate biotinylated DNA ends together, 900ul ligation master mix was added (649ul H2O, 120ul 10X NEB T4 ligase buffer, 100ul 10% Triton X-100, 6ul of 20mg/ml BSA, 25ul of 400U/ul T4 DNA ligase). Reaction was mixed by inverting and incubated at room temperature for 4h with rotation. Reaction was then centrifuged at 1700xg for 5min at room temperature. Pelleted nuclei were resuspended in 750ul TES buffer (1mM EDTA, 10mM Tris-HCl pH 8, 0.5% SDS), 50ul Proteinase K (20mg/ml) added and incubated at 55C for 1h with rotation. To reverse crosslinks, 88.9ul 3M NaCl was added and crosslinks reversed by incubated overnight at 65C with shaking (Eppendorf thermomixer, 950rpm). Samples were cooled to room temperature and DNA extracted with phase-lock gel tubes (5Prime). RNA was digested from purifying DNA by adding 2ul RNAse A (20mg/ml) and incubating at 37C for 1h. DNA was then extracted with phase-lock gel tubes (5Prime) and resuspended in 130ul 1X Tris buffer (10mM Tris-HCl, pH 8). To ensure complete resuspension, DNA was then incubated at 37C for 15min. DNA was then sonicated to a size of 300-500bp using a Covaris with the following parameters: 140W, 10% duty cycle, 200 bursts/cycle, 100s. Samples were then size selected to 300-500bp using AMPure XP beads and eluted in 300ul 1X Tris buffer. To pulldown biotinylated ligation products, 30ul (per sample) of MyOne Streptavidin C1 beads were washed with 400ul of 1X Tween washing buffer (5mM Tris-HCl pH 7.5, 0.5mM EDTA, 1M NaCl, 0.05% Tween 20) and resuspended in 300ul 2X binding buffer (10mM Tris-HCl pH 7.5, 1mM EDTA, 2M NaCl). Beads were then added to purified DNA and sample incubated at room temperature for 15min with rotation. Beads were washed twice with 600ul 1X Tween Washing buffer. Subsequent library preparation steps were then done directly on beads. HYbrid Capture Hi-C (Hi-C^2) probes were designed and hybridization to in-situ Hi-C libraries carried out as described previously (Sanborn et al., 2015). Probe sets were designed to enrich interactions in the region of interest: chrX:70,370,161-71,832,975 (mm9). Briefly, 120 bp probes were designed around the MboI restriction sites in the regions of interest and custom synthesized pools of single stranded oligodeoxynucleotides ordered from CustomArray, Inc. (Bothell, WA). Single stranded DNA oligos were amplified and biotinylated in a MAXIScript T7 transcription reaction (Ambion). The resulting biotinylated RNA probes were hybridized to 250-300 ng of in situ Hi-C libraries for 24 hours at 65C. DNA hybridized to the RNA probes was pulled down by streptavidin beads (Dynabeads MyOne Streptavidin C1, Life Technologies), washed, and eluted.
Libraries were constructed using the NEBNext ChIP-seq Library Preparation Kit with no modifications to the protocol. We used 12-18 cycles of PCR amplification, depending on library concentration measured by qPCR.
 
Library strategy Hi-C
Library source genomic
Library selection other
Instrument model Illumina HiSeq 2000
 
Data processing ChIP-seq: BWA was used to align reads against the mm9 reference genome (68). Alignments were filtered for uniquely mapped reads and duplicates were removed. Input-normalized coverage tracks were generated using SPP (Kharchenko et al., 2008). To identify prospective CTCF binding sites, we resolved regions of ChIP-Seq tag enrichment. Tag counts were analyzed in a 1 Kb window over the chromosome length with a 200bp step and estimated statistical significance of enrichment of ChIP vs input using negative binomial distribution, with the estimate of the mean based on the tag counts in input, and the size parameter(s) selected based on manual inspection of resulting peak calls. Regions of significant enrichment were generated by merging adjacent significantly enriched windows separated by 1 Kb or less. Broad regions of enrichment were called using SPP (Kharchenko et al., 2008). Genomic location of the resultant 50,749 CTCF and 912 CTCF Ser224-P peaks (z=6) was categorized using CEAS according to RefGene annotation of introns, exons, and UTRs (Shin et al., 2012). CTCF and CTCF Ser224-P data was visualized using IGV with ENCODE mESC CTCF ChIP-seq (GSM723015) and RNA-seq (GSM723776) data plotted for reference. Motif search of the ChIP-seq peak datasets (z=6) and logo generation was performed using MEME-ChIP with default parameters (http://meme-suite.org/)
RNA-seq: RNA-seq reads were trimmed using cutadapt with the options -e 0.2 -q 20 -a AGATCGGAAGAGC -m 20 --overlap 12. Trimmed reads were aligned to the mm9 reference genome using TopHat2 as previously described. PCR duplicates were removed and HOMER used to tabulate unique reads mapping to exons of each gene. DESeq2 was then used to identify differentially expressed (DE) genes. DE genes were counted as those with adjusted p-value < 0.01. Only genes with at least one count in one of the four samples were included in the analysis. For visualizing RNA-seq reads overlapping codon 224 of CTCF, RNA-seq reads were aligned to the CTCF S224E transgene sequence using the same parameters as above and an index of the CTCF S224E transgene sequence generated using bowtie2 with the default parameters. The resulting bam file of aligned reads was then visualized on IGV. Overrepresentation analysis of upregulated and downregulated DE genes was done using the PANTHER Classification System (http://www.pantherdb.org). Gene symbols for downregulated or upregulated DE genes used as input lists and the gene symbols of all expressed genes (i.e., all genes used in DESeq2 analysis) were used as the reference list. Metagene plots of ChIP-seq over DE upregulated, downregulated and all genes were generated via CEAS using bigwig files of reads from CTCF or CTCF Ser224-P ChIP-seq (Kharchenko et al., 2008).
Hi-C^2: Reads were trimmed using cutadapt with the options --adapter=GATCGATC (MboI ligation junction) and --minimum-length=20. Reads of each pair were individually mapped to the mus and cas reference genomes using novoalign and merged into non-allelic Hi-C summary files and filtered using HOMER. To avoid computational complexities arising from normalization of sparse, non-enriched regions in the Hi-C contact map, only Hi-C interactions falling within the capture region were analyzed further. For each capture, a custom script was used to pull out the filtered Hi-C interactions falling within the target region from the HOMER tag directories. Hi-C contact maps of the capture regions were then generated from these HOMER tags using the ‘pre’ command of Juicer tools. The resulting Hi-C contact maps in .hic format were visualized and normalized with the ‘Coverage (Sqrt)’ option in Juicebox . Insulation scores were computed for Hi-C2 datasets as previously described (Crane et al. 2015, Giorgetti et al. 2016) with a few modifications. Briefly, 25kb resolution ‘Coverage (Sqrt)’ normalized Hi-C2 interaction matrices were extracted from Juicebox using the ‘dump’ function of Juicer tools and converted to the cworld matrix format using custom shell and R scripts. Insulation scores were then computed from the cworld matrices using the ‘matrix2insulation.pl’ cworld script and the parameters ‘-v --is 125000 --ids 75000 –im sum’ and manually normalized using the total number of reads falling in the capture region, rather than total number of reads on the chromosome. TAD scores were then calculated from the insulation scores using the ‘insulation2tads.pl’ cworld script with default parameters and the Mecp2 TAD boundaries manually defined as the bins chrX:71200000-71225000 and chrX:71650000-71675000.
Genome_build: mm9
Supplementary_files_format_and_content: ChIP-seq: Input-normalized bigWig files (bw)
Supplementary_files_format_and_content: RNA-seq: bigWig files (bw) of fpm values. Raw exonic read counts in each RNA-seq library are summarized in 'counts' .txt files
Supplementary_files_format_and_content: Hi-C^2: Compressed Hi-C matrix (.hic) files for filtered Hi-C reads in the capture region (chrX:70,370,161-71,832,975 ). File can be loaded onto Juicebox to view either raw or normalized Hi-C interaction matrices at 5kb to 2.5Mb resolutions.
 
Submission date Sep 09, 2018
Last update date May 09, 2019
Contact name Andrea June Kriz
E-mail(s) ajkriz@g.harvard.edu
Phone 6177265943
Organization name Massachusetts General Hospital, Harvard Medical School
Department Molecular Biology
Lab Jeannie Lee Lab
Street address 185 Cambridge St
City Boston
State/province MA
ZIP/Postal code 02114
Country USA
 
Platform ID GPL13112
Series (1)
GSE119697 Exploration of CTCF post-translation modifications uncovers Serine-224 phosphorylation by PLK1 at pericentric regions during the G2/M transition
Relations
BioSample SAMN10023813
SRA SRX4663289

Supplementary file Size Download File type/resource
GSM3380974_CTCF_S224A_plus_dox.hic 3.4 Mb (ftp)(http) HIC
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap