|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Feb 24, 2021 |
Title |
236_Wang_HiC_HG003 |
Sample type |
SRA |
|
|
Source name |
Staphylococcus aureus HG003
|
Organism |
Staphylococcus aureus |
Characteristics |
protocol: TSB, 37degree C genotype: S. aureus subsp. aureus NCTC8325 with rsbU and tcaR repaired restriction enzyme: HindIII strain: HG003
|
Treatment protocol |
Cells were growing in TSB medium at 37˚C, fixed by 3% formaldehyde for 30min at room temperature
|
Growth protocol |
Bacillus subtilis strains were derived from the prototrophic strain PY79 (Youngman et al 1983, PMID: 6300908). Cells were grown in defined rich medium (CH) (Harwood and Cutting 1990, Molecular Biological Methods for Bacillus) at specified temperature. Staphylococcus aureus strains were derived from HG003 (Herbert et al 2010, PMID: 20212089). Cells were grown in tryptic soy broth (TSB) at specified temperature.
|
Extracted molecule |
genomic DNA |
Extraction protocol |
Hi-C experiments were performed using HindIII according to previous publications (Le et al, 2013, PMID: 24158908; Wang et al 2015, PMID: 26253537). ChIP-seq experiments were performed using anti-SMC, or anti-His antibodies according to previous publication (Graham et al 2014, PMID: 24829297). Standard library construction protocol was used for Illumina HiSeq2500, Nextseq550, or MiSeq sequencing platforms. Illumina Truseq indexes were used.
|
|
|
Library strategy |
OTHER |
Library source |
genomic |
Library selection |
other |
Instrument model |
NextSeq 550 |
|
|
Description |
S. aureus subsp. aureus NCTC8325 with rsbU and tcaR repaired
|
Data processing |
Library strategy: HiC-Seq Reads from each end of a DNA fragment were represented in each of the two FASTQ files generated by Illumina paired-end sequencing. For Bacillus subtilis Hi-C, we mapped reads in each FASTQ file back to the genome of Bacillus subtilis PY79 (NCBI Reference Sequence: NC_022898.1) independently by Bowtie version 2.1.0 to preserve the order of reads. All the processing steps afterwards were performed using an in-house Perl scripts. For Staphylococcus aureus Hi-C, we mapped reads in each FASTQ file back to the genome of Staphylococcus aureus NCTC8325 genome (NCBI Reference Sequence: NC_007795.1) independently by Bowtie version 2.1.0 to preserve the order of reads. All the processing steps afterwards were performed using an in-house Perl scripts. For ChIP-seq, we mapped reads in each FASTQ file back to the genome of Bacillus subtilis PY79 (NCBI Reference Sequence: NC_022898.1) or Staphylococcus aureus NCTC8325 genome (NCBI Reference Sequence: NC_007795.1) independently using CLC genomics workbench 8.0, and exported .csv file. For Hi-C, the two resulting SAM files were then merged together. Unaligned reads were discarded. For Hi-C, the Bacillus subtilis (PY79) and Staphylococcus aureus (NCTC8325) genomes was divided into HindIII restriction fragments. Each aligned read was sorted into its corresponding restriction fragment. We inferred that a DNA fragment resulted from non-ligation or self-ligation if reads from both ends were from the same restriction fragment; these reads were discarded. Only DNA fragments for which the reads came from different restriction fragments were retained and used for construction of a Hi-C contact map. For Hi-C, the genomes of Bacillus subtilis (PY79) and Staphylococcus aureus (NCTC8325) was then divided into bins 10 kb and the remaining reads were allocated to their corresponding bin. We then counted the number of fragments having reads with in different bins. A raw Hi-C contact map is the matrix of read counts in which each entry, mij, indicates the number of fragments with ends mapping to bins i and j. The raw Hi-C contact map/matrix is biased due to the uneven distribution of restriction enzyme sites and, to a lesser extent, differences in GC content and the mappability of individual reads. We normalized raw contact maps using an iterative normalization procedure (Imakaev et al., 2012 PMID: 22941365). Essentially, we converted the number of interactions, or read counts, into Hi-C scores by applying the following equation and iteratively repeating it for the resulting contact map after each cycle: mij =mij * (total reads) / (total reads in bin i * total reads in bin j). The iterative procedure was repeated until the maximum relative error of the total number of Hi-C scores in a bin was less than 10-5. Resulting matrices were normalized so that Hi-C scores for each row and column sum to 1. Subsequent analysis and visualization was done using R scripts. Genome_build: NC_022898.1; NC_007795.1 Supplementary_files_format_and_content: Files ending with .matrix.txt: tab-delimited text files of the iteratively-corrected Hi-C contact maps/matrices. Files ending with .csv: comma-separated values files of ChIP-seq results. Supplementary_files_format_and_content: matrix.txt files: 10kb bins of B. subtilis genome; normalized Hi-C interaction scores Supplementary_files_format_and_content: csv files: genome position; reads; number of reads at each genome position
|
|
|
Submission date |
Feb 04, 2020 |
Last update date |
Feb 25, 2021 |
Contact name |
Xindan Wang |
E-mail(s) |
xindan@iu.edu
|
Organization name |
Indiana University at Bloomington
|
Department |
Biology
|
Lab |
Biology Building 225
|
Street address |
1001 E 3rd St
|
City |
Bloomington |
State/province |
IN |
ZIP/Postal code |
47405 |
Country |
USA |
|
|
Platform ID |
GPL28116 |
Series (1) |
GSE144742 |
XerD is required to unload bacterial SMC complexes at the replication terminus |
|
Relations |
BioSample |
SAMN13981328 |
SRA |
SRX7671104 |
Supplementary file |
Size |
Download |
File type/resource |
GSM4294722_236_Wang_HiC_HG003.matrix.txt.gz |
87.0 Kb |
(ftp)(http) |
TXT |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
|
|
|
|
|