|
Status |
Public on Dec 01, 2014 |
Title |
Comparative epigenomics of human and mouse erythropoiesis |
Sample organisms |
Homo sapiens; Mus musculus; Austrofundulus limnaeus |
Experiment type |
Expression profiling by high throughput sequencing Genome binding/occupancy profiling by high throughput sequencing Third-party reanalysis
|
Summary |
Erythropoiesis is one of the best understood examples of cellular differentiation. Morphologically, erythroid differentiation proceeds in a nearly identical fashion between humans and mice, but recent evidence has shown that networks of gene expression governing this process are divergent between species. We undertook a systematic comparative analysis of six histone modifications and four transcriptional master regulators in primary pro-erythroblasts and erythroid cell lines to better understand the underlying basis of these transcriptional differences. Our analyses suggest that while chromatin structure across orthologous promoters is strongly conserved, subtle differences are associated with transcriptional divergence between species. Many transcription factor (TF) occupancy sites were poorly conserved across species (~25% for GATA1, TAL1, and NFE2), but these factors were considerably more conserved between proerythroblasts and cell lines derived from the same species. We found that certain cis-regulatory modules co-occupied by GATA1, TAL1, and KLF1 are under strict evolutionary constraint and localize to genes necessary for erythroid cell identity. More generally, we show that conserved TF occupancy sites are indicative of active regulatory regions and strong gene expression sustained during maturation. Our results suggest that evolutionary turnover of TF binding sites drives transcriptional divergence partially by mediating changes in the underlying chromatin structure. We provide examples of how this framework can be applied to understand epigenomic variation in specific regulatory regions, such as the Beta-globin gene locus. Our findings have important implications for understanding epigenomic changes that mediate variation in cellular differentiation across species, while also providing a valuable resource for studies of hematopoiesis.
|
|
|
Overall design |
Examination of 6 histone marks and 4 transcription factors across human pro-erythroblasts, mouse pro-erythroblasts, K562 cells, and G1E cells 'Extract protocol' and 'library construction protocol' were performed as detailed in: GSE53983, GSE40522, GSE52924, GSE47492, GSE50406, GSE36028, GSE48020, GSE43626, GSE36985, GSE36589, GSE31477, GSE30142, GSE18868, GSE27893, GSE26501
|
|
|
Contributor(s) |
Ulirsch JC, Sankaran VG |
Citation(s) |
25521328 |
|
Submission date |
Jul 28, 2014 |
Last update date |
Dec 22, 2014 |
Contact name |
Vijay G Sankaran |
E-mail(s) |
sankaran@broadinstitute.org
|
Organization name |
Broad Institute
|
Street address |
7 Cambridge Center
|
City |
Cambridge |
State/province |
MA |
ZIP/Postal code |
02142 |
Country |
USA |
|
|
Relations |
Reanalysis of |
GSM1067274 |
Reanalysis of |
GSM1067275 |
Reanalysis of |
GSM1067276 |
Reanalysis of |
GSM1067277 |
Reanalysis of |
GSM1067279 |
Reanalysis of |
GSM1067280 |
Reanalysis of |
GSM1151146 |
Reanalysis of |
GSM1151147 |
Reanalysis of |
GSM1165493 |
Reanalysis of |
GSM1218225 |
Reanalysis of |
GSM1278238 |
Reanalysis of |
GSM1278240 |
Reanalysis of |
GSM1278241 |
Reanalysis of |
GSM1304777 |
Reanalysis of |
GSM1304778 |
Reanalysis of |
GSM1304779 |
Reanalysis of |
GSM1304780 |
Reanalysis of |
GSM1304781 |
Reanalysis of |
GSM1304782 |
Reanalysis of |
GSM1304783 |
Reanalysis of |
GSM1304784 |
Reanalysis of |
GSM1304785 |
Reanalysis of |
GSM1304786 |
Reanalysis of |
GSM1304787 |
Reanalysis of |
GSM1304788 |
Reanalysis of |
GSM1304789 |
Reanalysis of |
GSM1304790 |
Reanalysis of |
GSM1304791 |
Reanalysis of |
GSM1304792 |
Reanalysis of |
GSM1304793 |
Reanalysis of |
GSM1304794 |
Reanalysis of |
GSM1304795 |
Reanalysis of |
GSM1304796 |
Reanalysis of |
GSM1304797 |
Reanalysis of |
GSM1304798 |
Reanalysis of |
GSM1304799 |
Reanalysis of |
GSM1304800 |
Reanalysis of |
GSM1304801 |
Reanalysis of |
GSM1304802 |
Reanalysis of |
GSM1304803 |
Reanalysis of |
GSM467647 |
Reanalysis of |
GSM651545 |
Reanalysis of |
GSM651547 |
Reanalysis of |
GSM688808 |
Reanalysis of |
GSM688809 |
Reanalysis of |
GSM688810 |
Reanalysis of |
GSM688811 |
Reanalysis of |
GSM688812 |
Reanalysis of |
GSM688816 |
Reanalysis of |
GSM746581 |
Reanalysis of |
GSM746582 |
Reanalysis of |
GSM746583 |
Reanalysis of |
GSM746584 |
Reanalysis of |
GSM897224 |
Reanalysis of |
GSM908035 |
Reanalysis of |
GSM908037 |
Reanalysis of |
GSM908039 |
Reanalysis of |
GSM90804 |
Reanalysis of |
GSM908047 |
Reanalysis of |
GSM908049 |
Reanalysis of |
GSM908073 |
Reanalysis of |
GSM922955 |
Reanalysis of |
GSM922956 |
Reanalysis of |
GSM922957 |
Reanalysis of |
GSM922958 |
Reanalysis of |
GSM935465 |
Reanalysis of |
GSM946524 |
Reanalysis of |
GSM946536 |
Reanalysis of |
GSM946543 |
Reanalysis of |
GSM946544 |
Reanalysis of |
GSM946547 |
Reanalysis of |
GSM995529 |
Reanalysis of |
GSM995530 |
BioProject |
PRJNA256270 |
Supplementary file |
Size |
Download |
File type/resource |
GSE59801_0_GATA1_conserved.bed.gz |
27.1 Kb |
(ftp)(http) |
BED |
GSE59801_0_KLF1_conserved.bed.gz |
9.7 Kb |
(ftp)(http) |
BED |
GSE59801_0_NFE2_conserved.bed.gz |
32.4 Kb |
(ftp)(http) |
BED |
GSE59801_0_TAL1_conserved.bed.gz |
11.8 Kb |
(ftp)(http) |
BED |
GSE59801_1_GATA1_compensatory.bed.gz |
13.2 Kb |
(ftp)(http) |
BED |
GSE59801_1_KLF1_compensatory.bed.gz |
9.7 Kb |
(ftp)(http) |
BED |
GSE59801_1_NFE2_compensatory.bed.gz |
17.1 Kb |
(ftp)(http) |
BED |
GSE59801_1_TAL1_compensatory.bed.gz |
3.7 Kb |
(ftp)(http) |
BED |
GSE59801_2_GATA1_gained.bed.gz |
694.5 Kb |
(ftp)(http) |
BED |
GSE59801_2_KLF1_gained.bed.gz |
632.4 Kb |
(ftp)(http) |
BED |
GSE59801_2_NFE2_gained.bed.gz |
512.5 Kb |
(ftp)(http) |
BED |
GSE59801_2_TAL1_gained.bed.gz |
249.5 Kb |
(ftp)(http) |
BED |
GSE59801_3_GATA1_lost.bed.gz |
36.2 Kb |
(ftp)(http) |
BED |
GSE59801_3_KLF1_lost.bed.gz |
19.7 Kb |
(ftp)(http) |
BED |
GSE59801_3_NFE2_lost.bed.gz |
68.5 Kb |
(ftp)(http) |
BED |
GSE59801_3_TAL1_lost.bed.gz |
25.2 Kb |
(ftp)(http) |
BED |
GSE59801_4_GATA1_Q10.bed.gz |
40.0 Kb |
(ftp)(http) |
BED |
GSE59801_4_KLF1_Q10.bed.gz |
30.9 Kb |
(ftp)(http) |
BED |
GSE59801_4_NFE2_Q10.bed.gz |
31.5 Kb |
(ftp)(http) |
BED |
GSE59801_4_TAL1_Q10.bed.gz |
14.7 Kb |
(ftp)(http) |
BED |
GSE59801_G1E.genes.attr_table.txt.gz |
1.3 Mb |
(ftp)(http) |
TXT |
GSE59801_G1E.genes.count_table.txt.gz |
489.7 Kb |
(ftp)(http) |
TXT |
GSE59801_G1E.genes.fpkm_table.txt.gz |
698.7 Kb |
(ftp)(http) |
TXT |
GSE59801_G1E.isoforms.attr_table.txt.gz |
1.7 Mb |
(ftp)(http) |
TXT |
GSE59801_G1E.isoforms.count_table.txt.gz |
827.7 Kb |
(ftp)(http) |
TXT |
GSE59801_G1E.isoforms.fpkm_table.txt.gz |
1023.1 Kb |
(ftp)(http) |
TXT |
GSE59801_GATA1.K562.ProE.bw |
821.4 Mb |
(ftp)(http) |
BW |
GSE59801_GATA1.human.ProE.bw |
1.4 Gb |
(ftp)(http) |
BW |
GSE59801_GATA1.mouse.ProE.bw |
1.2 Gb |
(ftp)(http) |
BW |
GSE59801_K562.genes.attr_table.txt.gz |
776.9 Kb |
(ftp)(http) |
TXT |
GSE59801_K562.genes.count_table.txt.gz |
417.0 Kb |
(ftp)(http) |
TXT |
GSE59801_K562.genes.fpkm_table.txt.gz |
576.7 Kb |
(ftp)(http) |
TXT |
GSE59801_K562.isoforms.attr_table.txt.gz |
1.3 Mb |
(ftp)(http) |
TXT |
GSE59801_K562.isoforms.count_table.txt.gz |
1008.6 Kb |
(ftp)(http) |
TXT |
GSE59801_K562.isoforms.fpkm_table.txt.gz |
1.1 Mb |
(ftp)(http) |
TXT |
GSE59801_K562_GATA1_peaks.bed.gz |
350.0 Kb |
(ftp)(http) |
BED |
GSE59801_K562_GATA1_peaks.txt.gz |
500.9 Kb |
(ftp)(http) |
TXT |
GSE59801_K562_NFE2_peaks.bed.gz |
88.7 Kb |
(ftp)(http) |
BED |
GSE59801_K562_NFE2_peaks.txt.gz |
125.3 Kb |
(ftp)(http) |
TXT |
GSE59801_K562_TAL1_peaks.bed.gz |
372.9 Kb |
(ftp)(http) |
BED |
GSE59801_K562_TAL1_peaks.txt.gz |
558.9 Kb |
(ftp)(http) |
TXT |
GSE59801_K562_dense.bed.gz |
3.6 Mb |
(ftp)(http) |
BED |
GSE59801_KLF1.human.ProE.bw |
370.3 Mb |
(ftp)(http) |
BW |
GSE59801_KLF1.mouse.ProE.bw |
54.7 Mb |
(ftp)(http) |
BW |
GSE59801_NFE2.K562.ProE.bw |
458.1 Mb |
(ftp)(http) |
BW |
GSE59801_NFE2.human.ProE.bw |
340.0 Mb |
(ftp)(http) |
BW |
GSE59801_NFE2.mouse.ProE.bw |
767.7 Mb |
(ftp)(http) |
BW |
GSE59801_README_dataprocessing.txt |
3.0 Kb |
(ftp)(http) |
TXT |
GSE59801_README_reanalyzedGSMs_processedfilenames.txt |
2.9 Kb |
(ftp)(http) |
TXT |
GSE59801_TAL1.K562.ProE.bw |
461.4 Mb |
(ftp)(http) |
BW |
GSE59801_TAL1.human.ProE.bw |
654.3 Mb |
(ftp)(http) |
BW |
GSE59801_TAL1.mouse.ProE.bw |
902.1 Mb |
(ftp)(http) |
BW |
GSE59801_hProE_GATA1_peaks.bed.gz |
748.0 Kb |
(ftp)(http) |
BED |
GSE59801_hProE_GATA1_peaks.txt.gz |
1.1 Mb |
(ftp)(http) |
TXT |
GSE59801_hProE_KLF1_peaks.bed.gz |
666.4 Kb |
(ftp)(http) |
BED |
GSE59801_hProE_KLF1_peaks.txt.gz |
994.8 Kb |
(ftp)(http) |
TXT |
GSE59801_hProE_NFE2_peaks.bed.gz |
584.7 Kb |
(ftp)(http) |
BED |
GSE59801_hProE_NFE2_peaks.txt.gz |
871.1 Kb |
(ftp)(http) |
TXT |
GSE59801_hProE_TAL1_peaks.bed.gz |
267.4 Kb |
(ftp)(http) |
BED |
GSE59801_hProE_TAL1_peaks.txt.gz |
401.6 Kb |
(ftp)(http) |
TXT |
GSE59801_hProE_dense.bed.gz |
3.2 Mb |
(ftp)(http) |
BED |
GSE59801_human.genes.attr_table.txt.gz |
698.8 Kb |
(ftp)(http) |
TXT |
GSE59801_human.genes.count_table.txt.gz |
1.0 Mb |
(ftp)(http) |
TXT |
GSE59801_human.genes.fpkm_table.txt.gz |
1.3 Mb |
(ftp)(http) |
TXT |
GSE59801_human.isoforms.attr_table.txt.gz |
1.3 Mb |
(ftp)(http) |
TXT |
GSE59801_human.isoforms.count_table.txt.gz |
4.2 Mb |
(ftp)(http) |
TXT |
GSE59801_human.isoforms.fpkm_table.txt.gz |
4.4 Mb |
(ftp)(http) |
TXT |
GSE59801_mProE_GATA1_peaks.bed.gz |
146.1 Kb |
(ftp)(http) |
BED |
GSE59801_mProE_GATA1_peaks.txt.gz |
221.0 Kb |
(ftp)(http) |
TXT |
GSE59801_mProE_KLF1_peaks.bed.gz |
191.4 Kb |
(ftp)(http) |
BED |
GSE59801_mProE_KLF1_peaks.txt.gz |
265.8 Kb |
(ftp)(http) |
TXT |
GSE59801_mProE_NFE2_peaks.bed.gz |
328.6 Kb |
(ftp)(http) |
BED |
GSE59801_mProE_NFE2_peaks.txt.gz |
420.1 Kb |
(ftp)(http) |
TXT |
GSE59801_mProE_TAL1_peaks.bed.gz |
77.5 Kb |
(ftp)(http) |
BED |
GSE59801_mProE_TAL1_peaks.txt.gz |
115.9 Kb |
(ftp)(http) |
TXT |
GSE59801_mProE_dense.bed.gz |
2.7 Mb |
(ftp)(http) |
BED |
GSE59801_mouse.genes.attr_table.txt.gz |
672.6 Kb |
(ftp)(http) |
TXT |
GSE59801_mouse.genes.count_table.txt.gz |
697.9 Kb |
(ftp)(http) |
TXT |
GSE59801_mouse.genes.fpkm_table.txt.gz |
868.1 Kb |
(ftp)(http) |
TXT |
GSE59801_mouse.isoforms.attr_table.txt.gz |
1.1 Mb |
(ftp)(http) |
TXT |
GSE59801_mouse.isoforms.count_table.txt.gz |
1.9 Mb |
(ftp)(http) |
TXT |
GSE59801_mouse.isoforms.fpkm_table.txt.gz |
2.0 Mb |
(ftp)(http) |
TXT |
Processed data are available on Series record |