|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Dec 05, 2020 |
Title |
ArchR: An integrative and scalable software package for single-cell chromatin accessibility analysis |
Organism |
Homo sapiens |
Experiment type |
Genome binding/occupancy profiling by high throughput sequencing
|
Summary |
The advent of large-scale single-cell chromatin accessibility profiling has accelerated our ability to map gene regulatory landscapes, but has outpaced the development of scalable software to rapidly extract biological meaning from these data. Here we present a software suite for single-cell analysis of regulatory chromatin in R (ArchR; www.ArchRProject.com) that enables fast and comprehensive analysis of single-cell chromatin accessibility data. ArchR provides an intuitive, user-focused interface for complex single-cell analyses including doublet removal, single-cell clustering and cell type identification, unified peak set generation, cellular trajectory identification, DNA element to gene linkage, transcription factor footprinting, mRNA expression level prediction from chromatin accessibility, and multi-omic integration with scRNA-seq. Enabling the analysis of over 1.2 million single cells within 8 hours on a standard Unix laptop, ArchR is a comprehensive analytical suite for end-to-end analysis of single-cell chromatin accessibility data that will accelerate the understanding of gene regulation at the resolution of individual cells.
|
|
|
Overall design |
10x Genomics single-nucleus ATAC-seq and bulk ATAC-seq data obtained from 10 different cell lines. These cell lines include K562, Jurkat, THP1, GM12878, MCF7, MCF10A, HeLa, HT1080, T24, and 293T. In the context of 10x Genomics snATAC-seq, all cell lines were mixed together to enable detection of multiplet droplets based on genotype.
|
|
|
Contributor(s) |
Granja J, Corces R |
Citation(s) |
33633365 |
|
Submission date |
Dec 04, 2020 |
Last update date |
Mar 06, 2021 |
Contact name |
Michael Ryan Corces |
E-mail(s) |
ryan.corces@gladstone.ucsf.edu
|
Organization name |
Gladstone Institute
|
Department |
Neurological Disease
|
Lab |
Corces Lab
|
Street address |
1650 Owens Street
|
City |
San Francisco |
State/province |
CA |
ZIP/Postal code |
94158 |
Country |
USA |
|
|
Platforms (2) |
GPL20301 |
Illumina HiSeq 4000 (Homo sapiens) |
GPL24676 |
Illumina NovaSeq 6000 (Homo sapiens) |
|
Samples (22)
|
|
Relations |
BioProject |
PRJNA682619 |
SRA |
SRP295963 |
Supplementary file |
Size |
Download |
File type/resource |
GSE162690_CellLine_HighLoading-MotifMatrix-SE.rds.gz |
182.5 Mb |
(ftp)(http) |
RDS |
GSE162690_CellLine_HighLoading-PeakMatrix-SE.rds.gz |
154.6 Mb |
(ftp)(http) |
RDS |
GSE162690_CellLine_HighLoading.fragments.tsv.gz |
1.4 Gb |
(ftp)(http) |
TSV |
GSE162690_CellLine_LowLoading-MotifMatrix-SE.rds.gz |
127.1 Mb |
(ftp)(http) |
RDS |
GSE162690_CellLine_LowLoading-PeakMatrix-SE.rds.gz |
125.9 Mb |
(ftp)(http) |
RDS |
GSE162690_CellLine_LowLoading.fragments.tsv.gz |
756.3 Mb |
(ftp)(http) |
TSV |
GSE162690_demuxlet_CellLineHigh_mergedBAMs.best.txt.gz |
3.4 Mb |
(ftp)(http) |
TXT |
GSE162690_demuxlet_CellLineLow_mergedBAMs.best.txt.gz |
2.3 Mb |
(ftp)(http) |
TXT |
SRA Run Selector |
Raw data are available in SRA |
Processed data are available on Series record |
|
|
|
|
|