NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE254088 Query DataSets for GSE254088
Status Public on Aug 06, 2024
Title Noncoding Mendelian epigenomics - single cell CUT&Tag mouse cMN data
Organism Mus musculus
Experiment type Genome binding/occupancy profiling by high throughput sequencing
Summary Although Mendelian disorders are overwhelmingly attributed to protein-coding pathogenic variants, a majority of unsolved cases do not harbor obvious causal pathogenic variants in the coding sequence, suggesting a potential non-coding etiology. However, classification of pathogenicity in non-coding sequence remains prohibitive due to a vastly increased search space and the lack of a standardized rubric for interpretation. Here, we present an integrated single cell multiomic framework to nominate pathogenic non-coding variants for the congenital cranial dysinnervation disorders (CCDDs). The CCDDs are Mendelian neurodevelopmental disorders that result from aberrant development of cranial motor neurons in the embryonic brainstem. We created a non-coding reference atlas of single cell chromatin accessibility profiles for 86,089 embryonic mouse cranial motor neurons (cMNs). We found that high-quality single cell ATAC-seq (scATAC) profiles alone were a strong predictor of enhancement (64% in vivo validation rate). To further aid in interpretation, we integrated single cell histone modification and gene expression information to distinguish individual enhancers and their cognate genes. Relatively subtle differences in cellular composition of input data often led to substantial differences in predicted enhancer strength, cognate gene, and tissue of activity. Next, we mapped candidate non-coding variants from 899 whole genome sequences from 270 CCDD pedigrees to the murine cMN-specific regulatory elements and trained a machine learning classifier to accurately predict the functional effects of patient variants within these elements. We then performed high coverage scATACseq and site-specific footprinting analysis on an allelic series of CRISPR-humanised mice to validate our machine learning predictions and render important clues to the mode of pathogenicity. Finally, we performed peak- and gene-centric allelic aggregation to nominate non-coding variants, including those regulating MN1 and EBF3, respectively. Altogether this work extends non-coding variant analysis to Mendelian disease and presents a generalizable framework for nominating novel non-coding variants in other rare disorders.
 
Overall design Single cell CUT&Tag of mouse developing cranial motor neurons at e10.5 and e11.5 using antibody against H3K27Ac histone modification (ab177178)
 
Contributor(s) Engle E, Lee A
Citation(s) 39333082
Submission date Jan 24, 2024
Last update date Oct 15, 2024
Contact name Elizabeth C Engle
E-mail(s) elizabeth.engle@childrens.harvard.edu
Phone 6179194030
Organization name Boston Childrens Hospital
Street address 3 Blackfan Circle
City Boston
ZIP/Postal code 02115
Country USA
 
Platforms (1)
GPL19057 Illumina NextSeq 500 (Mus musculus)
Samples (8)
GSM8033215 CN34_e11.5_R1
GSM8033216 CN34_e11.5_R2
GSM8033217 CN6_e11.5_R1
This SubSeries is part of SuperSeries:
GSE254090 A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders
Relations
BioProject PRJNA1068537

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE254088_RAW.tar 1004.6 Mb (http)(custom) TAR (of BIGWIG)
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap