NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE254084 Query DataSets for GSE254084
Status Public on Aug 06, 2024
Title Noncoding Mendelian epigenomics - bulk RNA mouse cMN data
Organism Mus musculus
Experiment type Expression profiling by high throughput sequencing
Summary Although Mendelian disorders are overwhelmingly attributed to protein-coding pathogenic variants, a majority of unsolved cases do not harbor obvious causal pathogenic variants in the coding sequence, suggesting a potential non-coding etiology. However, classification of pathogenicity in non-coding sequence remains prohibitive due to a vastly increased search space and the lack of a standardized rubric for interpretation. Here, we present an integrated single cell multiomic framework to nominate pathogenic non-coding variants for the congenital cranial dysinnervation disorders (CCDDs). The CCDDs are Mendelian neurodevelopmental disorders that result from aberrant development of cranial motor neurons in the embryonic brainstem. We created a non-coding reference atlas of single cell chromatin accessibility profiles for 86,089 embryonic mouse cranial motor neurons (cMNs). We found that high-quality single cell ATAC-seq (scATAC) profiles alone were a strong predictor of enhancement (64% in vivo validation rate). To further aid in interpretation, we integrated single cell histone modification and gene expression information to distinguish individual enhancers and their cognate genes. Relatively subtle differences in cellular composition of input data often led to substantial differences in predicted enhancer strength, cognate gene, and tissue of activity. Next, we mapped candidate non-coding variants from 899 whole genome sequences from 270 CCDD pedigrees to the murine cMN-specific regulatory elements and trained a machine learning classifier to accurately predict the functional effects of patient variants within these elements. We then performed high coverage scATACseq and site-specific footprinting analysis on an allelic series of CRISPR-humanised mice to validate our machine learning predictions and render important clues to the mode of pathogenicity. Finally, we performed peak- and gene-centric allelic aggregation to nominate non-coding variants, including those regulating MN1 and EBF3, respectively. Altogether this work extends non-coding variant analysis to Mendelian disease and presents a generalizable framework for nominating novel non-coding variants in other rare disorders.
 
Overall design Bulk RNAseq of mouse developing cranial motor neurons at e10.5 and e11.5
 
Contributor(s) Engle E, Lee A
Citation missing Has this study been published? Please login to update or notify GEO.
Submission date Jan 24, 2024
Last update date Aug 07, 2024
Contact name Elizabeth C Engle
E-mail(s) elizabeth.engle@childrens.harvard.edu
Phone 6179194030
Organization name Boston Childrens Hospital
Street address 3 Blackfan Circle
City Boston
ZIP/Postal code 02115
Country USA
 
Platforms (1)
GPL17021 Illumina HiSeq 2500 (Mus musculus)
Samples (15)
GSM8033156 bulkRNA_cMN3_Isl1-pos_E10-5A
GSM8033157 bulkRNA_cMN3_Isl1-pos_E10-5B
GSM8033158 bulkRNA_cMN3_Isl1-pos_E11-5A
This SubSeries is part of SuperSeries:
GSE254090 A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders
Relations
BioProject PRJNA1068533

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE254084_RAW.tar 15.2 Mb (http)(custom) TAR (of RESULTS)
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap