Whole-genome analysis of noncoding genetic variations identifies multiscale regulatory element perturbations associated with Hirschsprung disease

Genome Res. 2020 Nov;30(11):1618-1632. doi: 10.1101/gr.264473.120. Epub 2020 Sep 18.

Abstract

It is widely recognized that noncoding genetic variants play important roles in many human diseases, but there are multiple challenges that hinder the identification of functional disease-associated noncoding variants. The number of noncoding variants can be many times that of coding variants; many of them are not functional but in linkage disequilibrium with the functional ones; different variants can have epistatic effects; different variants can affect the same genes or pathways in different individuals; and some variants are related to each other not by affecting the same gene but by affecting the binding of the same upstream regulator. To overcome these difficulties, we propose a novel analysis framework that considers convergent impacts of different genetic variants on protein binding, which provides multiscale information about disease-associated perturbations of regulatory elements, genes, and pathways. Applying it to our whole-genome sequencing data of 918 short-segment Hirschsprung disease patients and matched controls, we identify various novel genes not detected by standard single-variant and region-based tests, functionally centering on neural crest migration and development. Our framework also identifies upstream regulators whose binding is influenced by the noncoding variants. Using human neural crest cells, we confirm cell stage-specific regulatory roles of three top novel regulatory elements on our list, respectively in the RET, RASGEF1A, and PIK3C2B loci. In the PIK3C2B regulatory element, we further show that a noncoding variant found only in the patients affects the binding of the gliogenesis regulator NFIA, with a corresponding up-regulation of multiple genes in the same topologically associating domain.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Class II Phosphatidylinositol 3-Kinases / genetics
  • Class II Phosphatidylinositol 3-Kinases / metabolism
  • Enhancer Elements, Genetic*
  • Genetic Variation
  • Hirschsprung Disease / genetics*
  • Humans
  • Introns
  • NFI Transcription Factors / metabolism
  • Promoter Regions, Genetic*
  • Proto-Oncogene Proteins c-ret / genetics
  • Whole Genome Sequencing
  • ras Guanine Nucleotide Exchange Factors / genetics

Substances

  • NFI Transcription Factors
  • NFIA protein, human
  • RASGEF1A protein, human
  • ras Guanine Nucleotide Exchange Factors
  • Class II Phosphatidylinositol 3-Kinases
  • PIK3C2B protein, human
  • Proto-Oncogene Proteins c-ret
  • RET protein, human