AlleleSeq: analysis of allele-specific expression and binding in a network framework

Mol Syst Biol. 2011 Aug 2:7:522. doi: 10.1038/msb.2011.54.

Abstract

To study allele-specific expression (ASE) and binding (ASB), that is, differences between the maternally and paternally derived alleles, we have developed a computational pipeline (AlleleSeq). Our pipeline initially constructs a diploid personal genome sequence (and corresponding personalized gene annotation) using genomic sequence variants (SNPs, indels, and structural variants), and then identifies allele-specific events with significant differences in the number of mapped reads between maternal and paternal alleles. There are many technical challenges in the construction and alignment of reads to a personal diploid genome sequence that we address, for example, bias of reads mapping to the reference allele. We have applied AlleleSeq to variation data for NA12878 from the 1000 Genomes Project as well as matched, deeply sequenced RNA-Seq and ChIP-Seq data sets generated for this purpose. In addition to observing fairly widespread allele-specific behavior within individual functional genomic data sets (including results consistent with X-chromosome inactivation), we can study the interaction between ASE and ASB. Furthermore, we investigate the coordination between ASE and ASB from multiple transcription factors events using a regulatory network framework. Correlation analyses and network motifs show mostly coordinated ASB and ASE.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles*
  • Cell Line
  • Chromosome Mapping
  • Chromosomes, Human, X / genetics
  • Chromosomes, Human, Y / genetics
  • DNA-Binding Proteins / genetics*
  • DNA-Binding Proteins / metabolism
  • Databases, Genetic
  • Gene Expression Regulation
  • Gene Regulatory Networks*
  • Genome, Human
  • Humans
  • Molecular Sequence Annotation
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, RNA*
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • DNA-Binding Proteins
  • Transcription Factors