Toward a universal microarray: prediction of gene expression through nearest-neighbor probe sequence identification

Nucleic Acids Res. 2007;35(15):e99. doi: 10.1093/nar/gkm549. Epub 2007 Aug 7.

Abstract

A generic DNA microarray design applicable to any species would greatly benefit comparative genomics. We have addressed the feasibility of such a design by leveraging the great feature densities and relatively unbiased nature of genomic tiling microarrays. Specifically, we first divided each Homo sapiens Refseq-derived gene's spliced nucleotide sequence into all of its possible contiguous 25 nt subsequences. For each of these 25 nt subsequences, we searched a recent human transcript mapping experiment's probe design for the 25 nt probe sequence having the fewest mismatches with the subsequence, but that did not match the subsequence exactly. Signal intensities measured with each gene's nearest-neighbor features were subsequently averaged to predict their gene expression levels in each of the experiment's thirty-three hybridizations. We examined the fidelity of this approach in terms of both sensitivity and specificity for detecting actively transcribed genes, for transcriptional consistency between exons of the same gene, and for reproducibility between tiling array designs. Taken together, our results provide proof-of-principle for probing nucleic acid targets with off-target, nearest-neighbor features.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Expression Profiling / methods*
  • Genome, Human
  • Humans
  • Oligonucleotide Array Sequence Analysis / methods*
  • Oligonucleotide Probes / chemistry*
  • Sequence Analysis, DNA
  • Transcription, Genetic

Substances

  • Oligonucleotide Probes