Chromosomal regions containing high-density and ambiguously mapped putative single nucleotide polymorphisms (SNPs) correlate with segmental duplications in the human genome

Hum Mol Genet. 2002 Aug 15;11(17):1987-95. doi: 10.1093/hmg/11.17.1987.

Abstract

We have explored the National Center for Biotechnology Information (NCBI) single nucleotide polymorphisms (SNPs) database for a correlation between the density of putative SNPs, as well as SNPs that map to different chromosomal locations (ambiguously mapped SNPs), and segmental duplications of DNA in chromosome regions involved in genomic disorders. A high density of SNPs (14.4 and 12.4 SNPs per kb) was detected in the low copy repeats (LCRs) responsible for the chromosome 17p12 duplication and deletion that cause peripheral neuropathies. None of the SNPs at the PMP22 gene were ambiguously mapped, but 93% of the SNPs at LCRs mapped on both LCR copies, indicating that they are in fact variants in paralogous sequences. Similarly, a high SNP density was found in the LCR regions flanking the neurofibromatosis type 1 (NF1) gene, with 80% of SNPs mapping on both LCR copies. A high density of SNPs was found within LCR sequences involved in the deletions that mediate contiguous gene syndromes on chromosomes 7q11, 15q11-q13 and 22q11. We have analyzed the whole sequence of chromosome 22, which contains 14% of ambiguously mapped SNPs, and have found a good correlation between these SNPs and segmental duplications detected by BLAST analysis. We have identified several segments of ambiguously mapped SNPs, four corresponding to LCRs involved in the chromosome 22q11 microdeletion syndromes. Our data indicate that most SNPs in LCR segments are in fact paralogous sequence variants (PSVs), and suggest that a significant proportion of the SNPs in the NCBI database correspond to PSVs within segmental duplications of the human genome sequence.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Centromere / genetics
  • Chromosome Mapping*
  • Chromosomes, Human, Pair 22 / genetics*
  • Computational Biology
  • Contig Mapping / trends
  • Databases, Factual
  • Exons
  • Gene Deletion
  • Gene Duplication*
  • Genome, Human*
  • Humans
  • Polymorphism, Single Nucleotide*