The chip has approximately 2.5 million features, covering 4 types of probe sets: custom SNP probe sets, 3’ expression probe sets s from the Affymetrix Drosophila expression array 2.0 (all perfect-match probes), selected probes from the Affymetrix Drosophila tiling array version 2.0 (exonic perfect-match probes), and standard Affymetrix negative control probe sets. Custom SNP probe sets were designed to for genotyping and ASE assays. Each SNP probe set includes 24 probes corresponding to four bases at the SNP site, both forward and reverse strands, and three positions of the SNP relative to the central base of the probe. For SNP identification, alignment sets were created from multiple sequence sources, including FlyBase R5.4 exons (68,536), 6 DPGP D. simulans strain genomes (assembled against the R4.2 D. melanogaster genome), and all Genbank D. simulans sequences (343,420) that were not annotated as “whole genome”. For each R5.4 exon the genome location in R4.2 was determined by BLAST alignment to the D. melanogaster R4.2 genome. The location was determined as where the longest alignment was found for multihit exons. Exons that mapped to more than one location or to chromosomes 4 or U were excluded. The remaining unique exons were BLAST (ref) aligned to the DPGP genomes and Genbank sequences. All sequences were then aligned using ClustalW(cite), creating a multiple sequence alignment for each exon at its genome position, from which SNPs were identified. SNPs were selected by quality criteria using the information from their supportive sequences. If there were fewer than 5 sequences supporting this SNP, or if more than 1 SNP occurred in the design window makeing the alignment suspected, the SNP was discarded. Genome position was not considered for SNP selection. The selected SNPs could be located at any position of an exon. There were 566 exons where SNP data was identified from Genbank alone, and 52,188 exons for which SNP data was identified from DPGP alone. Only 357 exons had no identified SNPs in either Genbank or DPGP. At each nucleotide where a D. simulans SNP was present a 35 bp design window was created- 17 bases upstream and 17 bp downstream from the SNP. There were 589,915 design windows created, corresponding to 13,637 genes. Design windows were compared to the D. melanogaster genome (v 4.7 as this is the assembly used for the DPGP data) using BLAST. There were 563,558 design windows found to be unique to the genome. If there were multiple SNPs in the design window, or if the SNPs identified were not biallelic those SNPS were discarded. After this initial selection, there were 196,345 biallelic SNPs in unique design windows with no other SNPs. For each design window 24 probes were constructed. For each SNP, probes for all 4 bases were designed12 each for the forward and reverse strands, with the SNP at the 0, +4, and -4 position. Probe sets were eliminated if any probe in the probe set contained a homopolymer run or could not be synthesized. The remaining 189,946 probe sets were examined for hybridization quality predicted using an Affymetrix internal scoring algorithm that takes into account, Tm, secondary structure and previous empirical observations. Probe sets were eliminated if 1/3 or more of the probes had poor predicted hybridization. For all genes with 7 or fewer SNPs, all SNPs were selected. Finally, if a gene had more than 7 SNPs with many of them being high coverage (more than 4 lines supporting the SNP), the probe sets with the best predicted hybridization were selected. This led to a total of 61,142 probe sets selected. To fill the chip an additional 610 were selected at random from the set of genes with more than 7 SNPs per gene for a total of 61,752 probe sets. Overall, SNP probe sets were designed for 11,946 genes (R5.26, 12,385 genes of R5.4 at the time of chip design), covering 87% of the known transcriptome. Also printed on the chip were all the positive match (PM) probes on the 3’ IVT Affymetrix Drosophila 2.0 array (n=262,766); all tiling probes in exons (FlyBase 5.11 n=699,856) from the Affymetrix Drosophila 2.0 tiling array; GC band controls (n=16,943); and hybridization and labeling controls (n=1,644). Overall, a total of 2,463,630 probes are on the chip. The probes from the 3’ IVT array allow for clear gene level detection calls and will be referred as “3’ expression probes” in the following text. The tiling probes provide controls for measurement of signal fluctuation caused by 5’ bias in expression assays as well as measurement of alternative exon usage, and they will be referred as “exon probes” hereby.