NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE16544 Query DataSets for GSE16544
Status Public on Jan 24, 2010
Title Systematic planning of genome-scale experiments in poorly studied species.
Platform organisms Saccharomyces bayanus; Saccharomyces cerevisiae
Sample organism Saccharomyces bayanus
Experiment type Expression profiling by array
Summary Genome-scale datasets have been used extensively in model organisms to screen for specific candidates or to predict functions for uncharacterized genes. However, despite the availability of extensive knowledge in model organisms, the planning of genome-scale experiments in poorly studied species is still based on the intuition of experts or heuristic trials. We propose that computational and systematic approaches can be applied to drive the experiment planning process in poorly studied species based on available data and knowledge in closely related model organisms. In this paper, we suggest a computational strategy for recommending genome-scale experiments based on their capability to interrogate diverse biological processes to enable protein function assignment. To this end, we use the data-rich functional genomics compendium of the model organism to quantify the accuracy of each dataset in predicting each specific biological process and the overlap in such coverage between different datasets. Our approach uses an optimized combination of these quantifications to recommend an ordered list of experiments for accurately annotating most proteins in the poorly studied related organisms to most biological processes, as well as a set of experiments that target each specific biological process. The effectiveness of this experiment- planning system is demonstrated for two related yeast species: the model organism Saccharomyces cerevisiae and the comparatively poorly studied Saccharomyces bayanus. Our system recommended a set of S. bayanus experiments based on an S. cerevisiae microarray data compendium. In silico evaluations estimate that less than 10% of the experiments could achieve similar functional coverage to the whole microarray compendium. This estimation was confirmed by performing the recommended experiments in S. bayanus, therefore significantly reducing the labor devoted to characterize the poorly studied genome. This experiment-planning framework could readily be adapted to the design of other types of large-scale experiments as well as other groups of organisms.
 
Overall design This dataset contains 53 experiments as follows:
(experiment: number of datasets (number of arrays))

growth at different temperatures: 1 (3)
heat shock: 4 (18)
ammonium: 1 (6)
cadmium: 1 (4)
copper: 1 (6)
lead: 1 (6)
nickel: 1 (5)
sulfite toxicity: 1 (6)
zinc: 1 (6)
ethanol toxicity: 1 (6)
sorbitol: 1 (6)
bleach: 1 (6)
hydrogen peroxide: 3 (17)
2-deoxyglucose: 1 (6)
hydroxyurea: 1 (6)
lovastatin: 1 (4)
MG-132: 1 (5)
MMS: 1 (6)
rapamycin: 1 (6)
tunicamycin: 1 (6)
zeocin: 1 (6)
chronological aging: 3 (13)
diauxic shift: 1 (6)
galactose: 5 (38)
glycerol: 1 (4)
sucrose: 1 (4)
auxotroph starvation: 1 (11)
nutrient limited chemostat growth: 3 (7)
mating type and ploidy: 1 (3)
alpha factor: 1 (8)
cell cycle: 1 (30)
sporulation: 3 (18)
strain backgrounds: 1 (4)
cross progeny: 1 (22)
Tn7 insertions: 1 (27)
555.11 and 670.20 knockout tetrads: 2 (8)

Timecourse datasets:

heat shock, ammonium, cadmium, copper, lead, nickel, sulfite toxicity, zinc, ethanol toxicity, sorbitol, bleach, hydrogen peroxide, 2-deoxyglucose, hydroxyurea, lovastatin, MG-132, MMS, rapamycin, tunicamycin, zeocin, chronological aging, diauxic shift, galactose, glycerol, sucrose, auxotroph starvation, alpha factor, cell cycle, sporulation

Single timepoint datasets:

growth at different temperatures, strain backgrounds, cross progeny, Tn7 insertions, nutrient limited chemostat growth, mating type and ploidy, 555.11 and 670.20 knockout tetrads

Almost all samples were hybridized versus a common reference prepared from a mixture of RNA from Mat a, Matx, and Mata/x cells sampled in both exponential and stationary phase. Additionally, RNA from stress conditions was included: hydrogen peroxide treatment sampled at 10, 30 and 45 minutes, and heat shock from 25 to 37 degrees sampled at 10 and 30 minutes.

Samples from the following datasets were not hybridized versus the common reference (reference used in parenthesis and in array annotations): cell cycle (asynchronous culture), constant temperatures (log phase culture), mating type and ploidy (log phase culture), diauxic shift (log phase culture), and strain backgrounds (log phase culture).

Replicates and dye swaps were not used.
 
Citation(s) 20221257, 21942931, 23324262
Submission date Jun 10, 2009
Last update date Aug 14, 2019
Contact name Maitreya J. Dunham
E-mail(s) maitreya@uw.edu
Phone 206-543-2338
Organization name University of Washington
Department Genome Sciences
Lab Dunham Lab
Street address Foege Building, S403B, Box 355065
City Seattle
State/province WA
ZIP/Postal code 98195-5065
Country USA
 
Platforms (7)
GPL8493 Lewis-Sigler Institute Saccharomyces 13K [20071114MD_PMC]
GPL8665 Lewis-Sigler Institute Saccharomyces 12K [20060831_PMC]
GPL8666 Lewis-Sigler Institute Saccharomyces 12K [20061110_PMC]
Samples (343)
GSM415438 B20 bayanus grown at 20 C
GSM415439 B25 bayanus grown at 25 C
GSM415440 B30 bayanus grown at 30 C
Relations
BioProject PRJNA116169

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE16544_RAW.tar 348.2 Mb (http)(custom) TAR (of GPR)
Processed data included within Sample table

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap