NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE42871 Query DataSets for GSE42871
Status Public on Mar 18, 2013
Title Using RNA-Seq to Profile Soybean Seed Development from Fertilization to Maturity
Organism Glycine max
Experiment type Expression profiling by high throughput sequencing
Summary To understand gene expression networks leading to functional properties and compositional traits of the soybean seed, we have undertaken a detailed examination of soybean seed development from a few days post-fertilization to the mature seed using Illumina high-throughput transcriptome sequencing (RNA-Seq). RNA was sequenced from seven different stages of seed development, yielding between 12 million and 78 million sequenced transcripts. These have been aligned to the 79,000 gene models predicted from the soybean genome recently sequenced by the Department of Energy Joint Genome Institute. Over one hundred gene models were identified with high expression exclusively in young seed stages, starting at just four days after fertilization. These were annotated as being related to many basic components and processes such as histones and proline-rich proteins. Genes involved with some storage proteins such as glycinin and beta-conglycinin had their highest expression levels at the stages of largest fresh weight, confirming previous knowledge that these storage products are being rapidly accumulated before the seed begins the desiccation process. Other gene models showed high expression in the dry, mature seeds, perhaps indicating the preparation of pathways needed later, in the early stages of imbibition. Many highly-expressed gene models at the dry seed stage are, as expected, annotated as hydrophilic proteins associated with low water conditions, such as late embryogenesis abundant (LEA) proteins and dehydrins, which help preserve the cellular structures and nutrients within the seed during desiccation. Hundreds of transcription factors with notable expression in at least one stage of seed development were also identified and examined. Results from a second biological replicate demonstrate high reproducibility of these data.
 
Overall design High-throughput sequencing using Illumina Genome Analyzer II and Illumina HiSeq 2000 (RNA-Seq) was performed on seven stages of soybean seeds, with two biological replicates per stage.
 
Contributor(s) Jones SI, Vodkin LO
Citation(s) 23555009, 25635113
Submission date Dec 12, 2012
Last update date May 15, 2019
Contact name Lila O. Vodkin
E-mail(s) l-vodkin@illinois.edu
Phone 217-244-6147
Organization name University of Illinois
Department Crop Sciences
Lab Lila Vodkin
Street address 1201 W. Gregory Dr.
City Urbana
State/province IL
ZIP/Postal code 61801
Country USA
 
Platforms (2)
GPL11192 Illumina Genome Analyzer II (Glycine max)
GPL15008 Illumina HiSeq 2000 (Glycine max)
Samples (16)
GSM1056094 R08_4DAF_BR1
GSM1056095 R83_4DAF_BR2
GSM1056096 R09_12-14DAF_BR1
Relations
BioProject PRJNA184365
SRA SRP017638

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE42871_RAW.tar 10.6 Mb (http)(custom) TAR (of TXT)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap