NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE76705 Query DataSets for GSE76705
Status Public on Jan 12, 2016
Title Complex Disease Subtypes Identified by Network-Based Clustering of Gene Expression Data: Application to COPD
Organism Homo sapiens
Experiment type Expression profiling by array
Third-party reanalysis
Summary One of the most common smoking-related diseases, chronic obstructive pulmonary disease (COPD), results from a dysregulated, multi-tissue inflammatory response to cigarette smoke. We hypothesized that systemic inflammatory signals in genome-wide blood gene expression can identify clinically important COPD-related disease subtypes, and we leveraged pre-existing gene interaction networks to guide unsupervised clustering of blood microarray expression data. Using network-informed non-negative matrix factorization, we analyzed genome-wide blood gene expression from 229 former smokers in the ECLIPSE Study, and we identified novel, clinically relevant molecular subtypes of COPD. These network-informed clusters were more stable and more strongly associated with measures of lung structure and function than clusters derived from a network-naïve approach, and they were associated with subtype-specific enrichment for inflammatory and protein catabolic pathways. These clusters were successfully reproduced in an independent sample of 135 smokers from the COPDGene Study.
Briefly, gene expression was derived from whole blood samples in ECLIPSE subjects and peripheral blood mononuclear cells (PBMCs) for the COPDGene subjects. Gene expression profiling was performed using the Affymetrix Human U133 Plus2 array. Gene expression data were log-transformed, and background correction and normalization were performed for the merged ECLIPSE and COPDGene samples using robust multi-array averaging and quantile normalization as implemented in the affy Bioconductor package[27]. Of the 136 COPDGene subjects reported in a previous publication[13], one self-reported African-American subject was removed from analysis, which was conducted on the remaining 135 non-Hispanic white subjects. To identify a set of genes associated with COPD, we performed differential expression analysis for 38,519 probesets in ECLIPSE that passed quality control measures. Normalized probeset intensities were related to measures indicative of two primary dimensions of pulmonary impairment in COPD airway obstruction as indicated by two measures of spirometric lung function (FEV1 (% of predicted) and FEV1/FVC) and lung parenchymal destruction, i.e., emphysema (as quantified by the percentage of low attenuation area less than -950 Hounsfield units on lung computed tomography, %LAA-950). The analysis was conducted using the limma Bioconductor package, and the false discovery rate was controlled at 5%. The following covariates were included in the differential expression analysis age, pack-years of cigarette smoke exposure, and gender.
 
Overall design After standardizing gene expression data from 229 ECLIPSE subjects by the variance of each probe set, we applied NMF[29] and NBS[6] to identify meta-patients (i.e. subtypes or subject clusters) and meta-genes (i.e. representative subtype expression profiles).

Cross-sectional study of smokers. 229 subjects from the ECLIPSE study were analyzed in the model discovery phase. 135 subjects from the COPDGene Study (GSE42057) were used for replication.

Please note that the entire data set for total 364 samples including the re-analyzed samples is provided in the *364samples.txt files.
 
Contributor(s) Castaldi P
Citation(s) 26773458, 35715807
Submission date Jan 11, 2016
Last update date Jun 27, 2022
Contact name Peter Castaldi
Organization name Brigham and Women's Hospital
Street address 181 Longwood Ave
City Boston
State/province Massachusetts
ZIP/Postal code 02115
Country USA
 
Platforms (1)
GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array
Samples (229)
GSM2036035 142975hp133a11
GSM2036036 142976HP133A11
GSM2036037 142977HP133A11
Relations
Reanalysis of GSM1031549
Reanalysis of GSM1031550
Reanalysis of GSM1031551
Reanalysis of GSM1031552
Reanalysis of GSM1031553
Reanalysis of GSM1031554
Reanalysis of GSM1031555
Reanalysis of GSM1031556
Reanalysis of GSM1031557
Reanalysis of GSM1031558
Reanalysis of GSM1031559
Reanalysis of GSM1031560
Reanalysis of GSM1031561
Reanalysis of GSM1031562
Reanalysis of GSM1031563
Reanalysis of GSM1031564
Reanalysis of GSM1031565
Reanalysis of GSM1031566
Reanalysis of GSM1031567
Reanalysis of GSM1031568
Reanalysis of GSM1031569
Reanalysis of GSM1031570
Reanalysis of GSM1031571
Reanalysis of GSM1031572
Reanalysis of GSM1031573
Reanalysis of GSM1031574
Reanalysis of GSM1031575
Reanalysis of GSM1031576
Reanalysis of GSM1031577
Reanalysis of GSM1031578
Reanalysis of GSM1031579
Reanalysis of GSM1031580
Reanalysis of GSM1031581
Reanalysis of GSM1031582
Reanalysis of GSM1031583
Reanalysis of GSM1031584
Reanalysis of GSM1031585
Reanalysis of GSM1031586
Reanalysis of GSM1031587
Reanalysis of GSM1031588
Reanalysis of GSM1031589
Reanalysis of GSM1031590
Reanalysis of GSM1031591
Reanalysis of GSM1031592
Reanalysis of GSM1031593
Reanalysis of GSM1031594
Reanalysis of GSM1031595
Reanalysis of GSM1031596
Reanalysis of GSM1031597
Reanalysis of GSM1031598
Reanalysis of GSM1031599
Reanalysis of GSM1031600
Reanalysis of GSM1031601
Reanalysis of GSM1031602
Reanalysis of GSM1031603
Reanalysis of GSM1031604
Reanalysis of GSM1031605
Reanalysis of GSM1031606
Reanalysis of GSM1031607
Reanalysis of GSM1031608
Reanalysis of GSM1031609
Reanalysis of GSM1031610
Reanalysis of GSM1031611
Reanalysis of GSM1031612
Reanalysis of GSM1031613
Reanalysis of GSM1031614
Reanalysis of GSM1031615
Reanalysis of GSM1031616
Reanalysis of GSM1031617
Reanalysis of GSM1031618
Reanalysis of GSM1031619
Reanalysis of GSM1031620
Reanalysis of GSM1031621
Reanalysis of GSM1031622
Reanalysis of GSM1031623
Reanalysis of GSM1031624
Reanalysis of GSM1031625
Reanalysis of GSM1031626
Reanalysis of GSM1031627
Reanalysis of GSM1031628
Reanalysis of GSM1031629
Reanalysis of GSM1031630
Reanalysis of GSM1031631
Reanalysis of GSM1031632
Reanalysis of GSM1031633
Reanalysis of GSM1031634
Reanalysis of GSM1031635
Reanalysis of GSM1031636
Reanalysis of GSM1031637
Reanalysis of GSM1031638
Reanalysis of GSM1031639
Reanalysis of GSM1031640
Reanalysis of GSM1031641
Reanalysis of GSM1031642
Reanalysis of GSM1031643
Reanalysis of GSM1031644
Reanalysis of GSM1031645
Reanalysis of GSM1031646
Reanalysis of GSM1031647
Reanalysis of GSM1031648
Reanalysis of GSM1031649
Reanalysis of GSM1031650
Reanalysis of GSM1031651
Reanalysis of GSM1031652
Reanalysis of GSM1031653
Reanalysis of GSM1031654
Reanalysis of GSM1031655
Reanalysis of GSM1031656
Reanalysis of GSM1031657
Reanalysis of GSM1031658
Reanalysis of GSM1031659
Reanalysis of GSM1031660
Reanalysis of GSM1031661
Reanalysis of GSM1031662
Reanalysis of GSM1031663
Reanalysis of GSM1031664
Reanalysis of GSM1031665
Reanalysis of GSM1031666
Reanalysis of GSM1031667
Reanalysis of GSM1031668
Reanalysis of GSM1031669
Reanalysis of GSM1031670
Reanalysis of GSM1031671
Reanalysis of GSM1031672
Reanalysis of GSM1031673
Reanalysis of GSM1031674
Reanalysis of GSM1031675
Reanalysis of GSM1031676
Reanalysis of GSM1031677
Reanalysis of GSM1031679
Reanalysis of GSM1031680
Reanalysis of GSM1031681
Reanalysis of GSM1031682
Reanalysis of GSM1031683
Reanalysis of GSM1031684
BioProject PRJNA308432

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE76705_RAW.tar 1.1 Gb (http)(custom) TAR (of CEL)
GSE76705_metadata_364samples.txt.gz 9.6 Kb (ftp)(http) TXT
GSE76705_normalize_data_364samples.txt.gz 102.9 Mb (ftp)(http) TXT
Processed data included within Sample table
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap