Protein interaction network underpins concordant prognosis among heterogeneous breast cancer signatures

James Chen, Lee Sam, Yong Huang, Younghee Lee, Jianrong Li, Yang Liu, H. Rosie Xing, Yves A Lussier

Research output: Contribution to journalArticle

42 Citations (Scopus)

Abstract

Characterizing the biomolecular systems' properties underpinning prognosis signatures derived from gene expression profiles remains a key clinical and biological challenge. In breast cancer, while different "poor-prognosis" sets of genes have predicted patient survival outcome equally well in independent cohorts, these prognostic signatures have surprisingly little genetic overlap. We examine 10 such published expression-based signatures that are predictors or distinct breast cancer phenotypes, uncover their mechanistic interconnectivity through a protein-protein interaction network, and introduce a novel cross-"gene expression signature" analysis method using (i) domain knowledge to constrain multiple comparisons in a mechanistically relevant single-gene network interactions and (ii) scale-free permutation re-sampling to statistically control for hubness (SPAN - Single Protein Analysis of Network with constant node degree per protein). At adjusted p-values < 5%, 54-genes thus identified have a significantly greater connectivity than those through meticulous permutation re-sampling of the context-constrained network. More importantly, eight of 10 genetically non-overlapping signatures are connected through well-established mechanisms of breast cancer oncogenesis and progression. Gene Ontology enrichment studies demonstrate common markers of cell cycle regulation. Kaplan-Meier analysis of three independent historical gene expression sets confirms this network-signature's inherent ability to identify "poor outcome" in ER(+) patients without the requirement of machine learning. We provide a novel demonstration that genetically distinct prognosis signatures, developed from independent clinical datasets, occupy overlapping prognostic space of breast cancer via shared mechanisms that are mediated by genetically different yet mechanistically comparable interactions among proteins of differentially expressed genes in the signatures. This is the first study employing a networks' approach to aggregate established gene expression signatures in order to develop a phenotype/pathway-based cancer roadmap with the potential for (i) novel drug development applications and for (ii) facilitating the clinical deployment of prognostic gene signatures with improved mechanistic understanding of biological processes and functions associated with gene expression changes. http://www.lussierlab.org/publication/networksignature/.

Original languageEnglish (US)
Pages (from-to)385-396
Number of pages12
JournalJournal of Biomedical Informatics
Volume43
Issue number3
DOIs
StatePublished - Jun 2010
Externally publishedYes

Fingerprint

Protein Interaction Maps
Gene expression
Genes
Transcriptome
Breast Neoplasms
Proteins
Phenotype
Biological Phenomena
Gene Expression
Gene Ontology
Gene Regulatory Networks
Kaplan-Meier Estimate
Sampling
Publications
Cell Cycle
Carcinogenesis
Survival
Ontology
Learning systems
Demonstrations

Keywords

  • Breast cancer
  • Context-constrained networks
  • Gene signatures
  • Protein interaction networks
  • Systems biology

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics
  • Medicine(all)

Cite this

Protein interaction network underpins concordant prognosis among heterogeneous breast cancer signatures. / Chen, James; Sam, Lee; Huang, Yong; Lee, Younghee; Li, Jianrong; Liu, Yang; Xing, H. Rosie; Lussier, Yves A.

In: Journal of Biomedical Informatics, Vol. 43, No. 3, 06.2010, p. 385-396.

Research output: Contribution to journalArticle

Chen, James ; Sam, Lee ; Huang, Yong ; Lee, Younghee ; Li, Jianrong ; Liu, Yang ; Xing, H. Rosie ; Lussier, Yves A. / Protein interaction network underpins concordant prognosis among heterogeneous breast cancer signatures. In: Journal of Biomedical Informatics. 2010 ; Vol. 43, No. 3. pp. 385-396.
@article{57fb2ff33b8a4e258dfba6d59b6d2e2e,
title = "Protein interaction network underpins concordant prognosis among heterogeneous breast cancer signatures",
abstract = "Characterizing the biomolecular systems' properties underpinning prognosis signatures derived from gene expression profiles remains a key clinical and biological challenge. In breast cancer, while different {"}poor-prognosis{"} sets of genes have predicted patient survival outcome equally well in independent cohorts, these prognostic signatures have surprisingly little genetic overlap. We examine 10 such published expression-based signatures that are predictors or distinct breast cancer phenotypes, uncover their mechanistic interconnectivity through a protein-protein interaction network, and introduce a novel cross-{"}gene expression signature{"} analysis method using (i) domain knowledge to constrain multiple comparisons in a mechanistically relevant single-gene network interactions and (ii) scale-free permutation re-sampling to statistically control for hubness (SPAN - Single Protein Analysis of Network with constant node degree per protein). At adjusted p-values < 5{\%}, 54-genes thus identified have a significantly greater connectivity than those through meticulous permutation re-sampling of the context-constrained network. More importantly, eight of 10 genetically non-overlapping signatures are connected through well-established mechanisms of breast cancer oncogenesis and progression. Gene Ontology enrichment studies demonstrate common markers of cell cycle regulation. Kaplan-Meier analysis of three independent historical gene expression sets confirms this network-signature's inherent ability to identify {"}poor outcome{"} in ER(+) patients without the requirement of machine learning. We provide a novel demonstration that genetically distinct prognosis signatures, developed from independent clinical datasets, occupy overlapping prognostic space of breast cancer via shared mechanisms that are mediated by genetically different yet mechanistically comparable interactions among proteins of differentially expressed genes in the signatures. This is the first study employing a networks' approach to aggregate established gene expression signatures in order to develop a phenotype/pathway-based cancer roadmap with the potential for (i) novel drug development applications and for (ii) facilitating the clinical deployment of prognostic gene signatures with improved mechanistic understanding of biological processes and functions associated with gene expression changes. http://www.lussierlab.org/publication/networksignature/.",
keywords = "Breast cancer, Context-constrained networks, Gene signatures, Protein interaction networks, Systems biology",
author = "James Chen and Lee Sam and Yong Huang and Younghee Lee and Jianrong Li and Yang Liu and Xing, {H. Rosie} and Lussier, {Yves A}",
year = "2010",
month = "6",
doi = "10.1016/j.jbi.2010.03.009",
language = "English (US)",
volume = "43",
pages = "385--396",
journal = "Journal of Biomedical Informatics",
issn = "1532-0464",
publisher = "Academic Press Inc.",
number = "3",

}

TY - JOUR

T1 - Protein interaction network underpins concordant prognosis among heterogeneous breast cancer signatures

AU - Chen, James

AU - Sam, Lee

AU - Huang, Yong

AU - Lee, Younghee

AU - Li, Jianrong

AU - Liu, Yang

AU - Xing, H. Rosie

AU - Lussier, Yves A

PY - 2010/6

Y1 - 2010/6

N2 - Characterizing the biomolecular systems' properties underpinning prognosis signatures derived from gene expression profiles remains a key clinical and biological challenge. In breast cancer, while different "poor-prognosis" sets of genes have predicted patient survival outcome equally well in independent cohorts, these prognostic signatures have surprisingly little genetic overlap. We examine 10 such published expression-based signatures that are predictors or distinct breast cancer phenotypes, uncover their mechanistic interconnectivity through a protein-protein interaction network, and introduce a novel cross-"gene expression signature" analysis method using (i) domain knowledge to constrain multiple comparisons in a mechanistically relevant single-gene network interactions and (ii) scale-free permutation re-sampling to statistically control for hubness (SPAN - Single Protein Analysis of Network with constant node degree per protein). At adjusted p-values < 5%, 54-genes thus identified have a significantly greater connectivity than those through meticulous permutation re-sampling of the context-constrained network. More importantly, eight of 10 genetically non-overlapping signatures are connected through well-established mechanisms of breast cancer oncogenesis and progression. Gene Ontology enrichment studies demonstrate common markers of cell cycle regulation. Kaplan-Meier analysis of three independent historical gene expression sets confirms this network-signature's inherent ability to identify "poor outcome" in ER(+) patients without the requirement of machine learning. We provide a novel demonstration that genetically distinct prognosis signatures, developed from independent clinical datasets, occupy overlapping prognostic space of breast cancer via shared mechanisms that are mediated by genetically different yet mechanistically comparable interactions among proteins of differentially expressed genes in the signatures. This is the first study employing a networks' approach to aggregate established gene expression signatures in order to develop a phenotype/pathway-based cancer roadmap with the potential for (i) novel drug development applications and for (ii) facilitating the clinical deployment of prognostic gene signatures with improved mechanistic understanding of biological processes and functions associated with gene expression changes. http://www.lussierlab.org/publication/networksignature/.

AB - Characterizing the biomolecular systems' properties underpinning prognosis signatures derived from gene expression profiles remains a key clinical and biological challenge. In breast cancer, while different "poor-prognosis" sets of genes have predicted patient survival outcome equally well in independent cohorts, these prognostic signatures have surprisingly little genetic overlap. We examine 10 such published expression-based signatures that are predictors or distinct breast cancer phenotypes, uncover their mechanistic interconnectivity through a protein-protein interaction network, and introduce a novel cross-"gene expression signature" analysis method using (i) domain knowledge to constrain multiple comparisons in a mechanistically relevant single-gene network interactions and (ii) scale-free permutation re-sampling to statistically control for hubness (SPAN - Single Protein Analysis of Network with constant node degree per protein). At adjusted p-values < 5%, 54-genes thus identified have a significantly greater connectivity than those through meticulous permutation re-sampling of the context-constrained network. More importantly, eight of 10 genetically non-overlapping signatures are connected through well-established mechanisms of breast cancer oncogenesis and progression. Gene Ontology enrichment studies demonstrate common markers of cell cycle regulation. Kaplan-Meier analysis of three independent historical gene expression sets confirms this network-signature's inherent ability to identify "poor outcome" in ER(+) patients without the requirement of machine learning. We provide a novel demonstration that genetically distinct prognosis signatures, developed from independent clinical datasets, occupy overlapping prognostic space of breast cancer via shared mechanisms that are mediated by genetically different yet mechanistically comparable interactions among proteins of differentially expressed genes in the signatures. This is the first study employing a networks' approach to aggregate established gene expression signatures in order to develop a phenotype/pathway-based cancer roadmap with the potential for (i) novel drug development applications and for (ii) facilitating the clinical deployment of prognostic gene signatures with improved mechanistic understanding of biological processes and functions associated with gene expression changes. http://www.lussierlab.org/publication/networksignature/.

KW - Breast cancer

KW - Context-constrained networks

KW - Gene signatures

KW - Protein interaction networks

KW - Systems biology

UR - http://www.scopus.com/inward/record.url?scp=77952745604&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77952745604&partnerID=8YFLogxK

U2 - 10.1016/j.jbi.2010.03.009

DO - 10.1016/j.jbi.2010.03.009

M3 - Article

VL - 43

SP - 385

EP - 396

JO - Journal of Biomedical Informatics

JF - Journal of Biomedical Informatics

SN - 1532-0464

IS - 3

ER -