The ordering of expression among a few genes can provide simple cancer biomarkers and signal BRCA1 mutations. Academic Article uri icon

Overview

abstract

  • BACKGROUND: A major challenge in computational biology is to extract knowledge about the genetic nature of disease from high-throughput data. However, an important obstacle to both biological understanding and clinical applications is the "black box" nature of the decision rules provided by most machine learning approaches, which usually involve many genes combined in a highly complex fashion. Achieving biologically relevant results argues for a different strategy. A promising alternative is to base prediction entirely upon the relative expression ordering of a small number of genes. RESULTS: We present a three-gene version of "relative expression analysis" (RXA), a rigorous and systematic comparison with earlier approaches in a variety of cancer studies, a clinically relevant application to predicting germline BRCA1 mutations in breast cancer and a cross-study validation for predicting ER status. In the BRCA1 study, RXA yields high accuracy with a simple decision rule: in tumors carrying mutations, the expression of a "reference gene" falls between the expression of two differentially expressed genes, PPP1CB and RNF14. An analysis of the protein-protein interactions among the triplet of genes and BRCA1 suggests that the classifier has a biological foundation. CONCLUSION: RXA has the potential to identify genomic "marker interactions" with plausible biological interpretation and direct clinical applicability. It provides a general framework for understanding the roles of the genes involved in decision rules, as illustrated for the difficult and clinically relevant problem of identifying BRCA1 mutation carriers.

publication date

  • August 20, 2009

Research

keywords

  • Biomarkers, Tumor
  • Breast Neoplasms
  • Computational Biology
  • Gene Expression Regulation, Neoplastic
  • Genes, BRCA1

Identity

PubMed Central ID

  • PMC2745389

Scopus Document Identifier

  • 70349731768

Digital Object Identifier (DOI)

  • 10.1073/pnas.211566398

PubMed ID

  • 19695104

Additional Document Info

volume

  • 10