CellBox: Interpretable Machine Learning for Perturbation Biology with Application to the Design of Cancer Combination Therapy

Bo Yuan, Ciyue Shen, Augustin Luna, Anil Korkut, Debora S. Marks, John Ingraham, Chris Sander

Research output: Contribution to journalArticlepeer-review

50 Scopus citations

Abstract

Systematic perturbation of cells followed by comprehensive measurements of molecular and phenotypic responses provides informative data resources for constructing computational models of cell biology. Models that generalize well beyond training data can be used to identify combinatorial perturbations of potential therapeutic interest. Major challenges for machine learning on large biological datasets are to find global optima in a complex multidimensional space and mechanistically interpret the solutions. To address these challenges, we introduce a hybrid approach that combines explicit mathematical models of cell dynamics with a machine-learning framework, implemented in TensorFlow. We tested the modeling framework on a perturbation-response dataset of a melanoma cell line after drug treatments. The models can be efficiently trained to describe cellular behavior accurately. Even though completely data driven and independent of prior knowledge, the resulting de novo network models recapitulate some known interactions. The approach is readily applicable to various kinetic models of cell biology. A record of this paper's Transparent Peer Review process is included in the Supplemental Information. The ability to accurately predict cell behavior to previously untested perturbations would benefit the discovery of combination therapies in cancer. To overcome the lack of interpretability of black-box machine-learning models, we developed a hybrid approach called CellBox that combines explicit mathematical models of molecular interactions with efficient parameter inference algorithms adapted from deep learning. The models are data driven and do not require prior knowledge, and their predictive scope scales well with the availability of high-throughput data.

Original languageEnglish (US)
Pages (from-to)128-140.e4
JournalCell Systems
Volume12
Issue number2
DOIs
StatePublished - Feb 17 2021

Keywords

  • cancer
  • cell dynamics
  • combinatorial therapy
  • dynamical systems
  • interpretability
  • machine learning
  • network pharmacology
  • perturbation biology
  • systems biology

ASJC Scopus subject areas

  • Pathology and Forensic Medicine
  • Histology
  • Cell Biology

MD Anderson CCSG core facilities

  • Functional Proteomics Reverse Phase Protein Array Core
  • Bioinformatics Shared Resource

Fingerprint

Dive into the research topics of 'CellBox: Interpretable Machine Learning for Perturbation Biology with Application to the Design of Cancer Combination Therapy'. Together they form a unique fingerprint.

Cite this