TY - JOUR
T1 - Bayesian Structure Learning in Multilayered Genomic Networks
AU - Ha, Min Jin
AU - Stingo, Francesco
AU - Baladandayuthapani, Veerabhadran
N1 - Funding Information:
Min Jin Ha’s research is supported by by NIH/NCI grants P30CA01 and R21CA22029. Veerabhadran Baladandayuthapani’s research is supported by NIH R01-CA160736, R01-CA194391, P30-CA46592, R21CA22029, National Science Foundation Grant No. DMS 1922567, and UM Rogel Cancer Center and the School of Public Health.
Publisher Copyright:
© 2020 American Statistical Association.
PY - 2021
Y1 - 2021
N2 - Integrative network modeling of data arising from multiple genomic platforms provides insight into the holistic picture of the interactive system, as well as the flow of information across many disease domains including cancer. The basic data structure consists of a sequence of hierarchically ordered datasets for each individual subject, which facilitates integration of diverse inputs, such as genomic, transcriptomic, and proteomic data. A primary analytical task in such contexts is to model the layered architecture of networks where the vertices can be naturally partitioned into ordered layers, dictated by multiple platforms, and exhibit both undirected and directed relationships. We propose a multilayered Gaussian graphical model (mlGGM) to investigate conditional independence structures in such multilevel genomic networks in human cancers. We implement a Bayesian node-wise selection (BANS) approach based on variable selection techniques that coherently accounts for the multiple types of dependencies in mlGGM; this flexible strategy exploits edge-specific prior knowledge and selects sparse and interpretable models. Through simulated data generated under various scenarios, we demonstrate that BANS outperforms other existing multivariate regression-based methodologies. Our integrative genomic network analysis for key signaling pathways across multiple cancer types highlights commonalities and differences of p53 integrative networks and epigenetic effects of BRCA2 on p53 and its interaction with T68 phosphorylated CHK2, that may have translational utilities of finding biomarkers and therapeutic targets. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
AB - Integrative network modeling of data arising from multiple genomic platforms provides insight into the holistic picture of the interactive system, as well as the flow of information across many disease domains including cancer. The basic data structure consists of a sequence of hierarchically ordered datasets for each individual subject, which facilitates integration of diverse inputs, such as genomic, transcriptomic, and proteomic data. A primary analytical task in such contexts is to model the layered architecture of networks where the vertices can be naturally partitioned into ordered layers, dictated by multiple platforms, and exhibit both undirected and directed relationships. We propose a multilayered Gaussian graphical model (mlGGM) to investigate conditional independence structures in such multilevel genomic networks in human cancers. We implement a Bayesian node-wise selection (BANS) approach based on variable selection techniques that coherently accounts for the multiple types of dependencies in mlGGM; this flexible strategy exploits edge-specific prior knowledge and selects sparse and interpretable models. Through simulated data generated under various scenarios, we demonstrate that BANS outperforms other existing multivariate regression-based methodologies. Our integrative genomic network analysis for key signaling pathways across multiple cancer types highlights commonalities and differences of p53 integrative networks and epigenetic effects of BRCA2 on p53 and its interaction with T68 phosphorylated CHK2, that may have translational utilities of finding biomarkers and therapeutic targets. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
KW - Bayesian variable selection
KW - Multilayered Gaussian graphical models
KW - Multilevel data integration
UR - http://www.scopus.com/inward/record.url?scp=85088563649&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088563649&partnerID=8YFLogxK
U2 - 10.1080/01621459.2020.1775611
DO - 10.1080/01621459.2020.1775611
M3 - Article
C2 - 34239216
AN - SCOPUS:85088563649
SN - 0162-1459
VL - 116
SP - 605
EP - 618
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 534
ER -