Literature-Based Discovery of Confounding in Observational Clinical Data

Scott A. Malec, Peng Wei, Hua Xu, Elmer V. Bernstam, Sahiti Myneni, Trevor Cohen

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Observational data recorded in the Electronic Health Record (EHR) can help us better understand the effects of therapeutic agents in routine clinical practice. As such data were not collected for research purposes, their reuse for research must compensate for additional information that may bias analyses and lead to faulty conclusions. Confounding is present when factors aside from the given predictor(s) affect the response of interest. However, these additional factors may not be known at the outset. In this paper, we present a scalable literature-based confounding variable discovery method for biomedical research applications with pharmacovigilance as our use case. We hypothesized that statistical models, adjusted with literature-derived confounders, will more accurately identify causative drug-adverse drug event (ADE) relationships. We evaluated our method with a curated reference standard, and found a pattern of improved performance ~ 5% in two out of three models for gastrointestinal bleeding (pre-adjusted Area Under Curve ≥ 0.6).

Original languageEnglish (US)
Pages (from-to)1920-1929
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
Volume2016
StatePublished - 2016

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'Literature-Based Discovery of Confounding in Observational Clinical Data'. Together they form a unique fingerprint.

Cite this