Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models

Mishka Gidwani; Ken Chang; Jay Biren Patel; Katharina Viktoria Hoebel; Syed Rakin Ahmed; Praveer Singh; Clifton David Fuller; Jayashree Kalpathy-Cramer

doi:10.1148/radiol.220715

Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models

Mishka Gidwani, Ken Chang, Jay Biren Patel, Katharina Viktoria Hoebel, Syed Rakin Ahmed, Praveer Singh, Clifton David Fuller, Jayashree Kalpathy-Cramer

Radiation Oncology

Research output: Contribution to journal › Article › peer-review

13 Scopus citations

Abstract

Background: Radiomics is the extraction of predefined mathematic features from medical images for the prediction of variables of clinical interest. While some studies report superlative accuracy of radiomic machine learning (ML) models, the published methodology is often incomplete, and the results are rarely validated in external testing data sets. Purpose: To characterize the type, prevalence, and statistical impact of methodologic errors present in radiomic ML studies. Materials and Methods: Radiomic ML publications were reviewed for the presence of performance-inflating methodologic flaws. Common flaws were subsequently reproduced with randomly generated features interpolated from publicly available radiomic data sets to demonstrate the precarious nature of reported findings. Results: In an assessment of radiomic ML publications, the authors uncovered two general categories of data analysis errors: inconsistent partitioning and unproductive feature associations. In simulations, the authors demonstrated that inconsistent partitioning augments radiomic ML accuracy by 1.4 times from unbiased performance and that correcting for flawed methodologic results in areas under the receiver operating characteristic curve approaching a value of 0.5 (random chance). With use of randomly generated features, the authors illustrated that unproductive associations between radiomic features and gene sets can imply false causality for biologic phenomenon. Conclusion: Radiomic machine learning studies may contain methodologic flaws that undermine their validity. This study provides a review template to avoid such flaws.

Original language	English (US)
Article number	e220715
Journal	Radiology
Volume	307
Issue number	1
DOIs	https://doi.org/10.1148/radiol.220715
State	Published - Apr 2023

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1148/radiol.220715

Cite this

@article{7300e962fb054b6c8c247b5774bff80e,

title = "Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models",

abstract = "Background: Radiomics is the extraction of predefined mathematic features from medical images for the prediction of variables of clinical interest. While some studies report superlative accuracy of radiomic machine learning (ML) models, the published methodology is often incomplete, and the results are rarely validated in external testing data sets. Purpose: To characterize the type, prevalence, and statistical impact of methodologic errors present in radiomic ML studies. Materials and Methods: Radiomic ML publications were reviewed for the presence of performance-inflating methodologic flaws. Common flaws were subsequently reproduced with randomly generated features interpolated from publicly available radiomic data sets to demonstrate the precarious nature of reported findings. Results: In an assessment of radiomic ML publications, the authors uncovered two general categories of data analysis errors: inconsistent partitioning and unproductive feature associations. In simulations, the authors demonstrated that inconsistent partitioning augments radiomic ML accuracy by 1.4 times from unbiased performance and that correcting for flawed methodologic results in areas under the receiver operating characteristic curve approaching a value of 0.5 (random chance). With use of randomly generated features, the authors illustrated that unproductive associations between radiomic features and gene sets can imply false causality for biologic phenomenon. Conclusion: Radiomic machine learning studies may contain methodologic flaws that undermine their validity. This study provides a review template to avoid such flaws.",

author = "Mishka Gidwani and Ken Chang and Patel, {Jay Biren} and Hoebel, {Katharina Viktoria} and Ahmed, {Syed Rakin} and Praveer Singh and Fuller, {Clifton David} and Jayashree Kalpathy-Cramer",

note = "Funding Information: C.D.F. received/receives funding and salary support unrelated to this project during the period of study execution from the following: the National Institutes of Health (NIH) National Institute of Biomedical Imaging and Bioengineering (NIBIB) Research Education Programs for Residents and Clinical Fellows Grant (R25EB025787-01); the National Institute for Dental and Craniofacial Research Establishing Outcome Measures Award (1R01DE025248/R56DE025248) and Academic Industrial Partnership Grant (R01DE028290); NCI Early Phase Clinical Trials in Imaging and Image-Guided Interventions Program (1R01CA218148); an NIH/NCI Cancer Center Support Grant (CCSG) Pilot Research Program Award from the UT MD Anderson CCSG Radiation Oncology and Cancer Imaging Program (P30CA016672); an NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Award (P50 CA097007); NIH Big Data to Knowledge (BD2K) Program of the National Cancer Institute (NCI) Early Stage Development of Technologies in Biomedical Computing, Informatics, and Big Data Science Award (1R01CA2148250); National Science Foundation (NSF), Division of Mathematical Sciences, Joint NIH/NSF Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) Grant (NSF 1557679); NSF Division of Civil, Mechanical, and Manufacturing Innovation (CMMI) grant (NSF 1933369); and Elekta. Publisher Copyright: {\textcopyright} RSNA, 2022.",

year = "2023",

month = apr,

doi = "10.1148/radiol.220715",

language = "English (US)",

volume = "307",

journal = "Radiology",

issn = "0033-8419",

publisher = "Radiological Society of North America Inc.",

number = "1",

}

TY - JOUR

T1 - Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models

AU - Gidwani, Mishka

AU - Chang, Ken

AU - Patel, Jay Biren

AU - Hoebel, Katharina Viktoria

AU - Ahmed, Syed Rakin

AU - Singh, Praveer

AU - Fuller, Clifton David

AU - Kalpathy-Cramer, Jayashree

N1 - Funding Information: C.D.F. received/receives funding and salary support unrelated to this project during the period of study execution from the following: the National Institutes of Health (NIH) National Institute of Biomedical Imaging and Bioengineering (NIBIB) Research Education Programs for Residents and Clinical Fellows Grant (R25EB025787-01); the National Institute for Dental and Craniofacial Research Establishing Outcome Measures Award (1R01DE025248/R56DE025248) and Academic Industrial Partnership Grant (R01DE028290); NCI Early Phase Clinical Trials in Imaging and Image-Guided Interventions Program (1R01CA218148); an NIH/NCI Cancer Center Support Grant (CCSG) Pilot Research Program Award from the UT MD Anderson CCSG Radiation Oncology and Cancer Imaging Program (P30CA016672); an NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Award (P50 CA097007); NIH Big Data to Knowledge (BD2K) Program of the National Cancer Institute (NCI) Early Stage Development of Technologies in Biomedical Computing, Informatics, and Big Data Science Award (1R01CA2148250); National Science Foundation (NSF), Division of Mathematical Sciences, Joint NIH/NSF Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) Grant (NSF 1557679); NSF Division of Civil, Mechanical, and Manufacturing Innovation (CMMI) grant (NSF 1933369); and Elekta. Publisher Copyright: © RSNA, 2022.

PY - 2023/4

Y1 - 2023/4

N2 - Background: Radiomics is the extraction of predefined mathematic features from medical images for the prediction of variables of clinical interest. While some studies report superlative accuracy of radiomic machine learning (ML) models, the published methodology is often incomplete, and the results are rarely validated in external testing data sets. Purpose: To characterize the type, prevalence, and statistical impact of methodologic errors present in radiomic ML studies. Materials and Methods: Radiomic ML publications were reviewed for the presence of performance-inflating methodologic flaws. Common flaws were subsequently reproduced with randomly generated features interpolated from publicly available radiomic data sets to demonstrate the precarious nature of reported findings. Results: In an assessment of radiomic ML publications, the authors uncovered two general categories of data analysis errors: inconsistent partitioning and unproductive feature associations. In simulations, the authors demonstrated that inconsistent partitioning augments radiomic ML accuracy by 1.4 times from unbiased performance and that correcting for flawed methodologic results in areas under the receiver operating characteristic curve approaching a value of 0.5 (random chance). With use of randomly generated features, the authors illustrated that unproductive associations between radiomic features and gene sets can imply false causality for biologic phenomenon. Conclusion: Radiomic machine learning studies may contain methodologic flaws that undermine their validity. This study provides a review template to avoid such flaws.

AB - Background: Radiomics is the extraction of predefined mathematic features from medical images for the prediction of variables of clinical interest. While some studies report superlative accuracy of radiomic machine learning (ML) models, the published methodology is often incomplete, and the results are rarely validated in external testing data sets. Purpose: To characterize the type, prevalence, and statistical impact of methodologic errors present in radiomic ML studies. Materials and Methods: Radiomic ML publications were reviewed for the presence of performance-inflating methodologic flaws. Common flaws were subsequently reproduced with randomly generated features interpolated from publicly available radiomic data sets to demonstrate the precarious nature of reported findings. Results: In an assessment of radiomic ML publications, the authors uncovered two general categories of data analysis errors: inconsistent partitioning and unproductive feature associations. In simulations, the authors demonstrated that inconsistent partitioning augments radiomic ML accuracy by 1.4 times from unbiased performance and that correcting for flawed methodologic results in areas under the receiver operating characteristic curve approaching a value of 0.5 (random chance). With use of randomly generated features, the authors illustrated that unproductive associations between radiomic features and gene sets can imply false causality for biologic phenomenon. Conclusion: Radiomic machine learning studies may contain methodologic flaws that undermine their validity. This study provides a review template to avoid such flaws.

UR - http://www.scopus.com/inward/record.url?scp=85149627292&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85149627292&partnerID=8YFLogxK

U2 - 10.1148/radiol.220715

DO - 10.1148/radiol.220715

M3 - Article

C2 - 36537895

AN - SCOPUS:85149627292

SN - 0033-8419

VL - 307

JO - Radiology

JF - Radiology

IS - 1

M1 - e220715

ER -

Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this