Comparison and optimization of machine learning methods for automated classification of circulating tumor cells

Timothy B. Lannin, Fredrik I. Thege, Brian J. Kirby

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Advances in rare cell capture technology have made possible the interrogation of circulating tumor cells (CTCs) captured from whole patient blood. However, locating captured cells in the device by manual counting bottlenecks data processing by being tedious (hours per sample) and compromises the results by being inconsistent and prone to user bias. Some recent work has been done to automate the cell location and classification process to address these problems, employing image processing and machine learning (ML) algorithms to locate and classify cells in fluorescent microscope images. However, the type of machine learning method used is a part of the design space that has not been thoroughly explored. Thus, we have trained four ML algorithms on three different datasets. The trained ML algorithms locate and classify thousands of possible cells in a few minutes rather than a few hours, representing an order of magnitude increase in processing speed. Furthermore, some algorithms have a significantly (P < 0.05) higher area under the receiver operating characteristic curve than do other algorithms. Additionally, significant (P < 0.05) losses to performance occur when training on cell lines and testing on CTCs (and vice versa), indicating the need to train on a system that is representative of future unlabeled data. Optimal algorithm selection depends on the peculiarities of the individual dataset, indicating the need of a careful comparison and optimization of algorithms for individual image classification tasks.

Original languageEnglish (US)
Pages (from-to)922-931
Number of pages10
JournalCytometry Part A
Volume89
Issue number10
DOIs
StatePublished - Oct 1 2016
Externally publishedYes

Keywords

  • biomedical image processing
  • circulating tumor cells
  • Gaussian mixture model
  • image cytometry
  • k-nearest neighbors
  • machine learning
  • machine vision
  • random forest classifier
  • support vector machines

ASJC Scopus subject areas

  • Pathology and Forensic Medicine
  • Histology
  • Cell Biology

Fingerprint

Dive into the research topics of 'Comparison and optimization of machine learning methods for automated classification of circulating tumor cells'. Together they form a unique fingerprint.

Cite this