Clustering High-Dimensional Landmark-Based Two-Dimensional Shape Data

Chao Huang, Martin Styner, Hongtu Zhu

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

An important goal in image analysis is to cluster and recognize objects of interest according to the shapes of their boundaries. Clustering such objects faces at least four major challenges including a curved shape space, a high-dimensional feature space, a complex spatial correlation structure, and shape variation associated with some covariates (e.g., age or gender). The aim of this article is to develop a penalized model-based clustering framework to cluster landmark-based planar shape data, while explicitly addressing these challenges. Specifically, a mixture of offset-normal shape factor analyzers (MOSFA) is proposed with mixing proportions defined through a regression model (e.g., logistic) and an offset-normal shape distribution in each component for data in the curved shape space. A latent factor analysis model is introduced to explicitly model the complex spatial correlation. A penalized likelihood approach with both adaptive pairwise fused Lasso penalty function and L2 penalty function is used to automatically realize variable selection via thresholding and deliver a sparse solution. Our real data analysis has confirmed the excellent finite-sample performance of MOSFA in revealing meaningful clusters in the corpus callosum shape data obtained from the Attention Deficit Hyperactivity Disorder-200 (ADHD-200) study. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)946-961
Number of pages16
JournalJournal of the American Statistical Association
Volume110
Issue number511
DOIs
StatePublished - Jul 3 2015

Keywords

  • Alternating direction method of multipliers
  • Attention deficit hyperactivity disorder
  • Corpus callosum
  • Offset-normal shape distribution
  • Shape clustering

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Clustering High-Dimensional Landmark-Based Two-Dimensional Shape Data'. Together they form a unique fingerprint.

Cite this