Data mining

L. Adrienne Cupples, Julia Bailey, Kevin C. Cartier, Catherine T. Falk, Kuang Yu Liu, Yuanqing Ye, Robert Yu, Heping Zhang, Hongyu Zhao

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Group 14 used data-mining strategies to evaluate a number of issues, including appropriate diagnosis, haplotype estimation, genetic linkage and association studies, and type I error. Methods ranged from exploratory analyses, to machine learning strategies (neural networks, supervised learning, and tree-based methods), to false discovery rate control of type I errors. The general motivations were to find the "story" in the data and to summarize information from a multitude of measures. Several methods illustrated strategies for better trait definition, using summarization of related traits. In the few studies that sought to identify genes for alcoholism, there was little agreement among the different strategies, likely reflecting the complexities of the disease. Nevertheless, Group 14 found that these methods offered strategies to gain a better understanding of the complex pathways by which disease develops.

Original languageEnglish (US)
Pages (from-to)S103-S109
JournalGenetic epidemiology
Volume29
Issue numberSUPPL.
DOIs
StatePublished - 2005

Keywords

  • Association studies
  • Haplotype estimation
  • Machine learning
  • Neural networks
  • Trees

ASJC Scopus subject areas

  • Epidemiology
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Data mining'. Together they form a unique fingerprint.

Cite this