Scaled sparse high-dimensional tests for localizing sequence variants

Shaolong Cao, Huaizhen Qin, Jian Li, Hong Wen Deng, Yu Ping Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Deep sequencing studies have been generating high-throughput data for high resolution and comprehensive detection of rare and common genetic variants. Existing association tests are powerful to identify functional variants in large samples. These tests, however, have low power to identify susceptible variants in high-dimensional SNP set, where n (the number of observations) is smaller than or close to m (the size of SNP set under testing). We propose a scaled sparse regression approach for localizing susceptible variants set in high-dimensional SNP sets which are ubiquitous in analyses of deep sequencing data. This approach applies sparse regression with scaled Lp (0<p<1) norm regularization. Under a wide range of simulated scenarios, the proposed approach appropriately controlled Type I error rate. In terms of the detection power of susceptible variants, the proposed approach appeared more powerful than several existing prominent methods. The practical utility of the proposed approach is illustrated by application to real DNA sequence data of Mexican Americans from GAW18.

Original languageEnglish (US)
Title of host publicationACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PublisherAssociation for Computing Machinery
Pages79-87
Number of pages9
ISBN (Electronic)9781450328944
DOIs
StatePublished - Sep 20 2014
Event5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014 - Newport Beach, United States
Duration: Sep 20 2014Sep 23 2014

Publication series

NameACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

Other

Other5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014
Country/TerritoryUnited States
CityNewport Beach
Period9/20/149/23/14

Keywords

  • GAW18 data
  • HDS-based significance tests
  • Lnorm regularization
  • Scaled sparse regression

ASJC Scopus subject areas

  • Health Informatics
  • Computer Science Applications
  • Software
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'Scaled sparse high-dimensional tests for localizing sequence variants'. Together they form a unique fingerprint.

Cite this