Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE

Alexej Gossmann; Shaolong Cao; Yu Ping Wang

doi:10.1145/2808719.2808743

Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE

Alexej Gossmann, Shaolong Cao, Yu Ping Wang

Bioinformatics & Computational Biology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

The method of Sorted L-One Penalized Estimation, abbreviated as SLOPE, is a novel sparse regression method for model selection introduced in a sequence of recent papers, [4], [3] and [7] by Bogdan, van den Berg, Sabatti, Su and Candes. It estimates the coefficients of a linear model that possibly has more unknown parameters than observations. In many settings the SLOPE method is shown to successfully control the false discovery rate (the proportion of the irrelevant among all selected predictors) at a user specified level. In this paper we evaluate its performance on genetic data, and show its superiority over LASSO which is a related and popular method. Often in genetic data sets, group structures among the predictor variables are given as prior knowledge, such as SNPs in a gene or genes in a pathway. Following this motivation we extend SLOPE in the spirit of Group LASSO to Group SLOPE, a method that can handle group structures between the predictor variables, which are ubiquitous in real genetic data. Our simulation results show that the proposed Group SLOPE method is capable of controlling the false discovery rate at a specified level. Moreover, our simulations show that compared to Group LASSO, Group SLOPE in general achieves a higher power as well as a lower false discovery rate.

Original language	English (US)
Title of host publication	BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
Publisher	Association for Computing Machinery, Inc
Pages	232-240
Number of pages	9
ISBN (Electronic)	9781450338530
DOIs	https://doi.org/10.1145/2808719.2808743
State	Published - Sep 9 2015
Event	6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015 - Atlanta, United States Duration: Sep 9 2015 → Sep 12 2015

Publication series

Name	BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

Other

Other	6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015
Country/Territory	United States
City	Atlanta
Period	9/9/15 → 9/12/15

Keywords

False discovery rate
Group LASSO
LASSO
SLOPE
Sparse regression

ASJC Scopus subject areas

Software
Health Informatics
Computer Science Applications
Biomedical Engineering

Access to Document

10.1145/2808719.2808743

Cite this

Gossmann, A., Cao, S., & Wang, Y. P. (2015). Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE. In BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (pp. 232-240). (BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics). Association for Computing Machinery, Inc. https://doi.org/10.1145/2808719.2808743

Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE. / Gossmann, Alexej; Cao, Shaolong; Wang, Yu Ping.
BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. Association for Computing Machinery, Inc, 2015. p. 232-240 (BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Gossmann, A, Cao, S & Wang, YP 2015, Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE. in BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, Association for Computing Machinery, Inc, pp. 232-240, 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015, Atlanta, United States, 9/9/15. https://doi.org/10.1145/2808719.2808743

Gossmann A, Cao S, Wang YP. Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE. In BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. Association for Computing Machinery, Inc. 2015. p. 232-240. (BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics). doi: 10.1145/2808719.2808743

Gossmann, Alexej ; Cao, Shaolong ; Wang, Yu Ping. / Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE. BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. Association for Computing Machinery, Inc, 2015. pp. 232-240 (BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics).

@inproceedings{d8dd9972beb946b6a861aff0efeaf868,

title = "Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE",

abstract = "The method of Sorted L-One Penalized Estimation, abbreviated as SLOPE, is a novel sparse regression method for model selection introduced in a sequence of recent papers, [4], [3] and [7] by Bogdan, van den Berg, Sabatti, Su and Candes. It estimates the coefficients of a linear model that possibly has more unknown parameters than observations. In many settings the SLOPE method is shown to successfully control the false discovery rate (the proportion of the irrelevant among all selected predictors) at a user specified level. In this paper we evaluate its performance on genetic data, and show its superiority over LASSO which is a related and popular method. Often in genetic data sets, group structures among the predictor variables are given as prior knowledge, such as SNPs in a gene or genes in a pathway. Following this motivation we extend SLOPE in the spirit of Group LASSO to Group SLOPE, a method that can handle group structures between the predictor variables, which are ubiquitous in real genetic data. Our simulation results show that the proposed Group SLOPE method is capable of controlling the false discovery rate at a specified level. Moreover, our simulations show that compared to Group LASSO, Group SLOPE in general achieves a higher power as well as a lower false discovery rate.",

keywords = "False discovery rate, Group LASSO, LASSO, SLOPE, Sparse regression",

author = "Alexej Gossmann and Shaolong Cao and Wang, {Yu Ping}",

year = "2015",

month = sep,

day = "9",

doi = "10.1145/2808719.2808743",

language = "English (US)",

series = "BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics",

publisher = "Association for Computing Machinery, Inc",

pages = "232--240",

booktitle = "BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics",

}

TY - GEN

T1 - Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE

AU - Gossmann, Alexej

AU - Cao, Shaolong

AU - Wang, Yu Ping

PY - 2015/9/9

Y1 - 2015/9/9

N2 - The method of Sorted L-One Penalized Estimation, abbreviated as SLOPE, is a novel sparse regression method for model selection introduced in a sequence of recent papers, [4], [3] and [7] by Bogdan, van den Berg, Sabatti, Su and Candes. It estimates the coefficients of a linear model that possibly has more unknown parameters than observations. In many settings the SLOPE method is shown to successfully control the false discovery rate (the proportion of the irrelevant among all selected predictors) at a user specified level. In this paper we evaluate its performance on genetic data, and show its superiority over LASSO which is a related and popular method. Often in genetic data sets, group structures among the predictor variables are given as prior knowledge, such as SNPs in a gene or genes in a pathway. Following this motivation we extend SLOPE in the spirit of Group LASSO to Group SLOPE, a method that can handle group structures between the predictor variables, which are ubiquitous in real genetic data. Our simulation results show that the proposed Group SLOPE method is capable of controlling the false discovery rate at a specified level. Moreover, our simulations show that compared to Group LASSO, Group SLOPE in general achieves a higher power as well as a lower false discovery rate.

AB - The method of Sorted L-One Penalized Estimation, abbreviated as SLOPE, is a novel sparse regression method for model selection introduced in a sequence of recent papers, [4], [3] and [7] by Bogdan, van den Berg, Sabatti, Su and Candes. It estimates the coefficients of a linear model that possibly has more unknown parameters than observations. In many settings the SLOPE method is shown to successfully control the false discovery rate (the proportion of the irrelevant among all selected predictors) at a user specified level. In this paper we evaluate its performance on genetic data, and show its superiority over LASSO which is a related and popular method. Often in genetic data sets, group structures among the predictor variables are given as prior knowledge, such as SNPs in a gene or genes in a pathway. Following this motivation we extend SLOPE in the spirit of Group LASSO to Group SLOPE, a method that can handle group structures between the predictor variables, which are ubiquitous in real genetic data. Our simulation results show that the proposed Group SLOPE method is capable of controlling the false discovery rate at a specified level. Moreover, our simulations show that compared to Group LASSO, Group SLOPE in general achieves a higher power as well as a lower false discovery rate.

KW - False discovery rate

KW - Group LASSO

KW - LASSO

KW - SLOPE

KW - Sparse regression

UR - http://www.scopus.com/inward/record.url?scp=84963616437&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84963616437&partnerID=8YFLogxK

U2 - 10.1145/2808719.2808743

DO - 10.1145/2808719.2808743

M3 - Conference contribution

AN - SCOPUS:84963616437

T3 - BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

SP - 232

EP - 240

BT - BCB 2015 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

PB - Association for Computing Machinery, Inc

T2 - 6th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2015

Y2 - 9 September 2015 through 12 September 2015

ER -

Identification of significant genetic variants via SLOPE, and its extension to Group SLOPE

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this