Abstract
A general modeling procedure for analyzing genetic data is reviewed. We review ANOVA type model that can handle both the continuous and discrete genetic variables in one modeling framework. Unlike the regression type models which typically set the phenotype variable as a response, this ANOVA model treats the phenotype variable as an explanatory variable. By reversely treating the phenotype variable, usual high dimensional problem is turned into low dimension. Instead, the ANOVA model always includes interaction term between the genetic locations and phenotype variable to find potential association between them. The interaction term is designed to be low rank with the multiplication of bilinear terms so that the required number of parameters is kept in a manageable degree. We compare the performance of the reviewed ANOVA model to the other popular methods via microarray and SNP data sets.
Original language | English (US) |
---|---|
Pages (from-to) | 169-178 |
Number of pages | 10 |
Journal | Journal of the Korean Statistical Society |
Volume | 48 |
Issue number | 2 |
DOIs | |
State | Published - Jun 2019 |
Externally published | Yes |
Keywords
- ANOVA
- BIC
- High dimension
- Variable selection
ASJC Scopus subject areas
- Statistics and Probability