A Conditional Autoregressive Model for Detecting Natural Selection in Protein-Coding DNA Sequences

Yu Fan, Rui Wu, Ming Hui Chen, Lynn Kuo, Paul O. Lewis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Phylogenetics, the study of evolutionary relationships among groups of organisms, has played an important role in modern biological research, such as genomic comparison, detecting orthology and paralogy, estimating divergence times, reconstructing ancient proteins, identifying mutations likely to be associated with disease, determining the identity of new pathogens, and finding the residues that are important to natural selection. Given an alignment of protein-coding DNA sequences, most methods for detecting natural selection rely on estimating the codon-specific nonsynonymous/synonymous rate ratios (dN/dS). Here, we describe an approach to modeling variation in the dN/dS by using a conditional autoregressive (CAR) model. The CAR model relaxes the assumption in most contemporary phylogenetic models, i.e., sites in molecular sequences evolve independently. By incorporating the information stored in the Protein Data Bank (PDB) file, the CAR model estimates the dN/dS based on the protein three-dimensional structure. We implement the model in a fully Bayesian approach with all parameters of the model considered as random variables and make use of the NVIDIA's parallel computing architecture (CUDA) to accelerate the calculation. Our result of analyzing an empirical abalone sperm lysine data is in accordance with the previous findings.

Original languageEnglish (US)
Title of host publicationTopics in Applied Statistics - 2012 Symposium of the International Chinese Statistical Association
Pages203-212
Number of pages10
DOIs
StatePublished - 2013
Event21st Symposium of the International Chinese Statistical Association, ICSA 2012 - Boston, MA, United States
Duration: Jun 23 2012Jun 26 2012

Publication series

NameSpringer Proceedings in Mathematics and Statistics
Volume55
ISSN (Print)2194-1009
ISSN (Electronic)2194-1017

Other

Other21st Symposium of the International Chinese Statistical Association, ICSA 2012
Country/TerritoryUnited States
CityBoston, MA
Period6/23/126/26/12

ASJC Scopus subject areas

  • General Mathematics

Fingerprint

Dive into the research topics of 'A Conditional Autoregressive Model for Detecting Natural Selection in Protein-Coding DNA Sequences'. Together they form a unique fingerprint.

Cite this