University of Tasmania
Browse

File(s) under permanent embargo

To gamma or not to gamma? Testing the fit of rates-across-sites models

conference contribution
posted on 2023-05-24, 11:31 authored by Melissa Humphries, Barbara HollandBarbara Holland, Karpievitch, YV, Jeremy SumnerJeremy Sumner
Since the introduction of explicitly model based methods of phylogenetic inference (e.g. maximum like- lihood and Bayesian approaches) the complexity and biological realism of models of sequence evolution has increased. An important advance in this regard was the introduction of models that allowed rate variation across sites (RAS), i.e. they modelled the fact that some sites in a gene may be more or less likely to accept substitutions than others. The most common way of accomplishing this is to use a discrete approximation to a gamma distribution. This has the computational advantage of allowing (usually 4 or 8) different rate categories with the addition of a single extra parameter into the model. However, overly simplistic models of RAS can cause problems for phylogenetic inference and for estimating dates of divergences. In particular, a recent study has shown that if there are a small number of sites that mutate very frequently compared to other sites (so called hot spots) this can lead to time-dependence of rate estimates (Soubrier et al 2012). In this study we used amino-acid data from a study by Grahnen et al (2011) who simulated data using a biophysical model of protein folding and binding. We extracted the number of mutations at each site and fit this data to a variety of models. In particular: • Constant RAS implies the frequency distribution of counts of mutations should follow a Poisson distribution • Gamma distributed RAS imply that the counts should follow a negative binomial distribution • Gamma distributed RAS with invariants sites imply that counts should follow a zero inflated negative binomial distribution. We will discuss the merits of these models and whether or not any of them provide an acceptable fit to data generated under biologically realistic conditions.

History

Publication title

Phylomania 2012

Editors

Dr. Barabara Holland, Dr. Jeremy Sumner

Pagination

7

Department/School

School of Natural Sciences

Publisher

School of Mathematics and Physics

Place of publication

University of Tasmania

Event title

Phylomania 2012

Event Venue

University of Tasmania, Hobart

Date of Event (Start Date)

2012-11-08

Date of Event (End Date)

2012-11-09

Repository Status

  • Restricted

Socio-economic Objectives

Expanding knowledge in the mathematical sciences

Usage metrics

    University Of Tasmania

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC