We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Identifiability of a Markovian model of molecular evolution with Gamma-distributed rates

Abstract: Inference of evolutionary trees and rates from biological sequences is commonly performed using continuous-time Markov models of character change. The Markov process evolves along an unknown tree while observations arise only from the tips of the tree. Rate heterogeneity is present in most real data sets and is accounted for by the use of flexible mixture models where each site is allowed its own rate. Very little has been rigorously established concerning the identifiability of the models currently in common use in data analysis, although non-identifiability was proven for a semi-parametric model and an incorrect proof of identifiability was published for a general parametric model (GTR+Gamma+I). Here we prove that one of the most widely used models (GTR+Gamma) is identifiable for generic parameters, and for all parameter choices in the case of 4-state (DNA) models. This is the first proof of identifiability of a phylogenetic model with a continuous distribution of rates.
Comments: 35 pages, 3 figures; Minor revisions and reformatting to reflect version to be published
Subjects: Statistics Theory (math.ST); Populations and Evolution (q-bio.PE)
MSC classes: 62P10, 92D15
Cite as: arXiv:0709.0531 [math.ST]
  (or arXiv:0709.0531v2 [math.ST] for this version)

Submission history

From: John Rhodes [view email]
[v1] Tue, 4 Sep 2007 20:18:38 GMT (191kb)
[v2] Fri, 1 Feb 2008 17:06:33 GMT (237kb)

Link back to: arXiv, form interface, contact.