We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.PE

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Populations and Evolution

Title: NJst and ASTRID are not statistically consistent under a random model of missing data

Abstract: Species tree estimation from multi-locus datasets is statistically challenging for multiple reasons, including gene tree heterogeneity across the genome due to incomplete lineage sorting (ILS). Species tree estimation methods have been developed that operate by estimating gene trees and then using those gene trees to estimate the species tree. Several of these methods (e.g., ASTRAL, ASTRID, and NJst) are provably statistically consistent under the multi-species coalescent (MSC) model, provided that the gene trees are estimated correctly, and there is no missing data. Recently, Nute et al. (BMC Genomics 2018) addressed the question of whether these methods remain statistically consistent under random models of taxon deletion, and asserted that they do so. Here we provide a counterexample to one of these theorems, and establish that ASTRID and NJst are not statistically consistent under an i.i.d. model of taxon deletion.
Comments: 6 pages, no figures, provides counterexample to theorem (the first and corresponding author are both co-authors on this paper)
Subjects: Populations and Evolution (q-bio.PE)
MSC classes: 92D15 (primary),
Cite as: arXiv:2001.07844 [q-bio.PE]
  (or arXiv:2001.07844v1 [q-bio.PE] for this version)

Submission history

From: Tandy Warnow [view email]
[v1] Wed, 22 Jan 2020 01:50:53 GMT (6kb)

Link back to: arXiv, form interface, contact.