We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Populations and Evolution

Title: Species tree estimation using ASTRAL: how many genes are enough?

Abstract: Species tree reconstruction from genomic data is increasingly performed using methods that account for sources of gene tree discordance such as incomplete lineage sorting. One popular method for reconstructing species trees from unrooted gene tree topologies is ASTRAL. In this paper, we derive theoretical sample complexity results for the number of genes required by ASTRAL to guarantee reconstruction of the correct species tree with high probability. We also validate those theoretical bounds in a simulation study. Our results indicate that ASTRAL requires $\mathcal{O}(f^{-2} \log n)$ gene trees to reconstruct the species tree correctly with high probability where n is the number of species and f is the length of the shortest branch in the species tree. Our simulations, which are the first to test ASTRAL explicitly under the anomaly zone, show trends consistent with the theoretical bounds and also provide some practical insights on the conditions where ASTRAL works well.
Comments: 22 pages, 2 figures, Accepted for oral presentation at RECOMB 2017; Under review at IEEE TCBB
Subjects: Populations and Evolution (q-bio.PE); Computational Engineering, Finance, and Science (cs.CE); Probability (math.PR); Statistics Theory (math.ST)
DOI: 10.1109/TCBB.2017.2757930
Cite as: arXiv:1704.06831 [q-bio.PE]
  (or arXiv:1704.06831v2 [q-bio.PE] for this version)

Submission history

From: Shubhanshu Shekhar [view email]
[v1] Sat, 22 Apr 2017 18:25:44 GMT (151kb,D)
[v2] Sat, 16 Sep 2017 03:04:45 GMT (158kb,D)

Link back to: arXiv, form interface, contact.