We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.PE

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Populations and Evolution

Title: Squaring within the Colless index yields a better balance index

Abstract: The Colless index for bifurcating phylogenetic trees, introduced by Colless (1982), is defined as the sum, over all internal nodes $v$ of the tree, of the absolute value of the difference of the sizes of the clades defined by the children of $v$. It is one of the most popular phylogenetic balance indices, because, in addition to measuring the balance of a tree in a very simple and intuitive way, it turns out to be one of the most powerful and discriminating phylogenetic shape indices. But it has some drawbacks. On the one hand, although its minimum value is reached at the so-called maximally balanced trees, it is almost always reached also at trees that are not maximally balanced. On the other hand, its definition as a sum of absolute values of differences makes it difficult to study analytically its distribution under probabilistic models of bifurcating phylogenetic trees. In this paper we show that if we replace in its definition the absolute values of the differences of clade sizes by the squares of these differences, all these drawbacks are overcome and the resulting index is still more powerful and discriminating than the original Colless index.
Comments: 31 pages
Subjects: Populations and Evolution (q-bio.PE); Discrete Mathematics (cs.DM); Combinatorics (math.CO)
Cite as: arXiv:2007.14731 [q-bio.PE]
  (or arXiv:2007.14731v1 [q-bio.PE] for this version)

Submission history

From: Francesc Rosselló [view email]
[v1] Wed, 29 Jul 2020 10:40:51 GMT (31kb,D)

Link back to: arXiv, form interface, contact.