References & Citations
Mathematics > Statistics Theory
Title: Tropical Sufficient Statistics for Persistent Homology
(Submitted on 8 Sep 2017 (v1), last revised 30 Jun 2019 (this version, v6))
Abstract: We show that an embedding in Euclidean space based on tropical geometry generates stable sufficient statistics for barcodes. In topological data analysis, barcodes are multiscale summaries of algebraic topological characteristics that capture the `shape' of data; however, in practice, they have complex structures that make them difficult to use in statistical settings. The sufficiency result presented in this work allows for classical probability distributions to be assumed on the tropical geometric representation of barcodes. This makes a variety of parametric statistical inference methods amenable to barcodes, all while maintaining their initial interpretations. More specifically, we show that exponential family distributions may be assumed, and that likelihood functions for persistent homology may be constructed. We conceptually demonstrate sufficiency and illustrate its utility in persistent homology dimensions 0 and 1 with concrete parametric applications to human immunodeficiency virus and avian influenza data.
Submission history
From: Anthea Monod [view email][v1] Fri, 8 Sep 2017 11:06:16 GMT (137kb,D)
[v2] Thu, 14 Sep 2017 07:09:13 GMT (137kb,D)
[v3] Sat, 30 Sep 2017 00:58:52 GMT (140kb,D)
[v4] Fri, 6 Jul 2018 11:14:19 GMT (352kb,D)
[v5] Thu, 20 Dec 2018 17:27:51 GMT (356kb,D)
[v6] Sun, 30 Jun 2019 11:59:32 GMT (357kb,D)
Link back to: arXiv, form interface, contact.