We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Optimal Transport vs. Fisher-Rao distance between Copulas for Clustering Multivariate Time Series

Abstract: We present a methodology for clustering N objects which are described by multivariate time series, i.e. several sequences of real-valued random variables. This clustering methodology leverages copulas which are distributions encoding the dependence structure between several random variables. To take fully into account the dependence information while clustering, we need a distance between copulas. In this work, we compare renowned distances between distributions: the Fisher-Rao geodesic distance, related divergences and optimal transport, and discuss their advantages and disadvantages. Applications of such methodology can be found in the clustering of financial assets. A tutorial, experiments and implementation for reproducible research can be found at www.datagrapple.com/Tech.
Comments: Accepted at IEEE Workshop on Statistical Signal Processing (SSP 2016)
Subjects: Machine Learning (stat.ML)
DOI: 10.1109/SSP.2016.7551770
Cite as: arXiv:1604.08634 [stat.ML]
  (or arXiv:1604.08634v2 [stat.ML] for this version)

Submission history

From: Gautier Marti [view email]
[v1] Thu, 28 Apr 2016 22:10:30 GMT (1693kb,D)
[v2] Mon, 14 Nov 2016 10:50:11 GMT (3376kb,D)

Link back to: arXiv, form interface, contact.