We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Distribution Regression for Sequential Data

Abstract: Distribution regression refers to the supervised learning problem where labels are only available for groups of inputs instead of individual inputs. In this paper, we develop a rigorous mathematical framework for distribution regression where inputs are complex data streams. Leveraging properties of the expected signature and a recent signature kernel trick for sequential data from stochastic analysis, we introduce two new learning techniques, one feature-based and the other kernel-based. Each is suited to a different data regime in terms of the number of data streams and the dimensionality of the individual streams. We provide theoretical results on the universality of both approaches and demonstrate empirically their robustness to irregularly sampled multivariate time-series, achieving state-of-the-art performance on both synthetic and real-world examples from thermodynamics, mathematical finance and agricultural science.
Comments: Published at AISTATS 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
MSC classes: 60L10, 60L20
Cite as: arXiv:2006.05805 [cs.LG]
  (or arXiv:2006.05805v5 [cs.LG] for this version)

Submission history

From: Cristopher Salvi [view email]
[v1] Wed, 10 Jun 2020 12:47:23 GMT (913kb,D)
[v2] Thu, 11 Jun 2020 05:55:52 GMT (913kb,D)
[v3] Mon, 22 Jun 2020 09:09:23 GMT (914kb,D)
[v4] Fri, 23 Oct 2020 08:56:02 GMT (5372kb,D)
[v5] Wed, 29 Sep 2021 17:44:28 GMT (5371kb,D)

Link back to: arXiv, form interface, contact.