Tensors, Learning, and 'Kolmogorov Extension' for Finite-alphabet Random Vectors

Kargas, Nikos; Sidiropoulos, Nicholas D.; Fu, Xiao

doi:10.1109/TSP.2018.2862383

Full-text links:

Download:

Current browse context:

math.PR

< prev | next >

new | recent | 1712

Electrical Engineering and Systems Science > Signal Processing

Title: Tensors, Learning, and 'Kolmogorov Extension' for Finite-alphabet Random Vectors

Authors: Nikos Kargas, Nicholas D. Sidiropoulos, Xiao Fu

(Submitted on 1 Dec 2017 (v1), last revised 27 Jul 2018 (this version, v2))

Abstract: Estimating the joint probability mass function (PMF) of a set of random variables lies at the heart of statistical learning and signal processing. Without structural assumptions, such as modeling the variables as a Markov chain, tree, or other graphical model, joint PMF estimation is often considered mission impossible - the number of unknowns grows exponentially with the number of variables. But who gives us the structural model? Is there a generic, `non-parametric' way to control joint PMF complexity without relying on a priori structural assumptions regarding the underlying probability model? Is it possible to discover the operational structure without biasing the analysis up front? What if we only observe random subsets of the variables, can we still reliably estimate the joint PMF of all? This paper shows, perhaps surprisingly, that if the joint PMF of any three variables can be estimated, then the joint PMF of all the variables can be provably recovered under relatively mild conditions. The result is reminiscent of Kolmogorov's extension theorem - consistent specification of lower-dimensional distributions induces a unique probability measure for the entire process. The difference is that for processes of limited complexity (rank of the high-dimensional PMF) it is possible to obtain complete characterization from only three-dimensional distributions. In fact not all three-dimensional PMFs are needed; and under more stringent conditions even two-dimensional will do. Exploiting multilinear algebra, this paper proves that such higher-dimensional PMF completion can be guaranteed - several pertinent identifiability results are derived. It also provides a practical and efficient algorithm to carry out the recovery task. Judiciously designed simulations and real-data experiments on movie recommendation and data classification are presented to showcase the effectiveness of the approach.

Subjects:	Signal Processing (eess.SP); Information Theory (cs.IT); Probability (math.PR); Machine Learning (stat.ML)
DOI:	10.1109/TSP.2018.2862383
Cite as:	arXiv:1712.00205 [eess.SP]
	(or arXiv:1712.00205v2 [eess.SP] for this version)

Submission history

From: Nikos Kargas [view email]
[v1] Fri, 1 Dec 2017 06:19:33 GMT (472kb,D)
[v2] Fri, 27 Jul 2018 16:08:09 GMT (482kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:1712.00205

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Signal Processing

Title: Tensors, Learning, and 'Kolmogorov Extension' for Finite-alphabet Random Vectors

Submission history