We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IT

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Theory

Title: The Partial Entropy Decomposition: Decomposing multivariate entropy and mutual information via pointwise common surprisal

Abstract: Obtaining meaningful quantitative descriptions of the statistical dependence within multivariate systems is a difficult open problem. Recently, the Partial Information Decomposition (PID) was proposed to decompose mutual information (MI) about a target variable into components which are redundant, unique and synergistic within different subsets of predictor variables. Here, we propose to apply the elegant formalism of the PID to multivariate entropy, resulting in a Partial Entropy Decomposition (PED). We implement the PED with an entropy redundancy measure based on pointwise common surprisal; a natural definition which is closely related to the definition of MI. We show how this approach can reveal the dyadic vs triadic generative structure of multivariate systems that are indistinguishable with classical Shannon measures. The entropy perspective also shows that misinformation is synergistic entropy and hence that MI itself includes both redundant and synergistic effects. We show the relationships between the PED and MI in two predictors, and derive two alternative information decompositions which we illustrate on several example systems. This reveals that in entropy terms, univariate predictor MI is not a proper subset of the joint MI, and we suggest this previously unrecognised fact explains in part why obtaining a consistent PID has proven difficult. The PED also allows separate quantification of mechanistic redundancy (related to the function of the system) versus source redundancy (arising from dependencies between inputs); an important distinction which no existing methods can address. The new perspective provided by the PED helps to clarify some of the difficulties encountered with the PID approach and the resulting decompositions provide useful tools for practical data analysis across a wide range of application areas.
Comments: Added Section 3.7 (Quantifying source vs mechanistic redundancy) and Section 3.8 (Shared entropy as a measure of dependence: pure mutual information) and updated abstract, results, and discussion accordingly
Subjects: Information Theory (cs.IT); Statistics Theory (math.ST); Neurons and Cognition (q-bio.NC); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
Cite as: arXiv:1702.01591 [cs.IT]
  (or arXiv:1702.01591v2 [cs.IT] for this version)

Submission history

From: Robin Ince [view email]
[v1] Mon, 6 Feb 2017 12:28:27 GMT (571kb,D)
[v2] Mon, 20 Feb 2017 16:11:20 GMT (354kb,D)

Link back to: arXiv, form interface, contact.