We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IT

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Learning a Tree-Structured Ising Model in Order to Make Predictions

Abstract: We study the problem of learning a tree Ising model from samples such that subsequent predictions made using the model are accurate. The prediction task considered in this paper is that of predicting the values of a subset of variables given values of some other subset of variables. Virtually all previous work on graphical model learning has focused on recovering the true underlying graph. We define a distance ("small set TV" or ssTV) between distributions $P$ and $Q$ by taking the maximum, over all subsets $\mathcal{S}$ of a given size, of the total variation between the marginals of $P$ and $Q$ on $\mathcal{S}$; this distance captures the accuracy of the prediction task of interest. We derive non-asymptotic bounds on the number of samples needed to get a distribution (from the same class) with small ssTV relative to the one generating the samples. One of the main messages of this paper is that far fewer samples are needed than for recovering the underlying tree, which means that accurate predictions are possible using the wrong tree.
Comments: 43 pages, 7 figure
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Probability (math.PR); Machine Learning (stat.ML)
Cite as: arXiv:1604.06749 [math.ST]
  (or arXiv:1604.06749v3 [math.ST] for this version)

Submission history

From: Mina Karzand [view email]
[v1] Fri, 22 Apr 2016 16:57:30 GMT (364kb,D)
[v2] Mon, 8 Aug 2016 20:11:52 GMT (897kb,D)
[v3] Thu, 14 Jun 2018 12:01:53 GMT (1045kb,D)

Link back to: arXiv, form interface, contact.