We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Actively Learning Hemimetrics with Applications to Eliciting User Preferences

Abstract: Motivated by an application of eliciting users' preferences, we investigate the problem of learning hemimetrics, i.e., pairwise distances among a set of $n$ items that satisfy triangle inequalities and non-negativity constraints. In our application, the (asymmetric) distances quantify private costs a user incurs when substituting one item by another. We aim to learn these distances (costs) by asking the users whether they are willing to switch from one item to another for a given incentive offer. Without exploiting structural constraints of the hemimetric polytope, learning the distances between each pair of items requires $\Theta(n^2)$ queries. We propose an active learning algorithm that substantially reduces this sample complexity by exploiting the structural constraints on the version space of hemimetrics. Our proposed algorithm achieves provably-optimal sample complexity for various instances of the task. For example, when the items are embedded into $K$ tight clusters, the sample complexity of our algorithm reduces to $O(n K)$. Extensive experiments on a restaurant recommendation data set support the conclusions of our theoretical analysis.
Comments: Extended version of ICML'16 paper
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1605.07144 [stat.ML]
  (or arXiv:1605.07144v2 [stat.ML] for this version)

Submission history

From: Sebastian Tschiatschek [view email]
[v1] Mon, 23 May 2016 19:21:35 GMT (1280kb,D)
[v2] Fri, 27 May 2016 17:45:26 GMT (2585kb,D)

Link back to: arXiv, form interface, contact.