We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Identifying Relevant Document Facets for Keyword-Based Search Queries

Authors: Lanbo Zhang
Abstract: As structured documents with rich metadata (such as products, movies, etc.) become increasingly prevalent, searching those documents has become an important IR problem. Although advanced search interfaces are widely available, most users still prefer to use keyword-based queries to search those documents. Query keywords often imply some hidden restrictions on the desired documents, which can be represented as document facet-value pairs. To achieve high retrieval performance, it's important to be able to identify the relevant facet-value pairs hidden in a query. In this paper, we study the problem of identifying document facet-value pairs that are relevant to a keyword-based search query. We propose a machine learning approach and a set of useful features, and evaluate our approach using a movie data set from INEX.
Subjects: Information Retrieval (cs.IR)
Cite as: arXiv:1501.00744 [cs.IR]
  (or arXiv:1501.00744v1 [cs.IR] for this version)

Submission history

From: Lanbo Zhang [view email]
[v1] Mon, 5 Jan 2015 01:49:11 GMT (42kb)

Link back to: arXiv, form interface, contact.