We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Unsupervised, Efficient and Semantic Expertise Retrieval

Abstract: We introduce an unsupervised discriminative model for the task of retrieving experts in online document collections. We exclusively employ textual evidence and avoid explicit feature engineering by learning distributed word representations in an unsupervised way. We compare our model to state-of-the-art unsupervised statistical vector space and probabilistic generative approaches. Our proposed log-linear model achieves the retrieval performance levels of state-of-the-art document-centric methods with the low inference cost of so-called profile-centric approaches. It yields a statistically significant improved ranking over vector space and generative models in most cases, matching the performance of supervised methods on various benchmarks. That is, by using solely text we can do as well as methods that work with external evidence and/or relevance feedback. A contrastive analysis of rankings produced by discriminative and generative approaches shows that they have complementary strengths due to the ability of the unsupervised discriminative model to perform semantic matching.
Comments: WWW2016, Proceedings of the 25th International Conference on World Wide Web. 2016
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
DOI: 10.1145/2872427.2882974
Cite as: arXiv:1608.06651 [cs.IR]
  (or arXiv:1608.06651v2 [cs.IR] for this version)

Submission history

From: Christophe Van Gysel [view email]
[v1] Tue, 23 Aug 2016 20:55:09 GMT (2135kb,D)
[v2] Sun, 17 Sep 2017 04:57:54 GMT (2139kb,D)

Link back to: arXiv, form interface, contact.