Current browse context:
cs.IR
Change to browse by:
References & Citations
Computer Science > Information Retrieval
Title: Unsupervised, Efficient and Semantic Expertise Retrieval
(Submitted on 23 Aug 2016 (v1), last revised 17 Sep 2017 (this version, v2))
Abstract: We introduce an unsupervised discriminative model for the task of retrieving experts in online document collections. We exclusively employ textual evidence and avoid explicit feature engineering by learning distributed word representations in an unsupervised way. We compare our model to state-of-the-art unsupervised statistical vector space and probabilistic generative approaches. Our proposed log-linear model achieves the retrieval performance levels of state-of-the-art document-centric methods with the low inference cost of so-called profile-centric approaches. It yields a statistically significant improved ranking over vector space and generative models in most cases, matching the performance of supervised methods on various benchmarks. That is, by using solely text we can do as well as methods that work with external evidence and/or relevance feedback. A contrastive analysis of rankings produced by discriminative and generative approaches shows that they have complementary strengths due to the ability of the unsupervised discriminative model to perform semantic matching.
Submission history
From: Christophe Van Gysel [view email][v1] Tue, 23 Aug 2016 20:55:09 GMT (2135kb,D)
[v2] Sun, 17 Sep 2017 04:57:54 GMT (2139kb,D)
Link back to: arXiv, form interface, contact.