References & Citations
Computer Science > Computation and Language
Title: BERT Goes Shopping: Comparing Distributional Models for Product Representations
(Submitted on 17 Dec 2020 (v1), last revised 23 Jun 2021 (this version, v2))
Abstract: Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through~\textit{prod2vec}. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model -- ~\textit{Prod2BERT} -- is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of~\textit{Prod2BERT} and~\textit{prod2vec} embeddings: while~\textit{Prod2BERT} is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints.
Submission history
From: Federico Bianchi [view email][v1] Thu, 17 Dec 2020 18:18:03 GMT (3397kb,D)
[v2] Wed, 23 Jun 2021 13:05:44 GMT (9796kb,D)
Link back to: arXiv, form interface, contact.