References & Citations
Computer Science > Computation and Language
Title: What Can We Learn From Almost a Decade of Food Tweets
(Submitted on 10 Jul 2020 (v1), last revised 1 Sep 2020 (this version, v2))
Abstract: We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using data from the corpus.
Submission history
From: Matiss Rikters [view email][v1] Fri, 10 Jul 2020 06:36:13 GMT (257kb,D)
[v2] Tue, 1 Sep 2020 07:38:09 GMT (352kb,D)
Link back to: arXiv, form interface, contact.