We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Yelp Dataset Analysis using Scalable Big Data

Abstract: Yelp has served and will continue to serve as a data-driven application. Yelp has published a dataset containing business information, reviews, user information, and check-in information. This paper will examine this dataset to provide descriptive analytics to understand business performance, geo-spatial distribution of businesses, reviewers' rating and other characteristics, and temporal distribution of check-ins in business premises. With these analysis we are able to establish that yelp reviews, tips, elite users and check ins have started to plummet over the years. Coincidentally, the paper also establishes that Canadians have a more stable star ratings as well as sentiment ratings when compared to Americans.
Comments: 4 pages, 11 figures, 4 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
Cite as: arXiv:2104.08396 [cs.DC]
  (or arXiv:2104.08396v1 [cs.DC] for this version)

Submission history

From: Jongwook Woo [view email]
[v1] Fri, 16 Apr 2021 22:53:19 GMT (790kb)

Link back to: arXiv, form interface, contact.