We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Information Retrieval

Title: Developing Products Update-Alert System for e-Commerce Websites Users Using HTML Data and Web Scraping Technique

Abstract: Websites are regarded as domains of limitless information which anyone and everyone can access. The new trend of technology put us to change the way we are doing our business. The Internet now is fastly becoming a new place for business and the advancement in this technology gave rise to the number of e-commerce websites. This made the lifestyle of marketers/vendors, retailers and consumers (collectively regarded as users in this paper) easy, because it provides easy platforms to sale/order items through the internet. This also requires that the users will have to spend a lot of time and effort to search for the best product deals, products updates and offers on e-commerce websites. They have to filter and compare search results by themselves which takes a lot of time and there are chances of ambiguous results. In this paper, we applied web crawling and scraping methods on an e-commerce website to get HTML data for identifying products updates based on the current time. The HTML data is preprocessed to extract details of the products such as name, price, post date and time, etc. to serve as useful information for users.
Comments: 6 pages, 3 figures, 1 table, IJNLC Journal
Subjects: Information Retrieval (cs.IR)
Journal reference: International Journal on Natural Language Computing 2021
Cite as: arXiv:2109.00656 [cs.IR]
  (or arXiv:2109.00656v1 [cs.IR] for this version)

Submission history

From: Ikechukwu Onyenwe [view email]
[v1] Thu, 2 Sep 2021 00:35:02 GMT (353kb)

Link back to: arXiv, form interface, contact.