We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Human-Computer Interaction

Title: WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

Abstract: Modeling user interfaces (UIs) from visual information allows systems to make inferences about the functionality and semantics needed to support use cases in accessibility, app automation, and testing. Current datasets for training machine learning models are limited in size due to the costly and time-consuming process of manually collecting and annotating UIs. We crawled the web to construct WebUI, a large dataset of 400,000 rendered web pages associated with automatically extracted metadata. We analyze the composition of WebUI and show that while automatically extracted data is noisy, most examples meet basic criteria for visual UI modeling. We applied several strategies for incorporating semantics found in web pages to increase the performance of visual UI understanding models in the mobile domain, where less labeled data is available: (i) element detection, (ii) screen classification and (iii) screen similarity.
Comments: Accepted to CHI 2023. Dataset, code, and models release coming soon
Subjects: Human-Computer Interaction (cs.HC)
Cite as: arXiv:2301.13280 [cs.HC]
  (or arXiv:2301.13280v1 [cs.HC] for this version)

Submission history

From: Jason Wu [view email]
[v1] Mon, 30 Jan 2023 20:47:12 GMT (9048kb,D)

Link back to: arXiv, form interface, contact.