We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages

Abstract: Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests. Although previous SRC work has leveraged extra information such as HTML tags or XPaths, the informative topology of web pages is not effectively exploited. In this work, we propose a Topological Information Enhanced model (TIE), which transforms the token-level task into a tag-level task by introducing a two-stage process (i.e. node locating and answer refining). Based on that, TIE integrates Graph Attention Network (GAT) and Pre-trained Language Model (PLM) to leverage the topological information of both logical structures and spatial structures. Experimental results demonstrate that our model outperforms strong baselines and achieves state-of-the-art performances on the web-based SRC benchmark WebSRC at the time of writing. The code of TIE will be publicly available at this https URL
Comments: Accepted to NAACL 2022
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2205.06435 [cs.CL]
  (or arXiv:2205.06435v1 [cs.CL] for this version)

Submission history

From: Zihan Zhao [view email]
[v1] Fri, 13 May 2022 03:21:09 GMT (12682kb,D)

Link back to: arXiv, form interface, contact.