We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Hybrid Approach to Dependency Parsing: Combining Rules and Morphology with Deep Learning

Authors: Şaziye Betül Özateş (1), Arzucan Özgür (1), Tunga Güngör (1), Balkız Öztürk (2) ((1) Department of Computer Engineering, Boğaziçi University, (2) Department of Linguistics, Boğaziçi University)
Abstract: Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is low-resourced and the amount of training data is insufficient, these models can benefit from the integration of natural language grammar-based information. We propose two approaches to dependency parsing especially for languages with restricted amount of training data. Our first approach combines a state-of-the-art deep learning-based parser with a rule-based approach and the second one incorporates morphological information into the parser. In the rule-based approach, the parsing decisions made by the rules are encoded and concatenated with the vector representations of the input words as additional information to the deep network. The morphology-based approach proposes different methods to include the morphological structure of words into the parser network. Experiments are conducted on the IMST-UD Treebank and the results suggest that integration of explicit knowledge about the target language to a neural parser through a rule-based parsing system and morphological analysis leads to more accurate annotations and hence, increases the parsing performance in terms of attachment scores. The proposed methods are developed for Turkish, but can be adapted to other languages as well.
Comments: 25 pages, 7 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
ACM classes: I.2.7
DOI: 10.1109/ACCESS.2022.3202947
Cite as: arXiv:2002.10116 [cs.CL]
  (or arXiv:2002.10116v1 [cs.CL] for this version)

Submission history

From: Şaziye Betül Özateş [view email]
[v1] Mon, 24 Feb 2020 08:34:33 GMT (675kb,D)

Link back to: arXiv, form interface, contact.