We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Improving Joint Layer RNN based Keyphrase Extraction by Using Syntactical Features

Abstract: Keyphrase extraction as a task to identify important words or phrases from a text, is a crucial process to identify main topics when analyzing texts from a social media platform. In our study, we focus on text written in Indonesia language taken from Twitter. Different from the original joint layer recurrent neural network (JRNN) with output of one sequence of keywords and using only word embedding, here we propose to modify the input layer of JRNN to extract more than one sequence of keywords by additional information of syntactical features, namely part of speech, named entity types, and dependency structures. Since JRNN in general requires a large amount of data as the training examples and creating those examples is expensive, we used a data augmentation method to increase the number of training examples. Our experiment had shown that our method outperformed the baseline methods. Our method achieved .9597 in accuracy and .7691 in F1.
Comments: 6 pages
Subjects: Computation and Language (cs.CL)
ACM classes: I.2.7
Journal reference: 2019 International Conference of Advanced Informatics: Concepts, Theory and Applications (ICAICTA)
DOI: 10.1109/ICAICTA.2019.8904194
Cite as: arXiv:2009.07119 [cs.CL]
  (or arXiv:2009.07119v1 [cs.CL] for this version)

Submission history

From: Sidik Soleman [view email]
[v1] Tue, 15 Sep 2020 14:20:04 GMT (80kb)

Link back to: arXiv, form interface, contact.