Improving Joint Layer RNN based Keyphrase Extraction by Using Syntactical Features

Mahfuzh, Miftahul; Soleman, Sidik; Purwarianti, Ayu

doi:10.1109/ICAICTA.2019.8904194

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2009

Change to browse by:

Computer Science > Computation and Language

Title: Improving Joint Layer RNN based Keyphrase Extraction by Using Syntactical Features

Authors: Miftahul Mahfuzh, Sidik Soleman, Ayu Purwarianti

(Submitted on 15 Sep 2020)

Abstract: Keyphrase extraction as a task to identify important words or phrases from a text, is a crucial process to identify main topics when analyzing texts from a social media platform. In our study, we focus on text written in Indonesia language taken from Twitter. Different from the original joint layer recurrent neural network (JRNN) with output of one sequence of keywords and using only word embedding, here we propose to modify the input layer of JRNN to extract more than one sequence of keywords by additional information of syntactical features, namely part of speech, named entity types, and dependency structures. Since JRNN in general requires a large amount of data as the training examples and creating those examples is expensive, we used a data augmentation method to increase the number of training examples. Our experiment had shown that our method outperformed the baseline methods. Our method achieved .9597 in accuracy and .7691 in F1.

Comments:	6 pages
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7
Journal reference:	2019 International Conference of Advanced Informatics: Concepts, Theory and Applications (ICAICTA)
DOI:	10.1109/ICAICTA.2019.8904194
Cite as:	arXiv:2009.07119 [cs.CL]
	(or arXiv:2009.07119v1 [cs.CL] for this version)

Submission history

From: Sidik Soleman [view email]
[v1] Tue, 15 Sep 2020 14:20:04 GMT (80kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2009.07119

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Improving Joint Layer RNN based Keyphrase Extraction by Using Syntactical Features

Submission history