We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Compositional Sentence Representation from Character within Large Context Text

Abstract: This paper describes a Hierarchical Composition Recurrent Network (HCRN) consisting of a 3-level hierarchy of compositional models: character, word and sentence. This model is designed to overcome two problems of representing a sentence on the basis of a constituent word sequence. The first is a data-sparsity problem in word embedding, and the other is a no usage of inter-sentence dependency. In the HCRN, word representations are built from characters, thus resolving the data-sparsity problem, and inter-sentence dependency is embedded into sentence representation at the level of sentence composition. We adopt a hierarchy-wise learning scheme in order to alleviate the optimization difficulties of learning deep hierarchical recurrent network in end-to-end fashion. The HCRN was quantitatively and qualitatively evaluated on a dialogue act classification task. Especially, sentence representations with an inter-sentence dependency are able to capture both implicit and explicit semantics of sentence, significantly improving performance. In the end, the HCRN achieved state-of-the-art performance with a test error rate of 22.7% for dialogue act classification on the SWBD-DAMSL database.
Comments: 13pages
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:1605.00482 [cs.CL]
  (or arXiv:1605.00482v3 [cs.CL] for this version)

Submission history

From: Geonmin Kim [view email]
[v1] Mon, 2 May 2016 13:41:38 GMT (5970kb)
[v2] Mon, 30 May 2016 05:57:09 GMT (3148kb)
[v3] Fri, 3 Jun 2016 13:35:17 GMT (3161kb)

Link back to: arXiv, form interface, contact.