Current browse context:
cs.CL
Change to browse by:
References & Citations
Computer Science > Computation and Language
Title: Learning Context-Sensitive Convolutional Filters for Text Processing
(Submitted on 25 Sep 2017 (v1), last revised 30 Aug 2018 (this version, v3))
Abstract: Convolutional neural networks (CNNs) have recently emerged as a popular building block for natural language processing (NLP). Despite their success, most existing CNN models employed in NLP share the same learned (and static) set of filters for all input sentences. In this paper, we consider an approach of using a small meta network to learn context-sensitive convolutional filters for text processing. The role of meta network is to abstract the contextual information of a sentence or document into a set of input-aware filters. We further generalize this framework to model sentence pairs, where a bidirectional filter generation mechanism is introduced to encapsulate co-dependent sentence representations. In our benchmarks on four different tasks, including ontology classification, sentiment analysis, answer sentence selection, and paraphrase identification, our proposed model, a modified CNN with context-sensitive filters, consistently outperforms the standard CNN and attention-based CNN baselines. By visualizing the learned context-sensitive filters, we further validate and rationalize the effectiveness of proposed framework.
Submission history
From: Dinghan Shen [view email][v1] Mon, 25 Sep 2017 02:29:26 GMT (523kb,D)
[v2] Wed, 29 Aug 2018 04:15:40 GMT (3597kb,D)
[v3] Thu, 30 Aug 2018 16:29:50 GMT (2317kb,D)
Link back to: arXiv, form interface, contact.