References & Citations
Computer Science > Computation and Language
Title: Rethinking Self-Attention: Towards Interpretability in Neural Parsing
(Submitted on 10 Nov 2019 (v1), last revised 29 Oct 2020 (this version, v3))
Abstract: Attention mechanisms have improved the performance of NLP tasks while allowing models to remain explainable. Self-attention is currently widely used, however interpretability is difficult due to the numerous attention distributions. Recent work has shown that model representations can benefit from label-specific information, while facilitating interpretation of predictions. We introduce the Label Attention Layer: a new form of self-attention where attention heads represent labels. We test our novel layer by running constituency and dependency parsing experiments and show our new model obtains new state-of-the-art results for both tasks on both the Penn Treebank (PTB) and Chinese Treebank. Additionally, our model requires fewer self-attention layers compared to existing work. Finally, we find that the Label Attention heads learn relations between syntactic categories and show pathways to analyze errors.
Submission history
From: Khalil Mrini [view email][v1] Sun, 10 Nov 2019 08:17:11 GMT (484kb,D)
[v2] Sat, 2 May 2020 04:34:52 GMT (922kb,D)
[v3] Thu, 29 Oct 2020 06:17:11 GMT (7994kb,D)
Link back to: arXiv, form interface, contact.