Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models

Krakovna, Viktoriya; Doshi-Velez, Finale

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1606

Statistics > Machine Learning

Title: Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models

Authors: Viktoriya Krakovna, Finale Doshi-Velez

(Submitted on 16 Jun 2016 (v1), last revised 30 Sep 2016 (this version, v2))

Abstract: As deep neural networks continue to revolutionize various application domains, there is increasing interest in making these powerful models more understandable and interpretable, and narrowing down the causes of good and bad predictions. We focus on recurrent neural networks (RNNs), state of the art models in speech recognition and translation. Our approach to increasing interpretability is by combining an RNN with a hidden Markov model (HMM), a simpler and more transparent model. We explore various combinations of RNNs and HMMs: an HMM trained on LSTM states; a hybrid model where an HMM is trained first, then a small LSTM is given HMM state distributions and trained to fill in gaps in the HMM's performance; and a jointly trained hybrid model. We find that the LSTM and HMM learn complementary information about the features in the text.

Comments:	presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY
Subjects:	Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1606.05320 [stat.ML]
	(or arXiv:1606.05320v2 [stat.ML] for this version)

Submission history

From: Viktoriya Krakovna [view email]
[v1] Thu, 16 Jun 2016 19:13:52 GMT (744kb,D)
[v2] Fri, 30 Sep 2016 22:20:39 GMT (744kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1606.05320

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models

Submission history