Knowledge extraction from the learning of sequences in a long short term memory (LSTM) architecture

Kaadoud, Ikram Chraibi; Rougier, Nicolas P.; Alexandre, Frédéric

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1912

Computer Science > Machine Learning

Title: Knowledge extraction from the learning of sequences in a long short term memory (LSTM) architecture

Authors: Ikram Chraibi Kaadoud, Nicolas P. Rougier, Frédéric Alexandre

(Submitted on 6 Dec 2019)

Abstract: We introduce a general method to extract knowledge from a recurrent neural network (Long Short Term Memory) that has learnt to detect if a given input sequence is valid or not, according to an unknown generative automaton. Based on the clustering of the hidden states, we explain how to build and validate an automaton that corresponds to the underlying (unknown) automaton, and allows to predict if a given sequence is valid or not. The method is illustrated on artificial grammars (Reber's grammar variations) as well as on a real use-case whose underlying grammar is unknown.

Comments:	18 pages, 17 figures
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1912.03126 [cs.LG]
	(or arXiv:1912.03126v1 [cs.LG] for this version)

Submission history

From: Nicolas Rougier [view email]
[v1] Fri, 6 Dec 2019 14:00:21 GMT (2853kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1912.03126

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Knowledge extraction from the learning of sequences in a long short term memory (LSTM) architecture

Submission history