Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems

Goyal, Anirudh; Lamb, Alex; Gampa, Phanideep; Beaudoin, Philippe; Levine, Sergey; Blundell, Charles; Bengio, Yoshua; Mozer, Michael

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems

Authors: Anirudh Goyal, Alex Lamb, Phanideep Gampa, Philippe Beaudoin, Sergey Levine, Charles Blundell, Yoshua Bengio, Michael Mozer

(Submitted on 29 Jun 2020 (v1), last revised 13 Nov 2020 (this version, v5))

Abstract: Modeling a structured, dynamic environment like a video game requires keeping track of the objects and their states declarative knowledge) as well as predicting how objects behave (procedural knowledge). Black-box models with a monolithic hidden state often fail to apply procedural knowledge consistently and uniformly, i.e., they lack systematicity. For example, in a video game, correct prediction of one enemy's trajectory does not ensure correct prediction of another's. We address this issue via an architecture that factorizes declarative and procedural knowledge and that imposes modularity within each form of knowledge. The architecture consists of active modules called object files that maintain the state of a single object and invoke passive external knowledge sources called schemata that prescribe state updates. To use a video game as an illustration, two enemies of the same type will share schemata but will have separate object files to encode their distinct state (e.g., health, position). We propose to use attention to determine which object files to update, the selection of schemata, and the propagation of information between object files. The resulting architecture is a drop-in replacement conforming to the same input-output interface as normal recurrent networks (e.g., LSTM, GRU) yet achieves substantially better generalization on environments that have multiple object tokens of the same type, including a challenging intuitive physics benchmark.

Comments:	Type/Token Distinction in Deep learning Framework
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2006.16225 [cs.LG]
	(or arXiv:2006.16225v5 [cs.LG] for this version)

Submission history

From: Anirudh Goyal [view email]
[v1] Mon, 29 Jun 2020 17:45:03 GMT (7034kb,D)
[v2] Tue, 30 Jun 2020 21:06:12 GMT (6466kb,D)
[v3] Mon, 5 Oct 2020 21:08:40 GMT (7667kb,D)
[v4] Thu, 12 Nov 2020 08:01:21 GMT (7668kb,D)
[v5] Fri, 13 Nov 2020 01:47:12 GMT (7668kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.16225

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems

Submission history