References & Citations
Computer Science > Artificial Intelligence
Title: Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
(Submitted on 17 Jan 2022 (v1), last revised 21 Jun 2022 (this version, v2))
Abstract: We introduce a data-driven, model-agnostic technique for generating a human-interpretable summary of the salient points of contrast within an evolving dynamical system, such as the learning process of a control agent. It involves the aggregation of transition data along both spatial and temporal dimensions according to an information-theoretic divergence measure. A practical algorithm is outlined for continuous state spaces, and deployed to summarise the learning histories of deep reinforcement learning agents with the aid of graphical and textual communication methods. We expect our method to be complementary to existing techniques in the realm of agent interpretability.
Submission history
From: Tom Bewley [view email][v1] Mon, 17 Jan 2022 11:34:59 GMT (6256kb,D)
[v2] Tue, 21 Jun 2022 10:53:57 GMT (11722kb,D)
Link back to: arXiv, form interface, contact.