Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Li, Yihan; Ren, Jinsheng; Xu, Tianrun; Zhang, Tianren; Gao, Haichuan; Chen, Feng

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2202

Computer Science > Computation and Language

Title: Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Authors: Yihan Li, Jinsheng Ren, Tianrun Xu, Tianren Zhang, Haichuan Gao, Feng Chen

(Submitted on 26 Jan 2022)

Abstract: Recently, incorporating natural language instructions into reinforcement learning (RL) to learn semantically meaningful representations and foster generalization has caught many concerns. However, the semantical information in language instructions is usually entangled with task-specific state information, which hampers the learning of semantically invariant and reusable representations. In this paper, we propose a method to learn such representations called element randomization, which extracts task-relevant but environment-agnostic semantics from instructions using a set of environments with randomized elements, e.g., topological structures or textures, yet the same language instruction. We theoretically prove the feasibility of learning semantically invariant representations through randomization. In practice, we accordingly develop a hierarchy of policies, where a high-level policy is designed to modulate the behavior of a goal-conditioned low-level policy by proposing subgoals as semantically invariant representations. Experiments on challenging long-horizon tasks show that (1) our low-level policy reliably generalizes to tasks against environment changes; (2) our hierarchical policy exhibits extensible generalization in unseen new tasks that can be decomposed into several solvable sub-tasks; and (3) by storing and replaying language trajectories as succinct policy representations, the agent can complete tasks in a one-shot fashion, i.e., once one successful trajectory has been attained.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.00466 [cs.CL]
	(or arXiv:2202.00466v1 [cs.CL] for this version)

Submission history

From: Yihan Li [view email]
[v1] Wed, 26 Jan 2022 08:04:27 GMT (6265kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2202.00466

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Submission history