Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning

Spielberg, Yitzhak; Azaria, Amos

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2201

Computer Science > Machine Learning

Title: Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning

Authors: Yitzhak Spielberg, Amos Azaria

(Submitted on 13 Jan 2022)

Abstract: In the context of reinforcement learning we introduce the concept of criticality of a state, which indicates the extent to which the choice of action in that particular state influences the expected return. That is, a state in which the choice of action is more likely to influence the final outcome is considered as more critical than a state in which it is less likely to influence the final outcome.
We formulate a criticality-based varying step number algorithm (CVS) - a flexible step number algorithm that utilizes the criticality function provided by a human, or learned directly from the environment. We test it in three different domains including the Atari Pong environment, Road-Tree environment, and Shooter environment. We demonstrate that CVS is able to outperform popular learning algorithms such as Deep Q-Learning and Monte Carlo.

Comments:	arXiv admin note: text overlap with arXiv:1810.07254
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Journal reference:	International Journal on Artificial Intelligence Tools, vol. 30, 2021
Cite as:	arXiv:2201.05034 [cs.LG]
	(or arXiv:2201.05034v1 [cs.LG] for this version)

Submission history

From: Yitzhak Spielberg [view email]
[v1] Thu, 13 Jan 2022 15:46:59 GMT (347kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.05034

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning

Submission history