Temporal-Differential Learning in Continuous Environments

Bian, Tao; Jiang, Zhong-Ping

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: Temporal-Differential Learning in Continuous Environments

Authors: Tao Bian, Zhong-Ping Jiang

(Submitted on 1 Jun 2020)

Abstract: In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced. Compared to the traditional temporal-difference learning method, it plays a crucial role in developing novel RL techniques for continuous environments. In particular, the continuous-time least squares policy evaluation (CT-LSPE) and the continuous-time temporal-differential (CT-TD) learning methods are developed. Both theoretical and empirical evidences are provided to demonstrate the effectiveness of the proposed temporal-differential learning methodology.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
Cite as:	arXiv:2006.00997 [cs.LG]
	(or arXiv:2006.00997v1 [cs.LG] for this version)

Submission history

From: Tao Bian [view email]
[v1] Mon, 1 Jun 2020 15:01:03 GMT (3416kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.00997

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Temporal-Differential Learning in Continuous Environments

Submission history