Q-Learning for Robust Satisfaction of Signal Temporal Logic Specifications

Aksaray, Derya; Jones, Austin; Kong, Zhaodan; Schwager, Mac; Belta, Calin

Full-text links:

Download:

Current browse context:

cs.SY

< prev | next >

new | recent | 1609

Computer Science > Systems and Control

Title: Q-Learning for Robust Satisfaction of Signal Temporal Logic Specifications

Authors: Derya Aksaray, Austin Jones, Zhaodan Kong, Mac Schwager, Calin Belta

(Submitted on 23 Sep 2016)

Abstract: This paper addresses the problem of learning optimal policies for satisfying signal temporal logic (STL) specifications by agents with unknown stochastic dynamics. The system is modeled as a Markov decision process, in which the states represent partitions of a continuous space and the transition probabilities are unknown. We formulate two synthesis problems where the desired STL specification is enforced by maximizing the probability of satisfaction, and the expected robustness degree, that is, a measure quantifying the quality of satisfaction. We discuss that Q-learning is not directly applicable to these problems because, based on the quantitative semantics of STL, the probability of satisfaction and expected robustness degree are not in the standard objective form of Q-learning. To resolve this issue, we propose an approximation of STL synthesis problems that can be solved via Q-learning, and we derive some performance bounds for the policies obtained by the approximate approach. The performance of the proposed method is demonstrated via simulations.

Comments:	This paper is accepted to IEEE CDC 2016
Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:1609.07409 [cs.SY]
	(or arXiv:1609.07409v1 [cs.SY] for this version)

Submission history

From: Derya Aksaray [view email]
[v1] Fri, 23 Sep 2016 15:51:34 GMT (229kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1609.07409

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Systems and Control

Title: Q-Learning for Robust Satisfaction of Signal Temporal Logic Specifications

Submission history