Computing Policies That Account For The Effects Of Human Agent Uncertainty During Execution In Markov Decision Processes

Gopalakrishnan, Sriram; Verma, Mudit; Kambhampati, Subbarao

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2109

Change to browse by:

Computer Science > Artificial Intelligence

Title: Computing Policies That Account For The Effects Of Human Agent Uncertainty During Execution In Markov Decision Processes

Authors: Sriram Gopalakrishnan, Mudit Verma, Subbarao Kambhampati

(Submitted on 15 Sep 2021 (v1), last revised 3 Mar 2022 (this version, v3))

Abstract: When humans are given a policy to execute, there can be policy execution errors and deviations in policy if there is uncertainty in identifying a state. This can happen due to the human agent's cognitive limitations and/or perceptual errors. So an algorithm that computes a policy for a human to execute ought to consider these effects in its computations. An optimal Markov Decision Process (MDP) policy that is poorly executed (because of a human agent) maybe much worse than another policy that is suboptimal in the MDP, but considers the human-agent's execution behavior. In this paper we consider two problems that arise from state uncertainty; these are erroneous state-inference, and extra-sensing actions that a person might take as a result of their uncertainty. We present a framework to model the human agent's behavior with respect to state uncertainty, and can be used to compute MDP policies that accounts for these problems. This is followed by a hill climbing algorithm to search for good policies given our model of the human agent. We also present a branch and bound algorithm which can find the optimal policy for such problems. We show experimental results in a Gridworld domain, and warehouse-worker domain. Finally, we present human-subject studies that support our human model assumptions.

Comments:	7 page paper, 6 pages supplemental material
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2109.07436 [cs.AI]
	(or arXiv:2109.07436v3 [cs.AI] for this version)

Submission history

From: Sriram Gopalakrishnan [view email]
[v1] Wed, 15 Sep 2021 17:10:46 GMT (1054kb,D)
[v2] Mon, 20 Sep 2021 21:24:20 GMT (1054kb,D)
[v3] Thu, 3 Mar 2022 22:00:30 GMT (8937kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2109.07436

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Computing Policies That Account For The Effects Of Human Agent Uncertainty During Execution In Markov Decision Processes

Submission history