We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Building Safer Autonomous Agents by Leveraging Risky Driving Behavior Knowledge

Abstract: Simulation environments are good for learning different driving tasks like lane changing, parking or handling intersections etc. in an abstract manner. However, these simulation environments often restrict themselves to operate under conservative interaction behavior amongst different vehicles. But, as we know, real driving tasks often involve very high risk scenarios where other drivers often don't behave in the expected sense. There can be many reasons for this behavior like being tired or inexperienced. The simulation environment doesn't take this information into account while training the navigation agent. Therefore, in this study we especially focus on systematically creating these risk prone scenarios with heavy traffic and unexpected random behavior for creating better model-free learning agents. We generate multiple autonomous driving scenarios by creating new custom Markov Decision Process (MDP) environment iterations in the highway-env simulation package. The behavior policy is learnt by agents trained with the help from deep reinforcement learning models. Our behavior policy is deliberated to handle collisions and risky randomized driver behavior. We train model free learning agents with supplement information of risk prone driving scenarios and compare their performance with baseline agents. Finally, we casually measure the impact of adding these perturbations in the training process to precisely account for the performance improvement obtained from utilizing the learnings from these scenarios.
Comments: Published in CCCI 2021, Best Paper Award in Informatics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
DOI: 10.1109/CCCI52664.2021.9583209
Cite as: arXiv:2103.10245 [cs.LG]
  (or arXiv:2103.10245v3 [cs.LG] for this version)

Submission history

From: Ashish Rana [view email]
[v1] Tue, 16 Mar 2021 23:39:33 GMT (2783kb,D)
[v2] Wed, 31 Mar 2021 14:35:14 GMT (2781kb,D)
[v3] Sun, 17 Oct 2021 18:09:57 GMT (5202kb,D)

Link back to: arXiv, form interface, contact.