SAFE-GIL: SAFEty Guided Imitation Learning

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Behavior Cloning is a popular approach to Imitation Learning, in which a robot observes an expert supervisor and learns a control policy. However, behavior cloning suffers from the "compounding error" problem - the policy errors compound as it deviates from the expert demonstrations and might lead to catastrophic system failures, limiting its use in safety-critical applications. On-policy data aggregation methods are able to address this issue at the cost of rolling out and repeated training of the imitation policy, which can be tedious and computationally prohibitive. We propose SAFE-GIL, an off-policy behavior cloning method that guides the expert via adversarial disturbance during data collection. The algorithm abstracts the imitation error as an adversarial disturbance in the system dynamics, injects it during data collection to expose the expert to safety critical states, and collects corrective actions. Our method biases training to more closely replicate expert behavior in safety-critical states and allows more variance in less critical states. We compare our method with several behavior cloning techniques and DAgger on autonomous navigation and autonomous taxiing tasks and show higher task success and safety, especially in low data regimes where the likelihood of error is higher, at a slight drop in the performance.

Related collections

Author and article information

Journal

Publication date Created: 08 April 2024

Article

ArXiV ID: 2404.05249

SO-VID: 256e9c00-e6dd-4f80-95fc-c74cdb3f0cc7

License:

http://creativecommons.org/licenses/by/4.0/

History

Custom metadata

Categories cs.RO cs.LG cs.SY eess.SY

ScienceOpen disciplines: Performance, Systems & Control,Robotics,Artificial intelligence

Data availability:

ScienceOpen disciplines: Performance, Systems & Control, Robotics, Artificial intelligence

SAFE-GIL: SAFEty Guided Imitation Learning

Read this article at

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 94