Have a personal or library account? Click to login
Open Access
|Sep 2021

Abstract

Reliability, efficiency and generalization are basic evaluation criteria for a vehicle automated driving system. This paper proposes an automated driving decision-making method based on the Wasserstein generative adversarial imitation learning–deep deterministic policy gradient (WGAIL–DDPG(λ)). Here the exact reward function is designed based on the requirements of a vehicle’s driving performance, i.e., safety, dynamic and ride comfort performance. The model’s training efficiency is improved through the proposed imitation learning strategy, and a gain regulator is designed to smooth the transition from imitation to reinforcement phases. Test results show that the proposed decision-making model can generate actions quickly and accurately according to the surrounding environment. Meanwhile, the imitation learning strategy based on expert experience and the gain regulator can effectively improve the training efficiency for the reinforcement learning model. Additionally, an extended test also proves its good adaptability for different driving conditions.

DOI: https://doi.org/10.34768/amcs-2021-0031 | Journal eISSN: 2083-8492 | Journal ISSN: 1641-876X
Language: English
Page range: 461 - 470
Submitted on: Jan 22, 2021
Accepted on: Jul 12, 2021
Published on: Sep 27, 2021
Published by: Sciendo
In partnership with: Paradigm Publishing Services
Publication frequency: 4 times per year

© 2021 Mingheng Zhang, Xing Wan, Longhui Gang, Xinfei Lv, Zengwen Wu, Zhaoyang Liu, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.