IMPROVE THE APPLICATION OF THE TD3 ALGORITHM IN LONGITUDINAL CONTROL FOR AUTONOMOUS DRIVING

Liang Xiong; Yu Du

doi:10.2316/J.2025.201-0504

IMPROVE THE APPLICATION OF THE TD3 ALGORITHM IN LONGITUDINAL CONTROL FOR AUTONOMOUS DRIVING

Liang Xiong∗ and Yu Du∗

Keywords

Reinforcement learning, autonomous driving, longitudinal control,generating experience replay, adversarial reinforcement learning

Abstract

Currently, autonomous driving longitudinal control algorithms based on twin delayed deep deterministic policy gradient have been extensively researched and have achieved relatively stable results. However, these algorithms also face a series of issues, such as low training eﬃciency and insuﬃcient robustness in complex and challenging environments. To address these problems, an improved algorithm named enhanced twin delayed deep deterministic policy gradient has been proposed. This algorithm introduces feedforward neural networks to implement generative experience replay, enabling it to utilise experiences in the experience pool to generate additional virtual experiences. Experiences enhance the sampling eﬃciency of valuable experiences and alleviate the issue of low training eﬃciency caused by insuﬃcient experiences. Additionally, the concept of adversarial reinforcement learning is introduced, where intelligent agents are alternately updated to reﬁne their policies and value functions. This approach empowers the primary agent to better handle challenging scenarios, improve its generalisation capabilities, and enhance the algorithm’s robustness. A series of simulation experiments were conducted on the CARLA simulation platform, and the results showed that the algorithm exhibited signiﬁcant improvements in training eﬃciency and robustness compared to traditional algorithms, soft actor–critic, and deep deterministic policy gradient algorithms in the context of longitudinal control tasks for autonomous vehicles. This indicates that the algorithm can eﬀectively address issues related to low training eﬃciency and limited robustness caused by diﬀerences between simulation and real-world environments, ultimately enhancing the performance of longitudinal control in the ﬁeld of autonomous driving. ∗ School of Robotics, Beijing Union University, Beijing Key Laboratory of Information Services Engineering, Beijing, 100101, China; e-mail: [email protected]; [email protected] Corresponding author: Yu Du

Important Links:

DOI: 10.2316/J.2025.201-0504
From Journal (201) Mechatronic Systems and Control - 2025

Go Back