Variable Speed Limit Control Based on Deep Deterministic Policy
                    Gradient under Mixed Traffic Environment

Xibin Ding; Zhaolei Zhang; Zhizhen Liu; Feng Tang

doi:10.4271/12-09-01-0004

Features

Authors

Xibin Ding

Changsha University of Science and Technology, School of Transportation Engineering, China

Zhaolei Zhang

Changsha University of Science and Technology, China

Zhizhen Liu

Changsha University of Science and Technology, China

Feng Tang

Changsha University of Science and Technology, China

Abstract

Content: The existing variable speed limit (VSL) control strategies rely on variable message signs, leading to slow response times and sensitivity to driver compliance. These methods struggle to adapt to environments where both connected automated vehicles (CAVs) and manual vehicles coexist. This article proposes a VSL control strategy using the deep deterministic policy gradient (DDPG) algorithm to optimize travel time, reduce collision risks, and minimize energy consumption. The algorithm leverages real-time traffic data and prior speed limits to generate new control actions. A reward function is designed within a DDPG-based actor-critic framework to determine optimal speed limits. The proposed strategy was tested in two scenarios and compared against no-control, rule-based control, and DDQN-based control methods. The simulation results indicate that the proposed control strategy outperforms existing approaches in terms of improving TTS (total time spent), enhancing the throughput efficiency of the bottleneck area, and reducing the spatial and temporal extent of traffic congestion. Compared to the suboptimal DDQN-based VSL control, the proposed strategy improves TTS by 9.3% in Scenario 1 and by 11% in Scenario 2. The sensitivity analysis shows that the proposed control strategy improves performance as the penetration rate of CAVs increases. However, when the penetration rate reaches a certain threshold, the potential for further optimization becomes limited. Furthermore, higher time-to-collision (TTC) values, influenced by the reward function r ₂, enhance traffic safety.

Meta Tags

Topics: Automated vehicles
Intelligent transportation systems
Energy consumption
Reaction and response times
Congestion
Mathematical models
Simulation and modeling
Optimization
Data exchange
Vehicle drivers

Affiliated or Co-Author: Changsha University of Science and Technology, School of Transportation Engineering, China
Changsha University of Science and Technology, China

Details

DOI: https://doi.org/10.4271/12-09-01-0004

Pages: 16

Citation: Ding, X., Zhang, Z., Liu, Z., and Tang, F., "Variable Speed Limit Control Based on Deep Deterministic Policy Gradient under Mixed Traffic Environment," SAE Int. J. CAV 9(1):1-16, 2026, https://doi.org/10.4271/12-09-01-0004.

Additional Details

Publisher: SAE International

Published: May 06

Product Code: 12-09-01-0004

Content Type: Journal Article

Language: English

SAE International Journal of Connected and Automated Vehicles

Volume 9, Issue 1