Modeling Car Following Behavior of Autonomous Driving Vehicles Based on Deep Reinforcement Learning

CHEN Yue; JIAO Pengpeng; BAI Ruyu; LI Rujian

doi:10.3963/j.jssn.1674-4861.2023.02.007

Volume 41 Issue 2

Apr. 2023

Turn off MathJax

Article Contents

Article Navigation > Journal of Transport Information and Safety > 2023 > 41(2): 67-75

CHEN Yue, JIAO Pengpeng, BAI Ruyu, LI Rujian. Modeling Car Following Behavior of Autonomous Driving Vehicles Based on Deep Reinforcement Learning[J]. Journal of Transport Information and Safety, 2023, 41(2): 67-75. doi: 10.3963/j.jssn.1674-4861.2023.02.007

Citation:

CHEN Yue, JIAO Pengpeng, BAI Ruyu, LI Rujian. Modeling Car Following Behavior of Autonomous Driving Vehicles Based on Deep Reinforcement Learning[J]. Journal of Transport Information and Safety, 2023, 41(2): 67-75. doi: 10.3963/j.jssn.1674-4861.2023.02.007

Citation:

PDF( 1827 KB)

Modeling Car Following Behavior of Autonomous Driving Vehicles Based on Deep Reinforcement Learning

doi: 10.3963/j.jssn.1674-4861.2023.02.007

Beijing Key Laboratory of General Aviation Technology, Beijing University of Civil Engineering and Architecture, Beijing 100044, China

Received Date: 2022-09-14
Available Online: 2023-06-19

Abstract

Abstract

In order to enhance the performance of car following behavior of autonomous vehicles and mitigate the negative effects of traffic oscillations, a deep reinforcement learning-based car following model for automated driving is investigated. The existing reward function is improved by incorporating energy consumption, and the related terms for representing energy consumption are established based on the VT-Micro model. In addition, the method of using the time gap between vehicles to establish the reward function related to driving efficiency is improved by adding virtual speed to the time gap, in order to avoid computation overflow and unrealistic short following distance in the traffic oscillation scenario. To overcome the limitations of training on closed-loop simulated roads and simulated vehicle trajectories, human driver behavior extracted from the NGSIM trajectory data during traffic oscillation are used to develop the training environment. By applying the twin delayed deep deterministic policy gradient algorithm (TD3), a multi-objective car following model is then developed. A system for evaluating model performance is established to compare the performance of the TD3 model with traditional models in car following and traffic oscillations scenarios. Study results of car following scenarios show that the TD3 model and the traditional adaptive cruise control (ACC) model perform similarly in terms of comfort and driving efficiency, but both outperform the human drivers. In terms of safety, the TD3 model reduces safety hazards by 53.65% compared to the traditional ACC model, and 36.24% compared to the human drivers. Regarding energy consumption, the TD3 model reduces the energy consumption of the conventional ACC model and human drivers by 6.73% and 15.65%, respectively. Study results show that the TD3 model can reduce the negative impacts of traffic oscillations. In the scenario with a 100% TD3 model penetration rate, driving discomfort decreases by 55.95%, driving efficiency increases by 8.82%, crash risks reduce by 73.21%, and fuel consumption drops by 5.97%, compared to a 100% human-driven environment.
- intelligent transportation,
- autonomous vehicle,
- reinforcement learning,
- car following model,
- traffic oscillation

FullText(HTML)

References(23)

References

[1]	LI X, CUI J, SHI A, et al. Stop-and-go traffic analysis: theoretical properties, environmental impacts and oscillation mitigation[J]. Transportation Research Part B: Methodological, 2014(70): 319-339.
[2]	ZHENG Z, AHN S, MONSERE C M. Impact of traffic oscillations on freeway crash occurrences[J]. Accident Analysis & Prevention, 2010, 42(2): 626-636.
[3]	GOLOB T F, RECKER W W, ALVAREZ V M. Safety aspects of freeway weaving sections[J]. Transportation Research Part A: Policy & Practice, 2004, 38(1): 35-51.
[4]	韩雨, 郭延永, 张乐, 等. 消除高速公路运动波的可变限速控制方法[J]. 中国公路学报, 2022, 35(1): 151-158. doi: 10.19721/j.cnki.1001-7372.2022.01.013 HAN Y, GUO Y Y, ZHANG L, et al. An optimal variable speed limit control approach against freeway jam waves[J]. China Journal of Highway and Transport, 2022, 35(1): 151-158. (in Chinese) doi: 10.19721/j.cnki.1001-7372.2022.01.013
[5]	HE Z, LIANG Z, SONG L, et al. A jam-absorption driving strategy for mitigating traffic oscillations[J]. IEEE Transactions on Intelligent Transportation Systems, 2017, 18(4): 802-813. doi: 10.1109/TITS.2016.2587699
[6]	秦严严, 王昊, 何兆益, 等. 基于比功率的自动驾驶交通流油耗分析[J]. 交通运输系统工程与信息, 2020, 20(1): 91-96. doi: 10.16097/j.cnki.1009-6744.2020.01.014 QIN Y Y, WANG H, HE Z Y, et al. Fuel consumption analysis of automated driving traffic flow based on vehicle specific power[J]. Journal of Transportation Systems Engineering and Information Technology, 2020, 20(1): 91-96. (in Chinese) doi: 10.16097/j.cnki.1009-6744.2020.01.014
[7]	KESTING A, TREIBER M, SCHÖNHOF M, et al. Adaptive cruise control design for active congestion avoidance[J]. Transportation Research Part C: Emerging Technologies, 2008.16(6): 668-683. doi: 10.1016/j.trc.2007.12.004
[8]	LI T N, CHEN D J, ZHAO H, et al. Car-following behavior characteristics of adaptive cruise control vehicles based on empirical experiments[J]. Transportation Research Part B: Methodological, 2021.147: 67-91. doi: 10.1016/j.trb.2021.03.003
[9]	LIN X, MENG W, VAN AREM B. Realistic car-following models for microscopic simulation of adaptive and cooperative adaptive cruise control vehicles[J]. Transportation Research Record: Journal of the Transportation Research Board, 2017, 2623(1): 1-9. doi: 10.3141/2623-01
[10]	ZHOU M, QU X, LI X. A recurrent neural network based microscopic car following model to predict traffic oscillation[J]. Transportation Research Part C: Emerging Technologies, 2017, 84: 245-264. doi: 10.1016/j.trc.2017.08.027
[11]	HUANG X, SUN J, SUN J. A car-following model considering asymmetric driving behavior based on long short-term memory neural networks[J]. Transportation Research Part C: Emerging Technologies, 2018, 95: 346-362. doi: 10.1016/j.trc.2018.07.022
[12]	MA L, QU S. A sequence to sequence learning based car-following model for multi-step predictions considering reaction delay[J]. Transportation Research Part C: Emerging Technologies, 2020, 120: 102785. doi: 10.1016/j.trc.2020.102785
[13]	朱冰, 蒋渊德, 赵健, 等. 基于深度强化学习的车辆跟驰控制[J]. 中国公路学报, 2019, 32(6): 53-60. https://www.cnki.com.cn/Article/CJFDTOTAL-ZGGL201906006.htm ZHU B, JIANG Y D, ZHAO J, et al. A car-following control algorithm based on deep reinforcement learning[J]. China Journal of Highway and Transport, 2019, 32(6): 53-60. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-ZGGL201906006.htm
[14]	闫浩, 刘小珠, 石英. 基于REINFORCE算法和神经网络的无人驾驶车辆变道控制[J]. 交通信息与安全, 2021, 39(1): 164-172. doi: 10.3963/j.jssn.1674-4861.2021.01.0019 YAN H, LIU X Z, SHI Y. Lane-change control for unmanned vehicle based on REINFORCE algorithm and neural network[J]. Journal of Transport Information and Safety, 2021, 39(1): 164-172. (in Chinese) doi: 10.3963/j.jssn.1674-4861.2021.01.0019
[15]	李孟凡, 秦文虎, 云中华. 基于横纵向联合控制的多目标优化车辆跟驰研究[J]. 计算机应用研究, 2022, 39(8): 2409-2413. https://www.cnki.com.cn/Article/CJFDTOTAL-JSYJ202208028.htm LI M F, QIN W H, YUN Z H. Multi-objective optimal car-following model with lateral and longitudinal control[J]. ApplicationResearchofComputers, 2022, 39 (8): 2409-2413. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-JSYJ202208028.htm
[16]	KREIDIEH A R, WU C, BAYCN A M. Dissipating stop-and-go waves in closed and open networks via deep reinforcement learning[C]. 2018 IEEE International Conference on Intelligent Transportation Systems(ITSC), Hawaii, USA: IEEE, 2018.
[17]	QU X, YU Y, ZHOU M, et al. Jointly dampening traffic oscillations and improving energy consumption with electric, connected and automated vehicles: A reinforcement learning based approach[J]. Applied Energy, 2020(257): 114030
[18]	ZHU M X, WANG Y H, PU Z Y, et al. Safe, efficient, and comfortable velocity control based on reinforcement learning for autonomous driving[J]. Transportation Research Part C: Emerging Technologies, 2020(117): 102662.
[19]	BALAS V E, BALAS M M. Driver assisting by inverse time to collision[C]. 2006 World Automation Congress, Budapest, Hungary: IEEE, 2006.
[20]	YAO Z H, RONG H, JIANG Y S, et al. Stability and safety evaluation of mixed traffic flow with connected automated vehicles on expressways[J]. Journal of Safety Research, 2020(75): 262-274.
[21]	YAO Z H, XU T R, JIANG Y S, et al. Linear stability analysis of heterogeneous traffic flow considering degradations of connected automated vehicles and reaction time[J]. Physica A: Statistical Mechanics and Its Applications, 2021(561): 125218.
[22]	MONTANINO M, PUNZO V. Trajectory data reconstruction and simulation-based validation against macroscopic traffic patterns[J]. Transportation Research Part B: Methodological, 2015, 80: 82-106.
[23]	TREIBER M, HENNECKE A, HELBING D. Congested traffic states in empirical observations and microscopic simulations[J]. Physical Review E, 2000(62): 1805-1824.

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(13) / Tables(3)

Get Citation

PDF

XML

Article Metrics

Article views (1001) PDF downloads(109)

Modeling Car Following Behavior of Autonomous Driving Vehicles Based on Deep Reinforcement Learning

doi: 10.3963/j.jssn.1674-4861.2023.02.007

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Modeling Car Following Behavior of Autonomous Driving Vehicles Based on Deep Reinforcement Learning

doi: 10.3963/j.jssn.1674-4861.2023.02.007

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content