Impact Factor:3.3
Journal:Transportmetrica A
Abstract:In this paper, we attempt to address the issue of controlling the sensitivity parameters (or control gains) of automated driving vehicles in an open heterogeneous traffic flow system. The automated driving vehicles are supposedly equipped with adaptive cruise control and connectivity while the conventional vehicles are characterized by a stochastic safe time headway. To optimize the sensitivity parameters, the natural policy gradient reinforcement learning algorithm has been used for the best policy search. In this context, two performance indices were considered: the traffic breakdown probability and fuel consumption. After extensive simulations, it is found that the sensitivity parameters should depend on both the flow and the penetration rate for maximum performance. In particular, a low-penetration rate of 5% can improve traffic performance. A comparison with other algorithms suggests that natural policy gradient and Q-learning yield a good approximation and reduce significantly the computational cost.
First Author:Marouane Bouadi, Bin Jia, Rui Jiang, Xingang Li, Ziyou Gao
Indexed by:Journal paper
Document Code:762-806
Volume:18
Translation or Not:no
Date of Publication:2021-03-11
Included Journals:SCI、SSCI