Authors
Elise van der Pol, Frans A Oliehoek
Publication date
2016
Conference
NIPS WS: Learning, Inference and Control of Multi-Agent Systems
Publisher
https://sites.google.com/site/malicnips2016/
Description
This paper investigates learning control policies for traffic lights. We introduce a new reward function for the traffic light control problem, and propose the combination of the popular Deep Q-learning algorithm with a coordination algorithm for a scalable approach to controlling coordinating traffic lights, without requiring the simplifying assumptions made in earlier work. We show that this approach reduces travel times compared to earlier work on reinforcement learning methods for traffic light control and investigate possible causes of instability in the single-agent case.
Total citations
201720182019202020212022202320249174575768813244
Scholar articles
E Van der Pol, FA Oliehoek - Proceedings of learning, inference and control of multi …, 2016