Authors
Ke Zhang, Fang He, Zhengchao Zhang, Xi Lin, Meng Li
Publication date
2020/12/1
Journal
Transportation Research Part C: Emerging Technologies
Volume
121
Pages
102861
Publisher
Pergamon
Description
Multi-vehicle routing problem with soft time windows (MVRPSTW) is an indispensable constituent in urban logistics distribution systems. Over the past decade, numerous methods for MVRPSTW have been proposed, but most are based on heuristic rules that require a large amount of computation time. With the current rapid increase of logistics demands, traditional methods incur the dilemma between computational efficiency and solution quality. To efficiently solve the problem, we propose a novel reinforcement learning algorithm called the Multi-Agent Attention Model that can solve routing problem instantly benefit from lengthy offline training. Specifically, the vehicle routing problem is regarded as a vehicle tour generation process, and an encoder-decoder framework with attention layers is proposed to generate tours of multiple vehicles iteratively. Furthermore, a multi-agent reinforcement learning method with an …
Total citations
20202021202220232024229355538
Scholar articles