View article

Exploiting multi-modal fusion for urban autonomous driving using latent deep reinforcement learning

Authors

Yasser H Khalil, Hussein T Mouftah

Publication date

2022/11/1

Journal

IEEE Transactions on Vehicular Technology

Volume

Issue

Pages

2921-2935

Publisher

IEEE

Description

Human driving decisions are the leading cause of road fatalities. Autonomous driving naturally eliminates such incompetent decisions and thus can improve traffic safety and efficiency. Deep reinforcement learning (DRL) has shown great potential in learning complex tasks. Recently, researchers investigated various DRL-based approaches for autonomous driving. However, exploiting multi-modal fusion to generate perception and motion prediction and then leveraging these predictions to train a latent DRL has not been targeted yet. To that end, we propose enhancing urban autonomous driving using multi-modal fusion with latent DRL. A single LIDAR sensor is used to extract bird's-eye view (BEV), range view (RV), and residual input images. These images are passed into LiCaNext, a real-time multi-modal fusion network, to produce accurate joint perception and motion prediction. Next, predictions are fed with …

Total citations

Cited by 14

2022202320242 6 6

Scholar articles

Exploiting multi-modal fusion for urban autonomous driving using latent deep reinforcement learning

YH Khalil, HT Mouftah - IEEE Transactions on Vehicular Technology, 2022