Bridging the Gap Between Modular and End-to-end Autonomous Driving Systems
Eric Leong
EECS Department, University of California, Berkeley
Technical Report No. UCB/EECS-2022-79
May 12, 2022
http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-79.pdf
We aim to bridge the gap between end-to-end learning and traditional pipeline-based approaches for autonomous vehicles (AVs). In this work, we replace the traditional planning and control algorithms of modular approaches with an end-to-end learned policy, developing a hybrid of the two approaches. Our learned policy takes a bird's-eye-view representation of the world as input, and produces control actions such as braking, steering, and acceleration. To support the development of this learned policy, we introduce caRLot, a novel OpenAI gym environment that builds atop the open-source Pylot AV platform to provide configurable abstractions in addition to an interface with the CARLA simulator. We use caRLot to learn a model-free reinforcement learning policy that replaces planning and control, and compare its performance and runtime against several state-of-the-art approaches. We find that our hybrid approach has a notable improvement in runtime over a modular driving system, while having a significant advantage in interpretability over end-to-end systems.
Advisors: Joseph Gonzalez
BibTeX citation:
@mastersthesis{Leong:EECS-2022-79, Author= {Leong, Eric}, Title= {Bridging the Gap Between Modular and End-to-end Autonomous Driving Systems}, School= {EECS Department, University of California, Berkeley}, Year= {2022}, Month= {May}, Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-79.html}, Number= {UCB/EECS-2022-79}, Abstract= {We aim to bridge the gap between end-to-end learning and traditional pipeline-based approaches for autonomous vehicles (AVs). In this work, we replace the traditional planning and control algorithms of modular approaches with an end-to-end learned policy, developing a hybrid of the two approaches. Our learned policy takes a bird's-eye-view representation of the world as input, and produces control actions such as braking, steering, and acceleration. To support the development of this learned policy, we introduce caRLot, a novel OpenAI gym environment that builds atop the open-source Pylot AV platform to provide configurable abstractions in addition to an interface with the CARLA simulator. We use caRLot to learn a model-free reinforcement learning policy that replaces planning and control, and compare its performance and runtime against several state-of-the-art approaches. We find that our hybrid approach has a notable improvement in runtime over a modular driving system, while having a significant advantage in interpretability over end-to-end systems.}, }
EndNote citation:
%0 Thesis %A Leong, Eric %T Bridging the Gap Between Modular and End-to-end Autonomous Driving Systems %I EECS Department, University of California, Berkeley %D 2022 %8 May 12 %@ UCB/EECS-2022-79 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-79.html %F Leong:EECS-2022-79