Learning to Walk: Legged Hexapod Locomotion from Simulation to the Real World

Maxime Kawawa-Beaudan

EECS Department, University of California, Berkeley

Technical Report No. UCB/EECS-2022-146

May 19, 2022

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-146.pdf

This thesis presents methods for training intelligent robotic systems to navigate challenging, diverse environments. In particular it focuses on legged locomotion for hexapods rather than well-studied bipedal or quadrupedal robots. We employ hierarchical reinforcement learning to integrate proprioception with a low-level gait controller, and train on-policy algorithms entirely in simulation before transferring to a real robot. We develop command-conditioned policies in PyBullet that learn to walk at commanded target velocities, and switch to Isaac Gym to train policies with multi-objective rewards both in rough and flat terrains. A comparison is established between gaits with motion priors and prior-free gaits. Both methods successfully surmount joist obstacles typically seen in attics in the real world. We propose novel approaches to the sim-to-real problem designed to address limitations of affordable hardware. We demonstrate methods in the Isaac Gym simulation for integrating vision.

Advisors: Avideh Zakhor

BibTeX citation:

@mastersthesis{Kawawa-Beaudan:EECS-2022-146,
    Author= {Kawawa-Beaudan, Maxime},
    Editor= {Zakhor, Avideh},
    Title= {Learning to Walk: Legged Hexapod Locomotion from Simulation to the Real World},
    School= {EECS Department, University of California, Berkeley},
    Year= {2022},
    Month= {May},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-146.html},
    Number= {UCB/EECS-2022-146},
    Abstract= {This thesis presents methods for training intelligent robotic systems to navigate challenging,
diverse environments. In particular it focuses on legged locomotion for hexapods rather
than well-studied bipedal or quadrupedal robots. We employ hierarchical reinforcement
learning to integrate proprioception with a low-level gait controller, and train on-policy
algorithms entirely in simulation before transferring to a real robot. We develop command-conditioned
policies in PyBullet that learn to walk at commanded target velocities, and
switch to Isaac Gym to train policies with multi-objective rewards both in rough and flat
terrains. A comparison is established between gaits with motion priors and prior-free gaits.
Both methods successfully surmount joist obstacles typically seen in attics in the real world.
We propose novel approaches to the sim-to-real problem designed to address limitations of
affordable hardware. We demonstrate methods in the Isaac Gym simulation for integrating
vision.},
}

EndNote citation:

%0 Thesis
%A Kawawa-Beaudan, Maxime 
%E Zakhor, Avideh 
%T Learning to Walk: Legged Hexapod Locomotion from Simulation to the Real World
%I EECS Department, University of California, Berkeley
%D 2022
%8 May 19
%@ UCB/EECS-2022-146
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-146.html
%F Kawawa-Beaudan:EECS-2022-146