Temporal IK: Data-Driven Pose Estimation for Virtual Realiity
James Lin
EECS Department, University of California, Berkeley
Technical Report No. UCB/EECS-2019-59
May 17, 2019
http://www2.eecs.berkeley.edu/Pubs/TechRpts/2019/EECS-2019-59.pdf
High-quality human avatars are an important part of compelling virtual reality (VR) experiences. Animating an avatar to match the movement of its user, however, is a fundamentally difficult task, as most VR systems only track the user's head and hands, leaving the rest of the body undetermined. In this report, we introduce Temporal IK, a data-driven approach to predicting full-body poses from standard VR headset and controller inputs. We describe a recurrent neural network that, when given a sequence of positions and rotations from VR tracked objects, predicts the corresponding full-body poses in a manner that exploits the temporal consistency of human motion. To train and evaluate this model, we recorded several hours of motion capture data of subjects using VR. The model is integrated into an end-to-end solution within Unity, a popular game engine, for ease of use. Our model is found to do well in generating natural looking motion in the upper body.
Advisors: James O'Brien
BibTeX citation:
@mastersthesis{Lin:EECS-2019-59, Author= {Lin, James}, Editor= {O'Brien, James}, Title= {Temporal IK: Data-Driven Pose Estimation for Virtual Realiity}, School= {EECS Department, University of California, Berkeley}, Year= {2019}, Month= {May}, Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2019/EECS-2019-59.html}, Number= {UCB/EECS-2019-59}, Abstract= {High-quality human avatars are an important part of compelling virtual reality (VR) experiences. Animating an avatar to match the movement of its user, however, is a fundamentally difficult task, as most VR systems only track the user's head and hands, leaving the rest of the body undetermined. In this report, we introduce Temporal IK, a data-driven approach to predicting full-body poses from standard VR headset and controller inputs. We describe a recurrent neural network that, when given a sequence of positions and rotations from VR tracked objects, predicts the corresponding full-body poses in a manner that exploits the temporal consistency of human motion. To train and evaluate this model, we recorded several hours of motion capture data of subjects using VR. The model is integrated into an end-to-end solution within Unity, a popular game engine, for ease of use. Our model is found to do well in generating natural looking motion in the upper body.}, }
EndNote citation:
%0 Thesis %A Lin, James %E O'Brien, James %T Temporal IK: Data-Driven Pose Estimation for Virtual Realiity %I EECS Department, University of California, Berkeley %D 2019 %8 May 17 %@ UCB/EECS-2019-59 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2019/EECS-2019-59.html %F Lin:EECS-2019-59