Poselets and Their Applications in High-Level Computer Vision

Abstract

We address the classic problems of detection and segmentation using a part based detector that operates on a novel part, which we refer to as a poselet. Poselets are tightly clustered in both appearance space (and thus are easy to detect) as well as in configuration space (and thus are helpful for localization and segmentation). We demonstrate poselets are effective for detection, pose extraction, segmentation, action/pose estimation and attribute classification. Poselet construction requires extra annotations beyond the object bounds. To train poselets we have created H3D (Humans in 3D) - a dataset of 1200+ person annotations. The annotations include the joints, the extracted 3D pose, keypoint visibility and region labels. We have also annotated the people in the training and validation sets of PASCAL VOC 2009.

Our poselet classifier achieves state-of-the-art results for the person category on PASCAL VOC 2007, 2008, 2009 and 2010 as well as on our dataset, H3D.

Browse Poselets

You can browse the 150 poselets for the person category.

Results

The following are results as of September 23, 2010 for the Person category of the PASCAL VOC challenges.

	Poselets	Second-highest score
VOC 2010	48.5	47.5 ***
VOC 2009	48.6	47.9 ***
VOC 2008	54.1	43.1 **
VOC 2007	46.9	43.2 *

* P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan Object Detection with Discriminatively Trained Part Based Models, (Release 4, 2010)

** P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan Object Detection with Discriminatively Trained Part Based Models, PAMI (preprint, 2009)

*** P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, PASCAL VOC 2010 competition

In this comparison we included all methods participating in Comp 3 (trained on VOC data) and Comp 4 (trained on own data). Our method requires extra annotations, so we competed in Comp 4, but we were the only submission in that category.

Papers

Core papers:

Lubomir Bourdev, Jitendra Malik,Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations, ICCV 2009 BibTeX
Lubomir Bourdev, Subhransu Maji, Thomas Brox, Jitendra Malik,Detecting People Using Mutually Consistent Poselet Activations, ECCV 2010 BibTeX

Applications papers:

Subhransu Maji, Lubomir Bourdev, Jitendra Malik,Action Recognition from a Distributed Representation of Pose and Appearance, CVPR 2011 BibTeX
Thomas Brox, Lubomir Bourdev, Subhransu Maji, Jitendra Malik,Object Segmentation by Alignment of Poselet Activations to Image Contours, CVPR 2011 BibTeX
Bharath Hariharan, Pablo Arbelaez, Lubomir Bourdev, Subhransu Maji, Jitendra Malik, Semantic Contours from Inverse Detectors, ICCV 2011 BibTeX
Lubomir Bourdev, Subhransu Maji, Jitendra Malik, Describing People: Poselet-Based Approach to Attribute Classification, ICCV 2011BibTeX

Slides

These slides describe the core poselet selection and training algorithm (the ICCV 2009 paper)

Code

Below is stand-alone code that takes an image and draws bounding boxes of the people in it and can also perform interactive visualization of the poselets. Requirements: Matlab + Image Processing toolbox. The code is released with a non-commercial license. The released code and trained detector is similar to the one we used in the PASCAL 2010 competition, which is slightly improved in accuracy (but slower) than our ECCV 2010 paper.

October 2010 release

June 2011 BETA release. The trained models are the same, but the code is cleaned up and there are a lot of visualization utilities. Please send email to lbourdev at eecs.berekely.edu if you have problems with the new version.

Note: If you use WinZip and Matlab reports that your file is corrupt, please try WinRAR. If you need an older release please let is know.

Datasets

H3D and the associated Matlab toolbox.
Keypoint annotations for the people in PASCAL 2009 training and validation images.
3D pose estimation (see project page for more)
Keypoint annotations for the PASCAL 2010 action dataset (see project page for more)
Attributes of people dataset.

H3D Annotation tool

The Java3D tool that we used to create H3D and a video tutorial are available here. There are no license restrictions on using the tool for your own annotations.

Contact us

For comments or questions about poselets please email Lubomir Bourdev lbourdev-at-eecs dot berekely dot edu.