Finding Pictures of Objects in Large Collections of Images
David A. Forsyth and Jitendra Malik and Margaret M. Fleck and Hayit Greenspan and Thomas Leung and Serge Belongie and Chad Carson and Christoph Bregler
EECS Department, University of California, Berkeley
Technical Report No. UCB/CSD-96-905
, 1996
http://www2.eecs.berkeley.edu/Pubs/TechRpts/1996/CSD-96-905.pdf
Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures using notions of content that are strongly oriented to the presence of abstractly defined objects. Computer programs that implement these queries automatically are desirable, but are hard to build because conventional object recognition techniques from computer vision cannot recognize very general objects in very general contexts. <p>This paper describes our approach to object recognition, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints. The constraints that are satisfied provide a form of object classification in quite general contexts. <p>This view of recognition is distinguished by: far richer involvement of early visual primitives, including color and texture; hierarchical grouping and learning strategies in the classification process; the ability to deal with rather general objects in uncontrolled configurations and contexts. We illustrate these properties with four case-studies: one demonstrating the use of color and texture descriptors; one showing how trees can be described by fusing texture and geometric properties; one learning scenery concepts using grouped features; and one showing how this view of recognition yields a program that can tell, quite accurately, whether a picture contains naked people or not.
BibTeX citation:
@techreport{Forsyth:CSD-96-905, Author= {Forsyth, David A. and Malik, Jitendra and Fleck, Margaret M. and Greenspan, Hayit and Leung, Thomas and Belongie, Serge and Carson, Chad and Bregler, Christoph}, Title= {Finding Pictures of Objects in Large Collections of Images}, Year= {1996}, Month= {Jun}, Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/1996/5525.html}, Number= {UCB/CSD-96-905}, Abstract= {Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures using notions of content that are strongly oriented to the presence of abstractly defined objects. Computer programs that implement these queries automatically are desirable, but are hard to build because conventional object recognition techniques from computer vision cannot recognize very general objects in very general contexts. <p>This paper describes our approach to object recognition, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints. The constraints that are satisfied provide a form of object classification in quite general contexts. <p>This view of recognition is distinguished by: far richer involvement of early visual primitives, including color and texture; hierarchical grouping and learning strategies in the classification process; the ability to deal with rather general objects in uncontrolled configurations and contexts. We illustrate these properties with four case-studies: one demonstrating the use of color and texture descriptors; one showing how trees can be described by fusing texture and geometric properties; one learning scenery concepts using grouped features; and one showing how this view of recognition yields a program that can tell, quite accurately, whether a picture contains naked people or not.}, }
EndNote citation:
%0 Report %A Forsyth, David A. %A Malik, Jitendra %A Fleck, Margaret M. %A Greenspan, Hayit %A Leung, Thomas %A Belongie, Serge %A Carson, Chad %A Bregler, Christoph %T Finding Pictures of Objects in Large Collections of Images %I EECS Department, University of California, Berkeley %D 1996 %@ UCB/CSD-96-905 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/1996/5525.html %F Forsyth:CSD-96-905