IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0020002.html
   My bibliography  Save this article

A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes

Author

Listed:
  • Xiaofu He
  • Zhiyong Yang
  • Joe Z Tsien

Abstract

Humans can categorize objects in complex natural scenes within 100–150 ms. This amazing ability of rapid categorization has motivated many computational models. Most of these models require extensive training to obtain a decision boundary in a very high dimensional (e.g., ∼6,000 in a leading model) feature space and often categorize objects in natural scenes by categorizing the context that co-occurs with objects when objects do not occupy large portions of the scenes. It is thus unclear how humans achieve rapid scene categorization. To address this issue, we developed a hierarchical probabilistic model for rapid object categorization in natural scenes. In this model, a natural object category is represented by a coarse hierarchical probability distribution (PD), which includes PDs of object geometry and spatial configuration of object parts. Object parts are encoded by PDs of a set of natural object structures, each of which is a concatenation of local object features. Rapid categorization is performed as statistical inference. Since the model uses a very small number (∼100) of structures for even complex object categories such as animals and cars, it requires little training and is robust in the presence of large variations within object categories and in their occurrences in natural scenes. Remarkably, we found that the model categorized animals in natural scenes and cars in street scenes with a near human-level performance. We also found that the model located animals and cars in natural scenes, thus overcoming a flaw in many other models which is to categorize objects in natural context by categorizing contextual features. These results suggest that coarse PDs of object categories based on natural object structures and statistical operations on these PDs may underlie the human ability to rapidly categorize scenes.

Suggested Citation

  • Xiaofu He & Zhiyong Yang & Joe Z Tsien, 2011. "A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-15, May.
  • Handle: RePEc:plo:pone00:0020002
    DOI: 10.1371/journal.pone.0020002
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0020002
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0020002&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0020002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Natasha Sigala & Nikos K. Logothetis, 2002. "Visual categorization shapes feature selectivity in the primate temporal cortex," Nature, Nature, vol. 415(6869), pages 318-320, January.
    2. Marius V. Peelen & Li Fei-Fei & Sabine Kastner, 2009. "Neural mechanisms of rapid natural scene categorization in human visual cortex," Nature, Nature, vol. 460(7251), pages 94-97, July.
    3. Nicolas Pinto & David D Cox & James J DiCarlo, 2008. "Why is Real-World Visual Object Recognition Hard?," PLOS Computational Biology, Public Library of Science, vol. 4(1), pages 1-6, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sacha Bourgeois-Gironde & Catherine Tallon-Baudry & Florent Meyniel, 2011. "Fast and Automatic Activation of an Abstract Representation of Money in the Human Ventral Visual Pathway," Post-Print ijn_00713469, HAL.
    2. Pavel Škrabánek & Alexandra Zahradníková jr., 2019. "Automatic assessment of the cardiomyocyte development stages from confocal microscopy images using deep convolutional networks," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-18, May.
    3. Yuri Vankov & Aleksey Rumyantsev & Shamil Ziganshin & Tatyana Politova & Rinat Minyazev & Ayrat Zagretdinov, 2020. "Assessment of the Condition of Pipelines Using Convolutional Neural Networks," Energies, MDPI, vol. 13(3), pages 1-12, February.
    4. Dileep George & Jeff Hawkins, 2009. "Towards a Mathematical Theory of Cortical Micro-circuits," PLOS Computational Biology, Public Library of Science, vol. 5(10), pages 1-26, October.
    5. Iris I A Groen & Sennay Ghebreab & Victor A F Lamme & H Steven Scholte, 2012. "Spatially Pooled Contrast Responses Predict Neural and Perceptual Similarity of Naturalistic Image Categories," PLOS Computational Biology, Public Library of Science, vol. 8(10), pages 1-16, October.
    6. Qianli Yang & Edgar Walker & R. James Cotton & Andreas S. Tolias & Xaq Pitkow, 2021. "Revealing nonlinear neural decoding by analyzing choices," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    7. Sebastian Bach & Alexander Binder & Grégoire Montavon & Frederick Klauschen & Klaus-Robert Müller & Wojciech Samek, 2015. "On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-46, July.
    8. Pedro Malaca & Luis F. Rocha & D. Gomes & João Silva & Germano Veiga, 2019. "Online inspection system based on machine learning techniques: real case study of fabric textures classification for the automotive industry," Journal of Intelligent Manufacturing, Springer, vol. 30(1), pages 351-361, January.
    9. Hailay Hagos Entahabu & Amare Sewnet Minale & Emiru Birhane, 2023. "Modeling and Predicting Land Use/Land Cover Change Using the Land Change Modeler in the Suluh River Basin, Northern Highlands of Ethiopia," Sustainability, MDPI, vol. 15(10), pages 1-15, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0020002. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.