Search for Geometric Regularity in 2D Static Scenes

Click here for instructions to get Source Code

Keywords:

Bayesian networks, feature grouping, perceptual organization, voting, preattentive vision, attentive vision, integration of vision modules


Abstract

The evolution of perceptual organization in biological vision, and its necessity in advanced computer vision systems, arises from the characteristic that perception, the extraction of meaning from sensory input, is an intelligent process. This is particularly so for high order organisms and, analogically, for more sophisticated computational models. By perceptual organization we refer to the ability of a vision system to organize detected features in images based on viewpoint consistency and other Gestaltic perceptual phenomena. This imparts robustness, efficiency, and a qualitative and holistic nature to vision.

Our computational paradigm aims to organize features into highly plausible sets of higher level geometric features which are present in images of objects belonging to a large number of domains. Our organizational philosophy is hierarchical, with complex organizations being formed from simpler ones. Each level of the hierarchy is constructed using voting methods, graph operations, and knowledge based reasoning in a new extension of the Bayesian network we call the Perceptual Inference Network. Analogous to theories in human vision, our strategy divides broadly into two parts: detecting regularities and similarities in the tokens (preattentive vision) and reasoning, based on a knowledge base built from past experience, to enable one to go beyond the information provided (attentive vision). The voting method provides organizations based on Gestalt principles and the network reasons on those organizations to extract geometric features. The two steps of voting and evidential reasoning are repeated.

Previous approaches to perceptual organization have mostly been purely bottom up, without any top down knowledge base influence and therefore entirely dependent on the inputs, which may be imperfect. The knowledge base, besides coping with such input imperfections, also allows us to integrate multiple sources of information and to form a composite organization hypothesis.

Publications

  • S. Sarkar and K. L. Boyer, "Computing Perceptual Organization in Computer Vision ," World Scientific, 1994, ISBN 981-02-1832-X. This book describes in detail the complete perceptual organization system and covers the papers listed below

    (online order)

    Ordering Information: World Scientific Publishing Co ;
    1060 Main Street, River Edge NJ 07661, USA
    Phone: 201-487-9655 FAX: 201-487-9656

  • S. Sarkar and K. L. Boyer, ``Perceptual Organization in Computer Vision: A Review and a proposal for a Classificatory Structure,'' IEEE Transactions on Systems, Man, and Cybernetics , vol.~23, no.~2, pp.~382--399, Mar. 1993.
  • S. Sarkar and K. L. Boyer, ``A Computational Structure for Preattentive Perceptual Organization: Graphical Enumeration and Voting Methods,'' IEEE Transactions on Systems, Man, and Cybernetics , vol.~24, no.~2, pp.~246--267, Feb. 1994.
  • S. Sarkar and K. L. Boyer, ``Integration, Inference, and Management of Spatial Information Using Bayesian Networks: Perceptual Organization,'' IEEE Transactions on Pattern Analysis and Machine Intelligence (Special Section on Probabilistic Reasoning), vol.~15, no.~3, pp.~256--274, Mar. 1993.
  • S. Sarkar and K. L. Boyer, ``Using Perceputal Inference Networks to Manage Vision Processes,'' Accepted in Computer Vision, Graphics, and Image Processing: Image Understanding , July 1994.
  • S. Sarkar and K. L. Boyer, ``Automated Design of Bayesian Perceptual Inference Networks,'' Tech. Rep. SAMPL-93-03, SAMP-Lab, Dept. of EE, OSU, March 1993, also appeared in CVPR-1994

    Please contact Prof. Sudeep Sarkar (sarkar@cs.csee.usf.edu) for more information.
    USF Home Pages: University --- College --- Department --- Vision Lab