Opinion
TRENDS in Cognitive Sciences
Vol.7 No.6 June 2003
What is a visual object?
Jacob Feldman
Department of Psychology, Center for Cognitive Science, Rutgers University, New Brunswick, NJ 08903, USA
The concept of an ‘object’ plays a central role in cognitive science, particularly in vision, reasoning and conceptual development – but it has rarely been given a concrete formal definition. Here I argue that visual objects cannot be defined according to simple physical properties but can instead be understood in terms of the hierarchical organization of visual scene interpretations. Within the tree describing such a hierarchical description, certain nodes make natural candidates as the ‘joints’ between objects, representing division points between parts of the image that cohere internally but do not perceptually group with one another. Thus each subtree hanging from such a node corresponds to a single perceived ‘object’. This formal definition accords with several intuitions about the way objects behave. Objects are everywhere in cognitive science. Objects are thought to be the building blocks of children’s conception of the physical world [1,2]; to delineate the boundaries respected by visual attention [3 – 5]; and to influence neural processing even at the earliest stages of visual cortex [6,7]. But what exactly is an object? In a phrase due to the American jurist Potter Stewart, ‘we know one when we see one’ – but what does the word actually mean? How do we know where one object ends and the next begins? The premise of this article is that this is not (as it were) an ‘objective’ question, but rather one that relates to how we mentally divide the world up into coherent units. That is, it is less about physics and more about the mental assumptions lurking behind the word ‘coherent’. The division of the world into objects seems so intuitive and effortless, at least under everyday conditions, that we speak about this division as if the world
References: 1 Spelke, E.S. (1990) Principles of object perception. Cogn. Sci. 14, 29 – 56 2 Xu, F. and Carey, S. (1996) Infants’ metaphysics: the case of numerical identity. Cogn. Psychol. 30, 111 – 153 http://tics.trends.com 3 Baylis, G.C. (1994) Visual attention and objects: two-object cost with equal convexity. J. Exp. Psychol. Hum. Percept. Perform. 20, 208 – 212 4 Duncan, J. (1984) Selective attention and the organization of visual information. J. Exp. Psychol. Gen. 113, 501 – 517 5 Scholl, B. et al. (2001) What is a visual object? Evidence from target merging in multiple object tracking. Cognition 80, 159 – 177 6 Albright, T.D. (1994) Why do things look as they do? Trends Neurosci. 17, 175 – 177 7 Olson, C.R. (2001) Object-based vision and attention in primates. Curr. Opin. Neurobiol. 11, 171 – 179 8 Allen, W. (1972) Without Feathers, Ballantine Books 9 Elder, J. and Zucker, S. (1993) The effect of contour closure on the rapid discrimination of two-dimensional shapes. Vision Res. 33, 981 – 991 10 Pomerantz, J.R. et al. (1977) Perception of wholes and their component parts: some configural superiority effects. J. Exp. Psychol. Hum. Percept. Perform. 3, 422 – 435 11 Palmer, P. and Rock, I. (1994) Rethinking perceptual organization: the role of uniform connectedness. Psychon. Bull. Rev. 1, 29 – 55 12 Bertamini, M. (2001) The importance of being convex: an advantage for convexity when judging position. Perception 30, 1295– 1310 13 Liu, Z. et al. (1999) The role of convexity in perceptual completion: beyond good continuation. Vision Res. 39, 4244– 4257 14 Caelli, T.M. and Umansky, J. (1976) Interpolation in the visual system. Vision Res. 16, 1055– 1060 15 Feldman, J. (1997) Curvilinearity, covariance, and regularity in perceptual groups. Vision Res. 37, 2835– 2848 16 Foster, D.H. (1979) Discrete internal pattern representations and visual detection of small changes in pattern shape. Percept. Psychophys. 26, 459 – 468 17 Pizlo, Z. et al. (1997) Curve detection in a noisy image. Vision Res. 37, 1217– 1241 18 Smits, J.T. and Vos, P.G. (1987) The perception of continuous curves in dot stimuli. Perception 16, 121– 131 19 Feldman, J. (2000) Bias toward regular form in mental shape spaces. J. Exp. Psychol. Hum. Percept. Perform. 26, 1 – 14 20 Kanizsa, G. (1979) Organization in Vision: Essays on Gestalt Perception, Praeger Publishers 21 Treisman, A. (1986) Properties, parts and objects. In Handbook of Perception and Human Performance: Cognitive Processes and Performance (Vol. 2) (Boff, K.R., ed.), pp. 35-1 – 35-70, John Wiley and Sons 22 Baylis, G. and Driver, J. (1993) Visual attention and objects: evidence for hierarchical coding of location. J. Exp. Psychol. Hum. Percept. Perform. 19, 451– 470 23 Geisler, W.S. and Super, B.J. (2000) Perceptual organization of twodimensional patterns. Psychol. Rev. 107, 677 – 708 24 Palmer, S.E. (1977) Hierarchical structure in perceptual representation. Cogn. Psychol. 9, 441 – 474 25 Marr, D. and Nishihara, H.K. (1978) Representation and recognition of the spatial organization of three-dimensional shapes. Proc. R. Soc. Lond. Ser. B 200, 269 – 294 26 Amir, A. and Lindenbaum, M. (1998) A generic grouping algorithm and its quantitative analysis. IEEE Trans. Patt. Anal. Mach. Intell. 20, 168– 185 27 Shi, J. and Malik, M. (2000) Normalized cuts and image segmentation. IEEE Trans. Patt. Anal. Mach. Intell. 22, 888– 905 28 Pomerantz, J.R. and Pristach, E.A. (1989) Emergent features, attention, and perceptual glue in visual form perception. J. Exp. Psychol. Hum. Percept. Perform. 15, 635 – 649 29 Feldman, J. (1997) Regularity-based perceptual grouping. Comput. Intell. 13, 582 – 623 30 Feldman, J., Perceptual grouping by selection of a logically minimal model. Intl. J. Comput. Vis. (in press) 31 Feldman, J. (1999) The role of objects in perceptual grouping. Acta Psychol. (Amst.) 102, 137 – 163 32 Behrmann, M. et al. (1998) Object-based attention and occlusion: evidence from normal participants and a computational model. J. Exp. Psychol. Hum. Percept. Perform. 24, 1011 – 1036 33 Moore, C.M. et al. (1998) Object-based visual selection: evidence from perceptual completion. Psychol. Sci. 9, 104 – 110 34 Feldman, J. (2002) Perceptual grouping into visual ‘objects’: a detailed chronology. Proc.2nd Annu. Conf. Vis. Sci. Soc., p. 172 35 Marr, D. (1982) Vision: A Computational Investigation into the Human Representation and Processing of Visual Information, Freeman & Co.