Abstract

= PDF Reprint, = BibTeX entry, = Online Abstract

L. Elazary, L. Itti, Interesting objects in natural scenes are more salient, In: Proc. Vision Science Society Annual Meeting (VSS07), May 2007. (Cited by 372)

Abstract: How do we decide which objects in a visual scene are more interesting? Intuition suggests a complex process of recognizing different candidate scene elements in turn, evaluating their identity and other attributes against behavioral preferences and goals, and finally deciding which among the candidates are more relevant and interesting. Here we investigate the contributions of a much simpler process, saliency-based visual attention. We used the publicly available LabelMe database of 24,863 digital photographs in which 74,454 presumably interesting objects have been manually outlined. We evaluated how often these objects were among the few most salient locations by a computational model of bottom-up attention. We find that in 43 percent of all images the model's first fixation falls within a labeled region, twice above chance (21 percent). Furthermore, within three fixations, the saliency map is able to pick a labeled region over 85 percent of the time, with performance leveling off after six fixations. The bottom-up attention model has no notion of object nor of semantic relevance. Hence, our results indicate that selecting interesting objects in a scene is largely constrained by low-level visual properties of scene elements, rather than solely determined by recognition and higher cognitive processes. The saliency map is a strong predictor of what humans find interesting in complex natural scenes.

Themes: Computational Modeling, Model of Bottom-Up Saliency-Based Visual Attention, Human Psychophysics