Top Banner
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998
18

A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Dec 26, 2015

Download

Documents

Jeffery Simmons
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

A Model of Saliency-BasedVisual Attention

for Rapid Scene Analysis

Laurent Itti, Christof Koch, and Ernst Niebur

IEEE PAMI, 1998

Page 2: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

What is Saliency?

● Something is said to be salient if it stands out

● E.g. road signs should have high saliency

Page 3: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Introduction

● Trying to model visual attention● Find locations of Focus of Attention in an

image● Use the idea of saliency as a basis for their

model● For primates focus of attention directed from:

● Bottom-up: rapid, saliency driven, task-independent

● Top-down: slower, task dependent

Page 4: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Results of the Model

• Only considering “Bottom-up”

task-independent

Page 5: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Model diagram

Page 6: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Model

● Input: static images (640x480)● Each image at 8 different scales

(640x480, 320x240, 160x120, …)● Use different scales for computing “centre-

surround” differences (similar to assignment)

+

-

Fine scale

Course scale

Page 7: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Feature Maps

1. Intensity contrast (6 maps)● Using “centre-surround”● Similar to neurons sensitive to dark centre,

bright surround, and opposite

2. Color (12 maps)● Similar to intensity map, but using different

color channels● E.g. high response to centre red, surround

green

Page 8: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Feature Maps

3. Orientation maps (24 maps)● Gabor filters at 0º, 45º, 90º, and 135º● Also at different scales

Total of 42 feature maps are combined into the saliency map

Page 9: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Saliency Map

● Purpose: represent saliency at all locations with a scalar quantity

● Feature maps combined into three “conspicuity maps”● Intensity (I)● Color (C)● Orientation (O)

● Before they are combined they need to be normalized

Page 10: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Normalization Operator

Page 11: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Example of operation

Leaky integrate-and-fire neurons“Inhibition of return”

Page 12: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Model diagram

Page 13: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Example of operation

• Using 2D “winner-take-all” neural network at scale 4

• FOA shifts every 30-70 ms

• Inhibition lasts 500-900 ms

Page 14: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Results

Image

SaliencyMap

High saliencyLocations(yellow circles)

Page 15: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Results

● Tested on both synthetic and natural images● Typically finds objects of interest, e.g. traffic

signs, faces, flags, buildings…● Generally robust to noise (less to

multicoloured noise)

Page 16: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Uses

● Real-time systems● Could be implemented in hardware● Great reduction of data volume

● Video compression (Parkhurst & Niebur)● Compress less important parts of images

Page 17: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

Summary

● Basic idea:● Find multiple saliency measures in parallel● Normalize ● Combine them to a single map● Use 2D integrate-and-fire layer of neurons to

determine position of FOA● Model appears to work accurately and

robustly (but difficult to evaluate)● Can be extended with other feature maps

Page 18: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis Laurent Itti, Christof Koch, and Ernst Niebur IEEE PAMI, 1998.

References

● Itti, Koch, and Niebur: “A Model of Saliency-Based Visual Attention for Rapid Scene Analysis” IEEE PAMI Vol. 20, No. 11, November (1998)

● Itti, Koch: “Computational Modeling of Visual Attention”, Nature Reviews – Neuroscience Vol. 2 (2001)

● Parkhurst, Law, Niebur: “Modeling the role of salience in the allocation of overt visual attention”, Vision Research 42 (2002)