Top Banner
Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5, MAY 2014 Zhixiang Ren, Shenghua Gao, liang-Tien Chia, and Ivor Wai-Hung Tsang 1
37

Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

Dec 13, 2015

Download

Documents

Edwina Joseph
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

1

Region-Based Saliency Detection and Its Application in Object Recognition

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24

NO. 5, MAY 2014

Zhixiang Ren, Shenghua Gao, liang-Tien Chia, and Ivor Wai-Hung Tsang

Page 2: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

2

Overview

• Introduction

• Related Work

• Proposed Method of Saliency

• Experiments For Saliency Detection

• Conclusion

Page 3: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

3

Introduction

Page 4: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

4

Introduction

• Visual Saliency

Measure to what extent a region attracts human attention.

• Potential Application

Adaptive compression, image retargeting, object detection

Page 5: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

5

Introduction (cont.)

• Many saliency detection algorithms (pixel-grid) have been proposed

[36]-[38], [57], [68]

• Drawbacks

• Perform poorly in the images with large salient regions

• Suffer from the messy background, e.g. natural scenes.

Page 6: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

6

Introduction (cont.)

• [17], [18] suggest that early feature like color, contrast, and orientation indirectly affect human attention

Human is attracted by objects not by individual pixels

• It is natural to work with those perceptually meaningful image regions in saliency detection

Concept of superpixel [54]

Page 7: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

7

Introduction (cont.)

• Proposed work applies two existing techniques to improve saliency detection

Superpixel representation – Used to represent the input image

PageRank algorithm – Applied to propagate saliency among similar clusters and refine saliency map

Page 8: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

8

Related Work

Page 9: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

9

Related Work

• Saliency detection methods can be divided into two categories

Top-down method : Task-dependent and based on prior knowledge about the object and their interrelations

Bottom-up method : Hypothesis for saliency is that salient stimulus is distinct from its surrounding stimuli. (contrast)

• For bottom-up method, research usually focus on identifying those regions with high contrast.

Page 10: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

10

Related Work (cont.)

• [38] proposed to determine the contrast by DoG

• [57] measured the likeness of a pixel to its surroundings by the local regression kernels

• [1] measured the saliency of each pixel by the difference between the feature of each pixel and mean of the whole image.

• [68] measured the global contrast with all the other pixels.

• [30] model both local and global contrast by taking the positional distance into account

• Most of approach represent the input image in pixel-grid manner, and these method may failed to detect the homogeneous and quite large salient objects

Page 11: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

11

Related Work (cont.)

Page 12: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

12

Proposed Method

Page 13: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

13

Proposed Method

• Superpixel Extraction and Clustering

• Salient Region Detection

• Saliency Refinement With Propagation

Page 14: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

14

Superpixel Extraction

• Given an input image, mean shift algorithm[13] will be performed in color space to extract superpixels.

• In mean shift algorithm, , , and are needed.

Spatial Radius

Range Radius

Minimum Point Density

Maximum range radius

Average color variance

Color variance

Page 15: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

15

Superpixel Extraction (cont.)

• Mean shift

Mean shift is a procedure for locating the maxima of a density function given discrete data sampled from that function.

Scale parameter

Page 16: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

16

Superpixel Extraction (cont.)

m(x)

Page 17: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

17

Superpixel Clustering

• After mean shift, every superpixel will obtain a unique RGB color

• GMM is introduced to cluster superpixels in RGB color space

• The RGB value of this superpixel will be set as a 3-D vector to represent the superpixel during GMMR 139

G 160

B 127

Page 18: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

18

Superpixel Clustering (cont.)

• K-means is used to initialize the GMM

• Expectation maximization algorithm is used to train GMM parameter [5]

• The probability of the th superpixel belong to the th cluster

Page 19: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

19

Superpixel Clustering (cont.)

Page 20: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

20

Superpixel Clustering (cont.)

• Mixture density

Assume the set of N training samples is drawn from a mixture of models

Page 21: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

21

Superpixel Clustering (cont.)

• How to estimate and

EM algorithm

Page 22: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

22

Superpixel Clustering (cont.)

Page 23: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

23

Superpixel Clustering (cont.)

Page 24: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

24

Superpixel Clustering (cont.)

Page 25: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

25

Superpixel Clustering (cont.)

Page 26: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

26

Salient Region Detection

• Idea : background has larger spread in spatial domain

• i.e., the more compact the clusters are spread, the more salient they will be

• [32] proposed compactness metric to evaluate the spread of cluster

• Inter-cluster distances defined as

Cluster Spatial Center

Page 27: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

27

Refinement With Propagation

• In some situation, the perceptually meaningful regions are less than the cluster number.

• That is, some regions, which should belong to one cluster, will be grouped into several clusters.

R 139

G 160

B 127

Page 28: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

28

Refinement With Propagation (cont.)

• If one cluster is over-segmented into several subclusters, the compactness may be highly distorted.

• Thus PageRank algorithm is proposed to propagate saliency between similar clusters.

• Original PageRank algorithm

• Question : How the original PageRank come from?

Page 29: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

29

Refinement With Propagation (cont.)

• Idea : A page linked by many pages with high PageRank receives high rank as well.

• Modified algorithm

Page 30: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

30

Refinement With Propagation (cont.)

Page 31: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

31

Experiment Result

Page 32: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

32

Experimental DataSet and Compared Method

• Dataset

EPFL dataset [1], CMU dataset [4], MSRA dataset [46] ,Itti’s method (ITTI) [38]

• Method

Spectral residual method (SR) [37]

Graph-based saliency method (GB) [36]

Frequency-tuned method (FT) [37]

Method based on color and orientation distributions (COD) [32]

Region contrast method (RC) [12] (Region-based)

Context-based method (CB) [39] (Region-based)

Page 33: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

33

Experiment Result

Page 34: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

34

Experiment Result (cont.)

• Linear Correlation Coefficient

Page 35: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

35

Experiment Result (cont.)

Page 36: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

36

Conclusion

Page 37: Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,

37

Conclusion

• The paper proposes a promising saliency detection approach, which can generate accurate saliency maps with well-defined object boundary.

• Mean shift, GMM are used to extract meaningful superpixel.

• Saliency value is refined as well with a modified PageRank algorithm.