Top Banner
Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of Geneva [email protected]
26

Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

Jan 16, 2016

Download

Documents

Violet Gilmore
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

Characterizing activity in video shots based on

salient points

Nicolas Moënne-LoccozViper groupComputer vision & multimedia laboratory University of Geneva

[email protected]

Page 2: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 2

Outline

• Context

• Video Activity extraction– Spatial salient points– Spatio-temporal salient points– Spatio-temporal salient regions

• Results

• Conclusion

Page 3: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 3

Context• Describe visual content of video

Index, retrieve and browse video database

• Requirements– Generic approach (v.s. domain oriented)– Local approach (v.s. global description of the content)– Computationally efficient approach

• Video activity : salient region of the video 3D space

Page 4: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 4

Context0 50 70Frames

Activity #1

Activity #2

Description in space and time of video activity Inference based on video object and event relationships High level indexing

Page 5: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 5

Context

• Related approaches : Spatio-temporal segmentation– Segmentation problem– Computational efficiency

• Our approach :– Spatio-temporal salient points– Spatial grouping of salient points– Temporal matching of salient regions

Set of activities

Page 6: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 6

Overview

Salient points& trajectories

Global motionestimation

Motion outliers

Spatial grouping

Video stream

Salient extraction

Temporal matching

Salient extraction

Page 7: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 7

Salient points

• Points in the image space– Repetitive (robust)– High information content

Scale invariant interest points (Mikolajczyk, Schmid 2001)

– One of the most robust– Salient points with characteristic scale

Page 8: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 8

Salient point extraction

• Linear Scale-Space :

• Harris function :

• Salient points (image space) : local maxima h(v,s)

• Laplacian over scale :

• Salient points (scale space) : local maxima l(v,s) & h(v,s)

H žv , s Ÿ= s 2G žv , ? ŸL x

2 žv , s Ÿ L x L yžv , sŸ

L x L yžv , sŸ L x2 žv , sŸ

Page 9: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 9

Salient point extraction

• Example :

scale

Page 10: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 10

Salient point extraction

scale

Page 11: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 11

Motion estimation

• Goal :– Find points having salient temporal behaviour

Estimate background motion model Select points that do not follow this background motion

model

• Estimation :– Compute salient point trajectories– Estimate corresponding affine motion model

Page 12: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 12

Trajectories

• Point descriptors : Local Grayvalue Invariants

• Point distance : Mahalanobis distance

Page 13: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 13

Trajectories

• Goodness of match :

• Candidate matching points– Matches with spatial distance below a threshold

• Relaxation process :– Disambiguating set of candidate matches– Greedy Winner-Takes-All algorithm

Page 14: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 14

Motion estimation

• Affine motion model :

• Estimate model from trajectories– Iterative least square error estimate (Tukey M-Estimator)

select points that belong to the global motion model

Assumption : +50% points belong to the background

Page 15: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 15

Motion estimation

Points of the background and their motion estimated using the presented approach

All points and their motion estimated by a dense motion estimator

Page 16: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 16

Spatio-temporal salient points

• Points whose trajectory does not fit the global motion model

Outliers (moving objects)

• Points without trajectory (no matching point) New points (appearing or deformable objects)

Page 17: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 17

Spatio-temporal salient points

Fixed camera Moving camera

Page 18: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 18

Salient regions

• Set of spatio-temporal salient points

Feature distribution of points (RGB colour features) Spatial distribution of points

• Grouping process : Estimate salient region models

Page 19: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 19

Feature model

• Feature description– A salient point is characterized by the feature distribution of its

neighbourhood

– Assumption : maximum of four regions in the neighbourhood of the points

– Compute the corresponding colour distributions :• K-means clustering• Gaussian model

• Gaussian models clustering– Greedy algorithm (AHC)

Set of Gaussian distributions representing the distribution of the neighbourhood of the salient points :

Page 20: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 20

Salient region model

• Feature models– Mixture of Gaussians

Corresponding weight of each Gaussian

• Spatial model : – Estimate spatial pdf from salient points & associated scale

Page 21: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 21

Salient region models

• Iterate a RanSaC algorithm

• Estimate salient region model– Robust estimation (Tukey M-estimator)– Cost function :

Page 22: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 22

Salient regions

Fixed camera Moving camera

Page 23: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 23

Temporal matching

• Spatio-temporal salient regions of arbitrary length

Matching of salient regions

• Use salient points trajectories

1. Match regions with the highest number of matching points

Page 24: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 24

Results - Meetings

Page 25: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 25

Results – Misc

Page 26: Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.

NML - CVML - UniGe 26

Conclusion

• Contribution– Highly informational content descriptor– Generic content descriptor– Local in space and time content descriptor

• Limitation– Noisy & short activity

• Ongoing work– Temporal filtering of activity– Indexing of videos through the set of activity