Top Banner
People Counting and Human Detection in a Challenging Situation Ya-Li Hou and Grantham K. H. Pang IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART A: SYSTEMS AND HUMANS, VOL. 41, NO. 1, JANUARY 2011
32

People Counting and Human Detection in a Challenging Situation Ya-Li Hou and Grantham K. H. Pang IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART.

Dec 14, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
  • Slide 1

People Counting and Human Detection in a Challenging Situation Ya-Li Hou and Grantham K. H. Pang IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART A: SYSTEMS AND HUMANS, VOL. 41, NO. 1, JANUARY 2011 Slide 2 Outline Introduction Related Work Proposed Algorithm Experiment Results Conclusion 2 Slide 3 Introduction Related Work Proposed Algorithm Experiment Results Conclusion 3 Slide 4 Object People counting is a crucial and challenging problem in visual surveillance. This paper aims to develop an effective method for estimating the number of people in a complicated outdoor scene. 4 Slide 5 Introduction Related Work Proposed Algorithm Experiment Results Conclusion 5 Slide 6 Related Work Detection-based methods Segment the foreground blobs into individuals based on prior knowledge of human shapes and the characteristics of the foreground contour. Detect individuals directly from the image. Map-based methods 6 Slide 7 Introduction Related Work Proposed Algorithm Experiment Results Conclusion 7 Slide 8 Framework 8 Slide 9 Introduction Related Work Proposed Algorithm People Counting Individual Detection Experiment Results Conclusion 9 Slide 10 People Counting A robust adaptive background estimation method based on the Gaussian Mixture Model [23], [24] is employed in this paper. The foreground image is then binarized based on a threshold to obtain the foreground pixels. [23] W. E. L. Grimson, C. Stauffer, R. Romano, and L. Lee, Using adaptive tracking to classify and monitor activities in a site, in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 1998, pp. 2229. [24] C. Stauffer and W. E. L. Grimson, Learning patterns of activity using real-time tracking, IEEE Trans. Pattern Anal.Mach. Intell., vol. 22, no. 8, pp. 747757, Aug. 2000. 10 Slide 11 People Counting Perspective correction is an important step for foreground pixels-based estimation[13]. The number of foreground pixels is computed with (2) [13] R. Ma, L. Li,W. Huang, and Q. Tian, On pixel count based crowd density estimation for visual surveillance, in Proc. IEEE Conf. Cybern. Intell. Syst., 2004, pp. 170173. 11 Slide 12 People Counting To determine the relationship between foreground pixels and the number of people, some manually annotated training images from a similar scene are needed. Method 1) Based on Foreground Pixels Method 2) Based on Closed Foreground Pixels Method 3) Based on Both Foreground Pixels and Closed Foreground Pixels 12 Slide 13 Based on Foreground Pixels The relationship between the number of foreground pixels after perspective correction and the number of people will be found directly. 13 Slide 14 Based on Closed Foreground Pixels Some people show solid foreground blobs, while others only show some scattered pixels in the foreground image. To reduce the difference between moving people and stationary people, a closing operation is employed. 14 Slide 15 Based on Both Foreground Pixels and Closed Foreground Pixels To keep more information about the original image, both foreground pixels and closed foreground pixels will be injected into the neural network. 15 Slide 16 Introduction Related Work Proposed Algorithm People Counting Individual Detection Experiment Results Conclusion 16 Slide 17 Individual Detection Feature Detection Foreground Mask Cluster Model EM Clustering Postprocessing 17 Slide 18 Feature Detection Kanade-Lucas-Tomasi (KLT) [25] is a popular corner detector and shows good performance for tracking. 18 [25] C. Tomasi and T. Kanade, Detection and tracking of point features, Carnegie Mellon Univ., Pittsburgh, PA, Tech. Rep. CMU-CS-91-132, 1991. Slide 19 Foreground Mask The foreground mask is obtained from the foreground pixel image after a closing operation. After filtering with the foreground mask, almost all feature points from the background will be removed. 19 Slide 20 Cluster Model 20 Slide 21 Cluster Model To facilitate the computation, we assume the ellipse is coincident with the 30% ellipse of the Gaussian distribution. 21 Slide 22 EM Clustering EM algorithm is used to cluster the feature points into each individual person. 22 Slide 23 Postprocessing The EM clustering results may contain some redundant ellipses. The candidate ellipses are checked one by one and the redundant ellipses removed. A very simple occlusion analysis is performed in this step. In our evaluations, occlusion is simply defined as a 30% overlap of two ellipses. Humans not occluded by others should have more than three feature points, while two feature points are acceptable for those who are occluded. 23 Slide 24 Introduction Related Work Proposed Algorithm Experiment Results Conclusion 24 Slide 25 Evaluation 1 25 Slide 26 26 Slide 27 Evaluation 2 27 Slide 28 28 Slide 29 Evaluation 3 29 Slide 30 30 Slide 31 Introduction Related Work Proposed Algorithm Experiment Results Conclusion 31 Slide 32 Conclusion In this paper, foreground pixels from both moving people and near stationary people have been considered to estimate their number. The best estimation results, with a 10% average error, were achieved when both foreground pixels and closed foreground pixels are learned in a neural network. 32