People detection and tracking using stereo vision and color

Similar documents
People detection and tracking using stereo vision and color q

REAL-TIME TRACKING WITH AN EMBEDDED 3D CAMERA WITH FPGA PROCESSING. Alessandro Muscoloni and Stefano Mattoccia

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Vision-based Mobile Robot Localization and Mapping using Scale-Invariant Features

Occlusion Detection of Real Objects using Contour Based Stereo Matching

Reliable Human Detection and Tracking in Top-View Depth Images

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

Stereo 3D camera supports automated storage of packaging Out of the box

Tracking Under Low-light Conditions Using Background Subtraction

W4. Perception & Situation Awareness & Decision making

Real Time Face Detection using Geometric Constraints, Navigation and Depth-based Skin Segmentation on Mobile Robots

CS 4758: Automated Semantic Mapping of Environment

Topic 4 Image Segmentation

Measurement of 3D Foot Shape Deformation in Motion

A Texture-based Method for Detecting Moving Objects

Brand Identification Using Gaussian Derivative Histograms

Measurement of Pedestrian Groups Using Subtraction Stereo

Stereo Vision A simple system. Dr. Gerhard Roth Winter 2012

Feature Transfer and Matching in Disparate Stereo Views through the use of Plane Homographies

Human Motion Detection and Tracking for Video Surveillance

3D Time-of-Flight Image Sensor Solutions for Mobile Devices

A Simple Interface for Mobile Robot Equipped with Single Camera using Motion Stereo Vision

Detection of Small-Waving Hand by Distributed Camera System

Detection and Classification of Vehicles

COMP 102: Computers and Computing

Vision-based Localization of an Underwater Robot in a Structured Environment

Computer and Machine Vision

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Robot Localization based on Geo-referenced Images and G raphic Methods

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

An Object Detection System using Image Reconstruction with PCA

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Human Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg

EE795: Computer Vision and Intelligent Systems

FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO

arxiv: v1 [cs.cv] 1 Jan 2019

Gesture Recognition using Temporal Templates with disparity information

COMS W4735: Visual Interfaces To Computers. Final Project (Finger Mouse) Submitted by: Tarandeep Singh Uni: ts2379

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Stereo Vision Computer Vision (Kris Kitani) Carnegie Mellon University

Theory of Stereo vision system

Pose estimation using a variety of techniques

Robot localization method based on visual features and their geometric relationship

Text-Tracking Wearable Camera System for the Blind

How to Use Passive Infrared Sensor Cameras

Study on the Signboard Region Detection in Natural Image

Scale Invariant Feature Transform

Dense Tracking and Mapping for Autonomous Quadrocopters. Jürgen Sturm

DPM Score Regressor for Detecting Occluded Humans from Depth Images

Interactive Collision Detection for Engineering Plants based on Large-Scale Point-Clouds

Multi-View Stereo for Static and Dynamic Scenes

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Multimedia Technology CHAPTER 4. Video and Animation

Real-time Stereo Vision for Urban Traffic Scene Understanding

Towards Autonomous Vehicle. What is an autonomous vehicle? Vehicle driving on its own with zero mistakes How? Using sensors

A Street Scene Surveillance System for Moving Object Detection, Tracking and Classification

Tecnologie per la ricostruzione di modelli 3D da immagini. Marco Callieri ISTI-CNR, Pisa, Italy

Automatic Colorization of Grayscale Images

Computationally Efficient Serial Combination of Rotation-invariant and Rotation Compensating Iris Recognition Algorithms

Scale Invariant Feature Transform

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

DEPTH ESTIMATION USING STEREO FISH-EYE LENSES

Depth Camera for Mobile Devices

ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall Midterm Examination

The XH-map algorithm: A method to process stereo video to produce a real-time obstacle map.

Fast Denoising for Moving Object Detection by An Extended Structural Fitness Algorithm

To Appear in IEEE International Symposium on Intelligent Vehicles in Parma, Italy, June 2004.

3D Environment Measurement Using Binocular Stereo and Motion Stereo by Mobile Robot with Omnidirectional Stereo Camera

L2 Data Acquisition. Mechanical measurement (CMM) Structured light Range images Shape from shading Other methods

Indoor Object Recognition of 3D Kinect Dataset with RNNs

MIME: A Gesture-Driven Computer Interface

PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE

An Implementation on Object Move Detection Using OpenCV

Introduction to 3D Machine Vision

Optical Flow-Based Person Tracking by Multiple Cameras

Avatar Communication: Virtual Instructor in the Demonstration Exhibit

DEVELOPMENT OF REAL TIME 3-D MEASUREMENT SYSTEM USING INTENSITY RATIO METHOD

Chapter 3 Image Registration. Chapter 3 Image Registration

Advance Shadow Edge Detection and Removal (ASEDR)

Complex Sensors: Cameras, Visual Sensing. The Robotics Primer (Ch. 9) ECE 497: Introduction to Mobile Robotics -Visual Sensors

ФУНДАМЕНТАЛЬНЫЕ НАУКИ. Информатика 9 ИНФОРМАТИКА MOTION DETECTION IN VIDEO STREAM BASED ON BACKGROUND SUBTRACTION AND TARGET TRACKING

CSE 4392/5369. Dr. Gian Luca Mariottini, Ph.D.

FOREGROUND DETECTION ON DEPTH MAPS USING SKELETAL REPRESENTATION OF OBJECT SILHOUETTES

3D scanning. 3D scanning is a family of technologies created as a means of automatic measurement of geometric properties of objects.

CS 4758 Robot Navigation Through Exit Sign Detection

A490 Machine Vision and Computer Graphics

A Real Time System for Detecting and Tracking People. Ismail Haritaoglu, David Harwood and Larry S. Davis. University of Maryland

Realtime Omnidirectional Stereo for Obstacle Detection and Tracking in Dynamic Environments

A method for depth-based hand tracing

Data-driven Depth Inference from a Single Still Image

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Website:

DETECTION OF SCREWS ON METAL CEILING STRUCTURES FOR DISMANTLING SYSTEMS

A Statistical Approach to Culture Colors Distribution in Video Sensors Angela D Angelo, Jean-Luc Dugelay

STEREO-VISION SYSTEM PERFORMANCE ANALYSIS

Generating and Learning from 3D Models of Objects through Interactions. by Kiana Alcala, Kathryn Baldauf, Aylish Wrench

A Robust Two Feature Points Based Depth Estimation Method 1)

Agent Mobility. Overview. mobile agents and mobile computing. technical issues agent languages, distributed execution, environment, security

Segmentation and Tracking of Partial Planar Templates

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Transcription:

People detection and tracking using stereo vision and color Rafael Munoz-Salinas, Eugenio Aguirre, Miguel Garcia-Silvente. In Image and Vision Computing Volume 25 Issue 6 (2007) 995-1007. Presented by Reid Sawtell

Introduction The system presented by this paper is designed to take advantage of newer stereo cameras to improve person detection and tracking. Additionally, color information is used to create a more robust tracking system. Finally, all of this must be done quickly so the system can be used in real time applications.

Overview Design Specifications Building the Environment Stereo Processing Background Foreground Detecting and Tracking People Division of Tasks Color Modeling Detecting People Tracking People Experimental Results Future Work

Design Specifications Stereo camera system (instead of the popular single camera system) Color information is used to improve tracking reliability Under-head camera position

Why Stereo? The price has decreased and availability has increased The added depth information allows for more accurate detection and tracking Foreground objects can be separated from background objects Can filter objects to reduce false positives Added filtering also reduces the need to apply costly face detection

Why use color? Systems based only on position can easily confuse objects in close proximity Luminance can vary greatly from one location to another

Why use an under-head camera? Camera position is problem dependent It can be mounted on mobile platforms It is ideal for gesture recognition Smaller robots are less intimidating

Building the Environment Stereo Processing Background Modeling Foreground Extraction

Stereo Processing Stereo System provides a disparity map Pixel location is calculated using known parameters Points are translated into a world model, which is likely to contain large volumes of data

Simplifying the Model 3D data is projected onto a 2D surface composed of uniform cells with height information Cell size must be carefully selected to balance memory/performance and reliability Filter points based on height to exclude ceiling or floor.

Scene reconstruction

Background Modeling People move, so separating them from the background is advantageous The heightmap is a model of the environment people move through Generating a reliable heightmap cannot be done instantaneously Must be updated more frequently than people are detected and tracked

Generating the heightmap over time

Detecting Foreground Objects Foreground points are easily isolated using the heightmap Occupancy map shows the distribution of foreground points over the scene Points are scaled based on proximity to camera A person is likely to be found in areas with high occupancy

Occupancy Map

Detecting and Tracking People Division of Tasks Color Modeling Detecting People Tracking People

Division of Tasks Face detection is computationally costly Tracked people can first be assigned to objects detected in a scene Remaining objects can then be studied as possible candidates for new people

How objects are Classified Generate a head region based on highest point in occupancy map Place a body region underneath the head region

Modeling Color The color of each object is modeled as a histogram Some illuminance information is held since color is not reliable at extreme values Pixels near the center are given more weight to reduce error Color models can be compared to compute a similarity value Color model for a tracked object is updated over time

Similarity of Body Regions

Detecting People Detected object not associated with tracked people may be new people in the scene Each head region is analyzed to determine if it is similar to a human head If it is, apply face detection based on common library routines, ignore it otherwise Pruning of regions reduces computational cost and occurrence of false positives

Identifying People

Tracking People Once a person is detected, they must be tracked Assign tracked people to detected objects using Kuhn s Hungarian Method Color similarity and estimated position from the Kalman Filter are used to compute how likely each object is a given person Color similarity is important when people are in close proximity If a person is not detected for some time, they are removed from the list

Estimating Position with the Kalman Filter Even though the filter uses a linear movement model, it works well because the time interval is small Worst error occurs when people turn

Experimental Setup Used 320*240 resolution images Operational frequency is 10Hz on a 3.2 Ghz Pentium 4 laptop Configured to detect people between.5 and 2.5 meters Cameras with 4mm and 6mm focal lengths are used Number of people vary from 2 to 4 People interact in a variety of ways to try and trick the system into confusing them Tests were run with and without the use of color modeling

Results In every test case, the use of color modeling improved tracking success rate The 6mm camera could not track as many people as the 4mm camera Increasing the number of people decreases the success rate of tracking, especially when color is not used The use of color may allow the system to correct itself once the interacting people have separated

Example Scene

Tracking Success #People f(mm) #conflicts #NC (%) #c (%) 2 6 30 86 100 3 6 17 58 82 3 4 52 69 100 4 4 13 50 100

Future Work Use multi-scale techniques when creating the height and occupancy maps Avoid the use of a unique cell size in hopes of improving success without hurting performance Use additional features aside from color and position to distinguish objects, such as face identification Test the system on a mobile platform such as an autonomous robot