A System for Real-time Detection and Tracking of Vehicles from a Single Car-mounted Camera

Similar documents
Robustifying the Flock of Trackers

Automatic Initialization of the TLD Object Tracker: Milestone Update

Tracking. Hao Guan( 管皓 ) School of Computer Science Fudan University

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

On-Road Vehicle and Lane Detection

2 OVERVIEW OF RELATED WORK

Machine learning based automatic extrinsic calibration of an onboard monocular camera for driving assistance applications on smart mobile devices

Tracking-Learning-Detection

Detection and Classification of Vehicles

Multi camera tracking. Jan Baan. Content. VBM Multicamera VBM A270 test site Helmond

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Aircraft Tracking Based on KLT Feature Tracker and Image Modeling

Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos

Learning a Fast Emulator of a Binary Decision Process

Evaluation of a laser-based reference system for ADAS

Fusion Framework for Moving-Object Classification. Omar Chavez, Trung-Dung Vu (UJF) Trung-Dung Vu (UJF) Olivier Aycard (UJF) Fabio Tango (CRF)

HOG-based Pedestriant Detector Training

Pedestrian Detection Using Correlated Lidar and Image Data EECS442 Final Project Fall 2016

Deformable Part Models

Cs : Computer Vision Final Project Report

Camera-based Vehicle Velocity Estimation using Spatiotemporal Depth and Motion Features

A Street Scene Surveillance System for Moving Object Detection, Tracking and Classification

Tracking driver actions and guiding phone usage for safer driving. Hongyu Li Jan 25, 2018

Object Detection. Part1. Presenter: Dae-Yong

Training models for road scene understanding with automated ground truth Dan Levi

TxDOT Video Analytics System User Manual

Tightly-Coupled LIDAR and Computer Vision Integration for Vehicle Detection

Calibration of a rotating multi-beam Lidar

Multiple-Person Tracking by Detection

On Board 6D Visual Sensors for Intersection Driving Assistance Systems

Face detection, validation and tracking. Océane Esposito, Grazina Laurinaviciute, Alexandre Majetniak

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

AUTOMATED GENERATION OF VIRTUAL DRIVING SCENARIOS FROM TEST DRIVE DATA

Human Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg

Min-Hashing and Geometric min-hashing

Classification of objects from Video Data (Group 30)

Laserscanner Based Cooperative Pre-Data-Fusion

P-N Learning: Bootstrapping Binary Classifiers by Structural Constraints

7. Boosting and Bagging Bagging

Real-Time Human Detection using Relational Depth Similarity Features

Robert Collins CSE598G. Intro to Template Matching and the Lucas-Kanade Method

Beyond Bags of Features

Lecture 12 Recognition. Davide Scaramuzza

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Spatio temporal Segmentation using Laserscanner and Video Sequences

IN computer vision develop mathematical techniques in

Detection of a Single Hand Shape in the Foreground of Still Images

Comparison of Local Feature Descriptors

Multi-Object Tracking Based on Tracking-Learning-Detection Framework

CLASSIFICATION FOR ROADSIDE OBJECTS BASED ON SIMULATED LASER SCANNING

CSE 586 Final Programming Project Spring 2011 Due date: Tuesday, May 3

Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement

Large Scale Image Retrieval

Active learning for visual object recognition

W4. Perception & Situation Awareness & Decision making

3D reconstruction how accurate can it be?

A Statistical Approach to Culture Colors Distribution in Video Sensors Angela D Angelo, Jean-Luc Dugelay

Postprint.

CS 231A Computer Vision (Fall 2012) Problem Set 3

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Category-level localization

Motion-based obstacle detection and tracking for car driving assistance

Multi-Perspective Vehicle Detection and Tracking: Challenges, Dataset, and Metrics

Using temporal seeding to constrain the disparity search range in stereo matching

Collaborative Mapping with Streetlevel Images in the Wild. Yubin Kuang Co-founder and Computer Vision Lead

CSc Topics in Computer Graphics 3D Photography

Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet.

People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features

ETISEO, performance evaluation for video surveillance systems

Sensory Augmentation for Increased Awareness of Driving Environment

Motion estimation of unmanned marine vehicles Massimo Caccia

Team Aware Perception System using Stereo Vision and Radar

The Pennsylvania State University. The Graduate School. College of Engineering ONLINE LIVESTREAM CAMERA CALIBRATION FROM CROWD SCENE VIDEOS

An Introduction to Pattern Recognition

Automatic visual recognition for metro surveillance

Object Detection Design challenges

CS 223B Computer Vision Problem Set 3

Advanced Processing Techniques and Classification of Full-waveform Airborne Laser...

Category vs. instance recognition

Pedestrian Detection and Tracking in Images and Videos

Histogram of Oriented Gradients for Human Detection

Computer Vision with MATLAB MATLAB Expo 2012 Steve Kuznicki

Lecture 12 Recognition

Racing Bib Number Recognition

Rigorous Scan Data Adjustment for kinematic LIDAR systems

FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU

CAP 6412 Advanced Computer Vision

Creating Affordable and Reliable Autonomous Vehicle Systems

Bus Detection and recognition for visually impaired people

Domain Adaptation For Mobile Robot Navigation

Real-Time License Plate Recognition on an Embedded DSP-Platform

Object Category Detection: Sliding Windows

Vehicle Detection Method using Haar-like Feature on Real Time System

Fusing shape and appearance information for object category detection

Chapter 9 Object Tracking an Overview

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Action recognition in videos

Object Localization, Segmentation, Classification, and Pose Estimation in 3D Images using Deep Learning

Designing a Site with Avigilon Self-Learning Video Analytics 1

Transcription:

A System for Real-time Detection and Tracking of Vehicles from a Single Car-mounted Camera Claudio Caraffi, Tomas Vojir, Jiri Trefny, Jan Sochman, Jiri Matas Toyota Motor Europe Center for Machine Perception, University of Prague ITS Conference, Anchorage, 18 Sep 2012

An easy problem Vehicle detection & tracking on motorways: Only vehicle rear, rigid object Limited street furniture, no pedestrian Single camera Variable lighting condition Quantitative evaluation: Extended dataset expensive annotation Laser-scanner for semi-automatic annotation of ~30000 frames. Dataset released.

Algorithm architecture Target initialized by vehicle detector (Wald Boost) KLT features tracking (Flock of Trackers) LrD: Learn specific target and re-detect to correct tracker drift (Randomized Forest) Wald Boost detections are also used to correct known targets Precise bounding box In case of tracker failure, WB & LrD try to recover the target in a Kalman-filter predicted position long-term vehicle identity maintenance

Vehicle Detection & Tracking

Vehicle detection and tracking

WaldBoost: WaldBoost Detector 1 Sliding window algorithm Sequence of weak classifier (AdaBoost) Focus on time to decision After each classifier, verify if we can take a decision not object (Asymmetric)

WaldBoost Detector 2 5000 car images (and a few trucks ) training dataset One billion negative samples Performance on test sequence: 1.74 average classifiers evaluated per window for rear vehicle detection Limits: longer time to decision in crowded traffic scenes. J. Sochman and J. Matas, WaldBoost - Learning for Time Constrained Sequential Detection, in CVPR 2005. J. Trefny and J. Matas, Extended Set of Local Binary Patterns for Rapid Object Detection, in CVWW 10: Proceedings of the Computer Vision Winter Workshop 2010.

Vehicle detection and tracking

Flock of trackers (FOT) 1 Target divided in cells One KLT feature tracker per region KLT feature pre-extraction skipped Tracker evenly placed, naturally converge to a good feature Trackers are evaluated (in/outlier): - Previous: Forward-backward KLT check, cost is twice - Neighbourhood consistency (Nh) - Markov model predictor (Mp) - Norm. cross correlation (NCC) - Combination: 10% KLT time Estimation of translation & scaling

Flock of trackers (FOT) 2 Fast computation (4.8 ms) Comparison with recently published methods on public sequences Effective for rigid objects Failure in case of strong illumination changes T. Vojir and J. Matas, Robustifying the Flock of Trackers, in Computer Vision Winter Workshop, 2011.

Vehicle detection and tracking

Learn & re-detect At target confirmation (after 3 detections) create a specific random forest classifier (RF): Insert as positive samples the current target window and its affine warps Keep updating the RF, collect: Negative samples: RF detector firing outside the tracked object Positive samples: current target window if similarity (NCC) > 0.75 Limit: Similar vehicles in the scene Z. Kalal, J. Matas, and K. Mikolajczyk, P-N Learning: Bootstrapping Binary Classifiers by Structural Constraints, in Conference on Computer Vision and Pattern Recognition (CVPR), 2010.

Scheduling (10 Hz objective) Only one expensive operation per frame: WB detector (every 3 rd frame) Random forest generation (frame after 1 st det.) LrD (other frames): correct position for each target, collect negative samples for one target Note: WB runs every 3 frames, 3 detections to confirm a target delay up to 0.9 s

Results 1

Quantitative evaluation: Dataset and annotation Acquired in December 2011 in cooperation with VisLab, University of Parma, Italy 28 clips for a total of ~27 minutes Two 1024x768 color cameras, IBEO 4-layer laser-scanner, ego-trajectory Idea: Use laser-scanner data to extract automatically vehicles Project vehicles into the image to create an approximated Ground Truth (GT )

Laser-scanner detections Easy scenario: Extract surfaces perpendicular to the driving direction, discard static objects Project into the image using fixed calibration parameters and a flat-ground assumption

Features of GT Consistent ID available 3D position available Object width estimated over full time of observation Car Truck classification based on width

No motorbike Limits of GT Unreliable beyond 60-70 meters ( < 3 laser reflections) Imprecise target side boundaries (quantization and noise) No vehicle length (although possible) No vehicle height (arbitrarily set) Static calibration insufficient for projection (oscillation and non-flat ground)

Oscillation: best pitch matching Vision algorithm GT pitch correction GT fixed calibration

Match GT Algorithm results Given the mentioned limitations, we introduce a custom overlap measurement AREA TERM POSITION TERM

Different sensor positions A correct detection can be classified false positive An object not visible in the image can be classified false negative

Different sensor positions The vehicle marked in yellow is not considered in the statistics

Quantitative evaluation Daylight: Very few false positives Delay of detection reduces recall rate Poor results for trucks Statistics beyond 70 meters not significant

Quantitative evaluation Daylight: Sunset:

Results

Conclusions Real-time reliable detection and tracking of cars Next steps: measure & reduce confirmation time focus on trucks & other classes different scenarios Dataset is made public Images available, ground truth + software Oct 12 http://cmp.felk.cvut.cz/data/motorway/ or google: motorway dataset

Thanks! http://cmp.felk.cvut.cz/data/motorway/