Visual Perception for Robots

Similar documents
Semantic RGB-D Perception for Cognitive Robots

Learning Semantic Environment Perception for Cognitive Robots

3D Simultaneous Localization and Mapping and Navigation Planning for Mobile Robots in Complex Environments

Manipulating a Large Variety of Objects and Tool Use in Domestic Service, Industrial Automation, Search and Rescue, and Space Exploration

Active Recognition and Manipulation of Simple Parts Exploiting 3D Information

Learning Depth-Sensitive Conditional Random Fields for Semantic Segmentation of RGB-D Images

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Semantic Mapping and Reasoning Approach for Mobile Robotics

3D Terrain Sensing System using Laser Range Finder with Arm-Type Movable Unit

W4. Perception & Situation Awareness & Decision making

Team Description Paper Team AutonOHM

Object Classification in Domestic Environments

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

An Interactive Technique for Robot Control by Using Image Processing Method

Rough Terrain 3D Mapping and Navigation using a Continuously Rotating 2D Laser Scanner

Introduction to Autonomous Mobile Robots

Learning to Interpret Pointing Gestures with a Time-of-Flight Camera

Fast Semantic Segmentation of RGB-D Scenes with GPU-Accelerated Deep Neural Networks

Multi-Resolution Surfel Mapping and Real-Time Pose Tracking using a Continuously Rotating 2D Laser Scanner

Final Project Report: Mobile Pick and Place

Removing Moving Objects from Point Cloud Scenes

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

ZJUCyber Team Description for ROBOCUP 2013

Towards Semantic Scene Analysis with Time-of-Flight Cameras

3D Maps. Prof. Dr. Andreas Nüchter Jacobs University Bremen Campus Ring Bremen 1

Scanning and Printing Objects in 3D Jürgen Sturm

Humanoid Manipulation

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

Mobile Robotics. Mathematics, Models, and Methods. HI Cambridge. Alonzo Kelly. Carnegie Mellon University UNIVERSITY PRESS

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Machine Learning for Medical Image Analysis. A. Criminisi

Dense Tracking and Mapping for Autonomous Quadrocopters. Jürgen Sturm

CRF Based Point Cloud Segmentation Jonathan Nation

The Kinect Sensor. Luís Carriço FCUL 2014/15

Autonomous Navigation of Nao using Kinect CS365 : Project Report

Humanoid Robotics. Monte Carlo Localization. Maren Bennewitz

Real-Time Navigation in 3D Environments Based on Depth Camera Data

Scanning and Printing Objects in 3D

Registration of Non-Uniform Density 3D Point Clouds using Approximate Surface Reconstruction

Robotics. Haslum COMP3620/6320

ECE276A: Sensing & Estimation in Robotics Lecture 11: Simultaneous Localization and Mapping using a Particle Filter

CONTENT ENGINEERING & VISION LABORATORY. Régis Vinciguerra

S7316: Real-Time Robotics Control and Simulation for Deformable Terrain Applications Using the GPU

LAUROPE Six Legged Walking Robot for Planetary Exploration participating in the SpaceBot Cup

AASS, Örebro University

Terrain Data Real-time Analysis Based on Point Cloud for Mars Rover

HOG-Based Person Following and Autonomous Returning Using Generated Map by Mobile Robot Equipped with Camera and Laser Range Finder

Robots Towards Making Sense of 3D Data

Creating Affordable and Reliable Autonomous Vehicle Systems

Separating Objects and Clutter in Indoor Scenes

Object Segmentation and Tracking in 3D Video With Sparse Depth Information Using a Fully Connected CRF Model

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

Small Object Manipulation in 3D Perception Robotic Systems Using Visual Servoing

Indoor Home Furniture Detection with RGB-D Data for Service Robots

Robust and Accurate Detection of Object Orientation and ID without Color Segmentation

Multi-Modal Human- Computer Interaction

CARE-O-BOT-RESEARCH: PROVIDING ROBUST ROBOTICS HARDWARE TO AN OPEN SOURCE COMMUNITY

A Modular Software Framework for Eye-Hand Coordination in Humanoid Robots

Learning 6D Object Pose Estimation and Tracking

Continuous Motion Planning for Domestic Service Robots with Multiresolution in Time

Simultaneous Localization and Mapping (SLAM)

Automatic Generation of Indoor VR-Models by a Mobile Robot with a Laser Range Finder and a Color Camera

Multi-View 3D Object Detection Network for Autonomous Driving

Generating Object Candidates from RGB-D Images and Point Clouds

Real-time Image-based Reconstruction of Pipes Using Omnidirectional Cameras

INTELLIGENT AUTONOMOUS SYSTEMS LAB

ArchGenTool: A System-Independent Collaborative Tool for Robotic Architecture Design

Semantic Labeling of 3D Point Clouds with Object Affordance for Robot Manipulation

Kinect Device. How the Kinect Works. Kinect Device. What the Kinect does 4/27/16. Subhransu Maji Slides credit: Derek Hoiem, University of Illinois

Local Multi-Resolution Surfel Grids for MAV Motion Estimation and 3D Mapping

AN EFFICIENT METHOD FOR HUMAN POINTING ESTIMATION FOR ROBOT INTERACTION. Satoshi Ueno, Sei Naito, and Tsuhan Chen

Using 3D Laser Range Data for SLAM in Outdoor Environments

Visual Navigation for Flying Robots Exploration, Multi-Robot Coordination and Coverage

ToBI - Team of Bielefeld: The Human-Robot Interaction System for 2013

USING 3D DATA FOR MONTE CARLO LOCALIZATION IN COMPLEX INDOOR ENVIRONMENTS. Oliver Wulf, Bernardo Wagner

Grasping Known Objects with Aldebaran Nao

LEARNING NAVIGATION MAPS BY LOOKING AT PEOPLE

Ceilbot vision and mapping system

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

Utilizing the Structure of Field Lines for Efficient Soccer Robot Localization

Aerial and Ground-based Collaborative Mapping: An Experimental Study

Lecture: Autonomous micro aerial vehicles

AAM Based Facial Feature Tracking with Kinect

LASERDATA LIS build your own bundle! LIS Pro 3D LIS 3.0 NEW! BETA AVAILABLE! LIS Road Modeller. LIS Orientation. LIS Geology.

Ensemble of Bayesian Filters for Loop Closure Detection

Autonomous navigation in industrial cluttered environments using embedded stereo-vision

Organized Segmenta.on

Canny Edge Based Self-localization of a RoboCup Middle-sized League Robot

3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field

BendIT An Interactive Game with two Robots

Exam in DD2426 Robotics and Autonomous Systems

Automatic Photo Popup

ToF Camera for high resolution 3D images with affordable pricing

Multiple View Geometry

Building Reliable 2D Maps from 3D Features

Deliverable D6.1 CENTAURO Manipulation Concept

High-speed Three-dimensional Mapping by Direct Estimation of a Small Motion Using Range Images

Model-based Visual Tracking:

Advanced Vision Guided Robotics. David Bruce Engineering Manager FANUC America Corporation

Simultaneous Localization and Mapping

Transcription:

Visual Perception for Robots Sven Behnke Computer Science Institute VI Autonomous Intelligent Systems

Our Cognitive Robots Complete systems for example scenarios Equipped with rich sensors Flying robot Soccer robot Communication robot Service robot Exploration robot 2

Our Humanoid Soccer Robots Dynaped Copedo NimbRo-OP Size: 95-114 cm, Weight: 6,6-8 kg 13-20 articulated joints PC, wide-angle camera(s), IMU 3

Visual Perception YUV color segmentation Recognition of field, ball, goals, obstacles, field lines, corners Egocentric modeling Probabilistic localization [Schulz & Behnke, Advanced Robotics 2012] 4

Features for Localization Goals Field lines Corners of lines Side poles Egocentric view Localization

Observation Likelihood Lines Side poles Line corners All features

RoboCup 2013 Final NimbRo 4:0 CIT Brains => Won fifth time in a row. 7

Intuitive Multimodal Communication Not keyboard, mouse, screen, but Eye contact Facing with head and trunk Facial expressions Gestures Speech Body language Transfer established human communication techniques to the man-machine interface Application: museum guide

Perception of Communication Partners Detection and tracking of faces Head pose estimation [Bennewitz Behnke: Humanoids 05] [Vatahska, Bennewitz, Behnke: Humanoids 07] Gesture recognition [Axenbeck, Bennewitz, Behnke, Burgard: Humanoids 08] Speech recognition (Loquendo)

Robotinho in Deutsches Museum Bonn [Nieuwenhuisen & Behnke, Journal of Social Robotics (SORO), 2013] 10

Our Service Robots Dynamaid Cosero Size: 100-180 cm, weight: 30-35 kg 36 articulated joints PC, laser scanner, Kinect, microphone, 11

2D Mapping of the Environment 12

3D-Mapping with Surfels 13

3D-Mapping with Surfels 14

3D-Mapping and Localization Registration of 3D laser scans Representation of point distributions in voxels Drivability assessment trough region growing Robust localization using 2D laser scans [Kläß, Stückler, Behnke: Robotik 2012] 15

3D Mapping by RGB-D SLAM Modelling of shape and color distributions in Voxels Local multiresolution Efficient registration of views on CPU Global optimization [Stückler, Behnke: Journal of Visual Communication and Image Representation 2013] 2,5cm 5cm Multi-camera SLAM [Stoucken, Diplomarbeit 2013] 16

Learning and Tracking Object Models Modeling of objects by RGB-D-SLAM Real-time registration with current RGB-D image 17

Transfer of Object Knowledge Non-rigid registration of known models and actual object Transfer of grasp and end-effector [Stückler, Behnke: submitted to ICRA] 18

Analysis of Table-top Scenes and Grasp Planning Detection of Clusters above horizontal plane Two grasps (top, side) Flexible grasping of many unknown objects [Stückler, Steffens, Holz, Behnke, Robotics and Autonomous Systems 2012] 19

Tool use: Bottle Opener Perception of tool tip Extension of arm kinematics Perception of crown cap 20

Tool use: Pair of Tongs Perception of tool tip Extension of arm kinematics Estimation of sausage pose Our team NimbRo has won the last three international RoboCup@Home competitions 21

Perception of Persons Detection in laser scans and tracking Visual verification and identification (VeriLook) 30cm 1m Systematic exploration Speech recognition and synthesis (Loquendo) [Stückler & Behnke, RoboCup 2010] Gesture recognition Natural gaze control [Droeschel et al, ICRA 2011] 22

Visual Object Recognition Object detection with laser or Kinect Recognition based on color and texture features (SURF) Object tracking 23

Semantic Mapping Pixel-wise classification of RGB-D images by random forests Inner nodes compare color / depth of regions Size normalization Training and recall on GPU 3D fusion through RGB-D SLAM Evaluation on own data set and NYU depth v2 [Stückler, Biresev, Behnke: IROS 2012] Accuracy in % Ø Classes Ø Pixels Ground truth Segmentation Silberman et al. 2012 59,6 58,6 Couprie et al. 2013 63,5 64,5 Random forest 65,9 68,6 3D-Fusion 67,0 70,9 [Stückler et al., Accepted with minor revision for Journal of Real-Time Image Processing] 24

Learning Depth-Sensitive CRFs SLIC+depth super pixels Unary features: random forest Height feature Pairwise features Color contrast Directed angle Depth difference Normal differences Results: similarity between superpixel normals Random forest CRF prediction Ground truth [Müller and Behnke, submitted to ICRA] 25

Object Class Detection in RGB-D Hough forests make not only object class decision, but describe object center RGB-D objects data set Color and depth features Training with rendered scenes Detection of object position and orientation Scene Class prob. Object centers Orientation Detected objects Depth helps a lot [Badami, Stückler, Behnke: SPME 2013] 26

Bin Picking Known objects in transport box Matching of graphs of 2D and 3D shape primitives 3D 2D Grasp and motion planning Offline Online [Nieuwenhuisen et al.: ICRA 2013] 27

Articulated Objects: Doors Door motion is important Detection of changes between maps Instantiation of door models Estimation of opening angle from laser scan Localization more reliable, more precise Navigation planning can use door opening state [Nieuwenhuisen, Stückler, Behnke, ICRA 10] 28

Adaptive Person Model Model: geometric primitives, connected by joints Registration through articulated ICP Adaptation of primitive parameters to body proportions [Droeschel, Behnke: ICIRA 2011] 29

Hierarchical Object Discovery trough Motion Segmentation Motion is strong segmentation cue Both camera and object motion Segment-wise registration of a sequence Inference of a segment hierarchy [Stückler, Behnke: IJCAI 2013] 30

Autonomous Flight near Obstacles Octocopter with many sensors and strong computer Multimodal obstacle detection 3D laser scanner Stereo cameras Ultrasound Local obstacle avoidance [Nieuwenhuisen et al., ECMR 2013] 31

Exploration in Rough Terrain Wheeled robot with Intel 4th Core-i7 Quad Omnidirectional RGB-D sensor 3D laser scanner 32

3D Mapping and 6D Localization Efficient registration of Multiresolution surfel maps Global optimization 6D localization with 2D laser scan using particle filter [Schadler, Stückler, Behnke: accepted for SSRR 2013] 33

Conclusion Robot operation in complex environments is challenging Simple skills realized Autonomous control is limited Often perception is the problem 3D sensors are helpful Need for further research Possibilities with robots Multimodal sensor fusion Active perception Interactive perception 34

Thanks for your attention! Questions? 35