Learning Visual Semantics: Models, Massive Computation, and Innovative Applications. Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH

Similar documents
Class 5: Attributes and Semantic Features

Shifting from Naming to Describing: Semantic Attribute Models. Rogerio Feris, June 2014

Visual Recognition and Search

Additional Remarks on Designing Category-Level Attributes for Discriminative Visual Recognition

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

Big Data - Some Words BIG DATA 8/31/2017. Introduction

Challenges and Opportunities with Big Data. By: Rohit Ranjan

A global technology leader approaching $42B in sales with 57,000 people, and customers in 160+ countries LENOVO. ALL RIGHTS RESERVED

Visual AI in Milliwatts: A Grand Challenge and a Massive Opportunity

The Connected World and Speech Technologies: It s About Effortless

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Additional Remarks on Designing Category-Level Attributes for Discriminative Visual Recognition

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

Class 9 Action Recognition

DL Tutorial. Xudong Cao

ARKive-ERA Project Lessons and Thoughts

Opportunities of Scale

Web-Scale Image Search and Their Applications

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

Wonder of Mobile Social Multimedia. Shih-Fu Chang June 2013, New York

Experiments of Image Retrieval Using Weak Attributes

Computing Yi Fang, PhD

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

IMOTION. Heiko Schuldt, University of Basel, Switzerland

Welcome to CS120 Fall 2012

Version 11

CAS CS 460/660 Introduction to Database Systems. Fall

A Deep Learning Framework for Authorship Classification of Paintings

The Establishment of Large Data Mining Platform Based on Cloud Computing. Wei CAI

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

EMC ACADEMIC ALLIANCE

Database Management Systems

Mixtures of Gaussians and Advanced Feature Encoding

EMEA/Africa/Middle East - Tuesday June 25th, :00:00 a.m. - 1:00pm BST / 10:00:00 a.m. - 2:00 p.m.cest /

Next-Generation Distributed Satellite Bus Information Systems

The Four Key Trends Driving the Proliferation of Visual Perception

Three Verticals, One Data Backup Solution: A Revolutionary Unified Approach to Data Protection

An Exploration of Computer Vision Techniques for Bird Species Classification

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

The Analysis of Faces in Brains and Machines

Recognition Tools: Support Vector Machines

Media, Multimedia & Digital Media. Basic Concepts

T chnology chnology Ma turity turity for fo Adaptiv Adaptiv Massively Massiv ely Pa P ra r llel llel Computing F rst rst Wo W rksho p 2009

Trends in Mobile Forensics from Cellebrite

The Academia-Industry-Government Innovation Cycle

Lecture 12 Recognition

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural edition

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Transfer Learning. Style Transfer in Deep Learning

Next-Generation Distributed Satellite Bus Information Systems

Data Structures and Algorithm Analysis (CSC317) Introduction

New research on Key Technologies of unstructured data cloud storage

IBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems

Object Recognition. Lecture 11, April 21 st, Lexing Xie. EE4830 Digital Image Processing

Introduction to Data Management. Lecture #1 (Course Trailer )

! References: ! Computer eyesight gets a lot more accurate, NY Times. ! Stanford CS 231n. ! Christopher Olah s blog. ! Take ECS 174!

Solutions for the Crossroads:

Bilinear Models for Fine-Grained Visual Recognition

Signs of Intelligent Life: AI Simplifies IoT

Graph-based Techniques for Searching Large-Scale Noisy Multimedia Data

Introduction to Text Mining. Hongning Wang

COMP 102: Computers and Computing Lecture 1: Introduction!

5G networks use-cases in 4G networks

AI Application and Development in ehealth Field. MIN Dong

SCU SEEDs Workshop Angela Musurlian

CG: Computer Graphics

Renovating your storage infrastructure for Cloud era

Requesting changes to the grades, please write to me in an with descriptions. Move my office hours to the morning. 11am - 12:30pm Thursdays

YOLO9000: Better, Faster, Stronger

: IBM T.J. Watson Research Center Research Staff Member, IBM Research AI

Future X Network. Sanjay Kamat Managing Partner, Bell Labs Consulting Nokia

Distributed Systems Intro and Course Overview

ACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang

State of the Internet Operating System

CS 378: Autonomous Intelligent Robotics. Instructor: Jivko Sinapov

CS5950 / CS6030 Cloud Computing

The SD-WAN security guide

CS230: Lecture 3 Various Deep Learning Topics

What s the Big Deal with Big Data?

Engaged Learning. Collection of perspectives on digital innovations in online learning

Bigtable. A Distributed Storage System for Structured Data. Presenter: Yunming Zhang Conglong Li. Saturday, September 21, 13

The Future of the Data Center: Software Will Lead The Way by, David Vellante

Introduction to Web Design & Computer Principles

INTERNET OF THINGS CAPACITY BUILDING CHALLENGES OF BIG DATA AND PLANNED SOLUTIONS BY ITU. ICTP Workshop 17 March 2016

Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

Rapid Modeling of Digital City Based on Sketchup

Copyright 2012 EMC Corporation. All rights reserved. Obrigado

Behind Today s Trends The Technologies Driving Change. Paul Smith Director Consulting Services

Welcome to Your Data

GOOGLE PHOTOS. Ron Brown.

24 hours of Photo Sharing. installation by Erik Kessels

CS 523: Multimedia Systems

Technology Trend : Green IT and Virtualizaiton. Education and Research Sun Microsystems(Thailand)

CS 4510/9010 Applied Machine Learning. Deep Learning. Paula Matuszek Fall copyright Paula Matuszek 2016

volley: automated data placement for geo-distributed cloud services

RECORD. Published : License : None

The FIRST Project and the Future Internet National Research Programme

Why data science is the new frontier in software development

Deep Learning for Computer Vision II

Computing as a Service

Transcription:

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH

Introduction Instructors: Shih-Fu Chang John Smith Rogerio Feris Liangliang Cao Columbia University IBM T. J. Watson Research Center

Introduction 1970s Early Days of Computer Vision

Introduction First Digital Camera (1975) 0.01 Megapixels 23 seconds to record a photo to cassette

Introduction Datasets with 5 or 10 images Large-Scale Experiment: 800 photos (Takeo Kanade Thesis, 1973) [D. Marr, 1976]

Introduction Today Visual Data is Exploding!

Introduction Announcement of Pope Benedict in 2005

Introduction Announcement of Pope Francis in 2013 Rapid proliferation of mobile devices equipped with cameras

Introduction Era of Big Visual Data Billions of cell phones equipped with cameras ~500 billion consumer photos are taken each year world-wide ~500 million photos taken per year in NYC alone Hundreds of millions of Facebook photo uploads per day

Introduction

Introduction Exciting Time for Computer Vision + DATA + Computational Processing + Advances in Computer Vision and Machine Learning Major opportunities for systems that automatically extract visual semantics from images and videos

Examples of Practical Application Areas

Examples of Application Areas Smart Surveillance Show me all images of people matching the suspect description from time X to time Y from all cameras in area Z. Visual Semantics: Fine-grained person attributes Slide credit: Rogerio Feris

Examples of Application Areas Medical Imaging MRI Brain Axial MRI Knee PET Color DX Appendage Visual Semantics: Medical Image Modality and Anatomy Slide credit: John Smith DX Torso DX Cervical Spine

Examples of Application Areas Astronomy [Cui et al, WACV 2015] http://www.galaxyzoo.org/ Visual Semantics: morphological galaxy attributes (important to understand star formation, gas fraction, galaxy evolution, ) Huge dataset of galaxy images makes manual labeling infeasible Slide credit: Rogerio Feris

Examples of Application Areas Nature / Ecology Snapshot Serengeti http://www.youtube.com/watch?v=aul03ivs8by http://www.snapshotserengeti.org/ Visual Semantics: species of animals from camera traps Understanding how competing species coexist is a fundamental theme in ecology, with important implications for biodiversity, and the sustainability of life on Earth Slide credit: Rogerio Feris

Examples of Application Areas Nature / Ecology Plant Species Bird Species [Kumar et al, ECCV 2012] Used by botanists, educators, http://www.vision.caltech.edu/visipedia/ Understanding of migration, conservation, Slide credit: Rogerio Feris

Examples of Application Areas Social Media: Visual Sentiment Analysis [Borth et al, ACM MM 2013] Colorful clouds Misty night Colorful butterfly Crying Baby Slide credit: Rogerio Feris

Many more applications Google Goggles [Kovashka et al, CVPR 2012] Amazon Slide credit: Rogerio Feris

Tutorial Overview

Tutorial Overview Objectives: Cover state-of-the-art techniques for learning visual semantics from images and videos Focus on intuitive, semantic visual representations Provide tools for scalable learning of semantic models Cover innovative and practical applications Provide pointers to related source code and datasets

Tutorial Overview Part I: Feature Extraction, Coding, and Pooling (Liangliang) Brief Introduction to local feature descriptors, coding,and pooling Focus on modern representations such as Fisher Vector and Sparse Coding

Tutorial Overview Part I: Feature Extraction, Coding, and Pooling (Liangliang) Connections to feature learning approaches (e.g., deep convolutional neural networks) Picture credit: Kai Yu

Tutorial Overview Part II: Large-Scale Semantic Modeling (John Smith) Semantic Concept Modeling: Historic Overview Picture credit: John Smith

Tutorial Overview Part II: Large-Scale Semantic Modeling (John Smith) How to deal with class imbalance? How to scale to millions of semantic unit models? Picture credit: John Smith

Tutorial Overview Part III: Shifting from naming to describing: semantic attribute models (Rogerio Feris) Scalable learning with Attribute Models / Zero-Shot Learning [Lampert et al, CVPR 2009]

Tutorial Overview Part III: Shifting from naming to describing: semantic attribute models (Rogerio Feris) Attribute-based Search Slide credit: Rogerio Feris

Tutorial Overview Part IV: High-level Semantic Modeling: Visual Sentiment Analysis (Shih-Fu Chang) Semantic models for encoding emotions in social media