Lecture 7: Introduction to Multimedia Content Description. Reji Mathew & Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2009

Similar documents
MPEG-7. Multimedia Content Description Standard

Extraction, Description and Application of Multimedia Using MPEG-7

Lecture 3: Multimedia Metadata Standards. Prof. Shih-Fu Chang. EE 6850, Fall Sept. 18, 2002

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 5: Multimedia description schemes

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

Management of Multimedia Semantics Using MPEG-7

The MPEG-7 Description Standard 1

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2007

Internet Streaming Media

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2006

Video Search and Retrieval Overview of MPEG-7 Multimedia Content Description Interface

Internet Streaming Media

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

M PEG-7,1,2 which became ISO/IEC 15398

USING METADATA TO PROVIDE SCALABLE BROADCAST AND INTERNET CONTENT AND SERVICES

MPEG-7 Audio: Tools for Semantic Audio Description and Processing

Interoperable Content-based Access of Multimedia in Digital Libraries

An Intelligent System for Archiving and Retrieval of Audiovisual Material Based on the MPEG-7 Description Schemes

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Overview of the MPEG-7 Standard and of Future Challenges for Visual Information Analysis

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 2: Description definition language

Lecture 7: Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2007

Lecture 7: Internet Streaming Media

MPEG-7 MULTIMEDIA TECHNIQUE

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Binju Bentex *1, Shandry K. K 2. PG Student, Department of Computer Science, College Of Engineering, Kidangoor, Kottayam, Kerala, India

Lecture 5: Error Resilience & Scalability

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 1: Systems

8 Description of a Single Multimedia Document

Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors

Scalable Hierarchical Summarization of News Using Fidelity in MPEG-7 Description Scheme

Overview of the MPEG-7 Standard and of Future Challenges for Visual Information Analysis

Lecture 1: Introduction & Image and Video Coding Techniques (I)

3. Technical and administrative metadata standards. Metadata Standards and Applications

Offering Access to Personalized Interactive Video

Multimedia Database Systems. Retrieval by Content

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig

Using the MPEG-7 Audio-Visual Description Profile for 3D Video Content Description

ISO/IEC Information technology Multimedia content description interface Part 7: Conformance testing

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia application format (MPEG-A) Part 4: Musical slide show application format

Delivery Context in MPEG-21

MPEG-4. Today we'll talk about...

Optimal Video Adaptation and Skimming Using a Utility-Based Framework

Searching Video Collections:Part I

Multimedia Databases. 9 Video Retrieval. 9.1 Hidden Markov Model. 9.1 Hidden Markov Model. 9.1 Evaluation. 9.1 HMM Example 12/18/2009

Video Compression Standards (II) A/Prof. Jian Zhang

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Compression and File Formats

Overview of MPEG-7. Outline of contents. From MPEG-1 to MPEG-7. Terms. Why is MPEG-7 needed. MPEG Family

Title: Automatic event detection for tennis broadcasting. Author: Javier Enebral González. Director: Francesc Tarrés Ruiz. Date: July 8 th, 2011

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

Lecture 3 Image and Video (MPEG) Coding

The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia application format (MPEG-A) Part 13: Augmented reality application format

Lecture 10: Tutorial Sakrapee (Paul) Paisitkriangkrai Evan Tan A/Prof. Jian Zhang

Video search requires efficient annotation of video content To some extent this can be done automatically

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 4: Audio

SEARCHING MULTIMEDIA DATA USING MPEG-7 DESCRIPTIONS IN A BROADCAST TERMINAL

How to retrieve multimedia documents described by MPEG-7

Multimedia Information Retrieval

INTERACTIVE CONTENT-BASED VIDEO INDEXING AND BROWSING

Multimedia Information Retrieval The case of video

CHAPTER 8 Multimedia Information Retrieval

Introduzione alle Biblioteche Digitali Audio/Video

Overview of the MPEG-7 Standard

Semantic-Based Surveillance Video Retrieval

Multimedia Systems. Lehrstuhl für Informatik IV RWTH Aachen. Prof. Dr. Otto Spaniol Dr. rer. nat. Dirk Thißen

Peter van Beek, John R. Smith, Touradj Ebrahimi, Teruhiko Suzuki, and Joel Askelof

MPEG-7 Context and Objectives

EE 6882 Statistical Methods for Video Indexing and Analysis

Using MILOS to build a Multimedia Digital Library Application: The PhotoBook experience

Adaptive Multimedia Messaging based on MPEG-7 The M 3 -Box

Region Feature Based Similarity Searching of Semantic Video Objects

Contend Based Multimedia Retrieval

GraphOnto: OWL-Based Ontology Management and Multimedia Annotation in the DS-MIRF Framework

Lecture 12: Video Representation, Summarisation, and Query

Lecture 5: Video Compression Standards (Part2) Tutorial 3 : Introduction to Histogram

The BilVideo video database management system

AUTOMATIC VIDEO INDEXING

Lecture 6: Internet Streaming Media

Using Multimedia Metadata

Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems

Multimedia Modeling Using MPEG-7 for Authoring Multimedia Integration

Tips on DVD Authoring and DVD Duplication M A X E L L P R O F E S S I O N A L M E D I A

0 MPEG Systems Technologies- 27/10/2007. MPEG Systems and 3DGC Technologies Olivier Avaro Systems Chairman

This document is a preview generated by EVS

A MPEG-4/7 based Internet Video and Still Image Browsing System

Search Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Ar

Material Exchange Format (MXF) Mapping Type D-10 Essence Data to the MXF Generic Container

Vannotea Real-time collaborative indexing, annotation, discussion tools for Film/video, images, 3D objects

ISO/IEC INTERNATIONAL STANDARD. Information technology JPEG 2000 image coding system Part 3: Motion JPEG 2000

MATRIX BASED INDEXING TECHNIQUE FOR VIDEO DATA

PROPOSED SMPTE STANDARD for Television Material Exchange Format (MXF) Operational pattern 1A (Single Item, Single Package)

About MPEG Compression. More About Long-GOP Video

Efficient Image Retrieval Using Indexing Technique

Shikha Sharma RCET,Bhilai 1

MPEG-21: The 21st Century Multimedia Framework

A survey of technologies and algorithms for parsing and indexing multimedia databases. Augustine Kureva Damba

Transcription:

Lecture 7: Introduction to Multimedia Content Description Reji Mathew & Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2009

Outline Why do we need to describe multimedia content? Low level descriptors High level descriptors Why standardize description of multimedia? Application areas International Standard : MPEG-7 Overview Multimedia Descriptions Schemes (MDS) Visual and Audio descriptors examples System Queries of video, image database COMP9519 Multimedia Systems Lecture 8 Slide 2 R. Mathew & J. Zhang

Describing multimedia content Explosion in the availability of digital media content Individuals now creators and producers of content Digital cameras, increased storage capability, internet Large collections of media items Images, video, animation, audio recordings,.. Problem : How to search and discover multimedia contents? How to index video and audio sequences? How to easily browse contents? COMP9519 Multimedia Systems Lecture 8 Slide 3 R. Mathew & J. Zhang

Describing multimedia content Searching and Discovering content Text annotation Text-based Search Engine Find : red car Image storage < white car > < red car > < blue car > < yellow car > COMP9519 Multimedia Systems Lecture 8 Slide 4 R. Mathew & J. Zhang

Describing multimedia content Text based annotation is not always suitable Requires manual description to label content Not suitable for large collections of content Subjective, description may vary from person to person Desirable to have objective features to describe multimedia contents Objective features can be automatically generated Examples colour histogram, level of motion in video Framework still required for textual descriptions High level or semantic descriptions and relationships Example photo of two people shaking hands COMP9519 Multimedia Systems Lecture 8 Slide 5 R. Mathew & J. Zhang

Describing multimedia content Text based annotation is not always suitable Desirable to have objective features to describe multimedia content Framework still required for textual descriptions A need exists for an architecture That can integrate low-level and high-level descriptors Able to describe content from many application domains Rich set of descriptions MPEG-7 : multimedia content description interface COMP9519 Multimedia Systems Lecture 8 Slide 6 R. Mathew & J. Zhang

Outline Why do we need to describe multimedia content? Low level descriptors High level descriptors Why standardize description of multimedia? Application areas International Standard : MPEG-7 Overview Multimedia Descriptions Schemes (MDS) Visual and Audio descriptors examples System Queries of video, image database COMP9519 Multimedia Systems Lecture 8 Slide 7 R. Mathew & J. Zhang

Content Description Standard MPEG-7 : multimedia content description interface An international standard for descriptions and description systems Goal of MPEG-7 Standard Allow interoperable searching, indexing, filtering and access of multimedia content Enable interoperability among devices that deal with multimedia content description Why standardize? To enable interoperability Examples : Search across different repositories Content exchange between different databases COMP9519 Multimedia Systems Lecture 8 Slide 8 R. Mathew & J. Zhang

MPEG-7 Introduction The MPEG-7 descriptions of content that may include: Information describing creation & production process of the content director, title, short feature movie Information related to the usage of the content copyright pointers, usage history, broadcast schedule Information of the storage features of the content storage format, encoding Structural information on spatial, temporal or spatio-temporal components of the content scene cuts for video, segmented regions for image COMP9519 Multimedia Systems Lecture 8 Slide 9 R. Mathew & J. Zhang

MPEG-7 Introduction The MPEG-7 descriptions of content that may include: (continued) Information about low level features in the content colors, textures, sound timbres, melody description Conceptual information of the reality captured by the content objects and events, interactions among objects Information about how to browse the content in an efficient way summaries, variations Information about collections of objects. Information about the interaction of the user with the content user preferences, usage history COMP9519 Multimedia Systems Lecture 8 Slide 10 R. Mathew & J. Zhang

MPEG-7 Introduction MPEG-7 enables description of content from several viewpoints director, title copyright pointers storage format, encoding info on scene cuts. Info on spatial regions low level features (colors) objects and events. interactions among objects Info to browse content MPEG-7 Description Document Media Content (eg mpeg-4 audio-visual file) MPEG-7 descriptions do not depend on the ways the described content is coded or stored COMP9519 Multimedia Systems Lecture 8 Slide 11 R. Mathew & J. Zhang

Application Areas Digital libraries Searching through bio-medical imaging catalogues Play a few notes on a keyboard and retrieve similar music segments from musical repository Journalism Search radio archives based on name of a politician Home Entertainment Search digital photo collection based on an example image Search based on an example colour or sketch Surveillance Store detected events for searching / indexing Example : accompany surveillance video with metadata of locations and time of detected motion regions COMP9519 Multimedia Systems Lecture 8 Slide 12 R. Mathew & J. Zhang

MPEG-7 : Normative Elements Four types of normative elements Descriptors (D): describe individual features of multimedia content Describe low-level features : colour, motion, audio energy Describe high-level features of semantic object Description Schemes (DS) : descriptions by integrating together multiple descriptors and description schemes Combining D and DS within more complex structures Defining relationships between D and DS Description Definition Language (DDL) : used to define D and DS, an extension of the XML Schema language. System Tools : binary coded representation for efficient storage and transmission,. COMP9519 Multimedia Systems Lecture 8 Slide 13 R. Mathew & J. Zhang

MPEG-7 : Normative Elements Four types of normative elements (continued) Descriptors (D) Description Schemes (DS) Description Tools Description Definition Language (DDL) Based on XML Schema Language Consists of XML Schema Structural Components XML Schema Data Types MPEG-7 Specific Extensions System Tools COMP9519 Multimedia Systems Lecture 8 Slide 14 R. Mathew & J. Zhang

MPEG-7 : Normative Elements [6] MPEG-7 allows to create descriptions, which is a set of instantiated Description Schemes and their corresponding Descriptors and to deploy the descriptions using System tools. COMP9519 Multimedia Systems Lecture 8 Slide 15 R. Mathew & J. Zhang [6]

MPEG-7 : Scope [1] Extraction of features MPEG-7 allows max flexibility Only the description format, the syntax and semantics, is standardized Consumption of descriptions Not specified by MPEG-7 Max flexibility for application e.g. search engine, filtering COMP9519 Multimedia Systems Lecture 8 Slide 16 R. Mathew & J. Zhang

MPEG-7 : Example [6] <Mpeg7> <Description xsi:type="semanticdescriptiontype"> <Semantics> <Label> <Name> Car </Name> </Label> <Definition> <FreeTextAnnotation> Four wheel motorized vehicle </FreeTextAnnotation> </Definition> <MediaOccurrence> <MediaLocator> <MediaUri> image.jpg </MediaUri> </MediaLocator> </MediaOccurrence> </Semantics> </Description> </Mpeg7> COMP9519 Multimedia Systems Lecture 8 Slide 17 R. Mathew & J. Zhang

MPEG-7 : Example MPEG-7 description of the event of handshake between two people: See next slide, example taken from [6] [6] COMP9519 Multimedia Systems Lecture 8 Slide 18 R. Mathew & J. Zhang

<Mpeg7> [6] <Description xsi:type="semanticdescriptiontype"> <Semantics> <Label> <Name> Shake hands </Name> </Label> <SemanticBase xsi:type="agentobjecttype" id="a"> <Label href="urn:example:acs"> <Name> Person A </Name> </Label> </SemanticBase> <SemanticBase xsi:type="agentobjecttype" id="b"> <Label href="urn:example:acs"> <Name> Person B </Name> </Label> </SemanticBase> <SemanticBase xsi:type="eventtype"> <Label><Name> Handshake </Name></Label> <Definition> <FreeTextAnnotation> Clasping of right hands by two people </FreeTextAnnotation> </Definition> <Relation type="urn:mpeg:mpeg7:cs:semanticrelationcs:2001:agent" target="#a"/> <Relation type="urn:mpeg:mpeg7:cs:semanticrelationcs:2001:accompanier target="#b"/> </SemanticBase> </Semantics> </Description> COMP9519 Multimedia Systems Lecture 8 Slide 19 R. Mathew & J. Zhang </Mpeg7>

MPEG-7 Parts Systems : the tools needed to prepare MPEG-7 descriptions for efficient transport and storage and the terminal architecture. Description Definition Language : the language for defining the syntax of the MPEG-7 Description Tools and for defining new Description Schemes. Visual : the Description Tools dealing with (only) Visual descriptions. Audio : the Description Tools dealing with (only) Audio descriptions. Multimedia Description Schemes : the Description Tools dealing with generic features and multimedia descriptions. COMP9519 Multimedia Systems Lecture 8 Slide 20 R. Mathew & J. Zhang

MPEG-7 Parts (continued) Systems Description Definition Language Visual Audio Multimedia Description Schemes Reference Software : a software implementation of relevant parts of the MPEG-7 Standard with normative status. Conformance Testing : guidelines and procedures for testing conformance of MPEG-7 implementations Extraction and use of descriptions informative material about the extraction and use of some of the Description Tools. COMP9519 Multimedia Systems Lecture 8 Slide 21 R. Mathew & J. Zhang

MPEG-7 : MDS Multimedia Description Schemes (MDS) Description Tools dealing with generic features and multimedia descriptions Metadata structures for describing and annotating multimedia content NOT specific to image, video or audio but general to multimedia content. MDS is organized into the following areas Basic Elements Content Description Content Management Content Organization Navigation and Access User Interaction COMP9519 Multimedia Systems Lecture 8 Slide 22 R. Mathew & J. Zhang

MPEG-7 : MDS [1] Content organization Collections Models User interaction Media Creation & Production Content management Content description Usage Navigation & Access Summaries Views User Preferences User History Structural aspects Semantic aspects Variations Basic elements Schema Tools Basic datatypes Links & media localization Basic Tools [1] COMP9519 Multimedia Systems Lecture 8 Slide 23 R. Mathew & J. Zhang

MPEG-7 : MDS Basic Elements Essentials of multimedia content description Used repeatedly in descriptions of multimedia content Used by other parts of MPEG-7 (Visual and Audio) Examples Schema Tools: XML-like Language Basic data types : for describing matrices Linking and localization tools : link MPEG-7 descriptions to media Basic Tools : graph tool to represent relation : text annotation : description schemes for describing people & places More Info : www.chiariglione.org/mpeg/standards/mpeg-7/mpeg-7.htm Introduction to MPEG-7, Multimedia Content Description Interface, John Wiley & Sons, 2002 COMP9519 Multimedia Systems Lecture 8 Slide 24 R. Mathew & J. Zhang

MPEG-7 : MDS Description Schemes for Content Management Creation Info : Title, creators, creation location & dates, genre category, age classification,... Usage Info : Usage rights,... Links to rights holders & rights management Media description : compression, coding and storage format of multimedia content COMP9519 Multimedia Systems Lecture 8 Slide 25 R. Mathew & J. Zhang

MPEG-7 : MDS Content Management <CreationInformation> <Creation> <Creator> <Role><Name xml:lang="en">photographer</name></role> <Agent xsi:type= PersonType > <Name> <GivenName>Seungyup</GivenName> </Name> </Agent> </Creator> <CreationCoordinates> <Location> <Name xml:lang="en">columbia University</Name> <Region>us</Region> </Location> <Date> <TimePoint>1998-09-19</TimePoint> </Date> </CreationCoordinates> </Creation> </CreationInformation> COMP9519 Multimedia Systems Lecture 8 Slide 26 R. Mathew & J. Zhang XML example of content management descriptions

MPEG-7 : MDS Content Management <MediaFormat> <Content href=" urn:mpeg:mpeg7:cs:contentcs:2001:1"> <Name xml:lang="en">image</name> </Content> <FileFormat href=" urn:mpeg:mpeg7:cs:fileformatcs:2001:1"> <Name xml:lang="en">jpeg</name> </FileFormat> <VisualCoding> <Format colordomain="color" href=" urn:mpeg:mpeg7:cs:visualcodingformatcs:2001:1"> JPEG </Format> <Frame height="480" width="704"/> </VisualCoding> </MediaFormat> COMP9519 Multimedia Systems Lecture 8 Slide 27 R. Mathew & J. Zhang XML example of content management descriptions

MPEG-7 : MDS Content Description Description Schemes for Content Description content description tools describe the structure and semantics of multimedia data Structure : segments will explore this first Describe :Objects in image, video shot, audio segment Semantics : describing semantic entities in the narrative world Describe :People, Actions, Concepts, Relation between people and actions, actions and concepts. COMP9519 Multimedia Systems Lecture 8 Slide 28 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description Description Schemes for Content Description Describe structure of content Describes content by using the notion of Segments Image regions, video frames, audio segments Describe segments : using low level descriptors, text annotation, Example A single image decomposed into a set of segments (or regions) Each image region can then be further described using other tools A single video / audio clip can be decomposed into a set temporal segments E.g. Segment a video clip into video shots COMP9519 Multimedia Systems Lecture 8 Slide 29 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description [2] Describing Structure using StillRegion segments (spatial portions) Decompose the image (SR1) into two segments corresponding to the two people in the image (SR2 and SR3). Further describe segments using colour feature and text annotation Describe spatial relation between SR2 and SR3 MPEG-7 structural relations include left (spatial), precedes (temporal),. COMP9519 Multimedia Systems Lecture 8 Slide 30 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description Similarly, temporal portions of video can constitute segments Decompose one video clip into segments, with or without overlap. Each segment can then be described further (e.g. using level of motion in video and text annotations). [1] Suited to video shot boundary detection and indexing COMP9519 Multimedia Systems Lecture 8 Slide 31 R. Mathew & J. Zhang

[3] MPEG-7 : MDS Content Description [3] Spatio-temporal segments or moving regions Decompose video segment into various moving regions (spatiotemporal segments). Further descriptions of moving regions possible Structural relation tools to describe more general segment structures Example segment relationship graph, see next slide COMP9519 Multimedia Systems Lecture 8 Slide 32 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description [3] [3] Precedes Structural relationship for video segments and moving regions Standardized structural relations As well as non-normative relations COMP9519 Multimedia Systems Lecture 8 Slide 33 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description Description Schemes for Content Description content description tools describe the structure and semantics of multimedia data Structure : segments Semantics : describing semantic entities in the narrative world Examples include : agent objects (eg person or a group of people) events : perceivable event that takes place in time and space in the narrative world semantic relation : describe general relations between entities hasagentof - initiates the action of an event hasaccompanierof - object that is a join agent in an event COMP9519 Multimedia Systems Lecture 8 Slide 34 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description Structure : describing structure of content Semantics : describing semantics and concepts Semantic description scheme Objects (person, car, ) Events (perceivable occurrence) Abstract concepts Relationships Multimedia content can be described by both content structure and semantics Related together by a set of links Example : see next slide COMP9519 Multimedia Systems Lecture 8 Slide 35 R. Mathew & J. Zhang

MPEG-7 : MDS Content Description [2] [2] Structure Description and Semantic Description COMP9519 Multimedia Systems Lecture 8 Slide 36 R. Mathew & J. Zhang

MPEG-7 : MDS Navigation & Access Description schemes for enabling browsing & retrieval Summarization tools : Summarize a long video clip to highlight important segments Allows fast browsing of content Example highlights of a soccer game (just shots at goals) View tools : Different partitions and decompositions of image, video & audio Variations : Describe different variations of content available Example low resolution version, video only version,. COMP9519 Multimedia Systems Lecture 8 Slide 37 R. Mathew & J. Zhang

MPEG-7 : MDS Navigation & Access Description schemes for enabling browsing & retrieval Summarization tools : Summarize a long video clip to highlight important segments Allows fast browsing of content Example highlights of a soccer game (just shots at goals) Example Describe summarizations of video content that allows fast and flexible browsing Applications include summarizing sports events, Key frames or shots of long periods of surveillance video COMP9519 Multimedia Systems Lecture 8 Slide 38 R. Mathew & J. Zhang

MPEG-7 : MDS Navigation & Access [1] MPEG-7 enables that above summary (of audio-video content) to be captured in XML format COMP9519 Multimedia Systems Lecture 8 Slide 39 R. Mathew & J. Zhang

Outline Why do we need to describe multimedia content? Low level descriptors High level descriptors Why standardize description of multimedia? Application areas International Standard : MPEG-7 Overview Multimedia Descriptions Schemes (MDS) Visual and Audio descriptors examples System Queries of video, image database COMP9519 Multimedia Systems Lecture 8 Slide 40 R. Mathew & J. Zhang

MPEG-7 : Visual Descriptors and Description Schemes exclusively for visual information Descriptors : describe low-level features of visual content, such as colour, texture, motion,. Example : colour histogram [4] [4] COMP9519 Multimedia Systems Lecture 8 Slide 41 R. Mathew & J. Zhang

MPEG-7 : Audio Descriptors and Description Schemes exclusively for audio information Low Level Descriptors : describe low-level features of audio content, such as instantaneous waveform and power values, power spectrum and spectral features, High Level Descriptors : application-specific tools COMP9519 Multimedia Systems Lecture 8 Slide 42 R. Mathew & J. Zhang

MPEG-7 System : Client Server architecture MPEG-7 Indexing & Searching: Semantics-based (people, places, events, objects, scenes) Content-based (color, texture, motion, melody, timbre) Metadata (title, author, dates) Search Engine Query target sounds like, looks like MPEG-7 Database Search by Event Query Response List of matching content clip1.mp4 clip2.mp4 Request for Content RTSP Media Server Streaming Media RTP/RTCP COMP9519 Multimedia Systems Lecture 8 Slide 43 R. Mathew & J. Zhang

MPEG-7 System : Client Server architecture Compare colour histograms of Target with those in the database Search Engine Query target XML instantiation Color histogram XML instantiations Colour histogram of all stored images MPEG-7 Database Search by Example Query Response List of matching content img1.jpg img2.jpg Jpeg images Request for Content HTTP Media Server JPEG Images HTTP COMP9519 Multimedia Systems Lecture 8 Slide 44 R. Mathew & J. Zhang

MPEG 7 System : Search by Example Colour Histogram Descriptor Used in the demonstration system Example below, showing part of the XML document <VisualDescriptor xsi:type="scalablecolortype" numofcoeff="32" numofbitplanesdiscarded="0"> <Coeff> 62 17-127 47-8 13 22 30-31 -33 3 13-25 -11 13 20 2-13 -1 3-11 -10 1 6 2-1 0 0-9 5 1-4 </Coeff> </VisualDescriptor> COMP9519 Multimedia Systems Lecture 8 Slide 45 R. Mathew & J. Zhang

MPEG-7 : Demonstration MMVC Demonstration Select Image region Calculate Colour Histogram for selected region Generate XML instantiation Submit target to search engine Perform matching between histograms of target and stored content Return list of best matching content (in order) Retrieve images from Image Database COMP9519 Multimedia Systems Lecture 8 Slide 46 R. Mathew & J. Zhang

Outline Why do we need to describe multimedia content? Low level descriptors High level descriptors Why standardize description of multimedia? Application areas International Standard : MPEG-7 Overview Multimedia Descriptions Schemes (MDS) Visual and Audio descriptors examples System Queries of video, image database COMP9519 Multimedia Systems Lecture 8 Slide 47 R. Mathew & J. Zhang

References and Further Reading 1. www.chiariglione.org/mpeg/standards/mpeg-7/mpeg-7.htm 2. Introduction to MPEG-7, Multimedia Content Description Interface, John Wiley & Sons, 2002 3. Lecture 14, MPEG-7, SIMS 202: Information Organization and Retrieval, Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS, www.sims.berkeley.edu/academics/courses/is202/f03/ 4. The MPEG-7 Visual Standard for Content Description An Overview, Thomas Sikora, IEEE Trans. on Circuits and Systems for Video Technology, Vol. 11, No. 6, June 2001 5. MPEG-7: The Generic Multimedia Content Description Standard, Part 1," IEEE MultiMedia, vol. 09, no. 2, pp. 78-87, April-June 2002 6. MPEG-7 Multimedia Content Description Standard, John R. Smith, Pervasive Media Management Group, IBM, January 8, 2003 COMP9519 Multimedia Systems Lecture 8 Slide 48 R. Mathew & J. Zhang