TBM Standards, Specifications, Identifiers, and Data Models etc Identified

Similar documents
USING METADATA TO PROVIDE SCALABLE BROADCAST AND INTERNET CONTENT AND SERVICES

Utilizing PBCore as a Foundation for Archiving and Workflow Management

MPEG-21: The 21st Century Multimedia Framework

Delivery Context in MPEG-21

MPEG-7. Multimedia Content Description Standard

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 5: Multimedia description schemes

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 2: Description definition language

Digitisation Standards

! Provide the secretariat for International ISBN Agency! Standards development and management

ISO/IEC Information technology Multimedia framework (MPEG-21) Part 3: Digital Item Identification

Robin Wilson Director. Digital Identifiers Metadata Services

SUBTITLE EXCHANGE FORMAT (DPP-EBU-TT) Version 2.0

XML BASED DICTIONARIES FOR MXF/AAF APPLICATIONS

Metadata Workshop 3 March 2006 Part 1

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 1: Systems

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia framework (MPEG-21) Part 21: Media Contract Ontology

Increazing interactivity in IPTV using MPEG-21 descriptors

Digital repositories as research infrastructure: a UK perspective

Data Exchange and Conversion Utilities and Tools (DExT)

ISO 3901 INTERNATIONAL STANDARD. Information and documentation International Standard Recording Code (ISRC)

METADATA SYSTEMS ARCHITECTURE

Content structure breakdown, re-use of components and the importance of metadata Carl Mann

BIBLIOGRAPHIC REFERENCE DATA STANDARD

ISAN: the Global ID for AV Content

JPEG 2000 Archive Profiles

An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program

Affixing the ISAN. Possibilities Technical specifications

The Dublin Core Metadata Element Set

ISO/IEC INTERNATIONAL STANDARD

Introduction SCONE and IRIScotland Scaling General retrieval Interoperability between SCONE and IRIScotland Context

ISO/IEC Information technology Multimedia content description interface Part 7: Conformance testing

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

The RAMLET project Use cases

ISO INTERNATIONAL STANDARD. Information and documentation The Dublin Core metadata element set

MPEG-4. Today we'll talk about...

ISO/IEC INTERNATIONAL STANDARD. Information technology Coding of audio-visual objects Part 18: Font compression and streaming

3. Technical and administrative metadata standards. Metadata Standards and Applications

An Intelligent System for Archiving and Retrieval of Audiovisual Material Based on the MPEG-7 Description Schemes

Adopting HTML5 for Television: Next Steps

PROPOSED SMPTE STANDARD for Television Material Exchange Format (MXF) Operational pattern 1A (Single Item, Single Package)

Development of an Ontology-Based Portal for Digital Archive Services

ISO 2146 INTERNATIONAL STANDARD. Information and documentation Registry services for libraries and related organizations

Ontology Servers and Metadata Vocabulary Repositories

Customising Location of Knowledge. Ann Apps and Ross MacIntyre MIMAS, The University of Manchester, UK

RDF and Digital Libraries

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Data is the new Oil (Ann Winblad)

A Standards-Based Registry/Repository Using UK MOD Requirements as a Basis. Version 0.3 (draft) Paul Spencer and others

Wendy Thomas Minnesota Population Center NADDI 2014

Internet Engineering Task Force (IETF) Obsoletes: 7302 September 2016 Category: Informational ISSN:

Introduction

Metadata and Encoding Standards for Digital Initiatives: An Introduction

ISO/IEC TR TECHNICAL REPORT. Information technology Coding of audio-visual objects Part 24: Audio and systems interaction

Software Requirements Specification for the Names project prototype

INTERNATIONAL STANDARD

Comparison and mapping of VRA Core and IMS Meta-data

Core DOI Specification

A service oriented view of the JISC Information Environment

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

Digital TV Metadata. VassilisTsetsos

Summary of Bird and Simons Best Practices

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

Reusability and Adaptability of Interactive Resources in Web-Based Educational Systems. 01/06/2003

Metadata Elements Comparison: Vetadata and ANZ-LOM

Million Book Universal Library Project :Manual for Metadata Capture, Digitization, and OCR

Toward Horizon 2020: INSPIRE, PSI and other EU policies on data sharing and standardization

Big Data, exploiter de grands volumes de données

LORE: A Compound Object Authoring and Publishing Tool for Literary Scholars based on the FRBR. Anna Gerber, Jane Hunter

Data encoding protocol using key-length-value

Archivists Toolkit: Description Functional Area

Object-based audio production. Chris Baume EBU-PTS - 27th January 2016

Persistent identifiers, long-term access and the DiVA preservation strategy

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Material Exchange Format (MXF) Mapping Type D-10 Essence Data to the MXF Generic Container

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

From Open Data to Data- Intensive Science through CERIF

ISO INTERNATIONAL STANDARD. Document management Engineering document format using PDF Part 1: Use of PDF 1.6 (PDF/E-1)

The MIND Approach. Fabio Crestani University of Strathclyde, Glasgow, UK. Open Archive Forum Workshop Berlin, Germany, March 2003

For many years, the creation and dissemination

Report on the European Resolution Discovery Service (ERDS) Meeting (Feb 17/18, 2010)

Dexterity: Data Exchange Tools and Standards for Social Sciences

Extraction, Description and Application of Multimedia Using MPEG-7

Towards a joint service catalogue for e-infrastructure services

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia application format (MPEG-A) Part 4: Musical slide show application format

ISO/IEC INTERNATIONAL STANDARD

MISB RP May Security Metadata Universal and Local Sets for Digital Motion Imagery. 1. Scope. 2. References

An aggregation system for cultural heritage content

7.3. In t r o d u c t i o n to m e t a d a t a

Consolidation Team INSPIRE Annex I data specifications testing Call for Participation

Material Exchange Format Timecode Implementation

SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA SIGNALS Digital transmission of television signals

Using Dublin Core to Build a Common Data Architecture

MESH. Multimedia Semantic Syndication for Enhanced News Services. Project Overview

Design & Manage Persistent URIs

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

ICT & Digital Cinema A new start? Roberto Cencioni. DG Information Society and Media. Unit INFSO.E2 - Content & Knowledge

Why CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam

The European Commission s science and knowledge service. Joint Research Centre

COALA: CONTENT-ORIENTED AUDIOVISUAL LIBRARY ACCESS

The UK Marine Environmental Data and Information Network MEDIN

Transcription:

TBM Standards, Specifications, Identifiers, and Data Models etc Identified... 1 1. MPEG & METS... 2 2. EBU (European Broadcasting Union)... 3 3. AudioMD and VideoMD... 3 4. Identifiers... 3 5. SMEF... 4 6. Simple Application Profile combining Dublin Core and MPEG-7... 6 7. SMIL (Synchronous Multimedia Integration Language)... 7 8. JISC-SURF StreamTeam Partners... 7 9. JISC ARCHES Project... 8 TBM Standards, Specifications, Identifiers, and Data Models etc Identified Expert input to the project has so far helped to identify the following as relevant to the TBM media domain. This working document provides some background information on the reference material, and attempts to make suggestions & seek further information on what way these identified standards, specification, identifiers, data models may be relevant to the Purpose Statement for the Time-based Media Application Profile project and in what way the references could perhaps be usefully accommodated, or otherwise. The document also supports an attempt to come to some conclusions around related questions posed in the Functional Requirements. We would seek to constrain rather than to add, but not at the expense of rendering the subsequent work non-functional or severely restricting the usefulness of the profile. FRBR and Dublin Core are not described here in their own right as they are generic to the profile development projects, and this material is to determine what TBM specific input might be required and how it may be handled. The purpose behind ongoing investigations is to understand the various motives behind existing TBM metadata and standards approaches, their relationship to a Search & Discovery profile and alignment with the existing FR set, rather than to add to the FR or deviate from the development approach the project has been assigned to investigate. This document will be periodically updated to include responses and further information. Relevant resource references will also be posted to the project wiki. Project comments are noted in [RED] Summary List of Ongoing Investigations: TVAnytime, MIX, IPOD spec (for rss container), Wrapper formats (e.g. MXF, Flash, avi, qti), Vocabularies (supply support for but not recommend solutions), OAIS (metadata roles). TBMAP v1.0 June 2008 1

1. MPEG & METS http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=42114 http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=40392 There is a nice summary tutorial on the five MPEG ISO standards (MPEG1, MPEG2, MPEG4, MPEG7, and MPEG21) by Larry Bouthillier February 18, 2004 at: http://www.streamingmedia.com/ In short, the standards are not sequentially numbered and mp3 is not MPEG-3. MPEG1 became an official standard for encoding audio and video in 1992. It is the simplest of the MPEG standards, and describes a way to encode audio and video data streams, along with a way to decode them. MPEG2 relates to digital television, designed to scale up to broadcast and high-definition quality and operating requirements. MPEG4 is designed to scale down to dial-up internet bandwidths, cell phones and PDAs while remaining viable for high-quality desktop streaming, allowing synchronisation of all kinds of media objects with standard interfaces to allow plugging in a DRM scheme. MPEG7 (Multimedia Content Description Interface) is about metadata (and is not a video or audio coding scheme or delivery mechanism). MPEG21 is a Multimedia Framework to let content distributors have complete control over content at all parts of the delivery chain and on all kinds of networks and devices, and allow the right item in a file to be played for the user. MPEG21 is still being developed and doesn t appear to make reference to new metadata or profiles so far. MPEG7 describes three profiles: Simple Metadata Profile (SMP) User Description Profile (UDP) Core Description Profile (CDP). These are collected together from the rest of the standard in Part 9 of the 11 parts. Part 11 provides the XML Schemas for the same three MPEG 7 profiles. As a whole, MPEG 7 is contained in ISO/IEC 15398, and part 9 is referred to as ISO/IEC 15938 9:2005 with part 11 as ISO/IEC 15938 11:2005. [Will this information be carried with the file, through its format type rather than represented in the profile itself? The AHDS report suggests that a reliance on extraction tools at present is unwise: Sophisticated metadata standards (e.g. MPEG-7, TV-Anytime, SMIL) and container schemas (MPEG-21 and METS) are now available for the archiving of moving image and sound resources. However, manual metadata creation is expensive and, in reality, little file-level metadata is likely to exist separately from the content other than that which can be auto-generated. Few metadata extraction tools currently exist for this purpose and further research and development work is needed in this area. In addition, the archival-oriented element set proposed in this study makes no assumption as to the way the metadata is stored or contained (e.g. MPEG-21 or METS) and does not distinguish between whole or parts by assuming equal application of the element set to any level of granularity (e.g. whole, segment, still). TBMAP v1.0 June 2008 2

Taken from AHDS Digital Moving Images & sound Archiving Study Chapter 7 Metadata Review and Requirements 7.7 Conclusion p81 retrieved from: http://www.jisc.ac.uk/media/documents/programmes/preservation/moving%20pictures%20and%20sound%2 0archiving%20final%20version.doc ] 2. EBU (European Broadcasting Union) http://www.ebu.ch/cmsimages/en/tec_doc_t3293_tcm6-10494.pdf http://www.ebu.ch/en/technical/metadata/specifications/notes_on_tech3295.php EBU has a core metadata set for radio archives which is relatively simple and aligns with EBU/SMPTE/AES and Dublin Core. [This is possibly in line with the scope of the project.] P_Meta and its XML Schema provide metadata on common production practices realised through application-based specifications including, taking into particular account the point of view of broadcasters, e.g. programme exchange between broadcasters and possibly between production processes. [As P_Meta is both broadcast industry and production specific, we believe it is out-ofscope of this project.] 3. AudioMD and VideoMD AudioMD Draft data dictionary: http://www.loc.gov/rr/mopic/avprot/dd_amd.html AudioMD Draft schema: http://www.loc.gov/rr/mopic/avprot/amd_020409.xsd VideoMD Draft data dictionary: http://www.loc.gov/rr/mopic/avprot/dd_vmd.html VideoMD Draft schema: http://lcweb-2.loc.gov/mets/schemas/vmd.xsd These are draft METS extension metadata schema and data dictionaries for audio and video resources respectively. AudioMD contains 37 technical metadata elements for describing an audio object. VideoMD contains 16 technical metadata elements. It has been suggested (p81 AHDS report) that these represent interim measures developed by the Library of Congress Audi-Visual Prototyping Project with little subsequent development. [The project has not yet ascertained the relevance or mapping concepts associated with these approaches, but the elements may possibly be in line with the scope of the project in the same way as EBU core metadata set, and within the caveats of the Metadata.net simple application profile notes.] 4. Identifiers The project scope for identifiers specifies as: In scope: What are the various identifiers and how these could be used with time based media. Out of Scope: Recommendation of a particular identifier solution. TBMAP v1.0 June 2008 3

[At which points will the profile require identifiers? What hooks needed?] SMEF recognises three particular ISO Editorial Identifiers and makes provision for them as attributes. Very basic descriptions are given here: ISAN (International Standard Audio Visual Number, ISO 15706:2002, permanently issued to an audiovisual work and never altered replaced or reused) V-ISAN (Version-ISAN, ISO 20925, registers a version. The first part is an identifier, the second part is registration, resolution and supporting systems. ISRC is used for soundtracks.) ISRC (International Standard Recording Code, unique and permanent identifier for a specific recording that can be encoded permanently into a product. See http://www.ifpi.org/isrc/)) SMPTE UMID (Unique Material Identifier, specified in SMPTE 330M. Identifies an instance of audiovisual material uniquely, to enable the material instance to be linked with its associated metadata. The extended UMID allows a source pack to be attached which carries data for time, date, geospatial location, etc.) The JISC Standards Catalogue: Identifier Standards (available at: http://standards-catalogue.ukoln.ac.uk/index/category:identifier_standards) notes the following, along with fuller descriptions and further information: ARK, (Archival Resource Key) COinS (ContextObjects in Spans, a simple, ad hoc community specification for publishing OpenURL references in HTML) DOI (Digital Object Identifier, a permanent identifier given to a networked resource so that if its address changes, users will be redirected to its new address) Handle (a distributed information system for assigning, managing and resolving persistent identifiers) INFO (a Uniform Resource Identifier (URI) scheme created for information assets that have identifiers in public namespaces but that are not part of the URI allocation (e.g. LCCNs) OpenURL (NISO standard Z39.88-2004, the OpenURL Framework for Context-Sensitive Services provides a means of describing a referenced resource along with a description of the context of the reference, known as a ContextObject) PURL (Persistent Uniform Resource Locator) URI (Uniform Resource Identifier, a generic name for any of a class of ways of identifying resources on the Internet) 5. SMEF http://www.bbc.co.uk/guidelines/smef/ TBMAP v1.0 June 2008 4

The SMEF data model the BBC s Standard Media Exchange Framework. The Data Model (currently v1.10) is available free-of-charge subject to a specific licence agreement and eventual return of a questionnaire as to how it has been used and what benefits have been derived from its use. SMEF appears to be useful for helping to understand the different technical relationships across time-based media, and as a reference to standards, specifications and identifiers that may be relevant to the project remit. For Dublin Core, SMEF recommends projects use SMEF to establish their information analysis while ensuring DC descriptors are set up at the same time. For HE genres where, for example, production information or televisions attributes, for example, are important to search and discovery, SMEF may help with extending any application profile to better serve such a group. It may also perhaps have some scope as a source for additional elements related to the current project. Certainly, not all of SMEF will be relevant to post-production search & discovery, and what is relevant may not perhaps present in the exact structure as developed here. However, SMEF could be useful more generally to demonstrate how different types and complexities of domain model are relevant for different approaches to the media type, and as a particular view of modelling the TBM world. [Any comment on this analysis?] By way of further information, SMEF is designed to provide a set of definitions for the information required in production, distribution and management of media assets, currently expressed as a data dictionary and set of Entity Relationship Diagrams. It is designed to be comprehensive (the model stretches to 8 pages, and v 1.10 stretches to nearly 400 pages), and is specific to a particular industry sector, although the definitions are designed to be organisation independent. The scope is wide in that the model is based on metadata associated with media essence. It is applicable across television, radio, web and supports both analogue and digital services. The eight sections of SMEF cover: Editorial Objects (e.g. complete programme or item, work, episode) Image Format Type (e.g. geometric characteristics of an image or display device or constraints on the use of the image area) Acquisition Blocks (group of audio items in a published form e.g. CDs or records) Media Objects and Unique Material Instances (description of a component of an editorial object where each media type is a single object e.g. video is one, subtitles another) Location, Story & Classification (summary of how items or programmes can be referenced, classified or catalogued) Transmission and Publication of Editorial Objects (e.g. planned & actual transmission) People & Organisations (e.g. roles in media asset management such as cameraman or rights holder) Programme Production ( commissions, bids, projects etc) TBMAP v1.0 June 2008 5

6. Simple Application Profile combining Dublin Core and MPEG-7 http://www.metadata.net/harmony/video_appln_profile.html Metadata.Net was maintained until August 2006 by the MAENAD Project of ITEE at the University of Queensland and holds a draft simple combination Application Profile with a mapping between 15 DC elements DC and MPEG-7. The exercise concludes that the mapping is possible but complex and the MPEG-7 Descriptors corresponding to each DC element: are distributed across a range of Description Schemes (DSs); are embedded at a low level within nested XML descriptions; often occur in multiple alternative Description Schemes/contexts. There is often no simple or obvious one-to-one mapping or exact semantic equivalent within MPEG-7 to the Dublin Core elements. Illustrated examples are used to indicate the respective strengths and weaknesses of DC and MPEG-7 noting: Dublin Core provides a relatively simple, light-weight and concise method for resource discovery of composite resources over the Internet. It is not designed to describe the temporal, spatial or spatio-temporal aspects or the visual or audio features associated with multimedia resources. In comparison, MPEG-7 provides a rather heavy-weight approach to describing the creation and classification properties of an audiovisual resource. MPEG-7's strengths lie in its ability to specify detailed media-specific formatting and encoding information, and its fine-grained descriptions of temporal, spatial and spatio-temporal components of audiovisual content and their associated audio and visual features. Rather than duplicating the DC elements within MPEG-7 (by introducing a MPEG-7 Dublin Core Description Scheme) or using programmatic code or XSLT to implement dynamic mappings, we propose a hybrid approach which combines the complementary aspects of the DC and MPEG-7 metadata schemes in an application profile. Providing there is an XML Schema [5-7] representation of each vocabulary, defined in their own domain-specific namespaces [8], then implementors can import elements from both the Dubl n Core and MPEG-7 namespaces and combine them into a single description scheme (or "application profile") capable of supporting both cross-domain (Dubl n Core-based) resource discovery as well as fine-grained, content-based, search, retrieval and presentation of audiovisual content (using MPEG-7). [5] XML Schema Part 0: Primer, W3C Recommendation, 2 May 2001 http://www.w3.org/tr/xmlschema-0/ [6] XML Schema Part 1: Structures, W3C Recommendation, 2 May 2001 http://www.w3.org/tr/xmlschema-1/ [7] XML Schema Part 2: Datatypes, W3C Recommendation, 2 May 2001. http://www.w3.org/tr/xmlschema-2/ [8] Namespaces in XML, W3C Recommendation 14 January, 1999.http://www.w3.org/TR/REC-xml-names TBMAP v1.0 June 2008 6

[Suggests a DC approach for Search & Discovery of this resource type remains sound and that MPEG-7 may be suitable for augmentation in media specific areas that may be required for some purposes or user group needs.] 7. SMIL (Synchronous Multimedia Integration Language) http://www.w3.org/audiovideo/ SMIL (pronounced smile ) is an XML-based language that enables simple authoring of interactive audiovisual presentations for the web, and is typically used for choreographing "rich media" presentations which integrate streaming audio and video with images, text or any other media type. Using a small range of elements it is possible to specify and control the time given resources appear in relation to each other, for example a spoken sentence and an image, or a segment of a video with its transcript. [Although the availability of metadata is relied upon to identify the media being choreographed, SMIL itself is a mechanism to use this information to provide presentation and co-ordination of the media on screen. This makes SMIL itself unlikely to be useful in contributing to a Search & Discovery application profile.] 8. JISC-SURF StreamTeam Partners http://www.intute.ac.uk/publications/rdn-ltsn-ap/ http://www.uen.org/dms/umap/images/lomtree.gif User Generated Content TRIPLE L Project Dr Jan T. Goldschmeding, VU University Amsterdam, Learning on Screen 2008, York National Science Learning Centre, University of York, 18-19 March 2008 Also with thanks to Johan Ooman and Sylvia Moes of StreamTeam. Work in the Netherlands related to federated searching for video use in learning engages the concept of collecting other materials alongside video used for learning. This falls closer to the domain of learning objects than TBM and so the RDN/LTSN LOM Application Profile or RLLOMAP has been drawn on. As part of a presentation of RET (Reaching European Teachers) at StreamTeam 2008, a report on investigations into CAM (Contextualised Attention Metadata) was made. CAM is concerned with serving personalised content to teachers, particularly where this can be achieved directly into the learning environment such as a VLE. CAM appears to be metric-based and relate to information concerning use of material after it has been retrieved, and so forming a layer separate from content-based metadata that supports federated searching. The CAM operates to directly support user services and operations by working with user profiles to further customise the federated search results against user specified preferences (e.g. smart rankings, smart recommendations). Examples of CAM concepts include: annotations, task/type information, ratings, social network information, duration of use. The requirement for translation of material was highlighted, noting services such as ATHENA Web which provide translation to English of deposited materials. StreamTeam and RET Jan T Goldschmeding StreamTeam 2008 held at DIVERSE, Inholland university, Haarlem, NL, 1 July 2008 http://www.inholland.nl/diverse2008 TBMAP v1.0 June 2008 7

Athena Web, a video portal and workspace for European audiovisual communication professionals in the areas of science and scientific information, see: http://www.athenaweb.org/ [This suggests confirmation of the Functional Requirements of the profile to offer Extensibility for tailoring to genre-specific requirements and added-value services Clarity on which aspects of complex objects are in and out of scope and how those in scope are expected to be handled To assist with identification and retrieval of accessible versions and supporting material in this case different language versions. ] 9. JISC ARCHES Project ARCHES Final Report Version 1.1 18.4.05 The Arches project was funded by JISC as part of the x4l programme, and used UK LOM Core and VRA Core to develop a profile to service large numbers of visual resources including audio and video treated as learning objects. [This falls closer to the domain of learning objects and suggests that in certain cases a Learning Object-oriented Application Profile may be more suited for TBM material depending on its intended audience, and how the material is packaged. Use of VRA Core and the visual aspects of material handled by the project suggest an overlap with image requirements may also occur in cases where learning objects are being handled.] TBMAP v1.0 June 2008 8