XML information Packaging Standards for Archives

Similar documents
XFDU packaging contribution to an implementation of the OAIS reference model

1 st Int. Workshop on Standards and Technologies in Multimedia Archives and Records (STAR), Lausanne, /27

An overview of the OAIS and Representation Information

The OAIS Reference Model: current implementations

Content. Introduction Multimedia framework Seven elements of Mpeg-21

MPEG-21: The 21st Century Multimedia Framework

STANDARD ARCHIVE FORMAT FOR EUROPE

Using XFDU for CASPAR information packaging Matthew Dunckley Science & Technology Facilities Council, Oxford, UK

Implementing OAIS Reference Model Concepts: Information Packages and Producer-Archive Interface Standards

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 5: Multimedia description schemes

MPEG-21 Current Work Plan

ISO/IEC INTERNATIONAL STANDARD. Information technology Coding of audio-visual objects Part 18: Font compression and streaming

OAIS: What is it and Where is it Going?

Archival Information Package (AIP) E-ARK AIP version 1.0

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia service platform technologies Part 3: Conformance and reference software

adore: a modular, standards-based Digital Object Repository

Comparing Open Source Digital Library Software

Registry Interchange Format: Collections and Services (RIF-CS) explained

MPEG-21 IPMP. Jeong Hyun Kim, Seong Oun Hwang, Ki Song Yoon, Chang Soon Park

Delivery Context in MPEG-21

Increazing interactivity in IPTV using MPEG-21 descriptors

Opus: University of Bath Online Publication Store

Building for the Future

Metadata Standards and Applications. 4. Metadata Syntaxes and Containers

Metadata and Encoding Standards for Digital Initiatives: An Introduction

ISO ARCHIVE STANDARDS: STATUS REPORT

XML Structure and Construction Rules Concept Paper. 31Aug 2003

B. Assets are shared-by-copy by default; convert the library into *.jar and configure it as a shared library on the server runtime.

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 2: Description definition language

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 1: Systems

Session Two: OAIS Model & Digital Curation Lifecycle Model

Response to the CCSDS s DAI Working Group s call for corrections to the OAIS Draft for Public Examination

XML FORMATTED DATA UNIT (XFDU) STRUCTURE AND CONSTRUCTION RULES

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia framework (MPEG-21) Part 21: Media Contract Ontology

MPEG-21 MULTIMEDIA FRAMEWORK

Material Exchange Format Timecode Implementation

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

ISO INTERNATIONAL STANDARD. Information and documentation Managing metadata for records Part 2: Conceptual and implementation issues

Use of XML Schema and XML Query for ENVISAT product data handling

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

ISO/IEC Information technology Multimedia content description interface Part 7: Conformance testing

Assessment of product against OAIS compliance requirements

Metadata Issues in Long-term Management of Data and Metadata

Ontology Servers and Metadata Vocabulary Repositories

MPEG-21 SESSION MOBILITY FOR HETEROGENEOUS DEVICES

Introduction to. Digital Curation Workshop. March 14, 2013 SFU Wosk Centre for Dialogue Vancouver, BC

3. Technical and administrative metadata standards. Metadata Standards and Applications

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG systems technologies Part 5: Bitstream Syntax Description Language (BSDL)

Metadata Workshop 3 March 2006 Part 1

An introduction to the Metadata Encoding and Transmission Standard METS)

Defining OAIS requirements by Deconstructing the OAIS Reference Model Date last revised: August 28, 2005

MPEG-21 Overview. Xin Wang. Dept. Computer Science, University of Southern California

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia application format (MPEG-A) Part 4: Musical slide show application format

Information technology Multimedia service platform technologies. Part 3: Conformance and reference software

Metadata: The Theory Behind the Practice

Earth Observation Payload Data Ground Systems Infrastructure Evolution LTDP SAFE. EO Collection and EO Product metadata separation Trade-Off

For many years, the creation and dissemination

MPEG's Dynamic Adaptive Streaming over HTTP - An Enabling Standard for Internet TV. Thomas Stockhammer Qualcomm Incorporated

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

The RAMLET project Use cases

METS. Metadata Encoding and. Standard. ...standard for encoding descriptive, administrative, and structural metadata...

PROPOSED SMPTE STANDARD for Television Material Exchange Format (MXF) Operational pattern 1A (Single Item, Single Package)

DAITSS Demo Virtual Machine Quick Start Guide

Data formats for exchanging classifications UNSD

Assessment of product against OAIS compliance requirements

Development of an Ontology-Based Portal for Digital Archive Services

Fedora. CS 431 April 17, 2006 Carl Lagoze Cornell University. Acknowledgements: Sandy Payette (Cornell)

A Collaboration Model between Archival Systems to Enhance the Reliability of Preservation by an Enclose-and-Deposit Method

ISO/IEC Information technology Multimedia framework (MPEG-21) Part 3: Digital Item Identification

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011

Reusability and Adaptability of Interactive Resources in Web-Based Educational Systems. 01/06/2003

Digitisation Standards

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix

Combining SNIA Cloud, Tape and Container Format Technologies for the Long Term Retention of Big Data

The Extensible Markup Language (XML) and Java technology are natural partners in helping developers exchange data and programs across the Internet.

IPR Issues (2/2) Standardisation initiatives around Digital Rights Management. IPR Issues. Multimedia content. Representation: Metadata

MDA & Semantic Web Services Integrating SWSF & OWL with ODM

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

Digital TV Metadata. VassilisTsetsos

Information Technology Document Schema Definition Languages (DSDL) Part 1: Overview

An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program

ASAM MCD-2 D (ODX) Data Model for ECU Diagnostics (Open Diagnostic Data Exchange) Data Model Specification. Base Standard

Teiid Designer User Guide 7.5.0

Copyright 2008, Paul Conway.

User Stories : Digital Archiving of UNHCR EDRMS Content. Prepared for UNHCR Open Preservation Foundation, May 2017 Version 0.5

Compound or complex object: a set of files with a hierarchical relationship, associated with a single descriptive metadata record.

The ESA CASPAR Scientific Testbed and the combined approach with GENESI-DR

Digital Preservation with Special Reference to the Open Archival Information System (OAIS) Reference Model: An Overview

Mellon Fedora Technical Specification. (December 2002)

JPEG 2000 Archive Profiles

Affixing the ISAN. Possibilities Technical specifications

Archivists Toolkit: Description Functional Area

This document is a preview generated by EVS

Its All About The Metadata

ISO/IEC INTERNATIONAL STANDARD

ISO/IEC INTERNATIONAL STANDARD. Information technology JPEG 2000 image coding system Part 14: XML representation and reference

Chinese-European Workshop on Digital Preservation. Beijing (China), July 14 16, 2004

B2SAFE metadata management

Research Data Repository Interoperability Primer

Metadata. for the Long Term Preservation of Electronic Publications. Catherine Lupovici Julien Masanès. Bibliothèque nationale de France

Transcription:

XML information Packaging Standards for Archives Lou Reich/CSC Long Term Knowledge Retention Workshop March15,2006 15 March 2006 1

XML Packaging Standards Growing interest in XML-based representation of Information Object in Digital Library architectures: Platform-independence, Industry-support Longevity, potential migration paths Processing tools, validation capabilities XML-based Compound Object formats: ISO/IEC 21000-2 MPEG-21 DID & DIDL METS IMS/CP CCDS XFDU Typical functionality: By-Value (base64) and/or By-Reference provision of constituent datastreams By-Value and/or By-Reference provision of metadata Provision of identifiers 15 March 2006 2

From MPEG 1 to MPEG-21 Potential impact of MPEG-21 MPEG: ISO/IEC Committee MPEG-1/MPEG-2/MPEG-4/MPEG-7/MPEG-21/MPEG-A Expected industry support MPEG-21 defines a normative open framework for multimedia delivery and consumption for use by all the players in the delivery and consumption chain Applicability to Digital Libraries Ability to accomodate any media type and genre MPEG-21 is modular: Part 2: DID representation of digital objects Part 3: DII identification of digital objects Part 4: IPMP enforcement of rights expressions Part 5: REL declaration of rights expressions Part 7: DIA transcoding based on contextual information Part 10: DIP association of behaviors Part 16: BF binary representation of digital objects 15 March 2006 3

MPEG-21 DID ISO/IEC 21000-2: Digital Item Declaration First edition: ISO Standard March 2004 Second edition: FDIS January 2005 (based on LANL/UGent proposals related to DL use) Representation of Digital Objects Within the MPEG-21 framework referred to as Digital Items 3 distinct sections: Abstract Model DID Representation of the Model in XML (DIDL) DIDL document W3C XML Schema Other representation of DID may emerge: ISO/IEC 21000-16: Binary Format RDF? Reference software: ISO/IEC 21000-8 15 March 2006 4

Abstract Model basic entities-mpeg21 container: grouping of items and descriptor/statement constructs pertaining to the container item: represents a Digital Item aka Digital Object aka asset. Descriptor/statement constructs convey information about the Digital Item component: binding of descriptor/statements to datastreams resource: datastream descriptor/statement descriptor/statement container item item descriptor/statement component component descriptor/statement resource resource resource 15 March 2006 5

The METS Format Create a single document format for encoding digital library objects which can fulfill roles of SIP, AIP and DIP within the OAIS reference model Initial scope limited to objects comprised of text, image, audio & video files Promote interoperability of descriptive, administrative and technical metadata while supporting flexibility in local practice 6

Information Package Concepts XFDU Package Interchange File Manifest Document File File File File Package Interchange File File Registry 15 March 2006 7

Logical View of XFDU Manifest packageheader dataobjectsec dataobject xml Id metadatasec metadataobject behaviorsec s behaviorobjects xml Id informationpackagemap ContentUnit xml Id metadata Category Pointers (xml Ids) category class REP DED,SYNTAX, OTHER PDI CONTEXT, PROVENANCE REFERENCE, FIXITY, OTHER DMD DESCRIPTION, OTHER OTHER OTHER ANY xfdu 15 March 2006 8

Schema Diagram of XFDU Manifest 15 March 2006 9

XFDU Content Unit 15 March 2006 10

XFDUMetadata Section Type 15 March 2006 11

XFDU MANIFEST OVERVIEW Package Header (packageheader): Administrative metadata for the whole XFDU Package, such as version, operating system, hardware, author, etc, and metadata about transformations and behaviours that must be understood, in particular, transformations which must be reversed to access the data content. Metadata Section (metadatasection): This section records all of the metadata for all items in the XFDU package. Multiple metadata objects are allowed so that the metadata can be recorded for each separate item within the XFDU object. The metadata schema allows the package designer to define any metadata model by providing attributes for both metadata categories and a classification scheme for finer definition withincategories. The XFDU also provides predefined metadata categories and classes via enumerate attributes that follow the OAIS information model Information Package Map (informationpackagemap) outlines a hierarchical structure for the original object being encoded, by a series of nested contentunit elements. Content units contain pointers to the data objects and to the metadata associated with those objects. Data Object Section (dataobjectsection) contains a number of dataobject elements. A Data Object contains a bytestream and any data required to allow the information consumer to reverse any transformations that have been performed on the object and restore it to the byte stream intended for the original designated community and described by the Representation metadata in the Content Unit Behavior Section (behaviorsection) may contain any number of behavior objects. Each behavior object can be used to associate executable behaviors with content in the XFDU object. A behavior object contains an interface definition element that represents an abstract definition of the set of behaviors 15 March 2006 12

Lessons Learned in Testing XML schema versioning and extensibility is difficult at best XPath (JAVA implementation) does not scales to thousands of object 15 March 2006 13

XFDU Toolkit Library Java class library implemented in parallel with specification development XFDU API library can be used within any other application that needs to perform XFDU packaging. To use XFDU toolkit in this manner simply put all the jar files inside of the lib directory on the CLASSPATH of your application. Javadoc documentation for the library can found at http://sindbad.gsfc.nasa.gov/xfdu XFDU API Library conceptually consists of two layers First layer (xfdu.core.* )is low level API representing each structure in XFDU schema. Second layer (xfdu.packaging.*) is more high level API. It aggregates parts of functionality from the first layer; thus, allowing easier access to constructing and manipulating an XFDU package. A crude GUI is also supplied 15 March 2006 14

SAFE I/O API XFDU Java API: this API provides the general features common to all XFDU packages. This API has been developed jointly with the SAFE Java API and is presently used in the framework of the activity of the CCSDS Information Packaging and Registries (IPR) Working Group in support to the validation of the XFDU recommendation developed by the group XFDU C++ Wrapper: a C++ wrapper on top of the XFDU Java API. Most of the functionality is preserved from the Java API, apart the capability of browsing the binary content Data Request Broker (DRB): from which SAFE inherits the capability to access data independently from their formats. DRB supports in particular the SDF markup language used for annotating the XML Schema documents acting as representation information of the SAFE product objects. 15 March 2006 15