XML STANDARDS FOR ARCHIVING LEGISLATIVE RECORDS

Similar documents
XML STANDARDS FOR ARCHIVING LEGISLATIVE RECORDS

Preserving State Government Digital Information All Partners Meeting Summary

Preserving State Government Digital Information Core Legislative XML Schema Meeting. Minnesota Historical Society

Preserving State Government Digital Information Core Legislative XML Schema Meeting. Minnesota Historical Society

Preserving State Government Digital Information XML Working Group Meeting Summary

Structured documents

Preserving State Government Digital Information. NDIIPP Final All Partners Meeting. Meeting Summary December 6, 2011

Metadata and Encoding Standards for Digital Initiatives: An Introduction

UELMA Preservation. Jason Judt & Daniel Kruse Office of the Revisor of Statutes, Minnesota

Thanks to Connie for arranging this meeting and getting us a room. Thanks for your time.

Preserving State Government Digital Information Core Legislative XML Schema Meeting. Minnesota Historical Society

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 5: Multimedia description schemes

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 1: Systems

ALBERTA ADVERSE EVENT FOLLOWING IMMUNIZATION(AEFI) HL7 MESSAGING SPECIFICATION

The Uniform Electronic Legal Material Act

This work is licensed under the Creative Commons Attribution 4.0 International License. Page 1 of 10

Media Types. Web Architecture and Information Management [./] Spring 2009 INFO (CCN 42509) Contents. Erik Wilde, UC Berkeley School of

HTML: The Basics & Block Elements

European Legislation Identifier (ELI) Uniform Resource Identifier (URI) Standard for the electronic Irish Statute Book (eisb).

3. Technical and administrative metadata standards. Metadata Standards and Applications

IBM. XML and Related Technologies Dumps Braindumps Real Questions Practice Test dumps free

Step: 9 Conduct Data Standardization

Why HTML5? Why not XHTML2? Learning from history how to drive the future of the Web

Integrated Assessment Record (IAR)

ebxml Technical Architecture San Jose, CA USA Wednesday, 9 August 2000

Chapter 10: Understanding the Standards

XML Update. Royal Society of the Arts London, December 8, Jon Bosak Sun Microsystems

An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia content description interface Part 2: Description definition language

Document-Centric Computing

Its All About The Metadata

Publishing Technology 101 A Journal Publishing Primer. Mike Hepp Director, Technology Strategy Dartmouth Journal Services

Distributed Multitiered Application

Selecting a Schema for EvaluationWeb File Uploads

DCMI Abstract Model - DRAFT Update

Introduction to Archivists Toolkit Version (update 5)

ISO/IEC Information technology Multimedia content description interface Part 7: Conformance testing

Lisa Banks Distributed Systems Subcommittee

Welcome to the ACN Form ADV Narrative Brochure Application. Form ADV Part 2 Help / User Guide

A Breakthrough In the Science of Proposal Development: P-XML. APMP TM Southern California Fall Seminar. October 22, 2004.

THE OECD GLOBAL HARMONIZED SUBMISSION TRANSPORT STANDARD (GHSTS) PROJECT

[MS-PICSL]: Internet Explorer PICS Label Distribution and Syntax Standards Support Document

Agenda. Summary of Previous Session. XML for Java Developers G Session 6 - Main Theme XML Information Processing (Part II)

HTML: Introduction CISC 282. September 11, What is HTML?

Report of the working group on meteorological visualisation applications. November Convener: Jens Daabeck Report: Iain Russell.

Selecting a Schema for EvaluationWeb File Uploads

References differences between SVG 1.1 Full and SVG 1.2 Tiny

Tribhuvan University Institute of Engineering Pulchowk, Campus

CKO2 - XML Protocols. CKO2 outputs - A2A general description. NBB - IT Department. IT Applications - PRSM. Date: 12/05/2011. Document version: V0.

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

This document is a preview generated by EVS

Introduction to HTML5

Dictionary Driven Exchange Content Assembly Blueprints

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.

Use of XML Schema and XML Query for ENVISAT product data handling

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University

Records Management Metadata Standard

WSIA and WSRP are new Web

Some more XML applications and XML-related standards (XLink, XPointer, XForms)

Data Science Services Dirk Engfer Page 1 of 5

1997 Minna Laws Chap. February 1, The Honorable Jesse Ventura Governor 130 State Capitol Building

Internet Standards for the Web: Part II

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9

Ontolog Forum. Gunar Penikis. Sr. Product Manager Adobe Systems Adobe Systems Incorporated. All Rights Reserved.

XML BASED DICTIONARIES FOR MXF/AAF APPLICATIONS

XML Applications. Introduction Jaana Holvikivi 1

HTML Overview. With an emphasis on XHTML

SyncML Overview. Noel Poore, Psion Computers PLC

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Adaptive Multimedia Messaging based on MPEG-7 The M 3 -Box

BRA BIHAR UNIVERSITY, MUZAFFARPUR DIRECTORATE OF DISTANCE EDUCATION

Overview. Introduction. Introduction XML XML. Lecture 16 Introduction to XML. Boriana Koleva Room: C54

Introducing Manakin: Overview & Architecture. Scott Phillips, Cody Green, Alexey Maslov, Adam Mikeal, and John Leggett

2/24/2015. What are File Formats? Types of File Formats. Sustainable Formats. File formats can be grouped into three categories:

Oracle BI Publisher 10g R3: Fundamentals

XML for Java Developers G Session 8 - Main Theme XML Information Rendering (Part II) Dr. Jean-Claude Franchitti

Digital Imaging and Communications in Medicine (DICOM) Supplement 174: RESTful Rendering. Public Comment Draft

Microdata Management Toolkit (MMT) National Data Archive (NADA)

2015 Ed-Fi Alliance Summit Austin Texas, October 12-14, It all adds up Ed-Fi Alliance

SysML Past, Present, and Future. J.D. Baker Sparx Systems Ambassador Sparx Systems Pty Ltd

Kalaivani Ananthan Version 2.0 October 2008 Funded by the Library of Congress

RDF and Digital Libraries

Introduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University

NIEM. National. Information. Exchange Model. NIEM and Information Exchanges. <Insert Picture Here> Deploy. Requirements. Model Data.

Full file at New Perspectives on HTML and CSS 6 th Edition Instructor s Manual 1 of 13. HTML and CSS

Oracle BI Publisher 11g R1: Fundamentals

Exchange Network Schema Conformance Report Preparation and Review Process

Content Management for the Defense Intelligence Enterprise

Realisation of SOA using Web Services. Adomas Svirskas Vilnius University December 2005

Working Group Charter: Web Services Basic Profile

CSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML

Quick Guide to CAM Dictionaries

The Identity Web An Overview of XNS and the OASIS XRI TC

SOAP Messages with Attachments

The Adobe XML Architecture

Using Drupal to create a digital patient education experience at Memorial Sloan Kettering Cancer Center (MSK)

Introduction to XML 3/14/12. Introduction to XML

Archiving and Preserving the Web. Kristine Hanna Internet Archive November 2006

extensible Markup Language (XML) Basic Concepts

Obsoletes: 2070, 1980, 1942, 1867, 1866 Category: Informational June 2000

Transcription:

XML STANDARDS FOR ARCHIVING LEGISLATIVE RECORDS NDIIPP ALL PARTNERS MEETING DANIEL DODGE DECEMBER 6, 2011

TODAY'S OVERVIEW Proposed XML Standard A Working Group developed a proposed XML standard to support archiving legislative documents. Proof-of-Concept Website The Minnesota Revisor's Office developed a website to test the proposed XML standard.

XML SCHEMA WORKING GROUP Goal: Develop a data model to support archiving digital legislative artifacts http://www.mnhs.org/preserve/records/legislativerecords/xml1.htm Started in June 2008

REQUIREMENTS FOR SOURCE XML Initial tentative goal: Develop schema that covers all requirements for all participating jurisdictions. (This also implies every jurisdiction would convert their source into the common format for archiving.) Revised goal: Focus on archival goals: Develop schema to wrap "native" XML from jurisdiction no transformation to write no loss of semantic markup no mapping of structures to common structure

SCHEMA GOALS SUPPORT STANDARD METADATA TRANSPORT DATA PROVIDER XML HTML RENDERING BINARY ATTACHMENTS SUPPORT LONG-TERM ARCHIVAL AND RETRIEVAL

BASIC DOCUMENT STRUCTURE metadata metadata essential to interchange recipients XML source XML source in provider's native syntax HTML rendition HTML rendition of document being interchanged text attachments (encoded) PDF Word... one or more attachments for other renderings (encoded as Base64 or other plain-text format) n

SCHEMA CORE Main elements: <metadata> <xml.source> <html.rendition> <attachments>

STANDARD METADATA Identifier Title Type Jurisdiction Agent Date Session Description Subject Relation Governor Action Management History Rights

METADATA EXAMPLE 1 <meta.identifier.block> <meta.identifier>200800sf2377-1</meta.identifier> <!-- YYYYssBT####-E where: YYYY = year ss = session number. 0 is regular session B = body. H(ouse) S(enate) T = type. F(ile) R(esolution) #### = document number E = engrossment/version number --> </meta.identifier.block> <meta.title.block> <meta.title>sf2377-1</meta.title> </meta.title.block> <meta.type.block> <meta.type>bill</meta.type> </meta.type.block>

METADATA EXAMPLE 2 <meta.subject.block> <meta.subject>emergency and 911 Services</meta.subject> <meta.subject>health and Health Department</meta.subject> <meta.subject>hospitals and Health Facilities</meta.subject> <meta.subject>boards</meta.subject> </meta.subject.block> <meta.relation.block> <meta.relation> <meta.version.of>hf1111</meta.version.of> <!-- 'version of' another document --> <meta.companion.bill>hf2591</meta.companion.bill> <meta.chapter>2008 0 22</meta.chapter> <!-- Session Law: Year Session_Number Chapter --> </meta.relation> </meta.relation.block>

REQUIREMENTS FOR SOURCE XML Preserve original text content Preserve original markup

SOURCE XML EXAMPLE <xml.source><![cdata[ <bill engrossed = "y" type = "bill" xmlns:xsi = "http://www.w3.org/2001/xmlschema-instance" xmlns:atipl = "http://www.arbortext.com/namespace/pagelayout" xsi:nonamespaceschemalocation = "bill.xsd">... <btitle> <btitle_prolog>a bill for an act</btitle_prolog> <btitle_summary>relating to health; modifying basic life support ambulance staffing requirements under certain circumstances; allowing a hardship waiver of advanced life support ambulance staffing requirements; modifying advanced life support ambulance staffing requirements under certain circumstances;</btitle_summary> <btitle_action>amending Minnesota Statutes 2006, section 144E.101, subdivision 7; Minnesota Statutes 2007 Supplement, section 144E.101, subdivision 6.</btitle_action> </btitle>... </bill> ]]></xml.source>

REQUIREMENTS FOR HTML Allow optional HTML rendition to be included HTML XHTML Data with and without named entities (e.g. )

SOURCE HTML EXAMPLE <html.rendition><![cdata[ <html xmlns:atipl="http://www.arbortext.com/namespace/pagelayout" xmlns="http://www.w3.org/1999/xhtml" xmlns:pl="http://revisor.leg.state.mn.us/namespace"> <head> <meta http-equiv="content-type" content="text/html;charset=utf-8"/> <title>s2377-1</title> </head> <body> <div class="xtend">... <var>3.35</var> <u>a community with a population of less than 1,000 persons.</u> <br/> </div> </body></html> ]]></html.rendition>

REQUIREMENTS FOR ATTACHMENTS Encode binary file to allow to be included Format types (very extensible) Microsoft Word (application/msword) PDF (application/pdf) compressed (application/zip) Postscript (application/eps) RTF (text/rtf) Text (text/plain) Image (image/jpg)

EXAMPLE DATA FOR ATTACHMENT <attachments> <binary ID="thing1" encoding="base64" source="application/pdf"> <![CDATA[ JVBERi0xLjQKJeTjz9IKMSAwIG9iagpbL1BERi9JbWFnZUIvSW1 hz2vdl0ltywdlss9uzxh0xqplbmrv... DEzOTUKJSVFT0YK ]]> </binary> <binary ID="thing2" ><![CDATA[ ]]></binary> </attachments>

EXAMPLE ATTACHMENT RENDERING

PROOF OF CONCEPT: MINNESOTA Minnesota Revisor's Office Tim Orr Isaac Holmlund Online on November 11, 2009 Documentation/Interface: [For more information please see the project webpage at http://www.mnhs.org/preserve/records/legislativerecords/xml1.htm]

XML FILE DOWNLOAD

PROOF OF CONCEPT: CALIFORNIA Legislative Counsel Bureau, State of California November 7, 2010 An XML instance of a Chaptered Bill using wrapper schema Filename: ndiipp-2009ab 009296CHP.xml Metadata XML Source: caml namespace Identified several gaps, primarily metadata values

NEXT STEPS Complete additional features for proof-of-concept Secure interface for aggregation proof-of-concept Centralized archival repository Authentication/verification of transmitted data Engage other states for next proof-of-concept

POSSIBILITIES Toolkit for jump-starting website interfaces for additional states Toolkit for viewing contents of archivable package Proof-of-Concept for an archive repository (centralized aggregation process) Legislative content XML schema template (for easier adoption of XML by states)

QUESTIONS? Email: daniel.dodge@thomsonreuters.com References: http://www.digitalpreservation.gov http://www.mnhs.org/preserve/records/legislativerecords http://www.mnhs.org/preserve/records/legislativerecords/xml1.htm