Moving to XML: The Investment

Similar documents
Getting to JATS and BITS. Presented by Bruce D. Rosenblum CEO Inera Incorporated

XF Rendering Server 2008

A Case Study Webinar: How Wiley-Blackwell Accelerated Digital Production by 75% webinar. aptaracorp.com

Journals Manuscript Editing at the University of Chicago Press

Automating Publishing Workflows through Standardization. XML Publishing with SDL

Mulberry Classes Guide to Using the Oxygen XML Editor (v20.0)

LiXuid Manuscript. Sean MacRae, Business Systems Analyst

SECURITY AUTOMATION BEST PRACTICES. A Guide on Making Your Security Team Successful with Automation SECURITY AUTOMATION BEST PRACTICES - 1

Professional outputs with ODS LATEX

The DITA business case

Mission Possible: Move to a Content Management System to Deliver Business Results from Legacy Content

Quark XML Author October 2017 Update with Business Documents

COLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003

Tennessee. Business Technology Course Code Web Design Essentials. HTML Essentials, Second Edition 2010

Security Automation Best Practices

Quark XML Author September 2016 Update for Platform with Business Documents

SECURITY AUTOMATION BEST PRACTICES. A Guide to Making Your Security Team Successful with Automation

XML in Book Publishing

Integrated S1000D & ATA ispec 2200 Publications Lifecycle Management System

Pragma ADE systematically invests in development of text manipulation and text processing tools, most of which are available for free.

ProcSet. Workflow Solution Typical roles and scenarios

Automated Classification. Lars Marius Garshol Topic Maps

Understanding JDF The Graphic Enterprise It s all about Connectivity

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.

The Adobe XML Architecture

Quark XML Author 2015 October Update with Business Documents

CYBER SECURITY FOR BUSINESS COUNTING THE COSTS, FINDING THE VALUE

Stelae Technologies ... Data Conversion a Walkthrough. «Extracting the Intelligence from Content»

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward

Putting Open Access-into Interlibrary LoanJoe Mcarthur

Taming the Wild PDF within the In-Plant Printer

How to Get the Most Out of Content Migration to DITA

The ROI of UI Toolkit Standardization

XML for Java Developers G Session 2 - Sub-Topic 1 Beginning XML. Dr. Jean-Claude Franchitti

EPUB 3 An Insider's Look: How Will Your ebook Operations be Affected? webinar. aptaracorp.com

Where's the Beef from Enterprise Structured Content

Xyleme Studio Data Sheet

XML. Jonathan Geisler. April 18, 2008

Beyond DTP. Saving Time and Money with SDL Knowledge Center. Beyond DTP Saving Time and Money with SDL Knowledge sdl.com Center

Introduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University

Quark XML Author October 2017 Update for Platform with Business Documents

Digital Marketing Manager, Marketing Manager, Agency Owner. Bachelors in Marketing, Advertising, Communications, or equivalent experience

Virtualization. Q&A with an industry leader. Virtualization is rapidly becoming a fact of life for agency executives,

M359 Block5 - Lecture12 Eng/ Waleed Omar

Introduction to XML 3/14/12. Introduction to XML

XML Update. Royal Society of the Arts London, December 8, Jon Bosak Sun Microsystems

IMPROVING YOUR JOURNAL WORKFLOW


Chapter 11: Editorial Workflow

The vision provider. The vision provider. The newbie. The newbie. The vision provider is often a creative director for a digital team.

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University

QUICKBOOKS TO ACCOUNTEDGE CONVERSION GUIDE

WHITE PAPER. Comparison Guide: Choosing Between Help Authoring Tools and CCMSs

How to Evaluate a Next Generation Mobile Platform

Anchovy User Guide. Copyright Maxprograms

Collaborative Colour Production in Wide Format Printing. This is the third article in this part of the Wild Format Series. It is supported by

Opening up new opportunities through Cross-selling and Upselling. GMC Software Technology

NOT FOR SALE Microsoft Outlook Microsoft Office 2010 Specialist and Expert Certifications

IBM. XML and Related Technologies Dumps Braindumps Real Questions Practice Test dumps free

PTC Employs Its Own Arbortext Software to Improve Delivery of PTC University Learning Content Materials

Technical Intro Part 1

Content Management System Development Approach

Océ PRISMA archive software. Archiving made easy. Powerful, high-volume. archiving software

King County Housing Authority Delivers Multimedia Online Help with MadCap Doc-To-Help

UCSD Extension. Fundamentals of Web Services. Instructor: John Pantone. 2007, Objectech Corporation. All rights reserved

Publishing Technology 101 A Journal Publishing Primer. Mike Hepp Director, Technology Strategy Dartmouth Journal Services

Introducing Computer Programming

How To Validate An Xml File Against A Schema Using Xmlspy

Dreamweaver CS6. Level 1. Topics Workspaces Basic HTML Basic CSS

Content Development Reference. Including resources for publishing content on the Help Server

Overview 14 Table Definitions and Style Definitions 16 Output Objects and Output Destinations 18 ODS References and Resources 20

Editors. Getting Started

The main website for Henrico County, henrico.us, received a complete visual and structural

The Trail of Electrons

QUARK AUTHOR THE SMART CONTENT TOOL. INFO SHEET Quark Author

Structured Content and Personalization

Battle of Desktop Publishing Programs

PRODUCTION METRICS AND METADATA

Proposals for a New Workflow for Level-4 Content

Inera s Reference Processing Tools: extyles versus Edifix

Quantum, a Data Storage Solutions Leader, Delivers Responsive HTML5-Based Documentation Centers Using MadCap Flare

Modernized Fed-State E-File "101

Bringing Product Datasheets into the Information Age: Improving Time to Market and Driving Revenue Growth through Smart Publishing

Marketer s Guide to Website Redesign

XML Documentation for Adobe Experience Manager

Meet our Example Buyer Persona Adele Revella, CEO

The Definitive Guide to Automating Content Migration

BOV89296 SUSE Best Practices Sharing Expertise, Experience and Knowledge. Christoph Wickert Technical Writer SUSE /

Oracle BI Publisher 11g R1: Fundamentals

Implementing Web Content

Easing into DITA Publishing with TopLeaf

Information Technology Web Solution Services

Guidance on the appropriateness of the information technology solution

XML-based production of Eurostat publications

InDesign CS4 is the sixth version of Adobe s flagship publishing tool,

JATS, BITS, and STS. Journal Article Tag Suite Book Interchange Tag Suite Standards Tag Suite

<title> An XML based web service for an electronic logbook </title>

Proposal: [Product Name] User Documentation

ECE Senior Design Team 1702 Project Proposal

- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents

Transcription:

B. Tommie Usdin Mulberry Technologies Inc. 17 West Jefferson St. Suite 207 Rockville MD 20850 Phone: 301/315-9631 Fax: 301/315-8285 info@mulberrytech.com http://www.mulberrytech.com Version 1.0 (January 2006) 2006 Mulberry Technologies, Inc.

Moving to XML: The Investment...1 Content Providers and Vendors Investment...1 XML Gives Both Content Providers and Vendors...2 Content Provider Investment in XML...2 XML Will Change the Way Publishers Work...3 Added Value Means Added Work...3 Costs and Benefits Not Equitable...4 Service Vendor Investment in XML...4 So Service Providers Need to...5 Vendors Must...5 Placing XML in the Workflow...6 When is the Data Tagged...6 XML Workflow Means Changes in Staffing/Jobs...7 Warning: XML Does Not Reduce Staff...8 What it Takes to Make XML happen...8 Select (or Create) a Model...9 Basic XML Skills For Everyone...9 Tagging Knowledge Varies for Different Models...10 XML Management Training...10 XML Wrangler Training...11 Prepress Skill Set Includes...11 DTD and Schema Skills...12 New Data Manipulation Skills...12 Manipulating XML is Prepress Life Blood...13 Parsing and Delivering Clean XML...13 Publishers Can Use Transformation Skills Too...14 New Tools and Tool Training...14 XML Validation Tools...15 XML-Aware Editors...15 Transformation Language and Processors...16 Page i

A New Option: Making Print Directly From XML...16 Repositories/Content Management Systems...17 Conclusion The Bad News: There is no Free Lunch...17 The Good News: You Can Do XML and Benefit...18 Content Providers Get the Benefits They Want...18 Conversion Vendors Would Make a Killing...18 There are Real XML Benefits for Page Production...19 Colophon...19 Page ii

slide 1 Moving to XML: The Investment Many benefits to XML Doesn t come for free Changes to Work flow Staff skills Balance of power New opportunities slide 2 Two (Different) Investments in XML Moving to XML means very different things to Content owner / content provider (publisher) Service vendor (printer, typesetter, data conversion vendor) The two viewpoints have very different responsibilities / roles different first steps and workflow different software (some overlap) similar validation strategies/techniques Page 1

slide 3 XML Gives Both Content Providers and Vendors Some similarity in production advantages/gains Mostly different problems Similar skill sets/training requirements (some overlap) New ways to do what they always did New things to do slide 4 Content Provider Investment in XML Publisher must decide: What the XML is for business case (advantages) ROI what to do with the XML What does the XML look like (tag set and validation rules) How XML will be created How to store/manage the XML Where XML falls in the production workflow Page 2

XML Will Change the Way Publishers Work Line between authoring, content edit, and copy edit shifts/blurs Proofing and checking changes level changes (less worry about transposed letter and more about missing structures) specifics change (don t look for comma, look for <author>) new QA possibilities (like false-color proofs) one error may show many times but still be just one error Multiple vendors may use one file Machines do some work once done by people People do some work never done before slide 5 Added Value Means Added Work slide 6 Added metadata Repository storage and rights/permissions may need metadata that does not print or display Semantic tagging Value-added tags for searching may require subject knowledge Index terms Adding inline indexing is the same huge effort as a back-of-the-book index Writing changes If structures are to be used in parallel ways; they must be parallel Page 3

Costs and Benefits Not Equitable slide 7 (Warning: Costs in budget of Department A, while benefits accrue to Departments C and D or to the company as a whole) Some groups will have increased workload, for others production is faster and easier Expense and hard work are immediate (group-specific) Most benefits Increase the bottom line for a different group Are long term Are corporate-wide or related to sales Are cumulative; One job takes 2 months longer, but other jobs are done in half the time, are much cheaper, or are only possible now) Service Vendor Investment in XML (it s not your data, it s your service) If Content Provider has XML content, they will already have DTDs or schemas (not necessarily appropriate for publication) They will expect you to accept XML source (minimally) work in XML (better) work in their XML They will expect a compositor/typesetter to Provide round-trip services (even when it is not clear if that is possible) slide 8 Page 4

So Service Providers Need to Be able to produce backend XML from a backfile of hardcopy pages from a backfile of computer files in addition to pages or websites they make Be able to receive customer-specific XML and produce pages produce websites return it (updated and uninjured) when through (delivery of cleanly parsing, correctly tagged material) slide 9 Vendors Must Work with a variety of XML tag sets customers have varying requirements / XML tagsets vendors must use many and transform more Work with good, bad and indifferent tagging, therefore know the difference when to ask for changes when to offer help how to fix problems Conclusion: Vendors need better XML skills than content providers slide 10 Page 5

slide 11 Placing XML in the Workflow (Both Publisher and Vendor may be Involved) How XML will be created when in the workflow by whom (inhouse or outsourced to vendor) how will newly created XML be validated How to store/manage the XML content management system database (XML native or not) file system to manage documents Using XML for production and QA What products made from the XML (and how) slide 12 When is the Data Tagged XML on the front end During creation/authoring Between author and any edit XML as a production step as part of editing before page production as part of page production Post Production (XML on the back end) as extension to process (for example by compositor before return to publisher) when placed into repository long after production, possibly by years Page 6

XML Brings New Proofing/QA Possibilities More intelligent proofing than Does it look right? List any element / combination Print content of reference next to place where reference is made Bibliographic references checked and made live before publication False-color proofs Add color to make things stand out Add numbering Add links to what needs to be checked slide 13 XML Workflow Means Changes in Staffing/Jobs Much rekeying eliminated Proofing/rekeying cycles shortened or eliminated (exception math! and chemistry) Changes the nature of the grunt work More time devoted to content instead of format Transformation becomes a major activity More post processors More potential output products slide 14 Page 7

Warning: XML Does Not Reduce Staff XML can increase Accuracy Opportunities in sales and new products Customer satisfaction Timeliness (reduce time to market) But not likely to be done with fewer people (Not likely to increase staff dramatically either, the few can do more!) slide 15 What it Takes to Make XML happen Tag set (DTDs and schemas) Skills and training basic wrangler prepress Software slide 16 Page 8

slide 17 Select (or Create) a Model (DTD or schemas) Sources of tagsets/models Off-the-shelf Publicly-available Government, industry, interest area, corporate groups, standards bodies Altered-to-fit Started with off-the-shelf model Modified to meet organization needs Designed and developed for an organization s specific requirements (Good documentation is the key to success) Basic XML Training for All XML Users Principles of generic markup and document modeling Tagging basics Format-specific vs. generic Structure vs. content Logical structure of documents How to handle XML how to modify and create new tagged data how to apply a document model use of XML software tools parsing and other validation error resolution slide 18 Page 9

slide 19 Tagging Knowledge Varies for Different Models For all taggers and validators, basic XML concepts For structural tagging good language skills (in the appropriate language) knowledge of how your documents work For bibliographic tagging, reference knowledge (how to tell a journal from a book from a thesis) For subject tagging (legal, medical, pharmaceutical) subject-matter expertise user expectations publisher budget XML Management Training For anyone approving budgets, buying tools, developing production schedules or goals, or managing the process Basic concepts and vocabulary Costs and scheduling Risk and opportunities Production concerns and personnel factors An understanding of how capacity and opportunity have changed slide 20 Page 10

XML Wrangler Training XML authoring Tagging or conversion from word-processing Checking and fixing XML documents Distinguishing tagset problems from tagging problems (and fixing both) DTD/schema skills read XML models teach models to vendors and new employees maintain tagset and documentation Reading and writing typesetting and conversion specifications slide 21 Prepress Skill Set Includes Produce customer-specific XML Check and fix XML documents, models Flow XML into current tools QuarkXPress, InDesign, etc. Microsoft Word and other word-processors Extract XML from current tools QuarkXPress, InDesign, etc. Microsoft Word and other word-processors and then that XML real XML Handle customer expectations of round-tripping slide 22 Page 11

DTD and Schema Skills slide 23 Identify conflicts between the tagset and the content Modify a DTD or schema to solve production problems without killing downstream processing versioning and configuration management Cheat the markup to resolve production problems (during a production crisis) New Data Manipulation Skills XML customers believe the XML hype Assume that if you have tags you can work wonders Assume you have skills beyond theirs that your old customers did not assume you had slide 24 Page 12

Manipulating XML is Prepress Life Blood slide 25 The tasks XML extraction from other files Transforming XML tags to typesetting driver code (multiple outputs) Rearranging content (into publication order) Inserting generated text (chapter numbers, footnote symbols, automatic Table of Contents) Adding metadata from other sources Vendors do QA too (they get paid for quality) The skill set XML transformation string manipulation Parsing and Delivering Clean XML You would think is basic But it may not be possible document model requires elements you cannot know publication date DOI full name of author (you have initials) Welcome to creative tag abuse <doi>xxx DOI here when assigned XXX</doi> <pub-date>xxxxxx</pub-date> These will happen so work out preferred solutions with your publisher (in advance!) slide 26 Page 13

slide 27 Publishers Can Use Transformation Skills Too Reports, checklists, false-color proofs for QA which figures do not have captions? what are the key terms in each section? any footnotes not referenced? make all the surname pink (for human checking) collect all abbreviated journal titles (software check with authority file) Almost-instant galleys or editors proofs Generated text is managed separately: (And publishers will need one or two XML gurus ) slide 28 New Tools and Tool Training (software costs are not the big issue*) Validation tools Creation and editing applications Transformation language and engine(s) Software to make print pages and webpages New delivery applications *Except maybe Content Management Systems Page 14

slide 29 XML Validation Tools (validation is central to all human-created XML processing) XML parser checks an XML document against a DTD or schema don t buy these, they are free or built in Schematron processors are free Validation beyond DTDs and schemas is typically programming (Perl, C++,.Net, javascript, etc.) (Remember that an XML document is just an editable file) XML-Aware Editors Like an XML word-processor Understands tags and knows your model Typical features styled representation (font, colors, forms-entry) choose to view tags or not DTDs and schemas guide authoring: context-sensitive tag choices validation attribute editing pick lists and shortcuts slide 30 Page 15

slide 31 Transformation Language and Processors The need is to pour XML into desktop packages select, excise, rearrange for reuse and repurposing rework XML for QA make HTML for delivery Solution = a transformation language and software standalone transformer (XSLT engines are free) built into developers environments, XML editors, GUI application-generators commercial packages (OmniMark) slide 32 A New Option: Making Print Directly From XML Using XSL-FO (Extensible Stylesheet Language-Formatting Objects) A way to make PDF, Postscript, RTF, TeX, etc. directly from XML Application-independent vocabulary for specifying text formatting page-layout Special-purpose XSL-FO rendering software takes XML as input gives PDF, Postscript (RTF, MIF, etc.) as output (Typesetters understand XSL-FO better than do most people who work with XML) Page 16

Repositories/Content Management Systems slide 33 Will you need one publishers probably/possibly prepress vendors probably not There are many things called CMS (Content Management Systems) Free advice: start without one determine your real checkin, checkout, workflow, routing, reuse requirements with your requirements known, then look at a lots of them Conclusion The Bad News: There is no Free Lunch Training is required Added value (that s richer tagging) means more work Just because it's XML doesn't mean it's good The benefits of XML are available only to those who Plan Play by the rules Work hard slide 34 Page 17

slide 35 The Good News: You Can Do XML and Benefit Training and books are available Many good tools are free, many others are affordable Service vendors rapidly becoming XML savvy The XML marketplace is growing fast Users of electronic products more sophisticated every day, demanding better electronic products slide 36 Content Providers Get the Benefits They Want Re-use and repurpose content Internationalization, localization, customization Quality checking and consistency does every chapter have a title are all the cited works referenced Print and screen publications publish at the same time Vendor and software independence slide 37 Conversion Vendors Would Make a Killing If there were not so many of them Page 18

There are Real XML Benefits for Page Production Content is in a single, familiar form: few structural surprises no unique and one-time-only styling Content is cleaner when typesetting starts Multiple outputs produced simultaneously from one source Pre-processing reports on what the document contains: do we have a graphic for every map? how many tables are there? are there equations? Generated content can be error-free slide 38 Colophon Slides and handouts created from single XML source Slides projected from HTML which was created from XML using XSLT Handouts created from XML: Source XML transformed to Open Office XML Open Office XML opened in Open Office Pagination normally adjusted Saved as PDF Slideshow materials available at: http://www.mulberrytech.com/slideshow slide 39 Page 19