Standards and Paradigms

Similar documents
Linport: Interoperable Translation and Localization Project Containers (A Progress Report) Alan Melby and Jeremy Coombs

Understand Localization Standards and Use Them Effectively. John Watkins, President, ENLASO

STANDARD FORMATS IN TRANSLATION

Transport Mechanisms: Making it Possible to Share Your Health Story. Monday, April 4 th 12:00-1:00 pm ET

GEOFLUENT VS. STRAIGHT MACHINE TRANSLATION: UNDERSTANDING THE DIFFERENCES

TMX Wish List. FEISGILTT 2016 Dublin. Angelika Zerfaß

Copyright 2018 Maxprograms

Cisco Unified Presence 8.0

Deliver robust products at reduced cost by linking model-driven software testing to quality management.

Archive and Preservation for Media Collections

UniTerm Formats and Terminology Exchange

IBM Workplace Web Content Management

Lionbridge Connector for Episerver CMS 11. User Guide

PROMT Flexible and Efficient Management of Translation Quality

Anchovy User Guide. Copyright Maxprograms

supported file formats & compatibility with other tools

Swordfish III User Guide. Copyright Maxprograms

DITA 1.2 Feature Article: Using XLIFF to Translate DITA Projects

HPE BRIDGES TRADITIONAL AND NEW IT

Product Roadmap Structured Content Technologies

Network Data Recorder. Chapters and Chapters 9-11 Discussions and a look toward 2019

Federal Government. Each fiscal year the Federal Government is challenged CATEGORY MANAGEMENT IN THE WHAT IS CATEGORY MANAGEMENT?

Translation Tools. What they can and cannot do. Angelika Zerfass

An introduction to the VDI landscape

IP Video Network Gateway Solutions

Open and Flexible Metadata in Localization

Your Data Center is Everywhere. Unified Computing System Data Center Campaign Overview Marketing Cheat Sheet

Lionbridge Connector for Sitecore. User Guide

Lionbridge Connector for Episerver. User Guide. EpiServer 10. Version

Analysis of Effectiveness of Open Service Architecture for Fixed and Mobile Convergence

Sterling Talent Solutions Automates DevOps and Orchestrates Data Center Operations. SaltStack Enterprise case study

Getting Started for Translators Training Course

XLIFF 2.0. Dr. David Filip. Multilingual Web Workshop CNR PISA, April 4, 2011

Artop (AUTOSAR Tool Platform) Whitepaper

XLIFF. An XML standard for localisation Tony Jewtushenko Oracle Peter Reynolds Bowne Global Solutions. Tuesday, November 12, 2002 LRC 2002 Conference

BUILDING A WEBSITE FOR THE NUMBER ONE CHILDREN S HOSPITAL IN THE U.S. May 10, 2011

Web Services For Translation

Fast Innovation requires Fast IT

CT and IT architecture reconstruction based on software_. Global CTO

Lionbridge Connector for Sitecore. User Guide

Fluenta DITA Translation Manager. Copyright Maxprograms

Tech Talk Nutanix Calm. Greg Smith, VP Product Marketing Gil Haberman, Director of Product Marketing

Part I: Future Internet Foundations: Architectural Issues

Upgrade Your MuleESB with Solace s Messaging Infrastructure

Lionbridge Connector for Drupal 8 TMGMT. Setup and User Guide

SYSPRO s Fluid Interface Design

Introduction to RESTful Web Services. Presented by Steve Ives

Copyright 2014 Blue Net Corporation. All rights reserved

Activity Report at SYSTRAN S.A.

Quantum, a Data Storage Solutions Leader, Delivers Responsive HTML5-Based Documentation Centers Using MadCap Flare

A number of optimizations are already in use by the majority of companies in industry, notably:

Pervasive PSQL Summit v10 Highlights Performance and analytics

R&D White Paper WHP 018. The DVB MHP Internet Access profile. Research & Development BRITISH BROADCASTING CORPORATION. January J.C.

XML Documentation for Adobe Experience Manager

MICROSOFT VISUAL STUDIO 2010 Overview

WHITE PAPER Cloud Technology for TV Broadcasters Baskar Subramanian Co-founder Amagi Media Labs Pvt. Ltd.

IMI WHITE PAPER INFORMATION MAPPING AND DITA: TWO WORLDS, ONE SOLUTION

Best Practices to Transition to the Cloud. Five ways to improve IT agility and speed development by adopting a Cloud DevOps approach

Automating SAP System Copy Pre- and Post processing of the System Copy WHITE PAPER

MadCap Lingo at HPE Big Data

ONAP CCVPN Blueprint Overview. ONAP CCVPN Blueprint Improves Agility and Provides Cross-Domain Connectivity. ONAP CCVPN Blueprint Overview 1

Agent-Enabling Transformation of E-Commerce Portals with Web Services

The Case for Connectivity

GNU Radio Technical Update

Vortex Whitepaper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

The Technologies of Translation. The need-to-know about three popular content localization technologies: proxy, connectors and APIs.

WHITEPAPER. Enterprise Cyber Risk Management Protecting IT Assets that Matter

Reverse Software Engineering Using UML tools Jalak Vora 1 Ravi Zala 2

Understanding JDF The Graphic Enterprise It s all about Connectivity

An introduction to Headless Content Management Systems

How to integrate data into Tableau

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.

Taming Rave: How to control data collection standards?

REALIZE YOUR. DIGITAL VISION with Digital Private Cloud from Atos and VMware

Spotfire and Tableau Positioning. Summary

Legacy Transaction Integration TM In a Service-oriented Architecture (SOA)

Seven Decision Points When Considering Containers

FOUR INDEPENDENT TOOLS TO MANAGE COMPLEXITY INHERENT TO DEVELOPING STATE OF THE ART SYSTEMS. DEVELOPER SPECIFIER TESTER

From Windows CE to an IoT RTOS. Microsoft s Evolution and the Rise of Real-Time Applications

THE STATE OF IT TRANSFORMATION FOR RETAIL

Transformation Through Innovation

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Introduction to NetBorder Suite for Contact Centers

J a v a C o n t e n t R e p o s i t o r y. Senior Software Engineer Object Computing, Inc.

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Dell EMC ScaleIO Ready Node

XLIFF Manager User Guide

Introduction of the Industrial Internet Consortium. May 2016

XLIFF: Theory and Reality Lessons Learned by Medtronic in 4 Years of Everyday XLIFF Use

Introduction to Computer-Assisted Translation

HOW TO ENABLE AFFORDABLE ENTERPRISE VIDEO FOR EVERYONE

4. TECHNOLOGICAL DECISIONS

SysML Past, Present, and Future. J.D. Baker Sparx Systems Ambassador Sparx Systems Pty Ltd

Top 4 considerations for choosing a converged infrastructure for private clouds

Bizagi Process Management Suite as an Application of the Model Driven Architecture Approach for Developing Information Systems

Introduction to Data Science

Automate Transform Analyze

Your guide to embedding SDN in data centers, from start to finish

Presence and IM: Reduce Distractions and Increase Productivity

Part 1: XLIFF Generators

Transcription:

Standards and Paradigms The difference made by standards oriented processes Joachim Schurig, Senior Technical Director Language Technology, Chair ETSI ISG LIS Global Customer Lifecycle SM 1

How do we produce today?

How do we produce today? Document based approach LIONBRIDGE MANAGED SERVICES Available Technology Components Helped by the following translation standards: XLIFF 1.2 TMX (ITS 2.0, SRX) PORTAL Freeway CONNECTIVITY Freeway Web Services WORKFLOW Lionbridge TMS PROXY Translation Proxy TRANSLATION Translation Workspace LANGUAGE QUALITY Linguistic Toolbox MACHINE TRANSLATION Hybrid Engine Principal agents in data flow PORTAL Freeway CONNECTIVITY Freeway Web Services WORKFLOW Translation Workspace W. TRANSLATION Translation Workspace MACHINE TRANSLATION Hybrid Engine LANGUAGE QUALITY Linguistic Toolbox 3

How do we produce today? Portal/Order System (customer facing) Receives any type of content Handles quotes and orders Passes content on to workflow system Workflow System (production facing) Breaks content down into work items Controls conversions to pivotal format (XLIFF 1.2) Streams work items through the translation process Assigns resources Translation Memory / Editing environment Keeps TUs of every translation ever done Real time globally shared TM access 4

The benefits of using XLIFF 1.2 internally No character corruption issues Automation works without unexpected failures No document corruption Control over editing environment (including version control) No back conversion failures 5

Case Study: Visual Studio (Orcas) Orcas was the first big project translated using the XLIFF file format and Xliff Editor During the project we tracked the wordcount differences between RTF and XLIFF The results confirmed the expectations of reduced cost and reduced cycle times. The tracked data helped us to convince the client about the benefit of switching to the XLIFF file format. Wordcount reduction Weighed Wordcounts using XLIFF file format lower by ~10% Improved Productivity XLIFF file format allowed a reduction in translation time Improved productivity by avoiding format issues. 6

What about the promise of interchangeability of XLIFF 1.2? It was just that a promise. In practice, working with 3rd party XLIFF 1.x documents was causing more difficulties than parsing plain XML or binary formats (where we knew at least what to expect) Reasons for bad interchange: Erroneous implementations Ambiguous and incomplete specification 7

The benefits still let XLIFF 1.2 fly! Adoption in Lionbridge initially happened by word- of- mouth of success stories Heavily campaigning XLIFF transition since 2010 Early 2012 showed 85% transition to XLIFF, which slowly converged to 95% nowadays Remaining 5% RTF due to Office centric, complex, legacy workflows No language preference on XLIFF adoption No vendors ever rejected working in XLIFF 8

And we do more than 1,2 billion words in XLIFF 1.2 per year! 1.4 Billion words (XLIFF) 1.2 1 0.8 0.6 Billion words (XLIFF) 0.4 0.2 0 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 9

Standardization v1.0 representing the traditional process This was an example for an internal translation process which profits from standardization Pivotal document format is XLIFF 1.2, and TM data are exchanged with TMX 1.4.2. (TMX 1.4.2, not 1.4b? Yes: http://uri.etsi.org/lis/002/v1.4.2/gs_lis002v010402p.pdf ) Internal standardization allows to streamline and automate processes as long as all tools are controlled by the same authority Benefits can be huge our lower limit acceptance level for a new client came down from several thousand US$ to $100 due to reduced transaction cost! But benefits from XLIFF 1.2 end when interoperating with 3rd party software Similarly, TMX shows its limitations when exchanging with 3rd party software 10

Standardization v1.0 representing the traditional process So this represents the traditional workflow, where we get files/documents from the customer, and create a TM on our side. Our industry s standards suite has been focused on this traditional workflow, too: TMX: document/tm centric SRX: document/tm centric in the way that it assumes there is more than one correct segmentation TBX: document centric in the way that it uses a document to transport the complete set of terms XLIFF: document centric as it is the document 11

Now what about the CMS systems? Starting in the mid- 90s, CMS systems have by now become the ubiquitous way of managing content electronically Over the past few years, CMS system count has ten- fold from the hundreds to the thousands Unfortunately, a large part of the CMS systems do not specifically support translation processes (although there are great exceptions) 12 Copyright 2013. Confiden4al Distribu4on prohibited without permission

Copy and Paste? Accessing content data for translation in CMS systems is often difficult Sometimes one gets the impression that content is held hostage We see many situations where data is transported via copy and paste Automation requires Connector functionality, which interfaces between CMS and translation service Each CMS requires its own Connector implementation There are > 1200 CMS on the market in 2014! 13

And the segmentation? Even with a successful source/target roundtrip process, CMS systems lose the association between source and target segments because their content block size typically is larger than one sentence. When target content now gets modified by a subject matter expert, it is extremely difficult to represent the modifications back in a sentence segmented translation memory. This is a major issue, as it makes the information flow unidirectional, forbids in- context reviews, or creates an insane amount of manual adjustment. As an industry, we need a technical solution for this problem. As an economy, we need a technical solution for this problem as it unproportionally increases cost of business for multilingual regions. 14

Is this the last millennium's Translation Model? Is the translation model with a TM external to the content data still the right one? Should we not see the CMS as one of the TMs, including versioning for old translations? Should the CMS not offer online APIs to push new translatable content and allow queries for old translations? Is variable segmentation an idea of the past, and ULI s universal segmentation initiative the future? We interconnect already today up to hundred TMs for one large query base, different segmentation rules only split the content into silos. 15

We need a new type of standards! Opening interoperability: tighter specification! Getting away from the document centric model! Focusing on services and protocols! Including the king of content of today the CMS Need incentives and motivation for CMSs to adapt to multilingualism All content and support data of today is dynamic, and should not be forced anymore into a document container 16

Whish List: XLIFF 2.0 adopted by all content creators for a translation interface Interoperability part of standards conformance tests Standardized connector protocol for CMS systems (COTI as a blueprint for the orders, Linport for the content?) CMS maintaining sentence segmentation/association can it be supported by some formalism? ITS 2.o alongside all exchanged content, persistently stored in CMS Standard protocol definitions for TM query, Terminology, Content/Segment exchange Hard competition on a unified market! We need the best products and services possible to bridge the language gap! 17

Thank you! Joachim Schurig, Senior Technical Director Language Technology, Chair ETSI ISG LIS Global Customer Lifecycle SM Email: joachim.schurig@lionbridge.com Mobile: +33 6 14 50 27 75 www.lionbridge.com http://blog.lionbridge.com http://twitter.com/lionbridge http://www.facebook.com/l10nbridge 18