Stelae Technologies ... Data Conversion a Walkthrough. «Extracting the Intelligence from Content»

Size: px
Start display at page:

Download "Stelae Technologies ... Data Conversion a Walkthrough. «Extracting the Intelligence from Content»"

Transcription

1 1

2 Stelae Technologies «Extracting the Intelligence from Content» Data Conversion a Walkthrough... 2

3 Khemeia the product Product: Khemeia - converts unstructured information into structured semantically tagged content Utilised by organizations who undertake: Data conversion and transform content Enrich content with metadata Position: Unique on the market over 70 algorithms combining multiple analysis methodologies Competition: Mainly solutions with a large manual workflow 3

4 Khemeia - What does it do? Technical Content types: Maintenance manuals, reference documents, catalogues Inputs: PDF, ASCII, ATF, Word Outputs: S1000D, ATA, XML, HTML, SGML, DITA Legal Content types: Judgments, legislation, regulation, contracts Inputs: PDF, OCR, HTML, RTF, Word Outputs: XML, EPUB, HTML Financial Content types: Company accounts, financial statements Inputs: PDF, OCR, Excel, Word HTML Outputs: ixbrl, XBRL, customer-specific XML Taxonomy: UK GAAP, US GAAP, Indian GAAP, Irish GAAP Publishing Content types: STM, newspapers, magazines, books Inputs: PDF, Word, InDesign, QuarkXPress Outputs: XML, EPUB, NITF, DITA, DocBook 4

5 Content conversion the problem Competing solutions Input: PDF Word Text. Output: XML S1000D XBRL/iXBRL Mobile devices Outsourced Semi-automated scripts Expensive Time consuming Error prone Extensive QA required 5

6 Khemeia the solution Customers benefits Enriched content for users Improved indexing Better search results Features Cloud based Automatic processing Rapid deployment Ultra-fast conversion times One product for multiple content types (legal, technical) Multi-language Faster speed to market XBRL filing of company accounts S1000D technical documentation for defense, aerospace 6

7 Khemeia - Inputs and outputs Input types include: PDF ATF (ASCII Technical Format) OCR (optical character recognition) formats Microsoft Word RTF HTML Excel CSV ASCII XML SGML InDesign, QuarkXPress Output types include: XML SGML HTML RDFa PDF JPEG XMP NITF, NewsML XBRL / ixbrl S1000D DITA EPUB, e-book reader, tablet, smartphone formats. customer-specific DTDs 7

8 Types of content 8

9 Application: Legal judgment Input document: US District Court Output: XML generated automatically per customer specification 9

10 Application: Technical documentation 10

11 Application: Reference information 11

12 Application: Investment bulletin 12

13 Application: Directory listing 13

14 Application: Contracts processing <?xml version="1.0" encoding="utf-8"?> <files> <agreement>channel Alliance Program Agreement </agreement> <effect>september 15st, 2000</effect> <party>masterway Telecomunicacaes Ltda</party> <address>rua do Ouvidor, 161 / 603, Rio de Janeiro, RJ, Brazil and Av. Brigadeiro Faria Lima, Cjs. 1005/1010, Sao Paulo, SP, Brazil. </address> <region>brazil</region> <termination>two (2) years</termination> <govlaw>the State of California</govlaw> <date>september 15st</date> </files> 14

15 Application: Financial accounts 15

16 Accounts output ixbrl extract 16

17 Application: Invoice processing Inputs OCR PDF Analysis and Extraction Analysis and extraction of metadata: customer name, supplier names, product type, quantity, amount, VAT numbers,... Outputs Integration into ERP applications: SAP,... 17

18 Case studies 18

19 Customer example: Aerospace Legacy data Aircraft maintenance manuals ASCII text and scanned paper Khemeia processing Automated structuring into linked S1000D data modules Images linked to part numbers Benefits Speed of deployment Automation and accuracy Security considerations: NATO & MOD classified information Khemeia identified as only viable solution 19

20 Customer example: Legal publisher Real time information crawled from over 40+ sources - PDF, Word, HTML Khemeia automated metadata extraction, structuring according top different XML schema Output delivered to content mining and CMS solutions (e.g. Temis & Documentum) Benefits One input multiple outputs (archive, CMS, web publishing, content mining) Real time information publishing - 2 seconds/page 70% cost reduction versus other solutions 20

21 Customer example: Financial statements Company accounts in PDF (native digital & image), Word, Excel, HTML, InDesign Khemeia automated conversion of financial data to XML XBRL taxonomy tags automatically queued to relevant financial values Processed and validated by an operator utilizing pdf2xbrl editor and output for filing as XBRL/iXBRL Benefits Unique PDF to XBRL conversion solution 2-4 hours of processing time per account set versus 18 hours 21

22 Behind the scenes 22

23 Workflow Scan OCR Analysis Style Structure Validation QA Image creation from paper PDF, JPEG, TIFF Optical Character Recognition Content analysis Structure Metadata Font Size Bold, italics Hierarchy Tables Images Equations Semantic tagging XML DTD/XML schema Quality control Checking Correction Error Control Module for Quality Control Scan & OCR Khemeia QA 23

24 Khemeia - The technology is unique Utilizes software algorithms that combine multiple analysis methodologies UNIQUE ON THE MARKET Visual Analysis (font, color, size,.) Structure/Hierarchy (e.g. titles, sub-titles, paragraphs, footnotes, etc.) Geometric Positioning (pinpoints the position of content on the page) Khemeia Keyword Analysis (matches specific terms i.e. key words or phrases) Regular Expressions (elements identified by matching specific logic against content patterns) Integration of Dictionaries/Indexes (match against customer-specific taxonomies e.g. legal terms) 24

25 Khemeia : User Interface 25

26 . 26

27 Business problem Khemeia solves Khemeia enables: Increased productivity Improved quality and enrichment Rapid deployment times Reduces customer costs by up to 70% For organizations who undertake: Data conversion and transform content Enrich content with metadata Have mainly manual workflows Applications: Conversion of content into XML Generating metadata to define, describe and enrich the content Providing indexed and searchable content Repurposing legacy XML 27

28 Khemeia partial client list 28

29 Thank you 29

Khemeia Case Study: Automation of Large Scale Legacy Data Conversion

Khemeia Case Study: Automation of Large Scale Legacy Data Conversion Khemeia Case Study: Automation of Large Scale Legacy Data Conversion Aruna Schwarz, CEO Stelae Technologies S1000D User Forum San Diego, 22 nd September 2015 1 Khemeia by Stelae Technologies Who are we?

More information

ABBYY FineReader 14. User s Guide ABBYY Production LLC. All rights reserved.

ABBYY FineReader 14. User s Guide ABBYY Production LLC. All rights reserved. ABBYY FineReader 14 User s Guide 2017 ABBYY Production LLC All rights reserved Information in this document is subject to change without notice and does not bear any commitment on the part of ABBYY The

More information

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION - QUALITATIVE

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION - QUALITATIVE PROCESSING AND CATALOGUING DATA AND DOCUMENTATION - QUALITATIVE....... INGEST SERVICES UNIVERSITY OF ESSEX... HOW TO SET UP A DATA SERVICE, 8-9 NOVEMBER 2012 PRE - PROCESSING Liaising with depositor: consent

More information

Scanshare Sales Guide V1.2

Scanshare Sales Guide V1.2 Scanshare Sales Guide V1.2 What is Scanshare? The document business critical data, currently locked in paper form The MFD the on ramp to an organisation s digital information workflow Scanshare the middleware/bridge

More information

+44 (0)

+44 (0) I N T R O D U C I N G N E X T G E N The Librios NextGen platform lets you monetise and repurpose your content and improve your production workflows. It is designed to help small to medium publishers repurpose

More information

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION: QUALITATIVE

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION: QUALITATIVE PROCESSING AND CATALOGUING DATA AND DOCUMENTATION: QUALITATIVE.... LIBBY BISHOP... INGEST SERVICES UNIVERSITY OF ESSEX... HOW TO SET UP A DATA SERVICE, 3 4 JULY 2013 PRE - PROCESSING Liaising with depositor:

More information

DOCUMENT NAVIGATOR SALES GUIDE ADD NAME. KONICA MINOLTA Document Navigator Sales Guide

DOCUMENT NAVIGATOR SALES GUIDE ADD NAME. KONICA MINOLTA Document Navigator Sales Guide DOCUMENT NAVIGATOR SALES GUIDE ADD NAME WHAT IS DOCUMENT NAVIGATOR? The document business critical data, currently locked in paper form The MFD the on ramp to an organisation s digital information workflow

More information

The Functional Extension Parser (FEP) A Document Understanding Platform

The Functional Extension Parser (FEP) A Document Understanding Platform The Functional Extension Parser (FEP) A Document Understanding Platform Günter Mühlberger University of Innsbruck Department for German Language and Literature Studies Introduction A book is more than

More information

XBRL Design and Modeling Methodology in Practice

XBRL Design and Modeling Methodology in Practice XBRL Design and Modeling Methodology in Practice speaker: co-author: Herm Fischer Developer, Mark V Systems Timothy Randle Senior Advising Architect, Data Modeler and XBRL Taxonomist Evolution of practices

More information

ABBYY FineReader 10. Professional Edition Corporate Edition Site License Edition. Small and medium-sized businesses or individual departments

ABBYY FineReader 10. Professional Edition Corporate Edition Site License Edition. Small and medium-sized businesses or individual departments ABBYY FineReader 10 Compare Versions Professional Edition Corporate Edition Site License Edition Ideal For Single users and business professionals Small and medium-sized businesses or individual departments

More information

A Case Study Webinar: How Wiley-Blackwell Accelerated Digital Production by 75% webinar. aptaracorp.com

A Case Study Webinar: How Wiley-Blackwell Accelerated Digital Production by 75% webinar. aptaracorp.com webinar Q&A A Case Study Webinar: How Wiley-Blackwell Accelerated Digital Production by 75% How would you characterize the capabilities of Wiley's solution...were they primarily due to (a) out-of-the-box

More information

The Case of the 35 Gigabyte Digital Record: OCR and Digital Workflows

The Case of the 35 Gigabyte Digital Record: OCR and Digital Workflows Florida International University FIU Digital Commons Works of the FIU Libraries FIU Libraries 8-14-2015 The Case of the 35 Gigabyte Digital Record: OCR and Digital Workflows Kelley F. Rowan Florida International

More information

Mission Possible: Move to a Content Management System to Deliver Business Results from Legacy Content

Mission Possible: Move to a Content Management System to Deliver Business Results from Legacy Content Mission Possible: Move to a Content Management System to Deliver Business Results from Legacy Content Greg Fagan, Sales Director Data Conversion Laboratory So you ve decided you need a system to migrate,

More information

Getting to JATS and BITS. Presented by Bruce D. Rosenblum CEO Inera Incorporated

Getting to JATS and BITS. Presented by Bruce D. Rosenblum CEO Inera Incorporated Getting to JATS and BITS Presented by Bruce D. Rosenblum CEO Inera Incorporated Basic Assumption XML is needed for scholarly Journals and Books HTML presentation Responsive design Rich hyperlinks Long-term

More information

ABBYY FineReader 14 Full Feature List

ABBYY FineReader 14 Full Feature List ABBYY FineReader 14 Full Feature List Productivity and Ease of Use Working with PDF NEW Read and search Review and comment Text extraction and quotation Edit and modify Form filling Security Prepare document

More information

XML, Metadata and More!

XML, Metadata and More! XML, Metadata and More! What is XML? A robust and useful mark-up language Meta-language Allows for reformatting of data through style sheets XML defines the structure of a document DTD - Document Type

More information

Scan to PC Desktop Professional v9 vs. Scan to PC Desktop SE v9 + SE

Scan to PC Desktop Professional v9 vs. Scan to PC Desktop SE v9 + SE Scan to PC Desktop Professional v9 PaperPort Desktop Page Thumbnails on the Desktop for Image and PDF files T (Scanner Enhancement) Tools on the Desktop Search by Document Name and Metadata PageViewer

More information

How to Build a Digital Library

How to Build a Digital Library How to Build a Digital Library Ian H. Witten & David Bainbridge Contents Preface Acknowledgements i iv 1. Orientation: The world of digital libraries 1 One: Supporting human development 1 Two: Pushing

More information

- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents

- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents Purpose of these slides Introduction to XML for parliamentary documents (and all other kinds of documents, actually) Prof. Fabio Vitali University of Bologna Part 1 Introduce the principal aspects of electronic

More information

Adobe. Using DITA XML for Instructional Documentation. Andrew Thomas 08/10/ Adobe Systems Incorporated. All Rights Reserved.

Adobe. Using DITA XML for Instructional Documentation. Andrew Thomas 08/10/ Adobe Systems Incorporated. All Rights Reserved. Adobe Using DITA XML for Instructional Documentation Andrew Thomas 08/10/2005 2005 Adobe Systems Incorporated. All Rights Reserved. Publishing & localization at Adobe Direct localization of software, documentation,

More information

PDFelement 6 Solutions Comparison

PDFelement 6 Solutions Comparison duct Data Sheet Solutions Comparison Our latest release comes stacked with all the productivity-enhancing functionality you ve come to know and love. Compatibility DC Compatible with Microsoft Windows

More information

Moving to XML: The Investment

Moving to XML: The Investment B. Tommie Usdin Mulberry Technologies Inc. 17 West Jefferson St. Suite 207 Rockville MD 20850 Phone: 301/315-9631 Fax: 301/315-8285 info@mulberrytech.com http://www.mulberrytech.com Version 1.0 (January

More information

Laserfiche Document Management at a Glance

Laserfiche Document Management at a Glance Document Management Laserfiche s desktop, web and mobile clients enable users to access and make changes to documents in the repository from any device. Laserfiche Document Management at a Glance Enable

More information

Features & Functionalities

Features & Functionalities Features & Functionalities Release 3.0 www.capture-experts.com Import FEATURES Processing TIF CSV EML Text Clean-up Email HTML ZIP TXT Merge Documents Convert to TIF PST RTF PPT XLS Text Recognition Barcode

More information

QUARK AUTHOR THE SMART CONTENT TOOL. INFO SHEET Quark Author

QUARK AUTHOR THE SMART CONTENT TOOL. INFO SHEET Quark Author QUARK AUTHOR THE SMART CONTENT TOOL Quark Author is Web-based software that, together with Quark Publishing Platform, enables business and IT leaders to streamline and automate high-value customer communications

More information

Quick Reference Guide What s New in NSi AutoStore TM 6.0

Quick Reference Guide What s New in NSi AutoStore TM 6.0 Quick Reference Guide What s New in NSi AutoStore TM 6.0 Notable Solutions, Inc. System requirements Hardware Windows operating system (OS) running on computer with at least a 2 GHz Processor Minimum 2

More information

Adobe Tech Comm Survey Findings. Explore key trends shaping the Technical Communication industry

Adobe Tech Comm Survey Findings. Explore key trends shaping the Technical Communication industry Explore key trends shaping the Technical Communication industry Adobe Tech Comm Survey 2017-2018 Findings The people The 2017-2018 edition of the world s biggest Tech Comm survey is powered by 2000+ respondents

More information

CGM v SVG. Computer Graphics Metafile v Scalable Vector Graphic. David Manock

CGM v SVG. Computer Graphics Metafile v Scalable Vector Graphic. David Manock It shall not be communicated to any third party without the owner s written consent. All rights reserved. CGM v SVG Computer Graphics Metafile v Scalable Vector Graphic David Manock VP Sales and Marketing

More information

File Format Considerations in the Preservation of e- Books

File Format Considerations in the Preservation of e- Books File Format Considerations in the Preservation of e- Books Sheila Morrissey Senior Research Developer, Portico NISO Webinar: Heritage Lost? Ensuring the Preservation of E-books May 23, 1012 Portico - Third

More information

A tool for Entering Structural Metadata in Digital Libraries

A tool for Entering Structural Metadata in Digital Libraries A tool for Entering Structural Metadata in Digital Libraries Lavanya Prahallad, Indira Thammishetty, E.Veera Raghavendra, Vamshi Ambati MSIT Division, International Institute of Information Technology,

More information

ABBYY FineReader 14 YOUR DOCUMENTS IN ACTION

ABBYY FineReader 14 YOUR DOCUMENTS IN ACTION YOUR DOCUMENTS IN ACTION Combining powerful OCR with essential PDF capabilities, FineReader provides a single solution for working with PDFs and scanned paper documents. Content Your Single Solution for

More information

Overview. What is TCM? TCM Supported File Types A Day in the Life of a Document Using TCM in Munis Using TCM without Munis TCM extra Features Q&A *

Overview. What is TCM? TCM Supported File Types A Day in the Life of a Document Using TCM in Munis Using TCM without Munis TCM extra Features Q&A * TCM 101 Overview What is TCM? TCM Supported File Types A Day in the Life of a Document Using TCM in Munis Using TCM without Munis TCM extra Features Q&A * 2 What is Tyler Content Manager? Provides Munis

More information

GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES

GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES October 2018 INTRODUCTION This document provides guidelines for the creation and preservation of digital files. They pertain to both born-digital

More information

Features & Functionalities

Features & Functionalities Features & Functionalities Release 2.1 www.capture-experts.com Import FEATURES OVERVIEW Processing TIF CSV EML Text Clean-up Email HTML ZIP TXT Merge Documents Convert to TIF PST RTF PPT XLS Text Recognition

More information

XBRL: Beyond Basic XML

XBRL: Beyond Basic XML XBRL: Beyond Basic XML Working Paper Series 08-11 August 2008 Craig A. VanLengen Professor of Computer Information Systems/Accounting Northern Arizona University The W. A. Franke College of Business PO

More information

WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES

WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES Comprehensive Collections Management Systems You Can Access Anytime, Anywhere AXIELL COLLECTIONS FOR ARCHIVES Axiell Collections is a webbased CMS designed

More information

Learn Html Pdf Converter Software Full Version Windows 7

Learn Html Pdf Converter Software Full Version Windows 7 Learn Html Pdf Converter Software Full Version Windows 7 The Free DOC to PDF Converter is software that has been widely used by many Learn more Version: 1.0. Total Downloads: 40,498. Date Added: Feb. 02,

More information

XML Documentation for Adobe Experience Manager

XML Documentation for Adobe Experience Manager XML Documentation for Adobe Experience Manager Solution brief XML Documentation for Adobe Experience Manager An enterprise-class CCMS to manage documentation from creation to delivery It s a component

More information

Choosing DITA and Componize

Choosing DITA and Componize Choosing DITA and Componize Linear writing versus structured & modular writing (DITA) Drawbacks of linear writing Authoring Cross-references inserted and maintained manually Copy and paste information

More information

A Guide to Automation Services 8.5.1

A Guide to Automation Services 8.5.1 A Guide to Automation Services 8.5.1 CONTENTS Contents Introduction...4 Where we're coming from...4 Conventions in this book...4 Understanding Automation Services...6 What is Automation Services?...6 Process

More information

Export out report results in multiple formats like PDF, Excel, Print, , etc.

Export out report results in multiple formats like PDF, Excel, Print,  , etc. Edition Comparison DOCSVAULT Docsvault is full of features that can help small businesses and large enterprises go paperless. The feature matrix below displays Docsvault s abilities for its Enterprise

More information

Digitizing Historic Newspapers

Digitizing Historic Newspapers Digitizing Historic Newspapers the University of Utah Way Presented by Scott Christensen iarchives, Inc. July 14, 2005 Agenda 3 Keys to a Quality Digitized Product Processing Methodology Q&A 3 Keys - Introduction

More information

Integrated S1000D & ATA ispec 2200 Publications Lifecycle Management System

Integrated S1000D & ATA ispec 2200 Publications Lifecycle Management System Integrated S1000D & ATA ispec 2200 Publications Lifecycle Management System WebX Systems Overview XML Software Solution Provider for Structured Document Creation, Multichannel Publishing, Content Management

More information

ISO PDF/A -Standard Archive file format standard for long-term preservation

ISO PDF/A -Standard Archive file format standard for long-term preservation ISO PDF/A -Standard Archive file format standard for long-term preservation Marc Straat 22 March 2005 Project ArchiSafe Arbeitskreise Nationale&Internationale Standards: Rechtliche Rahmenbedingungen, Verfahren,

More information

Improved automatic restart and failed job recovery 64-bit support for improved memory utilisation

Improved automatic restart and failed job recovery 64-bit support for improved memory utilisation NUANCE The experience speaks for itself Comparison Chart Solution Comparison Chart Ease of Use How-to-Guides teach you the key steps to use 3 3 Launchpad 3 Image Capture Scanners and All-in-Ones Scanner

More information

Lingotek Client Command Line Tool

Lingotek Client Command Line Tool DATA SHEET 03 01 2016 Lingotek Client Command Line Tool What can Lingotek Client do? Lingotek Client can do almost anything the TMS can do. Connect to Lingotek Create a project Upload documents Request

More information

PDF/A - The Basics. From the Understanding PDF White Papers PDF Tools AG

PDF/A - The Basics. From the Understanding PDF White Papers PDF Tools AG White Paper PDF/A - The Basics From the Understanding PDF White Papers PDF Tools AG Why is PDF/A necessary? What is the PDF/A standard? What are PDF/A-1a, PDF/A-1b, PDF/A2? How should the PDF/A Standard

More information

Consider the Source Structured Authoring for XML-based Documentation

Consider the Source Structured Authoring for XML-based Documentation Consider the Source Structured Authoring for XML-based Documentation Ellen McDaniel Manager of User Services and Web Coordinator College of Engineering North Carolina State University mcdaniel@ncsu.edu

More information

Xyleme Studio Data Sheet

Xyleme Studio Data Sheet XYLEME STUDIO DATA SHEET Xyleme Studio Data Sheet Rapid Single-Source Content Development Xyleme allows you to streamline and scale your content strategy while dramatically reducing the time to market

More information

Managing Information Resources

Managing Information Resources Managing Information Resources 1 Managing Data 2 Managing Information 3 Managing Contents Concepts & Definitions Data Facts devoid of meaning or intent e.g. structured data in DB Information Data that

More information

AGCO s Multi-National, Multi-language Conversion to DITA

AGCO s Multi-National, Multi-language Conversion to DITA AGCO s Multi-National, Multi-language Conversion to DITA Center for Information Development CMS/DITA NA April 2016 Abstract In 2015 AGCO, working with DCL, successfully converted over 70,000 pages to DITA

More information

Database of historical places, persons, and lemmas

Database of historical places, persons, and lemmas Database of historical places, persons, and lemmas Natalia Korchagina Outline 1. Introduction 1.1 Swiss Law Sources Foundation as a Digital Humanities project 1.2 Data to be stored 1.3 Final goal: how

More information

DOWNLOAD OR READ : WORD 10 FOR MAC OS X VISUAL QUICKSTART GUIDES PDF EBOOK EPUB MOBI

DOWNLOAD OR READ : WORD 10 FOR MAC OS X VISUAL QUICKSTART GUIDES PDF EBOOK EPUB MOBI DOWNLOAD OR READ : WORD 10 FOR MAC OS X VISUAL QUICKSTART GUIDES PDF EBOOK EPUB MOBI Page 1 Page 2 word 10 for mac os x visual quickstart guides word 10 for mac pdf word 10 for mac os x visual quickstart

More information

Laserfiche Product Suite 2011

Laserfiche Product Suite 2011 Laserfiche Product Suite 2011 The Laserfiche enterprise content management system is designed to be straightforward to purchase, deploy, extend, administer and support. Our solutions give IT managers central

More information

INDIVIDUAL bizhub ENHANCEMENT

INDIVIDUAL bizhub ENHANCEMENT INDIVIDUAL bizhub ENHANCEMENT Advanced functionality with i-option Streamlining user operation and increasing workflow capabilities are important requirements in today s corporate environments. Taking

More information

Publishing Technology 101 A Journal Publishing Primer. Mike Hepp Director, Technology Strategy Dartmouth Journal Services

Publishing Technology 101 A Journal Publishing Primer. Mike Hepp Director, Technology Strategy Dartmouth Journal Services Publishing Technology 101 A Journal Publishing Primer Mike Hepp Director, Technology Strategy Dartmouth Journal Services mike.hepp@sheridan.com Publishing Technology 101 AGENDA 12 3 EVOLUTION OF PUBLISHING

More information

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered. Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything

More information

Making Accessible Documents. PDF: Adobe Acrobat X & XI

Making Accessible Documents. PDF: Adobe Acrobat X & XI Making Accessible Documents PDF: Adobe Acrobat X & XI Purpose of Instruction Provide tips and strategies on creating documents accessible to individuals with disabilities. Accessibility tools and simple

More information

Scan to PC Desktop Professional v7.0 Orientation Guide

Scan to PC Desktop Professional v7.0 Orientation Guide Scan to PC Desktop Professional v7.0 Orientation Guide Maximizing Your Productivity with Scanning and Your Xerox WorkCentre Pro Multifunction Device Topics Included Scanning to the Desktop Scanning to

More information

3 Publishing Technique

3 Publishing Technique Publishing Tool 32 3 Publishing Technique As discussed in Chapter 2, annotations can be extracted from audio, text, and visual features. The extraction of text features from the audio layer is the approach

More information

White Paper: ABBYY Recognition Server Web Service API Example

White Paper: ABBYY Recognition Server Web Service API Example White Paper: ABBYY Recognition Server Web Service API Example By: Joe Hill Published: June 2017 Summary ABBYY Recognition Server converts paper or electronic documents into compressed, searchable, archive

More information

Question No: 2 Which part of the structured FrameMaker application controls how long SGML and FrameMaker element names can be by default?

Question No: 2 Which part of the structured FrameMaker application controls how long SGML and FrameMaker element names can be by default? Volume: 60 Questions Question No: 1 Which is necessary to create new data and markup text that will be inserted into an XML or SGML document when a structured FrameMaker document is exported? A. a read/write

More information

presentation design: Kat Kamp

presentation design: Kat Kamp presented by: Alicia Sell Vice President, Digitization Services Eric Larson Project Manager, Digitization Services Jocelyn Dunlop Account Representative, Midwest presentation design: Kat Kamp What is OCR

More information

How to use TRANSKRIBUS a very first manual

How to use TRANSKRIBUS a very first manual How to use TRANSKRIBUS a very first manual A simple standard workflow for humanities scholars and volunteers (screenshots below) 0.1.6, 2015-04-24 0. Introduction a. Transkribus is an expert tool. As with

More information

Using PDF Files in CONTENTdm

Using PDF Files in CONTENTdm Using PDF Files in CONTENTdm CONTENTdm uses the Adobe PDF Library to provide features for efficient processing of born-digital documents in Portable Document Format (PDF). PDF files and PDF compound objects

More information

SMEWEBSITE. How it all Works - The Dotser Process 01. Setup & Content Editing 02. The Dotser Content Management System 03

SMEWEBSITE.   How it all Works - The Dotser Process 01. Setup & Content Editing 02. The Dotser Content Management System 03 How it all Works - The Dotser Process 01 Setup & Content Editing 02 The Dotser Content Management System 03 Layout & Design 04 Responsive / Mobile Devices 05 Search Engine Optimisation (S.E.O.) Tool 06

More information

The DMS provides a web browser, a desktop client and a mobile browser as standard features.

The DMS provides a web browser, a desktop client and a mobile browser as standard features. Key System Requirements The DMS is a highly available, scalable platform on which to support a library containing millions of files and documents. All Administrative functionality can be accessed remotely

More information

Everyday Activity. Course Content. Objectives of Lecture 13 Search Engine

Everyday Activity. Course Content. Objectives of Lecture 13 Search Engine Web Technologies and Applications Winter 2001 CMPUT 499: Search Engines Dr. Osmar R. Zaïane University of Alberta Everyday Activity We use search engines whenever we look for resources on the Internet

More information

Mass Digitisation Enabling Access, Use and Reuse

Mass Digitisation Enabling Access, Use and Reuse Mass Digitisation Enabling Access, Use and Reuse National Digitisation Centre, Mikkeli, National Library of Finland Triangelipäivät 30.10.2008 Tiina Ison, Senior Analyst, Project Manager Organisation of

More information

Automating Publishing Workflows through Standardization. XML Publishing with SDL

Automating Publishing Workflows through Standardization. XML Publishing with SDL Automating Publishing Workflows through. XML Publishing with SDL sdl.com Automating Publishing Workflows through This white paper provides our perspective on the use of XML standards in managing styles

More information

AAB UNIVERSITY. Lecture 5. Use of technology in translation process. Dr.sc. Arianit Maraj

AAB UNIVERSITY. Lecture 5. Use of technology in translation process. Dr.sc. Arianit Maraj AAB UNIVERSITY Lecture 5 Use of technology in translation process Dr.sc. Arianit Maraj Arianit.maraj@universitetiaab.com 044 425 159 1 SDL Trados Studio 2014 Getting Started 2 Agenda Introducing SDL Trados

More information

Océ PRISMA archive software. Archiving made easy. Powerful, high-volume. archiving software

Océ PRISMA archive software. Archiving made easy. Powerful, high-volume. archiving software Océ PRISMA archive software Archiving made easy Powerful, high-volume archiving software Automate and accelerate archiving Flexible by design Secure access to archived documents Choose the solution that

More information

Accessible and Usable PDF Documents: Techniques for Document Authors Fourth Edition

Accessible and Usable PDF Documents: Techniques for Document Authors Fourth Edition Accessible and Usable PDF Documents: Techniques for Document Authors Fourth Edition Karen McCall, M.Ed. Contents From the Author... 4 Dedication... 4 Introduction... 20 What is PDF?... 21 History of PDF

More information

USER S GUIDE Software/Hardware Module: ADOBE ACROBAT 7

USER S GUIDE Software/Hardware Module: ADOBE ACROBAT 7 University of Arizona Information Commons Training 1 USER S GUIDE Software/Hardware Module: ADOBE ACROBAT 7 Objective: Scan and create PDF Documents using Adobe Acrobat Software p.1 Introduction p.2 Scanning

More information

is an electronic document that is both user friendly and library friendly

is an electronic document that is both user friendly and library friendly is an electronic document that is both user friendly and library friendly is easy to read and to navigate it has bookmarks and an interactive table-of-contents is practical to consult and arouses more

More information

Automatic Reader. Multi Lingual OCR System.

Automatic Reader. Multi Lingual OCR System. Automatic Reader Multi Lingual OCR System What is the Automatic Reader? Sakhr s Automatic Reader transforms scanned images into a grid of millions of dots, optically recognizes the characters found in

More information

Chapter 11: Editorial Workflow

Chapter 11: Editorial Workflow Chapter 11: Editorial Workflow Chapter 11: Editorial Workflow In this chapter, you will follow as submission throughout the workflow, from first submission to final publication. The workflow is divided

More information

What s New in QuarkXPress 2018

What s New in QuarkXPress 2018 What s New in QuarkXPress 2018 Contents What s New in QuarkXPress 2018...1 Digital publishing...2 Export as Android App...2 HTML5 enhancements...3 Configuration changes...5 Graphics...7 Transparency blend

More information

Part III: Survey of Internet technologies

Part III: Survey of Internet technologies Part III: Survey of Internet technologies Content (e.g., HTML) kinds of objects we re moving around? References (e.g, URLs) how to talk about something not in hand? Protocols (e.g., HTTP) how do things

More information

Chapter 9 Section 3. Digital Imaging (Scanned) And Electronic (Born-Digital) Records Process And Formats

Chapter 9 Section 3. Digital Imaging (Scanned) And Electronic (Born-Digital) Records Process And Formats Records Management (RM) Chapter 9 Section 3 Digital Imaging (Scanned) And Electronic (Born-Digital) Records Process And Formats Revision: 1.0 GENERAL 1.1 The success of a digitized document conversion

More information

SharePoint Archival Storage Strategies & Technologies January Porter-Roth Associates 1

SharePoint Archival Storage Strategies & Technologies January Porter-Roth Associates 1 SharePoint Archival Storage Strategies & Technologies January 2009 Porter-Roth Associates 1 Bud Porter-Roth Porter-Roth Associates 415-381-6217 budpr@erms.com http://www.erms.com Porter-Roth Associates

More information

Proposals for a New Workflow for Level-4 Content

Proposals for a New Workflow for Level-4 Content University of Michigan Deep Blue deepblue.lib.umich.edu 2006-02-13 Proposals for a New Workflow for Level-4 Content Hawkins, Kevin http://hdl.handle.net/2027.42/78536 Hawkins 8/12/2008 10:15:40 AM Page

More information

Accessibility 101. Things to Consider. Text Documents & Presentations: Word, PDF, PowerPoint, Excel, and General D2L Accessibility Guidelines.

Accessibility 101. Things to Consider. Text Documents & Presentations: Word, PDF, PowerPoint, Excel, and General D2L Accessibility Guidelines. Accessibility 101 Things to Consider Text Documents & Presentations: Word, PDF, PowerPoint, Excel, and General D2L Accessibility Guidelines. Things to Consider Structure Figures Hyperlinks Lists Columns

More information

Search Engine Optimization

Search Engine Optimization Search Engine Optimization A necessary campaign for heightened corporate awareness What is SEO? Definition: The practice of building or transforming a Web site so that its content is seen as highly readable,

More information

Contents. Page 2. delivering solutions for your environment

Contents. Page 2. delivering solutions for your environment Contents Introduction 3 Preparation 3 XBRL Process (CaseWare AFS) 4 Open the Financial Statements 4 Complete the mandatory information 5 XBRL process continued (Both for CaseWare AFS and non-caseware AFS)

More information

AIM. 10 September

AIM. 10 September AIM These two courses are aimed at introducing you to the World of Web Programming. These courses does NOT make you Master all the skills of a Web Programmer. You must learn and work MORE in this area

More information

Achieving Accessibility with PDF: Getting from Here to There

Achieving Accessibility with PDF: Getting from Here to There Achieving Accessibility with PDF: Getting from Here to There Featuring Adobe Acrobat 8 Pete DeVasto, Andrew Kirkpatrick, Greg Pisocky Adobe Systems CSUN 2007 March 23, 2007 2007 Adobe Systems Incorporated.

More information

FineReader Engine Overview & New Features in V10

FineReader Engine Overview & New Features in V10 FineReader Engine Overview & New Features in V10 Semyon Sergunin ABBYY Headquarters September 2010 Michael Fuchs ABBYY Europe GmbH September 2010 FineReader Engine Processing Steps Step 1: Image/Document

More information

WEB-BASED COLLECTION MANAGEMENT FOR LIBRARIES

WEB-BASED COLLECTION MANAGEMENT FOR LIBRARIES WEB-BASED COLLECTION MANAGEMENT FOR LIBRARIES Comprehensive Collections Management Systems You Can Access Anytime, Anywhere AXIELL COLLECTIONS FOR LIBRARIES Axiell Collections is a web-based CMS designed

More information

DOWNLOAD OR READ : WORD AND IMAGE IN ARTHURIAN LITERATURE PDF EBOOK EPUB MOBI

DOWNLOAD OR READ : WORD AND IMAGE IN ARTHURIAN LITERATURE PDF EBOOK EPUB MOBI DOWNLOAD OR READ : WORD AND IMAGE IN ARTHURIAN LITERATURE PDF EBOOK EPUB MOBI Page 1 Page 2 word and image in arthurian literature word and image in pdf word and image in arthurian literature pdf converter,

More information

The Journey to Globalization: Building a Successful and Scalable S1000D Authoring and Data Delivery Methodology

The Journey to Globalization: Building a Successful and Scalable S1000D Authoring and Data Delivery Methodology It shall not be communicated to any third party without the owner s written consent. All rights reserved. The Journey to Globalization: Building a Successful and Scalable S1000D Authoring and Data Delivery

More information

Advanced-Forms solution overview

Advanced-Forms solution overview Advanced-Forms solution overview Advanced-Forms is a unique solution in the Output Management market, because of its unique and modern user interfacing and modern and high quality level technology for

More information

Structured Content and Personalization

Structured Content and Personalization Structured Content and Personalization Presented by: - Su-Laine Yeo, Solutions Consultant, JustSystems - Chip Gettinger, VP XML Solutions, SDL - Tom Smith, Product Marketing Executive, SDL Our Presenters

More information

ERPANET Seminar Fontainebleau

ERPANET Seminar Fontainebleau ERPANET Seminar Fontainebleau Jan 29-30, 2003 Archiving policies at the hartmut.burghard@cec.eu.int Archiving policies Table of content 1. Statute and operational tasks 2. Framework for partnerships 3.

More information

Automated Classification. Lars Marius Garshol Topic Maps

Automated Classification. Lars Marius Garshol Topic Maps Automated Classification Lars Marius Garshol Topic Maps 2007 2007-03-21 Automated classification What is it? Why do it? 2 What is automated classification? Create parts of a topic map

More information

USER GUIDE. MADCAP FLARE 2017 r3. Import

USER GUIDE. MADCAP FLARE 2017 r3. Import USER GUIDE MADCAP FLARE 2017 r3 Import Copyright 2018 MadCap Software. All rights reserved. Information in this document is subject to change without notice. The software described in this document is

More information

Nuance AutoStore route destinations

Nuance AutoStore route destinations Data Sheet Nuance AutoStore route destinations is a server-based application which orchestrates the capture and secure delivery of paper and electronic documents into business applications. Once documents

More information

Full Text Service. User Guide. Version 6.1

Full Text Service. User Guide. Version 6.1 Full Text Service User Guide Version 6.1 DocuPhase Corporation 1499 Gulf to Bay Boulevard, Clearwater, FL 33755 Tel: (727) 441-8228 Fax: (727) 444-4419 Email: Support@DocuPhase.com Web: www.docuphase.com

More information

Paraben s Network Examiner 7.0 Release Notes

Paraben s Network  Examiner 7.0 Release Notes Paraben s Network E-mail Examiner 7.0 Release Notes 1 Paraben Corporation Welcome to Paraben s Network E-mail Examiner 7.0! Paraben s Network E-mail Examiner-NEMX is an advanced network e-mail archive

More information

Accessible Document Practices in Adobe Acrobat

Accessible Document Practices in Adobe Acrobat Accessible Document Practices in Adobe Acrobat Todd M. Weissenberger, University of Iowa Adobe Acrobat lets you create documents in Portable Document Format (PDF) from a variety of sources. Acrobat PDFs

More information