Marcello Federico MMT Srl / FBK Trento, Italy

Size: px
Start display at page:

Download "Marcello Federico MMT Srl / FBK Trento, Italy"

Transcription

1 Marcello Federico MMT Srl / FBK Trento, Italy Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 207

2 Symbiotic Human and Machine Translation MT seamlessly adapts to user data learns from post-editing user enjoys enhanced productivity better user experience 3 Usable technology for the translation industry easy to install and deploy fast to set-up for a new project effective, also on small projects scalable with data and users works with commodity hardware 4 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 208

3 The Modern MT way (1) connect your CAT with a plug-in (2) drag & drop your private TMs (3) start translating! 5 Modern MT in a nutshell zero training time adapts to context learns from user corrections scales with data and users 6 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 209

4 Training data is a dynamic collection of Translation Memories At any time: new TMs are added existing TMs are extended Training time comparable to uploading time! 7 Context aware translation SENTENCE party CONTEXT We are going out. CONTEXT We approved the law TRANSLATION fête TRANSLATION parti 8 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 210

5 requests suggestions Machine Translation post-edits Incremental Learning 9 Core technology [original plan] context analyser phrase-based decoder adaptive models incremental structures parallel processing Simple. Adaptive. Neural. 10 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 211

6 Language support 45 languages fast pre-/post-processing simple interfaces tags and XML management localization of expressions TM cleaning Simple. Adaptive. Neural. Context Analyzer analyze input text A B C 5% 45% 50% retrieve best matching TMs compute matching scores dynamic structure Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 212

7 Adaptive Phrase Table 1000 Suffix Array with Ranked Sampling suffix array indexed with TMs phrases sampled on demand priority sampling over TMs dynamic structure Simple. Adaptive. Neural. Adaptive Language Model A B C w p large static background model n-grams stats indexed with TMs combination of active TM LMs TM LMs computed on the fly dynamic structure Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 213

8 TED Talks English-French M. Cettolo, et al. (2016), The IWSLT 2016 Evaluation Campaign, IWSLT. 15 Second Prototype (0.14 January 2017) Open benchmark: - Training speed: 12x Moses - 100x NMT - MT quality (BLEU): +1 vs Moses -0.5 vs NMT Ada Domains: ECB, Gnome, JRC, KDE, OpenOffice, PHP, Ubuntu, UN-TM Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 214

9 What happened Research on adaptive neural MT Believed PBMT was competitive on technical translation Finally realised superiority of NMT quality Completed PBMT release and switched to NMT Data collection for 14 translation directions Simple. Adaptive. Neural. Roadmap from last review meeting technology switch 2015 Q Q Q Q4 minimum viable product first alpha release first beta release final release context aware 1 lang pair fast training, context aware, distributed, 1 lang pair online learning plug-in, 3 lang pairs neural MT, enterprise ready, 14 lang pairs Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 215

10 Multi-user scenario Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 216

11 Multi-user scenario Multi-user scenario Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 217

12 All we need is a memory 24 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 218

13 All we need is a memory 25 All we need is a memory 26 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 219

14 All we need is a memory 27 All we need is a memory 28 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 220

15 Multi-user adaptive NMT 29 Multi-user adaptive NMT Instances are selected by combining context scores and similarity scores 30 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 221

16 Adaptation, too! Source: Farajian et al, Multi-Domain Neural MT through Unsupervised Adaptation, Proc. WMT Farajian et al. (2017) Multi-domain NMT through unsupervised adaptation, WMT. 31 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 222

17 Sep: integration of MateCat Oct: NMT code released Nov: co-development release of 14 engines Dec: performance boost 33 Relative BLEU scores wrt Google Translate 34 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 223

18 Progression in one month on English-Italian Performance of generic MMT 1-6 scale (w/o adaptation) 35 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 224

19 37 38 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 225

20 Noisy training data EN: What history teaches us IT: === Storia ================================== Simple. Adaptive. Neural. Data Cleaning We added a simple QE module to filter out bad examples: Apply Fast-Align in two directions Compute Model 1 scores in two directions Combine and normalize scores Filter out on the distribution of scores Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 226

21 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 227

22 43 44 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 228

23 We compare: Generic MT: production engine [En-It] Custom MT: Generic MT tuned on TM [takes hours] +Adaptive MT: Generic MT adapted on TM [real-time] +Incremental MT: TM updated with simulated PE Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 229

24 47 48 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 230

25 49 50 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 231

26 Use post-editing as new training instances Perform one/more iterations Can be combined with a priori adaptation Updates generic or adapted model Turchi et al. (2017), Continuous learning from human post-edits for NMT, EAMT. 52 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 232

27 53 54 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 233

28 55 56 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 234

29 57 Online-learning contribution is consistent Does it scale with number of domains? Incremental learning contributes marginally Probably depends on test set size We are not always able to beat specialized models How to improve further adaptation? 58 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 235

30 Can improve MT without touching it inside We can adapt an external MT service! Similar to NMT: two inputs (src,mt), one output (ape) Can be trained with less data than NMT We can deploy instance based adaptation Chatterjee et al. (2017), Multi-source Neural APE: FBK s participation., WMT. 60 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 236

31 61 Neural APE uses two encoders and two attention models, which are merged and used by one decoder. 62 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 237

32 63 64 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 238

33 65 66 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 239

34 67 Can improve on top of static and adaptive engine! Uses incremental learning, adaptation and online learning Portable (in principle) on the multi-domain setting Limited gain on top of full-fledged adaptive NMT Can be an extra component to manage 68 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 240

35 Multi-user scenario goes beyond simple domain adaptation We need to handle multiple evolving domains Domain customization is not an option Real-time adaptation/learning works! But, there is still room for improvement! 70 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 241

36 Thank You Website Github github.com/modernmt/mmt Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 242

D5.6 - Evaluation Benchmarks

D5.6 - Evaluation Benchmarks Th is document is part of the Project Machine Tr a n sla tion En h a n ced Com pu ter A ssisted Tr a n sla tion (Ma teca t ), fu nded by the 7th Framework Programme of th e Eu r opea n Com m ission th

More information

MMT Modern Machine Translation

MMT Modern Machine Translation MMT Modern Machine Translation Second Design and Specifications Report Author(s): Davide Caroselli, Nicola Bertoldi, Mauro Cettolo, Marcello Federico Dissemination Level: Public Date: July 1 st, 2016 Grant

More information

D6.1.2: Second report on scientific evaluations

D6.1.2: Second report on scientific evaluations D6.1.2: Second report on scientific evaluations UPVLC, XEROX, JSI-K4A, RWTH, EML and DDS Distribution: Public translectures Transcription and Translation of Video Lectures ICT Project 287755 Deliverable

More information

An efficient and user-friendly tool for machine translation quality estimation

An efficient and user-friendly tool for machine translation quality estimation An efficient and user-friendly tool for machine translation quality estimation Kashif Shah, Marco Turchi, Lucia Specia Department of Computer Science, University of Sheffield, UK {kashif.shah,l.specia}@sheffield.ac.uk

More information

iems Interactive Experiment Management System Final Report

iems Interactive Experiment Management System Final Report iems Interactive Experiment Management System Final Report Pēteris Ņikiforovs Introduction Interactive Experiment Management System (Interactive EMS or iems) is an experiment management system with a graphical

More information

An Unsupervised Model for Joint Phrase Alignment and Extraction

An Unsupervised Model for Joint Phrase Alignment and Extraction An Unsupervised Model for Joint Phrase Alignment and Extraction Graham Neubig 1,2, Taro Watanabe 2, Eiichiro Sumita 2, Shinsuke Mori 1, Tatsuya Kawahara 1 1 Graduate School of Informatics, Kyoto University

More information

NTT SMT System for IWSLT Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada, and Hideki Isozaki NTT Communication Science Labs.

NTT SMT System for IWSLT Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada, and Hideki Isozaki NTT Communication Science Labs. NTT SMT System for IWSLT 2008 Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada, and Hideki Isozaki NTT Communication Science Labs., Japan Overview 2-stage translation system k-best translation

More information

INF5820/INF9820 LANGUAGE TECHNOLOGICAL APPLICATIONS. Jan Tore Lønning, Lecture 8, 12 Oct

INF5820/INF9820 LANGUAGE TECHNOLOGICAL APPLICATIONS. Jan Tore Lønning, Lecture 8, 12 Oct 1 INF5820/INF9820 LANGUAGE TECHNOLOGICAL APPLICATIONS Jan Tore Lønning, Lecture 8, 12 Oct. 2016 jtl@ifi.uio.no Today 2 Preparing bitext Parameter tuning Reranking Some linguistic issues STMT so far 3 We

More information

OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System

OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System To be presented at IceTAL (Reykjavík, 16-18 Aug. 2010) OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System Sandipan Dandapat, Mikel L. Forcada, Declan Groves, Sergio Penkale,

More information

LSTM for Language Translation and Image Captioning. Tel Aviv University Deep Learning Seminar Oran Gafni & Noa Yedidia

LSTM for Language Translation and Image Captioning. Tel Aviv University Deep Learning Seminar Oran Gafni & Noa Yedidia 1 LSTM for Language Translation and Image Captioning Tel Aviv University Deep Learning Seminar Oran Gafni & Noa Yedidia 2 Part I LSTM for Language Translation Motivation Background (RNNs, LSTMs) Model

More information

MMT Modern Machine Translation. First Evaluation Plan

MMT Modern Machine Translation. First Evaluation Plan This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No 645487. MMT Modern Machine Translation First Evaluation Plan Author(s):

More information

Introduction to MateCat

Introduction to MateCat Introduction to MateCat Luca De Franceschi - Associate Product Manager Le Mans, 12 June 2018 A little bit of history 1940s and 50s: Machine Translation is all the rage 1960s: Automatic Language Processing

More information

Better translations with user collaboration - Integrated MT at Microsoft

Better translations with user collaboration - Integrated MT at Microsoft Better s with user collaboration - Integrated MT at Microsoft Chris Wendt Microsoft Research One Microsoft Way Redmond, WA 98052 christw@microsoft.com Abstract This paper outlines the methodologies Microsoft

More information

A General Overview of Machine Translation System Development

A General Overview of Machine Translation System Development A General Overview of Machine Translation System Development Mārcis Pinnis, Tilde May 23, 2018, Lisbon, Portugal Who am I? My name is Mārcis Pinnis I work as a researcher in Tilde My current research focuses

More information

A Smorgasbord of Features to Combine Phrase-Based and Neural Machine Translation

A Smorgasbord of Features to Combine Phrase-Based and Neural Machine Translation A Smorgasbord of Features to Combine Phrase-Based and Neural Machine Translation Benjamin Marie Atsushi Fujita National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho,

More information

Kyoto-NMT: a Neural Machine Translation implementation in Chainer

Kyoto-NMT: a Neural Machine Translation implementation in Chainer Kyoto-NMT: a Neural Machine Translation implementation in Chainer Fabien Cromières Japan Science and Technology Agency Kawaguchi-shi, Saitama 332-0012 fabien@pa.jst.jp Abstract We present Kyoto-NMT, an

More information

TALP: Xgram-based Spoken Language Translation System Adrià de Gispert José B. Mariño

TALP: Xgram-based Spoken Language Translation System Adrià de Gispert José B. Mariño TALP: Xgram-based Spoken Language Translation System Adrià de Gispert José B. Mariño Outline Overview Outline Translation generation Training IWSLT'04 Chinese-English supplied task results Conclusion and

More information

Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets

Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Rahul Potharaju (Purdue University) Navendu Jain (Microsoft Research) Cristina Nita-Rotaru (Purdue University) April

More information

Balancing Awareness and Interruption: Investigation of Notification Deferral Policies

Balancing Awareness and Interruption: Investigation of Notification Deferral Policies Balancing Awareness and Interruption: Investigation of Notification Deferral Policies Eric Horvitz 1, Johnson Apacible 1, and Muru Subramani 1 1 Microsoft Research, One Microsoft Way Redmond, Washington

More information

RLAT Rapid Language Adaptation Toolkit

RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit Tim Schlippe May 15, 2012 RLAT Rapid Language Adaptation Toolkit - 2 RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit - 3 Outline Introduction

More information

THE CUED NIST 2009 ARABIC-ENGLISH SMT SYSTEM

THE CUED NIST 2009 ARABIC-ENGLISH SMT SYSTEM THE CUED NIST 2009 ARABIC-ENGLISH SMT SYSTEM Adrià de Gispert, Gonzalo Iglesias, Graeme Blackwood, Jamie Brunning, Bill Byrne NIST Open MT 2009 Evaluation Workshop Ottawa, The CUED SMT system Lattice-based

More information

Tuning. Philipp Koehn presented by Gaurav Kumar. 28 September 2017

Tuning. Philipp Koehn presented by Gaurav Kumar. 28 September 2017 Tuning Philipp Koehn presented by Gaurav Kumar 28 September 2017 The Story so Far: Generative Models 1 The definition of translation probability follows a mathematical derivation argmax e p(e f) = argmax

More information

D1.6 Toolkit and Report for Translator Adaptation to New Languages (Final Version)

D1.6 Toolkit and Report for Translator Adaptation to New Languages (Final Version) www.kconnect.eu D1.6 Toolkit and Report for Translator Adaptation to New Languages (Final Version) Deliverable number D1.6 Dissemination level Public Delivery date 16 September 2016 Status Author(s) Final

More information

High School PLTW Computer Science A Curriculum

High School PLTW Computer Science A Curriculum Grade 9th - 12th, 1 Credit Elective Course Prerequisites: PLTW Computer Science and Software Engineering High School PLTW Computer Science A Curriculum Course Description: Computer Science A (CSA) builds

More information

Statistical Machine Translation Part IV Log-Linear Models

Statistical Machine Translation Part IV Log-Linear Models Statistical Machine Translation art IV Log-Linear Models Alexander Fraser Institute for Natural Language rocessing University of Stuttgart 2011.11.25 Seminar: Statistical MT Where we have been We have

More information

Corpus Acquisition from the Internet

Corpus Acquisition from the Internet Corpus Acquisition from the Internet Philipp Koehn partially based on slides from Christian Buck 26 October 2017 Big Data 1 For many language pairs, lots of text available. Text you read in your lifetime

More information

Reviewer WordPress Plugin

Reviewer WordPress Plugin Reviewer WordPress Plugin Reviews and Comparison Tables for Your Posts Version: 2.3.0 - by Michele Ivani - evographics.net This is a how-to guide to install your plugin copy. If you have any questions

More information

Experimenting in MT: Moses Toolkit and Eman

Experimenting in MT: Moses Toolkit and Eman Experimenting in MT: Moses Toolkit and Eman Aleš Tamchyna, Ondřej Bojar Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics Charles University, Prague Tue Sept 8, 2015 1 / 30

More information

SCOTTISH PARLIAMENT WEB AND ONLINE ROADMAP PROJECT (INCLUDING DEVELOPING A NEW IMPROVED WEBSITE PRESENCE FOR THE SCOTTISH PARLIAMENT)

SCOTTISH PARLIAMENT WEB AND ONLINE ROADMAP PROJECT (INCLUDING DEVELOPING A NEW IMPROVED WEBSITE PRESENCE FOR THE SCOTTISH PARLIAMENT) SPCB(2018)Paper 37 3 May 2018 SCOTTISH PARLIAMENT WEB AND ONLINE ROADMAP PROJECT (INCLUDING DEVELOPING A NEW IMPROVED WEBSITE PRESENCE FOR THE SCOTTISH PARLIAMENT) Executive summary 1. The paper has been

More information

Corpus Acquisition from the Interwebs. Christian Buck, University of Edinburgh

Corpus Acquisition from the Interwebs. Christian Buck, University of Edinburgh Corpus Acquisition from the Interwebs Christian Buck, University of Edinburgh There is no data like more data (Bob Mercer, 1985) Mining Bilingual Text "Same text in different languages" Usually: one side

More information

DCU System Report on the WMT 2017 Multi-modal Machine Translation Task

DCU System Report on the WMT 2017 Multi-modal Machine Translation Task DCU System Report on the WMT 2017 Multi-modal Machine Translation Task Iacer Calixto and Koel Dutta Chowdhury and Qun Liu {iacer.calixto,koel.chowdhury,qun.liu}@adaptcentre.ie Abstract We report experiments

More information

Moses version 2.1 Release Notes

Moses version 2.1 Release Notes Moses version 2.1 Release Notes Overview The document is the release notes for the Moses SMT toolkit, version 2.1. It describes the changes to toolkit since version 1.0 from January 2013. The aim during

More information

A General-Purpose Rule Extractor for SCFG-Based Machine Translation

A General-Purpose Rule Extractor for SCFG-Based Machine Translation A General-Purpose Rule Extractor for SCFG-Based Machine Translation Greg Hanneman, Michelle Burroughs, and Alon Lavie Language Technologies Institute Carnegie Mellon University Fifth Workshop on Syntax

More information

Overview. Performance Breakthrough Features Impact Roadmap

Overview. Performance Breakthrough Features Impact Roadmap What s New? Overview Performance Breakthrough Features Impact Roadmap VS Secret Transparency Transparency No other vendor talks freely about real limits and product usability Performance is the key! Why?

More information

First Steps Towards Coverage-Based Document Alignment

First Steps Towards Coverage-Based Document Alignment First Steps Towards Coverage-Based Document Alignment Luís Gomes 1, Gabriel Pereira Lopes 1, 1 NOVA LINCS, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Portugal ISTRION BOX, Translation

More information

Joining the Conversation

Joining the Conversation Joining the Conversation Reddit, Maptime, Slack for GIS March 2, 2016 Atlanta Regional GIS User Group Chris Rice Langdon Sanders Reddit.com/r/GIS reddit.com /r/gis Reddit.com/r/GIS Reddit.com/r/GIS User

More information

NLPL - The Nordic Language Processing Laboratory.

NLPL - The Nordic Language Processing Laboratory. NLPL - The Nordic Language Processing Laboratory http://nlpl.eu/ What is NLPL? Vision virtual laboratory for large-scale NLP research share high-performance computing and data resources across language

More information

PHP Composer 9 Benefits of Using a Binary Repository Manager

PHP Composer 9 Benefits of Using a Binary Repository Manager PHP Composer 9 Benefits of Using a Binary Repository Manager White Paper Copyright 2017 JFrog Ltd. March 2017 www.jfrog.com Executive Summary PHP development has become one of the most popular platforms

More information

The QCN System for Egyptian Arabic to English Machine Translation. Presenter: Hassan Sajjad

The QCN System for Egyptian Arabic to English Machine Translation. Presenter: Hassan Sajjad The QCN System for Egyptian Arabic to English Machine Translation Presenter: Hassan Sajjad QCN Collaboration Ahmed El Kholy Wael Salloum Nizar Habash Ahmed Abdelali Nadir Durrani Francisco Guzmán Preslav

More information

jamf Nation - London Roadshow

jamf Nation - London Roadshow jamf Nation - London Roadshow Sachin Parmar Workplace Technology Manager 17 May 2018 About Just Eat Our vision Creating the world s greatest food community 3 About Me About Me Sachin Parmar 8+ years professionally

More information

Same-language MT for local flavours/flavors

Same-language MT for local flavours/flavors Same-language MT for local flavours/flavors Janice Campbell - Adobe Gema Ramírez - Prompsit Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17-21, 2018 Page 35 Intro: brief history of AltLang

More information

BA Insight Federator. How the BA Insight Federator Extends SharePoint Search

BA Insight Federator. How the BA Insight Federator Extends SharePoint Search How the BA Insight Federator Extends SharePoint Search 20 Park Plaza, Suite 1115 Boston, MA 02116, USA 1.339.368.7234 info@bainsight.com www.bainsight.com BA Insight Federator The BA Insight Federator

More information

Convolutional Sequence to Sequence Learning. Denis Yarats with Jonas Gehring, Michael Auli, David Grangier, Yann Dauphin Facebook AI Research

Convolutional Sequence to Sequence Learning. Denis Yarats with Jonas Gehring, Michael Auli, David Grangier, Yann Dauphin Facebook AI Research Convolutional Sequence to Sequence Learning Denis Yarats with Jonas Gehring, Michael Auli, David Grangier, Yann Dauphin Facebook AI Research Sequence generation Need to model a conditional distribution

More information

Sparse Feature Learning

Sparse Feature Learning Sparse Feature Learning Philipp Koehn 1 March 2016 Multiple Component Models 1 Translation Model Language Model Reordering Model Component Weights 2 Language Model.05 Translation Model.26.04.19.1 Reordering

More information

AMT 2014 A TUTORIAL. MateCat: an Open Source CAT Tool for MT Post-Editing. Vancouver, BC. October amta2014.amtaweb.org

AMT 2014 A TUTORIAL. MateCat: an Open Source CAT Tool for MT Post-Editing. Vancouver, BC. October amta2014.amtaweb.org AMT 2014 A MateCat: an Open Source CAT Tool for MT Post-Editing Marcello Federico Nicola Bertoldi Marco Trombetti Alessandro Cattelan TUTORIAL The 11th Conference of the Association for Machine Translation

More information

APNIC Update. German Valdez External Relations Program Director, APNIC ARIN XXX 26-October-2012

APNIC Update. German Valdez External Relations Program Director, APNIC ARIN XXX 26-October-2012 APNIC Update German Valdez External Relations Program Director, APNIC ARIN XXX 26-October-2012 Overview Stats IPv4 transfers Training APNIC Labs Policies ISIF Public Affairs APNIC Member Survey APNIC Coming

More information

AMTA PRESENTATION. How Much Cake is Enough: The Case for Domain-Specific Engines. ALEXYANISHEVSKY Welocalize October 2015

AMTA PRESENTATION. How Much Cake is Enough: The Case for Domain-Specific Engines. ALEXYANISHEVSKY Welocalize October 2015 AMTA PRESENTATION How Much Cake is Enough: The Case for Domain-Specific Engines ALEXYANISHEVSKY Welocalize October 2015 Proceedings of MT Summit XV, vol. 2: MT Users' Track Miami, Oct 30 - Nov 3, 2015

More information

Mobile Apps Why & How

Mobile Apps Why & How Mobile Apps Why & How MOBIKATS Phillip Hunt CEO philliphunt@mobikats.com 07590 277045 Putting your Content in Everyone s Pocket. Copyright Mobikats 2012 What do we do? Consultancy on developing or reviewing

More information

Machine Translation Research in META-NET

Machine Translation Research in META-NET Machine Translation Research in META-NET Jan Hajič Institute of Formal and Applied Linguistics Charles University in Prague, CZ hajic@ufal.mff.cuni.cz Solutions for Multilingual Europe Budapest, Hungary,

More information

GEOFLUENT VS. STRAIGHT MACHINE TRANSLATION: UNDERSTANDING THE DIFFERENCES

GEOFLUENT VS. STRAIGHT MACHINE TRANSLATION: UNDERSTANDING THE DIFFERENCES SOLUTION BRIEF GEOFLUENT VS. STRAIGHT MACHINE TRANSLATION: UNDERSTANDING THE DIFFERENCES by EXECUTIVE SUMMARY Real-time translation (RTT) technology is an effective solution for instantly translating content

More information

Overview. Websites. Brand Workshop 1-2 weeks. Research 1 week. Content 2-3 weeks. Design 4-6 weeks. Development 6-8 weeks. Complete weeks

Overview. Websites. Brand Workshop 1-2 weeks. Research 1 week. Content 2-3 weeks. Design 4-6 weeks. Development 6-8 weeks. Complete weeks Our Process Overview Custom design Branding workshop Developed using HTML5, PHP and a custom CMS Designed and developed to be easily managed & edited Content written by professional copywriter Free Google

More information

Joint Decoding with Multiple Translation Models

Joint Decoding with Multiple Translation Models Joint Decoding with Multiple Translation Models Yang Liu, Haitao Mi, Yang Feng, and Qun Liu Institute of Computing Technology, Chinese Academy of ciences {yliu,htmi,fengyang,liuqun}@ict.ac.cn 8/10/2009

More information

Tuning Machine Translation Parameters with SPSA

Tuning Machine Translation Parameters with SPSA Tuning Machine Translation Parameters with SPSA Patrik Lambert, Rafael E. Banchs TALP Research Center, Jordi Girona Salgado 1 3. 0803 Barcelona, Spain lambert,rbanchs@gps.tsc.upc.edu Abstract Most of statistical

More information

ITS 2.0-ENABLED STATISTICAL MACHINE TRANSLATION AND TRAINING WEB SERVICES

ITS 2.0-ENABLED STATISTICAL MACHINE TRANSLATION AND TRAINING WEB SERVICES ITS 2.0-ENABLED STATISTICAL MACHINE TRANSLATION AND TRAINING WEB SERVICES Ankit Srivastava & Declan Groves Centre for Next Generation Localisation (CNGL), School of Computing, Dublin City University FEISGILTT

More information

Programming Assignment #2

Programming Assignment #2 Programming Assignment #2 Due: 11:59pm, Wednesday, Feb. 13th Objective: This assignment will provide further practice with classes and objects, and deepen the understanding of basic OO programming. Task:

More information

The Tenth Biennial Conference of the Association for Machine Translation in the Americas Practical Domain Adaptation

The Tenth Biennial Conference of the Association for Machine Translation in the Americas Practical Domain Adaptation AMT 2012 20 Years A The Tenth Biennial Conference of the Association for Machine Translation in the Americas Practical Domain Adaptation Marcello Federico Fondazione Bruno Kessler Nicola Bertoldi Fondazione

More information

A Space-Efficient Phrase Table Implementation Using Minimal Perfect Hash Functions

A Space-Efficient Phrase Table Implementation Using Minimal Perfect Hash Functions A Space-Efficient Phrase Table Implementation Using Minimal Perfect Hash Functions Marcin Junczys-Dowmunt Faculty of Mathematics and Computer Science Adam Mickiewicz University ul. Umultowska 87, 61-614

More information

OCR and Automated Translation for the Navigation of non-english Handsets: A Feasibility Study with Arabic

OCR and Automated Translation for the Navigation of non-english Handsets: A Feasibility Study with Arabic OCR and Automated Translation for the Navigation of non-english Handsets: A Feasibility Study with Arabic Jennifer Biggs and Michael Broughton Defence Science and Technology Organisation Edinburgh, South

More information

verapdf Industry supported PDF/A validation

verapdf Industry supported PDF/A validation verapdf Industry supported PDF/A validation About this webinar What we ll be showing you: our current development status; the Consortium s development plans for 2016; how we ve been testing the software

More information

Work like a startup! TUT Software Startup Day. Harri Kiljander, Dr. Tech

Work like a startup! TUT Software Startup Day. Harri Kiljander, Dr. Tech TUT Software Startup Day Work like a startup! Harri Kiljander, Dr. Tech Product guy and UX designer, who also wrote real software a couple of decades ago Director of F-Secure Lokki 11 th of December 2013,

More information

Start downloading our playground for SMT: wget

Start downloading our playground for SMT: wget Before we start... Start downloading Moses: wget http://ufal.mff.cuni.cz/~tamchyna/mosesgiza.64bit.tar.gz Start downloading our playground for SMT: wget http://ufal.mff.cuni.cz/eman/download/playground.tar

More information

A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation.

A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation. A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation May 29, 2003 Shankar Kumar and Bill Byrne Center for Language and Speech Processing

More information

Multimedia Information Retrieval

Multimedia Information Retrieval Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Multimedia Information Retrieval 1. What are

More information

CACAO PROJECT AT THE 2009 TASK

CACAO PROJECT AT THE 2009 TASK CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype

More information

UELMA Preservation. Jason Judt & Daniel Kruse Office of the Revisor of Statutes, Minnesota

UELMA Preservation. Jason Judt & Daniel Kruse Office of the Revisor of Statutes, Minnesota UELMA Preservation Jason Judt & Daniel Kruse Office of the Revisor of Statutes, Minnesota UELMA System Overview Server-side authentication Complete versioning system Tracking and auditing tools developed

More information

The Continuous SEO Process. WebExpo 2016

The Continuous SEO Process. WebExpo 2016 The Continuous SEO Process WebExpo 2016 SEO needs TLC SEO needs TLC The nature of SEO SEO is continuous by nature. So why aren t we? SEO Authority & Trust Content Content Content Technical Foundation SEO

More information

Next Generation Platform (NGP) Marketing Plan

Next Generation Platform (NGP) Marketing Plan Next Generation Platform (NGP) Marketing Plan Digital Services Christine Ierano March 2015 Background & Rationale What's NGP? Next Generation Platform or NGP is a strategic solution to a business problem

More information

Maximising Search Engine Marketing. Search Marketing and Google Analytics

Maximising Search Engine Marketing. Search Marketing and Google Analytics Maximising Search Engine Marketing Search Marketing and Google Analytics Why Digital Marketing? UK Business Digital Index 2016 52% state cost savings is one of the advantages to being online, a significant

More information

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Topics AGENDA Challenges with Big Data Analytics How SAS can help you to minimize time to value with

More information

Open Document Format. It s Your Information Make the Choice to Ensure It Stays That Way ODF ALLIANCE. The OpenDocument Format: It's Your Information 1

Open Document Format. It s Your Information Make the Choice to Ensure It Stays That Way ODF ALLIANCE. The OpenDocument Format: It's Your Information 1 Open Document Format It s Your Information Make the Choice to Ensure It Stays That Way ODF ALLIANCE The OpenDocument Format: It's Your Information 1 Imagine if: your government forced you to buy a word

More information

Application for Summer of Code 2008: Jimmy Berry Usability Testing Suite 1

Application for Summer of Code 2008: Jimmy Berry Usability Testing Suite 1 Application for Summer of Code 2008: Jimmy Berry Usability Testing Suite 1 Purpose My project will provide a method of measuring the user experience 2 which has been stated as a very important goal of

More information

EXPERIMENTAL SETUP: MOSES

EXPERIMENTAL SETUP: MOSES EXPERIMENTAL SETUP: MOSES Moses is a statistical machine translation system that allows us to automatically train translation models for any language pair. All we need is a collection of translated texts

More information

WHITEPAPER WHAT TO CONSIDER TO SUCCESSFULLY LAUNCH A MOBILE APP! By RG Infotech WHITEPAPER. How to launch a MOBILE APP. Successfully!

WHITEPAPER WHAT TO CONSIDER TO SUCCESSFULLY LAUNCH A MOBILE APP! By RG Infotech WHITEPAPER. How to launch a MOBILE APP. Successfully! WHITEPAPER How to launch a MOBILE APP Successfully! By RG Infotech Since the launch of smartphones, mobile industry embarks several milestones that attracts several industries and developers to enter with

More information

1

1 1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance

More information

Recurrent Neural Networks

Recurrent Neural Networks Recurrent Neural Networks 11-785 / Fall 2018 / Recitation 7 Raphaël Olivier Recap : RNNs are magic They have infinite memory They handle all kinds of series They re the basis of recent NLP : Translation,

More information

Simile Tools Workshop Summary MacKenzie Smith, MIT Libraries

Simile Tools Workshop Summary MacKenzie Smith, MIT Libraries Simile Tools Workshop Summary MacKenzie Smith, MIT Libraries Intro On June 10 th and 11 th, 2010 a group of Simile Exhibit users, software developers and architects met in Washington D.C. to discuss the

More information

CASE STUDY. Inbound Marketing Engagement With Stepping Stones Centre, UAE. July Ongoing

CASE STUDY. Inbound Marketing Engagement With Stepping Stones Centre, UAE. July Ongoing CASE STUDY Inbound Marketing Engagement With Stepping Stones Centre, UAE July 216 - Ongoing Presentation Agenda! Deliverables Agreed To Activities Taken Up Results What s Next 3 About Stepping Stones Stepping

More information

A Guide to Using MailChimp

A Guide to Using MailChimp v 1.4 Updated October 11, 2018 Table of Contents 1. Introduction.......................................................................3 2. Account Set-Up...................................................................4

More information

Build Better Business Models for Machine Translation

Build Better Business Models for Machine Translation Build Better Business Models for Machine Translation Executive Summary KantanAnalytics is a newly developed, Machine Translation (MT) quality scoring technology that it is anticipated to have a considerable

More information

The Prague Bulletin of Mathematical Linguistics NUMBER 91 JANUARY

The Prague Bulletin of Mathematical Linguistics NUMBER 91 JANUARY The Prague Bulletin of Mathematical Linguistics NUMBER 9 JANUARY 009 79 88 Z-MERT: A Fully Configurable Open Source Tool for Minimum Error Rate Training of Machine Translation Systems Omar F. Zaidan Abstract

More information

Guide to Marketing

Guide to  Marketing Guide to Email Marketing Why Email Marketing? Isn t Email Dead? Email is not dead. In fact, it s very much alive. More and more businesses are using email as an effective form of communication, and more

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Industrial Machinery. Search Marketing Case Study

Industrial Machinery. Search Marketing Case Study Industrial Machinery Search Marketing Case Study 2015-2016 A performance digital marketing consultancy & training academy. Our team comprises of digital researchers, strategists, consultants, usability

More information

LoonyBin: Making Empirical MT Reproduciple, Efficient, and Less Annoying

LoonyBin: Making Empirical MT Reproduciple, Efficient, and Less Annoying LoonyBin: Making Empirical MT Reproduciple, Efficient, and Less Annoying Jonathan Clark Alon Lavie CMU Machine Translation Lunch Tuesday, January 19, 2009 Outline A (Brief) Guilt Trip What goes into a

More information

Did you know that SEO increases traffic, leads and sales? SEO = More Website Visitors More Traffic = More Leads More Leads= More Sales

Did you know that SEO increases traffic, leads and sales? SEO = More Website Visitors More Traffic = More Leads More Leads= More Sales 1 Did you know that SEO increases traffic, leads and sales? SEO = More Website Visitors More Traffic = More Leads More Leads= More Sales What is SEO? Search engine optimization is the process of improving

More information

DIGITAL SIGNAGE SOFTWARE

DIGITAL SIGNAGE SOFTWARE DIGITAL SIGNAGE SOFTWARE MODERN. POWERFUL. EFFECTIVE. The most complete solution for visual communication JADE GIVES LIFE TO YOUR DIGITAL SCREENS Jade is the modern and reliable digital signage content

More information

CS839: Probabilistic Graphical Models. Lecture 22: The Attention Mechanism. Theo Rekatsinas

CS839: Probabilistic Graphical Models. Lecture 22: The Attention Mechanism. Theo Rekatsinas CS839: Probabilistic Graphical Models Lecture 22: The Attention Mechanism Theo Rekatsinas 1 Why Attention? Consider machine translation: We need to pay attention to the word we are currently translating.

More information

/ Cloud Computing. Recitation 7 October 10, 2017

/ Cloud Computing. Recitation 7 October 10, 2017 15-319 / 15-619 Cloud Computing Recitation 7 October 10, 2017 Overview Last week s reflection Project 3.1 OLI Unit 3 - Module 10, 11, 12 Quiz 5 This week s schedule OLI Unit 3 - Module 13 Quiz 6 Project

More information

WordPress & Web Development. Student Curriculum

WordPress & Web Development. Student Curriculum WordPress & Web Development Student Curriculum Intro to HTML Coding (Week 1) Class discussion on why HTML is important. Each Student will complete 13 lessons in basic HTML coding and submit them for a

More information

Mobile Query Interfaces

Mobile Query Interfaces Mobile Query Interfaces Matthew Krog Abstract There are numerous alternatives to the application-oriented mobile interfaces. Since users use their mobile devices to manage personal information, a PIM interface

More information

JU_CSE_TE: System Description 2010 ResPubliQA

JU_CSE_TE: System Description 2010 ResPubliQA JU_CSE_TE: System Description QA@CLEF 2010 ResPubliQA Partha Pakray 1, Pinaki Bhaskar 1, Santanu Pal 1, Dipankar Das 1, Sivaji Bandyopadhyay 1, Alexander Gelbukh 2 Department of Computer Science & Engineering

More information

Module 3.1. The Numbers Game - Call Tracking

Module 3.1. The Numbers Game - Call Tracking Module 3.1 The Numbers Game - Why Local Consumers want something, and they want it now... If your toilet starts leaking and you need a plumber... Are you going to fill out a web form? NO! People Want to

More information

Spice UK. Susan Hallam. Susan Hallam Page 1. Spice UK. Agenda for Today

Spice UK. Susan Hallam. Susan Hallam Page 1. Spice UK. Agenda for Today UK UK www.shcl.co.uk susan@shcl.co.uk Agenda for Today Getting Found in Google Social Media Marketing Adwords Pay Per Click Advertising Promotion Techniques Google Analytics susan@shcl.co.uk Page 1 UK

More information

A PRACTICE BUILDERS white paper. 8 Ways to Improve SEO Ranking of Your Healthcare Website

A PRACTICE BUILDERS white paper. 8 Ways to Improve SEO Ranking of Your Healthcare Website A PRACTICE BUILDERS white paper 8 Ways to Improve SEO Ranking of Your Healthcare Website More than 70 percent of patients find their healthcare providers through a search engine. This means appearing high

More information

Outline GIZA++ Moses. Demo. Steps Output files. Training pipeline Decoder

Outline GIZA++ Moses. Demo. Steps Output files. Training pipeline Decoder GIZA++ and Moses Outline GIZA++ Steps Output files Moses Training pipeline Decoder Demo GIZA++ A statistical machine translation toolkit used to train IBM Models 1-5 (moses only uses output of IBM Model-1)

More information

Enriching Lifelong User Modelling with the Social e- Networking and e-commerce Pieces of the Puzzle

Enriching Lifelong User Modelling with the Social e- Networking and e-commerce Pieces of the Puzzle Enriching Lifelong User Modelling with the Social e- Networking and e-commerce Pieces of the Puzzle Demetris Kyriacou Learning Societies Lab School of Electronics and Computer Science, University of Southampton

More information

Translation Quality Estimation: Past, Present, and Future

Translation Quality Estimation: Past, Present, and Future Translation Quality Estimation: Past, Present, and Future André Martins MT Marathon, Lisbon, August 31st, 2017 André Martins (Unbabel) Quality Estimation MTM, 31/8/17 1 / 69 This Talk First part: largely

More information

The GREYC Machine Translation System for the IWSLT 2008 campaign

The GREYC Machine Translation System for the IWSLT 2008 campaign The GREYC Machine Translation System for the IWSLT 2008 campaign Yves Lepage Adrien Lardilleux Julien Gosme Jean-Luc Manguin GREYC, University of Caen, France (GREYC@IWSLT 2008) 1 / 12 The system The system

More information

Engineering Data Intensive Scalable Systems

Engineering Data Intensive Scalable Systems Engineering Data Intensive Scalable Systems Introduction Internet services companies such as Google, Yahoo!, Amazon, and Facebook, have pioneered systems that have achieved unprecedented scale while still

More information

The Multi-perspective Process Explorer

The Multi-perspective Process Explorer The Multi-perspective Process Explorer Felix Mannhardt 1,2, Massimiliano de Leoni 1, Hajo A. Reijers 3,1 1 Eindhoven University of Technology, Eindhoven, The Netherlands 2 Lexmark Enterprise Software,

More information