Marcello Federico MMT Srl / FBK Trento, Italy
|
|
- Amelia Summers
- 5 years ago
- Views:
Transcription
1 Marcello Federico MMT Srl / FBK Trento, Italy Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 207
2 Symbiotic Human and Machine Translation MT seamlessly adapts to user data learns from post-editing user enjoys enhanced productivity better user experience 3 Usable technology for the translation industry easy to install and deploy fast to set-up for a new project effective, also on small projects scalable with data and users works with commodity hardware 4 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 208
3 The Modern MT way (1) connect your CAT with a plug-in (2) drag & drop your private TMs (3) start translating! 5 Modern MT in a nutshell zero training time adapts to context learns from user corrections scales with data and users 6 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 209
4 Training data is a dynamic collection of Translation Memories At any time: new TMs are added existing TMs are extended Training time comparable to uploading time! 7 Context aware translation SENTENCE party CONTEXT We are going out. CONTEXT We approved the law TRANSLATION fête TRANSLATION parti 8 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 210
5 requests suggestions Machine Translation post-edits Incremental Learning 9 Core technology [original plan] context analyser phrase-based decoder adaptive models incremental structures parallel processing Simple. Adaptive. Neural. 10 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 211
6 Language support 45 languages fast pre-/post-processing simple interfaces tags and XML management localization of expressions TM cleaning Simple. Adaptive. Neural. Context Analyzer analyze input text A B C 5% 45% 50% retrieve best matching TMs compute matching scores dynamic structure Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 212
7 Adaptive Phrase Table 1000 Suffix Array with Ranked Sampling suffix array indexed with TMs phrases sampled on demand priority sampling over TMs dynamic structure Simple. Adaptive. Neural. Adaptive Language Model A B C w p large static background model n-grams stats indexed with TMs combination of active TM LMs TM LMs computed on the fly dynamic structure Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 213
8 TED Talks English-French M. Cettolo, et al. (2016), The IWSLT 2016 Evaluation Campaign, IWSLT. 15 Second Prototype (0.14 January 2017) Open benchmark: - Training speed: 12x Moses - 100x NMT - MT quality (BLEU): +1 vs Moses -0.5 vs NMT Ada Domains: ECB, Gnome, JRC, KDE, OpenOffice, PHP, Ubuntu, UN-TM Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 214
9 What happened Research on adaptive neural MT Believed PBMT was competitive on technical translation Finally realised superiority of NMT quality Completed PBMT release and switched to NMT Data collection for 14 translation directions Simple. Adaptive. Neural. Roadmap from last review meeting technology switch 2015 Q Q Q Q4 minimum viable product first alpha release first beta release final release context aware 1 lang pair fast training, context aware, distributed, 1 lang pair online learning plug-in, 3 lang pairs neural MT, enterprise ready, 14 lang pairs Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 215
10 Multi-user scenario Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 216
11 Multi-user scenario Multi-user scenario Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 217
12 All we need is a memory 24 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 218
13 All we need is a memory 25 All we need is a memory 26 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 219
14 All we need is a memory 27 All we need is a memory 28 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 220
15 Multi-user adaptive NMT 29 Multi-user adaptive NMT Instances are selected by combining context scores and similarity scores 30 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 221
16 Adaptation, too! Source: Farajian et al, Multi-Domain Neural MT through Unsupervised Adaptation, Proc. WMT Farajian et al. (2017) Multi-domain NMT through unsupervised adaptation, WMT. 31 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 222
17 Sep: integration of MateCat Oct: NMT code released Nov: co-development release of 14 engines Dec: performance boost 33 Relative BLEU scores wrt Google Translate 34 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 223
18 Progression in one month on English-Italian Performance of generic MMT 1-6 scale (w/o adaptation) 35 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 224
19 37 38 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 225
20 Noisy training data EN: What history teaches us IT: === Storia ================================== Simple. Adaptive. Neural. Data Cleaning We added a simple QE module to filter out bad examples: Apply Fast-Align in two directions Compute Model 1 scores in two directions Combine and normalize scores Filter out on the distribution of scores Simple. Adaptive. Neural. Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 226
21 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 227
22 43 44 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 228
23 We compare: Generic MT: production engine [En-It] Custom MT: Generic MT tuned on TM [takes hours] +Adaptive MT: Generic MT adapted on TM [real-time] +Incremental MT: TM updated with simulated PE Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 229
24 47 48 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 230
25 49 50 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 231
26 Use post-editing as new training instances Perform one/more iterations Can be combined with a priori adaptation Updates generic or adapted model Turchi et al. (2017), Continuous learning from human post-edits for NMT, EAMT. 52 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 232
27 53 54 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 233
28 55 56 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 234
29 57 Online-learning contribution is consistent Does it scale with number of domains? Incremental learning contributes marginally Probably depends on test set size We are not always able to beat specialized models How to improve further adaptation? 58 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 235
30 Can improve MT without touching it inside We can adapt an external MT service! Similar to NMT: two inputs (src,mt), one output (ape) Can be trained with less data than NMT We can deploy instance based adaptation Chatterjee et al. (2017), Multi-source Neural APE: FBK s participation., WMT. 60 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 236
31 61 Neural APE uses two encoders and two attention models, which are merged and used by one decoder. 62 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 237
32 63 64 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 238
33 65 66 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 239
34 67 Can improve on top of static and adaptive engine! Uses incremental learning, adaptation and online learning Portable (in principle) on the multi-domain setting Limited gain on top of full-fledged adaptive NMT Can be an extra component to manage 68 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 240
35 Multi-user scenario goes beyond simple domain adaptation We need to handle multiple evolving domains Domain customization is not an option Real-time adaptation/learning works! But, there is still room for improvement! 70 Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 241
36 Thank You Website Github github.com/modernmt/mmt Proceedings for AMTA 2018 Workshop: Translation Quality Estimation and Automatic Post-Editing Boston, March 21, 2018 Page 242
D5.6 - Evaluation Benchmarks
Th is document is part of the Project Machine Tr a n sla tion En h a n ced Com pu ter A ssisted Tr a n sla tion (Ma teca t ), fu nded by the 7th Framework Programme of th e Eu r opea n Com m ission th
More informationMMT Modern Machine Translation
MMT Modern Machine Translation Second Design and Specifications Report Author(s): Davide Caroselli, Nicola Bertoldi, Mauro Cettolo, Marcello Federico Dissemination Level: Public Date: July 1 st, 2016 Grant
More informationD6.1.2: Second report on scientific evaluations
D6.1.2: Second report on scientific evaluations UPVLC, XEROX, JSI-K4A, RWTH, EML and DDS Distribution: Public translectures Transcription and Translation of Video Lectures ICT Project 287755 Deliverable
More informationAn efficient and user-friendly tool for machine translation quality estimation
An efficient and user-friendly tool for machine translation quality estimation Kashif Shah, Marco Turchi, Lucia Specia Department of Computer Science, University of Sheffield, UK {kashif.shah,l.specia}@sheffield.ac.uk
More informationiems Interactive Experiment Management System Final Report
iems Interactive Experiment Management System Final Report Pēteris Ņikiforovs Introduction Interactive Experiment Management System (Interactive EMS or iems) is an experiment management system with a graphical
More informationAn Unsupervised Model for Joint Phrase Alignment and Extraction
An Unsupervised Model for Joint Phrase Alignment and Extraction Graham Neubig 1,2, Taro Watanabe 2, Eiichiro Sumita 2, Shinsuke Mori 1, Tatsuya Kawahara 1 1 Graduate School of Informatics, Kyoto University
More informationNTT SMT System for IWSLT Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada, and Hideki Isozaki NTT Communication Science Labs.
NTT SMT System for IWSLT 2008 Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada, and Hideki Isozaki NTT Communication Science Labs., Japan Overview 2-stage translation system k-best translation
More informationINF5820/INF9820 LANGUAGE TECHNOLOGICAL APPLICATIONS. Jan Tore Lønning, Lecture 8, 12 Oct
1 INF5820/INF9820 LANGUAGE TECHNOLOGICAL APPLICATIONS Jan Tore Lønning, Lecture 8, 12 Oct. 2016 jtl@ifi.uio.no Today 2 Preparing bitext Parameter tuning Reranking Some linguistic issues STMT so far 3 We
More informationOpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System
To be presented at IceTAL (Reykjavík, 16-18 Aug. 2010) OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System Sandipan Dandapat, Mikel L. Forcada, Declan Groves, Sergio Penkale,
More informationLSTM for Language Translation and Image Captioning. Tel Aviv University Deep Learning Seminar Oran Gafni & Noa Yedidia
1 LSTM for Language Translation and Image Captioning Tel Aviv University Deep Learning Seminar Oran Gafni & Noa Yedidia 2 Part I LSTM for Language Translation Motivation Background (RNNs, LSTMs) Model
More informationMMT Modern Machine Translation. First Evaluation Plan
This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No 645487. MMT Modern Machine Translation First Evaluation Plan Author(s):
More informationIntroduction to MateCat
Introduction to MateCat Luca De Franceschi - Associate Product Manager Le Mans, 12 June 2018 A little bit of history 1940s and 50s: Machine Translation is all the rage 1960s: Automatic Language Processing
More informationBetter translations with user collaboration - Integrated MT at Microsoft
Better s with user collaboration - Integrated MT at Microsoft Chris Wendt Microsoft Research One Microsoft Way Redmond, WA 98052 christw@microsoft.com Abstract This paper outlines the methodologies Microsoft
More informationA General Overview of Machine Translation System Development
A General Overview of Machine Translation System Development Mārcis Pinnis, Tilde May 23, 2018, Lisbon, Portugal Who am I? My name is Mārcis Pinnis I work as a researcher in Tilde My current research focuses
More informationA Smorgasbord of Features to Combine Phrase-Based and Neural Machine Translation
A Smorgasbord of Features to Combine Phrase-Based and Neural Machine Translation Benjamin Marie Atsushi Fujita National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho,
More informationKyoto-NMT: a Neural Machine Translation implementation in Chainer
Kyoto-NMT: a Neural Machine Translation implementation in Chainer Fabien Cromières Japan Science and Technology Agency Kawaguchi-shi, Saitama 332-0012 fabien@pa.jst.jp Abstract We present Kyoto-NMT, an
More informationTALP: Xgram-based Spoken Language Translation System Adrià de Gispert José B. Mariño
TALP: Xgram-based Spoken Language Translation System Adrià de Gispert José B. Mariño Outline Overview Outline Translation generation Training IWSLT'04 Chinese-English supplied task results Conclusion and
More informationJuggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets
Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Rahul Potharaju (Purdue University) Navendu Jain (Microsoft Research) Cristina Nita-Rotaru (Purdue University) April
More informationBalancing Awareness and Interruption: Investigation of Notification Deferral Policies
Balancing Awareness and Interruption: Investigation of Notification Deferral Policies Eric Horvitz 1, Johnson Apacible 1, and Muru Subramani 1 1 Microsoft Research, One Microsoft Way Redmond, Washington
More informationRLAT Rapid Language Adaptation Toolkit
RLAT Rapid Language Adaptation Toolkit Tim Schlippe May 15, 2012 RLAT Rapid Language Adaptation Toolkit - 2 RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit - 3 Outline Introduction
More informationTHE CUED NIST 2009 ARABIC-ENGLISH SMT SYSTEM
THE CUED NIST 2009 ARABIC-ENGLISH SMT SYSTEM Adrià de Gispert, Gonzalo Iglesias, Graeme Blackwood, Jamie Brunning, Bill Byrne NIST Open MT 2009 Evaluation Workshop Ottawa, The CUED SMT system Lattice-based
More informationTuning. Philipp Koehn presented by Gaurav Kumar. 28 September 2017
Tuning Philipp Koehn presented by Gaurav Kumar 28 September 2017 The Story so Far: Generative Models 1 The definition of translation probability follows a mathematical derivation argmax e p(e f) = argmax
More informationD1.6 Toolkit and Report for Translator Adaptation to New Languages (Final Version)
www.kconnect.eu D1.6 Toolkit and Report for Translator Adaptation to New Languages (Final Version) Deliverable number D1.6 Dissemination level Public Delivery date 16 September 2016 Status Author(s) Final
More informationHigh School PLTW Computer Science A Curriculum
Grade 9th - 12th, 1 Credit Elective Course Prerequisites: PLTW Computer Science and Software Engineering High School PLTW Computer Science A Curriculum Course Description: Computer Science A (CSA) builds
More informationStatistical Machine Translation Part IV Log-Linear Models
Statistical Machine Translation art IV Log-Linear Models Alexander Fraser Institute for Natural Language rocessing University of Stuttgart 2011.11.25 Seminar: Statistical MT Where we have been We have
More informationCorpus Acquisition from the Internet
Corpus Acquisition from the Internet Philipp Koehn partially based on slides from Christian Buck 26 October 2017 Big Data 1 For many language pairs, lots of text available. Text you read in your lifetime
More informationReviewer WordPress Plugin
Reviewer WordPress Plugin Reviews and Comparison Tables for Your Posts Version: 2.3.0 - by Michele Ivani - evographics.net This is a how-to guide to install your plugin copy. If you have any questions
More informationExperimenting in MT: Moses Toolkit and Eman
Experimenting in MT: Moses Toolkit and Eman Aleš Tamchyna, Ondřej Bojar Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics Charles University, Prague Tue Sept 8, 2015 1 / 30
More informationSCOTTISH PARLIAMENT WEB AND ONLINE ROADMAP PROJECT (INCLUDING DEVELOPING A NEW IMPROVED WEBSITE PRESENCE FOR THE SCOTTISH PARLIAMENT)
SPCB(2018)Paper 37 3 May 2018 SCOTTISH PARLIAMENT WEB AND ONLINE ROADMAP PROJECT (INCLUDING DEVELOPING A NEW IMPROVED WEBSITE PRESENCE FOR THE SCOTTISH PARLIAMENT) Executive summary 1. The paper has been
More informationCorpus Acquisition from the Interwebs. Christian Buck, University of Edinburgh
Corpus Acquisition from the Interwebs Christian Buck, University of Edinburgh There is no data like more data (Bob Mercer, 1985) Mining Bilingual Text "Same text in different languages" Usually: one side
More informationDCU System Report on the WMT 2017 Multi-modal Machine Translation Task
DCU System Report on the WMT 2017 Multi-modal Machine Translation Task Iacer Calixto and Koel Dutta Chowdhury and Qun Liu {iacer.calixto,koel.chowdhury,qun.liu}@adaptcentre.ie Abstract We report experiments
More informationMoses version 2.1 Release Notes
Moses version 2.1 Release Notes Overview The document is the release notes for the Moses SMT toolkit, version 2.1. It describes the changes to toolkit since version 1.0 from January 2013. The aim during
More informationA General-Purpose Rule Extractor for SCFG-Based Machine Translation
A General-Purpose Rule Extractor for SCFG-Based Machine Translation Greg Hanneman, Michelle Burroughs, and Alon Lavie Language Technologies Institute Carnegie Mellon University Fifth Workshop on Syntax
More informationOverview. Performance Breakthrough Features Impact Roadmap
What s New? Overview Performance Breakthrough Features Impact Roadmap VS Secret Transparency Transparency No other vendor talks freely about real limits and product usability Performance is the key! Why?
More informationFirst Steps Towards Coverage-Based Document Alignment
First Steps Towards Coverage-Based Document Alignment Luís Gomes 1, Gabriel Pereira Lopes 1, 1 NOVA LINCS, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Portugal ISTRION BOX, Translation
More informationJoining the Conversation
Joining the Conversation Reddit, Maptime, Slack for GIS March 2, 2016 Atlanta Regional GIS User Group Chris Rice Langdon Sanders Reddit.com/r/GIS reddit.com /r/gis Reddit.com/r/GIS Reddit.com/r/GIS User
More informationNLPL - The Nordic Language Processing Laboratory.
NLPL - The Nordic Language Processing Laboratory http://nlpl.eu/ What is NLPL? Vision virtual laboratory for large-scale NLP research share high-performance computing and data resources across language
More informationPHP Composer 9 Benefits of Using a Binary Repository Manager
PHP Composer 9 Benefits of Using a Binary Repository Manager White Paper Copyright 2017 JFrog Ltd. March 2017 www.jfrog.com Executive Summary PHP development has become one of the most popular platforms
More informationThe QCN System for Egyptian Arabic to English Machine Translation. Presenter: Hassan Sajjad
The QCN System for Egyptian Arabic to English Machine Translation Presenter: Hassan Sajjad QCN Collaboration Ahmed El Kholy Wael Salloum Nizar Habash Ahmed Abdelali Nadir Durrani Francisco Guzmán Preslav
More informationjamf Nation - London Roadshow
jamf Nation - London Roadshow Sachin Parmar Workplace Technology Manager 17 May 2018 About Just Eat Our vision Creating the world s greatest food community 3 About Me About Me Sachin Parmar 8+ years professionally
More informationSame-language MT for local flavours/flavors
Same-language MT for local flavours/flavors Janice Campbell - Adobe Gema Ramírez - Prompsit Proceedings of AMTA 2018, vol. 2: MT Users' Track Boston, March 17-21, 2018 Page 35 Intro: brief history of AltLang
More informationBA Insight Federator. How the BA Insight Federator Extends SharePoint Search
How the BA Insight Federator Extends SharePoint Search 20 Park Plaza, Suite 1115 Boston, MA 02116, USA 1.339.368.7234 info@bainsight.com www.bainsight.com BA Insight Federator The BA Insight Federator
More informationConvolutional Sequence to Sequence Learning. Denis Yarats with Jonas Gehring, Michael Auli, David Grangier, Yann Dauphin Facebook AI Research
Convolutional Sequence to Sequence Learning Denis Yarats with Jonas Gehring, Michael Auli, David Grangier, Yann Dauphin Facebook AI Research Sequence generation Need to model a conditional distribution
More informationSparse Feature Learning
Sparse Feature Learning Philipp Koehn 1 March 2016 Multiple Component Models 1 Translation Model Language Model Reordering Model Component Weights 2 Language Model.05 Translation Model.26.04.19.1 Reordering
More informationAMT 2014 A TUTORIAL. MateCat: an Open Source CAT Tool for MT Post-Editing. Vancouver, BC. October amta2014.amtaweb.org
AMT 2014 A MateCat: an Open Source CAT Tool for MT Post-Editing Marcello Federico Nicola Bertoldi Marco Trombetti Alessandro Cattelan TUTORIAL The 11th Conference of the Association for Machine Translation
More informationAPNIC Update. German Valdez External Relations Program Director, APNIC ARIN XXX 26-October-2012
APNIC Update German Valdez External Relations Program Director, APNIC ARIN XXX 26-October-2012 Overview Stats IPv4 transfers Training APNIC Labs Policies ISIF Public Affairs APNIC Member Survey APNIC Coming
More informationAMTA PRESENTATION. How Much Cake is Enough: The Case for Domain-Specific Engines. ALEXYANISHEVSKY Welocalize October 2015
AMTA PRESENTATION How Much Cake is Enough: The Case for Domain-Specific Engines ALEXYANISHEVSKY Welocalize October 2015 Proceedings of MT Summit XV, vol. 2: MT Users' Track Miami, Oct 30 - Nov 3, 2015
More informationMobile Apps Why & How
Mobile Apps Why & How MOBIKATS Phillip Hunt CEO philliphunt@mobikats.com 07590 277045 Putting your Content in Everyone s Pocket. Copyright Mobikats 2012 What do we do? Consultancy on developing or reviewing
More informationMachine Translation Research in META-NET
Machine Translation Research in META-NET Jan Hajič Institute of Formal and Applied Linguistics Charles University in Prague, CZ hajic@ufal.mff.cuni.cz Solutions for Multilingual Europe Budapest, Hungary,
More informationGEOFLUENT VS. STRAIGHT MACHINE TRANSLATION: UNDERSTANDING THE DIFFERENCES
SOLUTION BRIEF GEOFLUENT VS. STRAIGHT MACHINE TRANSLATION: UNDERSTANDING THE DIFFERENCES by EXECUTIVE SUMMARY Real-time translation (RTT) technology is an effective solution for instantly translating content
More informationOverview. Websites. Brand Workshop 1-2 weeks. Research 1 week. Content 2-3 weeks. Design 4-6 weeks. Development 6-8 weeks. Complete weeks
Our Process Overview Custom design Branding workshop Developed using HTML5, PHP and a custom CMS Designed and developed to be easily managed & edited Content written by professional copywriter Free Google
More informationJoint Decoding with Multiple Translation Models
Joint Decoding with Multiple Translation Models Yang Liu, Haitao Mi, Yang Feng, and Qun Liu Institute of Computing Technology, Chinese Academy of ciences {yliu,htmi,fengyang,liuqun}@ict.ac.cn 8/10/2009
More informationTuning Machine Translation Parameters with SPSA
Tuning Machine Translation Parameters with SPSA Patrik Lambert, Rafael E. Banchs TALP Research Center, Jordi Girona Salgado 1 3. 0803 Barcelona, Spain lambert,rbanchs@gps.tsc.upc.edu Abstract Most of statistical
More informationITS 2.0-ENABLED STATISTICAL MACHINE TRANSLATION AND TRAINING WEB SERVICES
ITS 2.0-ENABLED STATISTICAL MACHINE TRANSLATION AND TRAINING WEB SERVICES Ankit Srivastava & Declan Groves Centre for Next Generation Localisation (CNGL), School of Computing, Dublin City University FEISGILTT
More informationProgramming Assignment #2
Programming Assignment #2 Due: 11:59pm, Wednesday, Feb. 13th Objective: This assignment will provide further practice with classes and objects, and deepen the understanding of basic OO programming. Task:
More informationThe Tenth Biennial Conference of the Association for Machine Translation in the Americas Practical Domain Adaptation
AMT 2012 20 Years A The Tenth Biennial Conference of the Association for Machine Translation in the Americas Practical Domain Adaptation Marcello Federico Fondazione Bruno Kessler Nicola Bertoldi Fondazione
More informationA Space-Efficient Phrase Table Implementation Using Minimal Perfect Hash Functions
A Space-Efficient Phrase Table Implementation Using Minimal Perfect Hash Functions Marcin Junczys-Dowmunt Faculty of Mathematics and Computer Science Adam Mickiewicz University ul. Umultowska 87, 61-614
More informationOCR and Automated Translation for the Navigation of non-english Handsets: A Feasibility Study with Arabic
OCR and Automated Translation for the Navigation of non-english Handsets: A Feasibility Study with Arabic Jennifer Biggs and Michael Broughton Defence Science and Technology Organisation Edinburgh, South
More informationverapdf Industry supported PDF/A validation
verapdf Industry supported PDF/A validation About this webinar What we ll be showing you: our current development status; the Consortium s development plans for 2016; how we ve been testing the software
More informationWork like a startup! TUT Software Startup Day. Harri Kiljander, Dr. Tech
TUT Software Startup Day Work like a startup! Harri Kiljander, Dr. Tech Product guy and UX designer, who also wrote real software a couple of decades ago Director of F-Secure Lokki 11 th of December 2013,
More informationStart downloading our playground for SMT: wget
Before we start... Start downloading Moses: wget http://ufal.mff.cuni.cz/~tamchyna/mosesgiza.64bit.tar.gz Start downloading our playground for SMT: wget http://ufal.mff.cuni.cz/eman/download/playground.tar
More informationA Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation.
A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation May 29, 2003 Shankar Kumar and Bill Byrne Center for Language and Speech Processing
More informationMultimedia Information Retrieval
Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Multimedia Information Retrieval 1. What are
More informationCACAO PROJECT AT THE 2009 TASK
CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype
More informationUELMA Preservation. Jason Judt & Daniel Kruse Office of the Revisor of Statutes, Minnesota
UELMA Preservation Jason Judt & Daniel Kruse Office of the Revisor of Statutes, Minnesota UELMA System Overview Server-side authentication Complete versioning system Tracking and auditing tools developed
More informationThe Continuous SEO Process. WebExpo 2016
The Continuous SEO Process WebExpo 2016 SEO needs TLC SEO needs TLC The nature of SEO SEO is continuous by nature. So why aren t we? SEO Authority & Trust Content Content Content Technical Foundation SEO
More informationNext Generation Platform (NGP) Marketing Plan
Next Generation Platform (NGP) Marketing Plan Digital Services Christine Ierano March 2015 Background & Rationale What's NGP? Next Generation Platform or NGP is a strategic solution to a business problem
More informationMaximising Search Engine Marketing. Search Marketing and Google Analytics
Maximising Search Engine Marketing Search Marketing and Google Analytics Why Digital Marketing? UK Business Digital Index 2016 52% state cost savings is one of the advantages to being online, a significant
More informationOutrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS
Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Topics AGENDA Challenges with Big Data Analytics How SAS can help you to minimize time to value with
More informationOpen Document Format. It s Your Information Make the Choice to Ensure It Stays That Way ODF ALLIANCE. The OpenDocument Format: It's Your Information 1
Open Document Format It s Your Information Make the Choice to Ensure It Stays That Way ODF ALLIANCE The OpenDocument Format: It's Your Information 1 Imagine if: your government forced you to buy a word
More informationApplication for Summer of Code 2008: Jimmy Berry Usability Testing Suite 1
Application for Summer of Code 2008: Jimmy Berry Usability Testing Suite 1 Purpose My project will provide a method of measuring the user experience 2 which has been stated as a very important goal of
More informationEXPERIMENTAL SETUP: MOSES
EXPERIMENTAL SETUP: MOSES Moses is a statistical machine translation system that allows us to automatically train translation models for any language pair. All we need is a collection of translated texts
More informationWHITEPAPER WHAT TO CONSIDER TO SUCCESSFULLY LAUNCH A MOBILE APP! By RG Infotech WHITEPAPER. How to launch a MOBILE APP. Successfully!
WHITEPAPER How to launch a MOBILE APP Successfully! By RG Infotech Since the launch of smartphones, mobile industry embarks several milestones that attracts several industries and developers to enter with
More information1
1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance
More informationRecurrent Neural Networks
Recurrent Neural Networks 11-785 / Fall 2018 / Recitation 7 Raphaël Olivier Recap : RNNs are magic They have infinite memory They handle all kinds of series They re the basis of recent NLP : Translation,
More informationSimile Tools Workshop Summary MacKenzie Smith, MIT Libraries
Simile Tools Workshop Summary MacKenzie Smith, MIT Libraries Intro On June 10 th and 11 th, 2010 a group of Simile Exhibit users, software developers and architects met in Washington D.C. to discuss the
More informationCASE STUDY. Inbound Marketing Engagement With Stepping Stones Centre, UAE. July Ongoing
CASE STUDY Inbound Marketing Engagement With Stepping Stones Centre, UAE July 216 - Ongoing Presentation Agenda! Deliverables Agreed To Activities Taken Up Results What s Next 3 About Stepping Stones Stepping
More informationA Guide to Using MailChimp
v 1.4 Updated October 11, 2018 Table of Contents 1. Introduction.......................................................................3 2. Account Set-Up...................................................................4
More informationBuild Better Business Models for Machine Translation
Build Better Business Models for Machine Translation Executive Summary KantanAnalytics is a newly developed, Machine Translation (MT) quality scoring technology that it is anticipated to have a considerable
More informationThe Prague Bulletin of Mathematical Linguistics NUMBER 91 JANUARY
The Prague Bulletin of Mathematical Linguistics NUMBER 9 JANUARY 009 79 88 Z-MERT: A Fully Configurable Open Source Tool for Minimum Error Rate Training of Machine Translation Systems Omar F. Zaidan Abstract
More informationGuide to Marketing
Guide to Email Marketing Why Email Marketing? Isn t Email Dead? Email is not dead. In fact, it s very much alive. More and more businesses are using email as an effective form of communication, and more
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationIndustrial Machinery. Search Marketing Case Study
Industrial Machinery Search Marketing Case Study 2015-2016 A performance digital marketing consultancy & training academy. Our team comprises of digital researchers, strategists, consultants, usability
More informationLoonyBin: Making Empirical MT Reproduciple, Efficient, and Less Annoying
LoonyBin: Making Empirical MT Reproduciple, Efficient, and Less Annoying Jonathan Clark Alon Lavie CMU Machine Translation Lunch Tuesday, January 19, 2009 Outline A (Brief) Guilt Trip What goes into a
More informationDid you know that SEO increases traffic, leads and sales? SEO = More Website Visitors More Traffic = More Leads More Leads= More Sales
1 Did you know that SEO increases traffic, leads and sales? SEO = More Website Visitors More Traffic = More Leads More Leads= More Sales What is SEO? Search engine optimization is the process of improving
More informationDIGITAL SIGNAGE SOFTWARE
DIGITAL SIGNAGE SOFTWARE MODERN. POWERFUL. EFFECTIVE. The most complete solution for visual communication JADE GIVES LIFE TO YOUR DIGITAL SCREENS Jade is the modern and reliable digital signage content
More informationCS839: Probabilistic Graphical Models. Lecture 22: The Attention Mechanism. Theo Rekatsinas
CS839: Probabilistic Graphical Models Lecture 22: The Attention Mechanism Theo Rekatsinas 1 Why Attention? Consider machine translation: We need to pay attention to the word we are currently translating.
More information/ Cloud Computing. Recitation 7 October 10, 2017
15-319 / 15-619 Cloud Computing Recitation 7 October 10, 2017 Overview Last week s reflection Project 3.1 OLI Unit 3 - Module 10, 11, 12 Quiz 5 This week s schedule OLI Unit 3 - Module 13 Quiz 6 Project
More informationWordPress & Web Development. Student Curriculum
WordPress & Web Development Student Curriculum Intro to HTML Coding (Week 1) Class discussion on why HTML is important. Each Student will complete 13 lessons in basic HTML coding and submit them for a
More informationMobile Query Interfaces
Mobile Query Interfaces Matthew Krog Abstract There are numerous alternatives to the application-oriented mobile interfaces. Since users use their mobile devices to manage personal information, a PIM interface
More informationJU_CSE_TE: System Description 2010 ResPubliQA
JU_CSE_TE: System Description QA@CLEF 2010 ResPubliQA Partha Pakray 1, Pinaki Bhaskar 1, Santanu Pal 1, Dipankar Das 1, Sivaji Bandyopadhyay 1, Alexander Gelbukh 2 Department of Computer Science & Engineering
More informationModule 3.1. The Numbers Game - Call Tracking
Module 3.1 The Numbers Game - Why Local Consumers want something, and they want it now... If your toilet starts leaking and you need a plumber... Are you going to fill out a web form? NO! People Want to
More informationSpice UK. Susan Hallam. Susan Hallam Page 1. Spice UK. Agenda for Today
UK UK www.shcl.co.uk susan@shcl.co.uk Agenda for Today Getting Found in Google Social Media Marketing Adwords Pay Per Click Advertising Promotion Techniques Google Analytics susan@shcl.co.uk Page 1 UK
More informationA PRACTICE BUILDERS white paper. 8 Ways to Improve SEO Ranking of Your Healthcare Website
A PRACTICE BUILDERS white paper 8 Ways to Improve SEO Ranking of Your Healthcare Website More than 70 percent of patients find their healthcare providers through a search engine. This means appearing high
More informationOutline GIZA++ Moses. Demo. Steps Output files. Training pipeline Decoder
GIZA++ and Moses Outline GIZA++ Steps Output files Moses Training pipeline Decoder Demo GIZA++ A statistical machine translation toolkit used to train IBM Models 1-5 (moses only uses output of IBM Model-1)
More informationEnriching Lifelong User Modelling with the Social e- Networking and e-commerce Pieces of the Puzzle
Enriching Lifelong User Modelling with the Social e- Networking and e-commerce Pieces of the Puzzle Demetris Kyriacou Learning Societies Lab School of Electronics and Computer Science, University of Southampton
More informationTranslation Quality Estimation: Past, Present, and Future
Translation Quality Estimation: Past, Present, and Future André Martins MT Marathon, Lisbon, August 31st, 2017 André Martins (Unbabel) Quality Estimation MTM, 31/8/17 1 / 69 This Talk First part: largely
More informationThe GREYC Machine Translation System for the IWSLT 2008 campaign
The GREYC Machine Translation System for the IWSLT 2008 campaign Yves Lepage Adrien Lardilleux Julien Gosme Jean-Luc Manguin GREYC, University of Caen, France (GREYC@IWSLT 2008) 1 / 12 The system The system
More informationEngineering Data Intensive Scalable Systems
Engineering Data Intensive Scalable Systems Introduction Internet services companies such as Google, Yahoo!, Amazon, and Facebook, have pioneered systems that have achieved unprecedented scale while still
More informationThe Multi-perspective Process Explorer
The Multi-perspective Process Explorer Felix Mannhardt 1,2, Massimiliano de Leoni 1, Hajo A. Reijers 3,1 1 Eindhoven University of Technology, Eindhoven, The Netherlands 2 Lexmark Enterprise Software,
More information