Density-Driven Cross-Lingual Transfer of Dependency Parsers

Size: px
Start display at page:

Download "Density-Driven Cross-Lingual Transfer of Dependency Parsers"

Transcription

1 Density-Driven Cross-Lingual Transfer of Dependency Parsers Mohammad Sadegh Rasooli Michael Collins Presented by Owen Rambow EMNLP 2015

2 Motivation Availability of treebanks Accurate parsers use annotated treebanks. There are no gold-standard treebanks for many languages. Annotated treebanks are very expensive to create. Common approach: using universal linguistic information Without parallel data; e.g [Zhang and Barzilay, 2015] With parallel data; e.g [Ma and Xia, 2014] The best results but still lags behind supervised parsing

3 Motivation Availability of treebanks Accurate parsers use annotated treebanks. There are no gold-standard treebanks for many languages. Annotated treebanks are very expensive to create. Common approach: using universal linguistic information Without parallel data; e.g [Zhang and Barzilay, 2015] With parallel data; e.g [Ma and Xia, 2014] The best results but still lags behind supervised parsing

4 Projecting Dependencies from Parallel Data Bitext Prepare bitext The political priorities must be set by this House and the MEPs. ROOT Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

5 Projecting Dependencies from Parallel Data Align Align bitext (e.g. via Giza++) The political priorities must be set by this House and the MEPs. ROOT Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

6 Projecting Dependencies from Parallel Data Parse Parse source sentence with a supervised parser. The political priorities must be set by this House and the MEPs. ROOT Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

7 Projecting Dependencies from Parallel Data Project Project dependencies. The political priorities must be set by this House and the MEPs. ROOT Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

8 Projecting Dependencies from Parallel Data Train Train on the projected dependencies. Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

9 Practical Problems Most translations are not word-to-word. Alignment errors! Supervised parsers are not perfect. Difference in syntactic behavior across languages.

10 Previous Results 95 dependency accuracy (avg. over 6 EU languages) [McDonald et al., 2011] [Zhang and Barzilay, 2015] [Ma and Xia, 2014] MPH11 ZB15 Previous work MX14 1st-ord Sh-R

11 Previous Results 95 [McDonald et al., 2011] dependency accuracy (avg. over 6 EU languages) [Zhang and Barzilay, 2015] [Ma and Xia, 2014] [McDonald et al., 2005] [Rasooli and Tetreault, 2015] % lower than a first-order supervised model MPH11 ZB15 MX14 1st-ord Sh-R Previous work Supervised models

12 Our Approach We define different sets of dense structures Full trees Dense partial trees

13 Dense Structures A projected full tree t P 100 is: A projective dependency tree All words have one parent Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

14 Dense Structures A partial tree t P 80 is: A projective dependency tree (a collection of projective trees) At least 80% of words have one parent Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

15 Dense Structures A partial tree t P k is: A projective dependency tree (a collection of projective trees) There is at least one span of length k where all words in that span have one parent Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT k=7 in the above tree

16 Our Contributions We demonstrate the utility of dense projected structures. We describe a training algorithm that builds on dense structures.

17 Our Contributions 95 [McDonald et al., 2011] [Zhang and Barzilay, 2015] dependency accuracy (avg. over 6 EU languages) [Ma and Xia, 2014] [McDonald et al., 2005] [Rasooli and Tetreault, 2015] MPH11 ZB15 MX14 Our Model 1st-ord Sh-R Previous work Supervised models

18 Our Contributions 95 dependency accuracy (avg. over 6 EU languages) [McDonald et al., 2011] [Zhang and Barzilay, 2015] 5.5% absolute improvement [Ma and Xia, 2014] Our model 87.5 [McDonald et al., 2005] [Rasooli and Tetreault, 2015] MPH11 ZB15 MX14 Our Model 1st-ord Sh-R Previous work Supervised models

19 Overview The learning algorithm Results Analysis

20 Projecting Dependencies Languages from Google universal treebank: English (only as source), German, Spanish, French, Italian, Portuguese, and Swedish. English to German transfer data for developing our models. We use Giza++ intersected alignments on EuroParl data We use the Yara parser [Rasooli and Tetreault, 2015], a shift-reduce beam parser.

21 Functions Used in Our Algorithm We use the following function definitions: Train(D) CDECODE(P, θ) TOP(D, θ)

22 Train(D) Input D A set of dependency trees (full trees) Output θ A parsing model

23 CDECODE(P, θ) Input P Input θ A set of partial dependency structures Parsing model Output D A set of full trees that are completely consistent with the dependencies in P. Filling in partial trees with dynamic oracles [Goldberg and Nivre, 2013]. Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

24 CDECODE(P, θ) Input P Input θ A set of partial dependency structures Parsing model Output D A set of full trees that are completely consistent with the dependencies in P. Filling in partial trees with dynamic oracles [Goldberg and Nivre, 2013]. Die politischen Prioritäten müssen von diesem Parlament und den Europaabgeordneten abgesteckt werden. ROOT

25 TOP(D, θ) Input D Input θ A set of full dependency trees Parsing model Output A Top m highest scoring trees in D We use m=200,000 in our experiments. Score: Perceptron-based parse score normalized by sentence length

26 Definitions A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

27 Learning Algorithm Train on full trees θ 0 = Train(A 0 ) for i = do D i = CDECODE(A i, θ i 1 ) A i = TOP(D i, θ i 1 ) θ i = Train(A 0 A i ) end for Return θ 3 Given definitions: A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

28 Learning Algorithm Gradually decrease density θ 0 = Train(A 0 ) for i = do D i = CDECODE(A i, θ i 1 ) A i = TOP(D i, θ i 1 ) θ i = Train(A 0 A i ) end for Return θ 3 Given definitions: A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

29 Learning Algorithm Fill in partial trees θ 0 = Train(A 0 ) for i = do D i = CDECODE(A i, θ i 1 ) A i = TOP(D i, θ i 1 ) θ i = Train(A 0 A i ) end for Return θ 3 Given definitions: A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

30 Learning Algorithm Select high-scoring trees θ 0 = Train(A 0 ) for i = do D i = CDECODE(A i, θ i 1 ) A i = TOP(D i, θ i 1 ) θ i = Train(A 0 A i ) end for Return θ 3 Given definitions: A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

31 Learning Algorithm Train on the new set θ 0 = Train(A 0 ) for i = do D i = CDECODE(A i, θ i 1 ) A i = TOP(D i, θ i 1 ) θ i = Train(A 0 A i ) end for Return θ 3 Given definitions: A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

32 Learning Algorithm Return the final model θ 0 = Train(A 0 ) for i = do D i = CDECODE(A i, θ i 1 ) A i = TOP(D i, θ i 1 ) θ i = Train(A 0 A i ) end for Return θ 3 Given definitions: A 0 = P 100 A 1 = P 7 P 80 A 2 = P 5 P 80 A 3 = P 1 P 80 Note A 1 A 2 A 3

33 Overview The learning algorithm Results Analysis

34 Two Settings Scenario 1 Transfer from English. Scenario 2 (voting) The different languages vote on dependencies. This scenario is true for cases such as Europarl.

35 Results on European Languages UAS German Spanish French Italian Portuguese Swedish θ 0 (Full trees)

36 Results on European Languages UAS German Spanish French Italian Portuguese Swedish θ 0 (Full trees) θ 3 (Full+dense)

37 Results on European Languages UAS German Spanish French Italian Portuguese Swedish θ 0 (Full trees) θ 3 (Full+dense) θ 3 (voting)

38 Results on European Languages (Comparison) UAS German Spanish French Italian Portuguese Swedish [Ma and Xia, 2014] θ 3 (voting) Sup. MST-1st

39 Comparison to Previous Work 95 [McDonald et al., 2011] [Zhang and Barzilay, 2015] dependency accuracy (avg. over 6 EU languages) [Ma and Xia, 2014] [McDonald et al., 2005] [Rasooli and Tetreault, 2015] MPH11 ZB15 MX14 θ 0 θ 3 θ 3 (voting) 1st-ord Sh-R Previous work Supervised models

40 Comparison to Previous Work 95 dependency accuracy (avg. over 6 EU languages) [McDonald et al., 2011] [Zhang and Barzilay, 2015] [Ma and Xia, 2014] θ 0 (full trees) θ 3 (full+dense) [McDonald et al., 2005] [Rasooli and Tetreault, 2015] % absolute improvement MPH11 ZB15 MX14 θ 0 θ 3 θ 3 (voting) 1st-ord Sh-R Previous work Supervised models

41 Comparison to Previous Work 95 dependency accuracy (avg. over 6 EU languages) [McDonald et al., 2011] [Zhang and Barzilay, 2015] [Ma and Xia, 2014] θ 0 (full trees) θ 3 (full+dense) θ 3 (voting) [McDonald et al., 2005] [Rasooli and Tetreault, 2015] % absolute improvement MPH11 ZB15 MX14 θ 0 θ 3 θ 3 (voting) 1st-ord Sh-R Previous work Supervised models

42 Overview The learning algorithm Results Analysis

43 Accuracy of Full Trees The accuracy of full trees is high. Voting increases the number of words per sentence, number of sentences and accuracy of full trees. Setting English target Voting Sen# 17K 77K Word/sen Prec. vs supervised

44 Density of Partial Trees in Voting The length and number of sentences are increased in partial dense trees. The accuracy of partial trees are lower than full trees. Setting P 100 P 80 P 7 Sen# 77K 243K Deps# Words/sen Density 100% 50% Prec. vs supervised

45 Accuracy across Different Languages Language P 100 P 80 P 7 Sup. #sen words/sen #dep Prec. #sen words/sen #dep Prec. German 47K K Spanish 109K K French 78K K Italian 101K K Portuguese 39K K Swedish 86K K Average 77K K

46 Conclusion We showed the utility of dense structures in projected dependencies. We showed a simple and effective learning method to utilize dense structures. Our performance is very close to a supervised parser. Future work: Applying to a broader set of languages. Using this model to improve machine translation.

47 Thanks

48 References I Goldberg, Y. and Nivre, J. (2013). Training deterministic parsers with non-deterministic oracles. TACL, 1: Ma, X. and Xia, F. (2014). Unsupervised dependency parsing with transferring distribution via parallel guidance and entropy regularization. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages , Baltimore, Maryland. Association for Computational Linguistics. McDonald, R., Pereira, F., Ribarov, K., and Hajič, J. (2005). Non-projective dependency parsing using spanning tree algorithms. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, HLT 05, pages , Stroudsburg, PA, USA. Association for Computational Linguistics. McDonald, R., Petrov, S., and Hall, K. (2011). Multi-source transfer of delexicalized dependency parsers. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 62 72, Edinburgh, Scotland, UK. Association for Computational Linguistics.

49 References II Rasooli, M. S. and Tetreault, J. (2015). Yara parser: A fast and accurate dependency parser. arxiv preprint arxiv: Zhang, Y. and Barzilay, R. (2015). Hierarchical low-rank tensors for multilingual transfer parsing. In Conference on Empirical Methods in Natural Language Processing (EMNLP), Lisbon, Portugal.

Managing a Multilingual Treebank Project

Managing a Multilingual Treebank Project Managing a Multilingual Treebank Project Milan Souček Timo Järvinen Adam LaMontagne Lionbridge Finland {milan.soucek,timo.jarvinen,adam.lamontagne}@lionbridge.com Abstract This paper describes the work

More information

arxiv: v2 [cs.cl] 24 Mar 2015

arxiv: v2 [cs.cl] 24 Mar 2015 Yara Parser: A Fast and Accurate Dependency Parser Mohammad Sadegh Rasooli 1 and Joel Tetreault 2 1 Department of Computer Science, Columbia University, New York, NY, rasooli@cs.columbia.edu 2 Yahoo Labs,

More information

Statistical parsing. Fei Xia Feb 27, 2009 CSE 590A

Statistical parsing. Fei Xia Feb 27, 2009 CSE 590A Statistical parsing Fei Xia Feb 27, 2009 CSE 590A Statistical parsing History-based models (1995-2000) Recent development (2000-present): Supervised learning: reranking and label splitting Semi-supervised

More information

CS395T Project 2: Shift-Reduce Parsing

CS395T Project 2: Shift-Reduce Parsing CS395T Project 2: Shift-Reduce Parsing Due date: Tuesday, October 17 at 9:30am In this project you ll implement a shift-reduce parser. First you ll implement a greedy model, then you ll extend that model

More information

Agenda for today. Homework questions, issues? Non-projective dependencies Spanning tree algorithm for non-projective parsing

Agenda for today. Homework questions, issues? Non-projective dependencies Spanning tree algorithm for non-projective parsing Agenda for today Homework questions, issues? Non-projective dependencies Spanning tree algorithm for non-projective parsing 1 Projective vs non-projective dependencies If we extract dependencies from trees,

More information

Transition-Based Dependency Parsing with MaltParser

Transition-Based Dependency Parsing with MaltParser Transition-Based Dependency Parsing with MaltParser Joakim Nivre Uppsala University and Växjö University Transition-Based Dependency Parsing 1(13) Introduction Outline Goals of the workshop Transition-based

More information

Hybrid Combination of Constituency and Dependency Trees into an Ensemble Dependency Parser

Hybrid Combination of Constituency and Dependency Trees into an Ensemble Dependency Parser Hybrid Combination of Constituency and Dependency Trees into an Ensemble Dependency Parser Nathan David Green and Zdeněk Žabokrtský Charles University in Prague Institute of Formal and Applied Linguistics

More information

Dynamic Feature Selection for Dependency Parsing

Dynamic Feature Selection for Dependency Parsing Dynamic Feature Selection for Dependency Parsing He He, Hal Daumé III and Jason Eisner EMNLP 2013, Seattle Structured Prediction in NLP Part-of-Speech Tagging Parsing N N V Det N Fruit flies like a banana

More information

Projective Dependency Parsing with Perceptron

Projective Dependency Parsing with Perceptron Projective Dependency Parsing with Perceptron Xavier Carreras, Mihai Surdeanu, and Lluís Màrquez Technical University of Catalonia {carreras,surdeanu,lluism}@lsi.upc.edu 8th June 2006 Outline Introduction

More information

Dependency Parsing 2 CMSC 723 / LING 723 / INST 725. Marine Carpuat. Fig credits: Joakim Nivre, Dan Jurafsky & James Martin

Dependency Parsing 2 CMSC 723 / LING 723 / INST 725. Marine Carpuat. Fig credits: Joakim Nivre, Dan Jurafsky & James Martin Dependency Parsing 2 CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre, Dan Jurafsky & James Martin Dependency Parsing Formalizing dependency trees Transition-based dependency parsing

More information

Discriminative Training with Perceptron Algorithm for POS Tagging Task

Discriminative Training with Perceptron Algorithm for POS Tagging Task Discriminative Training with Perceptron Algorithm for POS Tagging Task Mahsa Yarmohammadi Center for Spoken Language Understanding Oregon Health & Science University Portland, Oregon yarmoham@ohsu.edu

More information

Dependency Parsing CMSC 723 / LING 723 / INST 725. Marine Carpuat. Fig credits: Joakim Nivre, Dan Jurafsky & James Martin

Dependency Parsing CMSC 723 / LING 723 / INST 725. Marine Carpuat. Fig credits: Joakim Nivre, Dan Jurafsky & James Martin Dependency Parsing CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre, Dan Jurafsky & James Martin Dependency Parsing Formalizing dependency trees Transition-based dependency parsing

More information

Incremental Integer Linear Programming for Non-projective Dependency Parsing

Incremental Integer Linear Programming for Non-projective Dependency Parsing Incremental Integer Linear Programming for Non-projective Dependency Parsing Sebastian Riedel James Clarke ICCS, University of Edinburgh 22. July 2006 EMNLP 2006 S. Riedel, J. Clarke (ICCS, Edinburgh)

More information

Easy-First POS Tagging and Dependency Parsing with Beam Search

Easy-First POS Tagging and Dependency Parsing with Beam Search Easy-First POS Tagging and Dependency Parsing with Beam Search Ji Ma JingboZhu Tong Xiao Nan Yang Natrual Language Processing Lab., Northeastern University, Shenyang, China MOE-MS Key Lab of MCC, University

More information

Fast(er) Exact Decoding and Global Training for Transition-Based Dependency Parsing via a Minimal Feature Set

Fast(er) Exact Decoding and Global Training for Transition-Based Dependency Parsing via a Minimal Feature Set Fast(er) Exact Decoding and Global Training for Transition-Based Dependency Parsing via a Minimal Feature Set Tianze Shi* Liang Huang Lillian Lee* * Cornell University Oregon State University O n 3 O n

More information

A Quick Guide to MaltParser Optimization

A Quick Guide to MaltParser Optimization A Quick Guide to MaltParser Optimization Joakim Nivre Johan Hall 1 Introduction MaltParser is a system for data-driven dependency parsing, which can be used to induce a parsing model from treebank data

More information

Learning Latent Linguistic Structure to Optimize End Tasks. David A. Smith with Jason Naradowsky and Xiaoye Tiger Wu

Learning Latent Linguistic Structure to Optimize End Tasks. David A. Smith with Jason Naradowsky and Xiaoye Tiger Wu Learning Latent Linguistic Structure to Optimize End Tasks David A. Smith with Jason Naradowsky and Xiaoye Tiger Wu 12 October 2012 Learning Latent Linguistic Structure to Optimize End Tasks David A. Smith

More information

Improving the quality of a customized SMT system using shared training data. August 28, 2009

Improving the quality of a customized SMT system using shared training data.  August 28, 2009 Improving the quality of a customized SMT system using shared training data Chris.Wendt@microsoft.com Will.Lewis@microsoft.com August 28, 2009 1 Overview Engine and Customization Basics Experiment Objective

More information

Online Service for Polish Dependency Parsing and Results Visualisation

Online Service for Polish Dependency Parsing and Results Visualisation Online Service for Polish Dependency Parsing and Results Visualisation Alina Wróblewska and Piotr Sikora Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland alina@ipipan.waw.pl,piotr.sikora@student.uw.edu.pl

More information

Introduction to Data-Driven Dependency Parsing

Introduction to Data-Driven Dependency Parsing Introduction to Data-Driven Dependency Parsing Introductory Course, ESSLLI 2007 Ryan McDonald 1 Joakim Nivre 2 1 Google Inc., New York, USA E-mail: ryanmcd@google.com 2 Uppsala University and Växjö University,

More information

Dependency grammar and dependency parsing

Dependency grammar and dependency parsing Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2016-12-05 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Activities - dependency parsing

More information

Transition-based dependency parsing

Transition-based dependency parsing Transition-based dependency parsing Syntactic analysis (5LN455) 2014-12-18 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Overview Arc-factored dependency parsing

More information

Non-Projective Dependency Parsing in Expected Linear Time

Non-Projective Dependency Parsing in Expected Linear Time Non-Projective Dependency Parsing in Expected Linear Time Joakim Nivre Uppsala University, Department of Linguistics and Philology, SE-75126 Uppsala Växjö University, School of Mathematics and Systems

More information

Non-Projective Dependency Parsing with Non-Local Transitions

Non-Projective Dependency Parsing with Non-Local Transitions Non-Projective Dependency Parsing with Non-Local Transitions Daniel Fernández-González and Carlos Gómez-Rodríguez Universidade da Coruña FASTPARSE Lab, LyS Research Group, Departamento de Computación Campus

More information

AT&T: The Tag&Parse Approach to Semantic Parsing of Robot Spatial Commands

AT&T: The Tag&Parse Approach to Semantic Parsing of Robot Spatial Commands AT&T: The Tag&Parse Approach to Semantic Parsing of Robot Spatial Commands Svetlana Stoyanchev, Hyuckchul Jung, John Chen, Srinivas Bangalore AT&T Labs Research 1 AT&T Way Bedminster NJ 07921 {sveta,hjung,jchen,srini}@research.att.com

More information

An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing

An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing Jun Suzuki, Hideki Isozaki NTT CS Lab., NTT Corp. Kyoto, 619-0237, Japan jun@cslab.kecl.ntt.co.jp isozaki@cslab.kecl.ntt.co.jp

More information

The Application of Constraint Rules to Data-driven Parsing

The Application of Constraint Rules to Data-driven Parsing The Application of Constraint Rules to Data-driven Parsing Sardar Jaf The University of Manchester jafs@cs.man.ac.uk Allan Ramsay The University of Manchester ramsaya@cs.man.ac.uk Abstract In this paper,

More information

Dependency grammar and dependency parsing

Dependency grammar and dependency parsing Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2015-12-09 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Activities - dependency parsing

More information

Machine Translation Research in META-NET

Machine Translation Research in META-NET Machine Translation Research in META-NET Jan Hajič Institute of Formal and Applied Linguistics Charles University in Prague, CZ hajic@ufal.mff.cuni.cz Solutions for Multilingual Europe Budapest, Hungary,

More information

Online Learning of Approximate Dependency Parsing Algorithms

Online Learning of Approximate Dependency Parsing Algorithms Online Learning of Approximate Dependency Parsing Algorithms Ryan McDonald Fernando Pereira Department of Computer and Information Science University of Pennsylvania Philadelphia, PA 19104 {ryantm,pereira}@cis.upenn.edu

More information

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed Let s get parsing! SpaCy default model includes tagger, parser and entity recognizer nlp = spacy.load('en ) tells spacy to use "en" with ["tagger", "parser", "ner"] Each component processes the Doc object,

More information

Langforia: Language Pipelines for Annotating Large Collections of Documents

Langforia: Language Pipelines for Annotating Large Collections of Documents Langforia: Language Pipelines for Annotating Large Collections of Documents Marcus Klang Lund University Department of Computer Science Lund, Sweden Marcus.Klang@cs.lth.se Pierre Nugues Lund University

More information

Discriminative Parse Reranking for Chinese with Homogeneous and Heterogeneous Annotations

Discriminative Parse Reranking for Chinese with Homogeneous and Heterogeneous Annotations Discriminative Parse Reranking for Chinese with Homogeneous and Heterogeneous Annotations Weiwei Sun and Rui Wang and Yi Zhang Department of Computational Linguistics, Saarland University German Research

More information

Dynamic Programming for Higher Order Parsing of Gap-Minding Trees

Dynamic Programming for Higher Order Parsing of Gap-Minding Trees Dynamic Programming for Higher Order Parsing of Gap-Minding Trees Emily Pitler, Sampath Kannan, Mitchell Marcus Computer and Information Science University of Pennsylvania Philadelphia, PA 19104 epitler,kannan,mitch@seas.upenn.edu

More information

Tekniker för storskalig parsning: Dependensparsning 2

Tekniker för storskalig parsning: Dependensparsning 2 Tekniker för storskalig parsning: Dependensparsning 2 Joakim Nivre Uppsala Universitet Institutionen för lingvistik och filologi joakim.nivre@lingfil.uu.se Dependensparsning 2 1(45) Data-Driven Dependency

More information

Dependency grammar and dependency parsing

Dependency grammar and dependency parsing Dependency grammar and dependency parsing Syntactic analysis (5LN455) 2014-12-10 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Mid-course evaluation Mostly positive

More information

Turning on the Turbo: Fast Third-Order Non- Projective Turbo Parsers

Turning on the Turbo: Fast Third-Order Non- Projective Turbo Parsers Carnegie Mellon University Research Showcase @ CMU Language Technologies Institute School of Computer Science 8-2013 Turning on the Turbo: Fast Third-Order Non- Projective Turbo Parsers Andre F.T. Martins

More information

Transition-Based Dependency Parsing with Stack Long Short-Term Memory

Transition-Based Dependency Parsing with Stack Long Short-Term Memory Transition-Based Dependency Parsing with Stack Long Short-Term Memory Chris Dyer, Miguel Ballesteros, Wang Ling, Austin Matthews, Noah A. Smith Association for Computational Linguistics (ACL), 2015 Presented

More information

Refresher on Dependency Syntax and the Nivre Algorithm

Refresher on Dependency Syntax and the Nivre Algorithm Refresher on Dependency yntax and Nivre Algorithm Richard Johansson 1 Introduction This document gives more details about some important topics that re discussed very quickly during lecture: dependency

More information

Improving Transition-Based Dependency Parsing with Buffer Transitions

Improving Transition-Based Dependency Parsing with Buffer Transitions Improving Transition-Based Dependency Parsing with Buffer Transitions Daniel Fernández-González Departamento de Informática Universidade de Vigo Campus As Lagoas, 32004 Ourense, Spain danifg@uvigo.es Carlos

More information

SEMINAR: RECENT ADVANCES IN PARSING TECHNOLOGY. Parser Evaluation Approaches

SEMINAR: RECENT ADVANCES IN PARSING TECHNOLOGY. Parser Evaluation Approaches SEMINAR: RECENT ADVANCES IN PARSING TECHNOLOGY Parser Evaluation Approaches NATURE OF PARSER EVALUATION Return accurate syntactic structure of sentence. Which representation? Robustness of parsing. Quick

More information

Transition-based Dependency Parsing with Rich Non-local Features

Transition-based Dependency Parsing with Rich Non-local Features Transition-based Dependency Parsing with Rich Non-local Features Yue Zhang University of Cambridge Computer Laboratory yue.zhang@cl.cam.ac.uk Joakim Nivre Uppsala University Department of Linguistics and

More information

Easy-First Chinese POS Tagging and Dependency Parsing

Easy-First Chinese POS Tagging and Dependency Parsing Easy-First Chinese POS Tagging and Dependency Parsing ABSTRACT Ji Ma, Tong Xiao, Jingbo Zhu, Feiliang Ren Natural Language Processing Laboratory Northeastern University, China majineu@outlook.com, zhujingbo@mail.neu.edu.cn,

More information

A Transition-Based Dependency Parser Using a Dynamic Parsing Strategy

A Transition-Based Dependency Parser Using a Dynamic Parsing Strategy A Transition-Based Dependency Parser Using a Dynamic Parsing Strategy Francesco Sartorio Department of Information Engineering University of Padua, Italy sartorio@dei.unipd.it Giorgio Satta Department

More information

Basic Parsing with Context-Free Grammars. Some slides adapted from Karl Stratos and from Chris Manning

Basic Parsing with Context-Free Grammars. Some slides adapted from Karl Stratos and from Chris Manning Basic Parsing with Context-Free Grammars Some slides adapted from Karl Stratos and from Chris Manning 1 Announcements HW 2 out Midterm on 10/19 (see website). Sample ques>ons will be provided. Sign up

More information

Combining Discrete and Continuous Features for Deterministic Transition-based Dependency Parsing

Combining Discrete and Continuous Features for Deterministic Transition-based Dependency Parsing Combining Discrete and Continuous Features for Deterministic Transition-based Dependency Parsing Meishan Zhang and Yue Zhang Singapore University of Technology and Design {meishan zhang yue zhang}@sutd.edu.sg

More information

Stanford s System for Parsing the English Web

Stanford s System for Parsing the English Web Stanford s System for Parsing the English Web David McClosky s, Wanxiang Che h, Marta Recasens s, Mengqiu Wang s, Richard Socher s, and Christopher D. Manning s s Natural Language Processing Group, Stanford

More information

Large-Scale Syntactic Processing: Parsing the Web. JHU 2009 Summer Research Workshop

Large-Scale Syntactic Processing: Parsing the Web. JHU 2009 Summer Research Workshop Large-Scale Syntactic Processing: JHU 2009 Summer Research Workshop Intro CCG parser Tasks 2 The Team Stephen Clark (Cambridge, UK) Ann Copestake (Cambridge, UK) James Curran (Sydney, Australia) Byung-Gyu

More information

Parsing with Dynamic Programming

Parsing with Dynamic Programming CS11-747 Neural Networks for NLP Parsing with Dynamic Programming Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Two Types of Linguistic Structure Dependency: focus on relations between words

More information

Sorting Out Dependency Parsing

Sorting Out Dependency Parsing Sorting Out Dependency Parsing Joakim Nivre Uppsala University and Växjö University Sorting Out Dependency Parsing 1(38) Introduction Introduction Syntactic parsing of natural language: Who does what to

More information

Fully Delexicalized Contexts for Syntax-Based Word Embeddings

Fully Delexicalized Contexts for Syntax-Based Word Embeddings Fully Delexicalized Contexts for Syntax-Based Word Embeddings Jenna Kanerva¹, Sampo Pyysalo² and Filip Ginter¹ ¹Dept of IT - University of Turku, Finland ²Lang. Tech. Lab - University of Cambridge turkunlp.github.io

More information

Transition-Based Parsing of the Chinese Treebank using a Global Discriminative Model

Transition-Based Parsing of the Chinese Treebank using a Global Discriminative Model Transition-Based Parsing of the Chinese Treebank using a Global Discriminative Model Yue Zhang Oxford University Computing Laboratory yue.zhang@comlab.ox.ac.uk Stephen Clark Cambridge University Computer

More information

Dependency Parsing domain adaptation using transductive SVM

Dependency Parsing domain adaptation using transductive SVM Dependency Parsing domain adaptation using transductive SVM Antonio Valerio Miceli-Barone University of Pisa, Italy / Largo B. Pontecorvo, 3, Pisa, Italy miceli@di.unipi.it Giuseppe Attardi University

More information

Tokenization and Sentence Segmentation. Yan Shao Department of Linguistics and Philology, Uppsala University 29 March 2017

Tokenization and Sentence Segmentation. Yan Shao Department of Linguistics and Philology, Uppsala University 29 March 2017 Tokenization and Sentence Segmentation Yan Shao Department of Linguistics and Philology, Uppsala University 29 March 2017 Outline 1 Tokenization Introduction Exercise Evaluation Summary 2 Sentence segmentation

More information

Testing parsing improvements with combination and translation in Evalita 2014

Testing parsing improvements with combination and translation in Evalita 2014 10.12871/clicit201424 Testing parsing improvements with combination and translation in Evalita 2014 Alessandro Mazzei Dipartimento di Informatica Università degli Studi di Torino Corso Svizzera 185, 10149

More information

Undirected Dependency Parsing

Undirected Dependency Parsing Computational Intelligence, Volume 59, Number 000, 2010 Undirected Dependency Parsing CARLOS GÓMEZ-RODRÍGUEZ cgomezr@udc.es Depto. de Computación, Universidade da Coruña. Facultad de Informática, Campus

More information

HC-search for Incremental Parsing

HC-search for Incremental Parsing HC-search for Incremental Parsing Yijia Liu, Wanxiang Che, Bing Qin, Ting Liu Research Center for Social Computing and Information Retrieval Harbin Institute of Technology, China {yjliu,car,qinb,tliu}@ir.hit.edu.cn

More information

Accurate Parsing and Beyond. Yoav Goldberg Bar Ilan University

Accurate Parsing and Beyond. Yoav Goldberg Bar Ilan University Accurate Parsing and Beyond Yoav Goldberg Bar Ilan University Syntactic Parsing subj root rcmod rel xcomp det subj aux acomp acomp The soup, which I expected to be good, was bad subj root rcmod rel xcomp

More information

Sorting Out Dependency Parsing

Sorting Out Dependency Parsing Sorting Out Dependency Parsing Joakim Nivre Uppsala University and Växjö University Sorting Out Dependency Parsing 1(38) Introduction Introduction Syntactic parsing of natural language: Who does what to

More information

Context Encoding LSTM CS224N Course Project

Context Encoding LSTM CS224N Course Project Context Encoding LSTM CS224N Course Project Abhinav Rastogi arastogi@stanford.edu Supervised by - Samuel R. Bowman December 7, 2015 Abstract This project uses ideas from greedy transition based parsing

More information

*North American Standard USB Keyboard

*North American Standard USB Keyboard USB Country Keyboard Types (Country Codes) Spanish Windows *North American Standard USB Keyboard Italian Windows French Windows UK English Windows French Canadian Windows 2000/XP Portuguese-Brazilian Windows

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Greed is Good if Randomized: New Inference for Dependency Parsing

Greed is Good if Randomized: New Inference for Dependency Parsing Greed is Good if Randomized: New Inference for Dependency Parsing Yuan Zhang, Tao Lei, Regina Barzilay, and Tommi Jaakkola Computer Science and Artificial Intelligence Laboratory Massachusetts Institute

More information

Automatic Discovery of Feature Sets for Dependency Parsing

Automatic Discovery of Feature Sets for Dependency Parsing Automatic Discovery of Feature Sets for Dependency Parsing Peter Nilsson Pierre Nugues Department of Computer Science Lund University peter.nilsson.lund@telia.com Pierre.Nugues@cs.lth.se Abstract This

More information

FP SIMPLE4ALL deliverable D6.5. Deliverable D6.5. Initial Public Release of Open Source Tools

FP SIMPLE4ALL deliverable D6.5. Deliverable D6.5. Initial Public Release of Open Source Tools Deliverable D6.5 Initial Public Release of Open Source Tools The research leading to these results has received funding from the European Community s Seventh Framework Programme (FP7/2007-2013) under grant

More information

Non-projective Dependency Parsing using Spanning Tree Algorithms

Non-projective Dependency Parsing using Spanning Tree Algorithms Non-projective Dependency Parsing using Spanning Tree Algorithms Ryan McDonald Fernando Pereira Department of Computer and Information Science University of Pennsylvania {ryantm,pereira}@cis.upenn.edu

More information

Graph-Based Parsing. Miguel Ballesteros. Algorithms for NLP Course. 7-11

Graph-Based Parsing. Miguel Ballesteros. Algorithms for NLP Course. 7-11 Graph-Based Parsing Miguel Ballesteros Algorithms for NLP Course. 7-11 By using some Joakim Nivre's materials from Uppsala University and Jason Eisner's material from Johns Hopkins University. Outline

More information

Better translations with user collaboration - Integrated MT at Microsoft

Better translations with user collaboration - Integrated MT at Microsoft Better s with user collaboration - Integrated MT at Microsoft Chris Wendt Microsoft Research One Microsoft Way Redmond, WA 98052 christw@microsoft.com Abstract This paper outlines the methodologies Microsoft

More information

A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction Seokhwan Kim Human Language Technology Dept. Institute for Infocomm Research Singapore 138632 kims@i2r.a-star.edu.sg

More information

A Search Engine based on Query Logs, and Search Log Analysis at the University of Sunderland

A Search Engine based on Query Logs, and Search Log Analysis at the University of Sunderland A Search Engine based on Query Logs, and Search Log Analysis at the University of Sunderland Michael Oakes, Yan Xu University of Sunderland, Dept. of Computing, Engineering and Technology, DGIC, St. Peter

More information

Statistical Dependency Parsing

Statistical Dependency Parsing Statistical Dependency Parsing The State of the Art Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Statistical Dependency Parsing 1(29) Introduction

More information

A Dynamic Confusion Score for Dependency Arc Labels

A Dynamic Confusion Score for Dependency Arc Labels A Dynamic Confusion Score for Dependency Arc Labels Sambhav Jain and Bhasha Agrawal Language Technologies Research Center IIIT-Hyderabad, India {sambhav.jain, bhasha.agrawal}@research.iiit.ac.in Abstract

More information

Dependency Parsing. Ganesh Bhosale Neelamadhav G Nilesh Bhosale Pranav Jawale under the guidance of

Dependency Parsing. Ganesh Bhosale Neelamadhav G Nilesh Bhosale Pranav Jawale under the guidance of Dependency Parsing Ganesh Bhosale - 09305034 Neelamadhav G. - 09305045 Nilesh Bhosale - 09305070 Pranav Jawale - 09307606 under the guidance of Prof. Pushpak Bhattacharyya Department of Computer Science

More information

Kofax Transformation Modules. Technical Specifications. Version: 6.2.0

Kofax Transformation Modules. Technical Specifications. Version: 6.2.0 Kofax Transformation Modules Technical s Version: 6.2.0 Written by: Product Management KTM Date: May 2017 2017 Lexmark. All rights reserved. Lexmark is a trademark of Lexmark International, Inc., or its

More information

Spanning Tree Methods for Discriminative Training of Dependency Parsers

Spanning Tree Methods for Discriminative Training of Dependency Parsers University of Pennsylvania ScholarlyCommons Technical Reports (CIS) Department of Computer & Information Science January 2006 Spanning Tree Methods for Discriminative Training of Dependency Parsers Ryan

More information

On maximum spanning DAG algorithms for semantic DAG parsing

On maximum spanning DAG algorithms for semantic DAG parsing On maximum spanning DAG algorithms for semantic DAG parsing Natalie Schluter Department of Computer Science School of Technology, Malmö University Malmö, Sweden natalie.schluter@mah.se Abstract Consideration

More information

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank text

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank text Philosophische Fakultät Seminar für Sprachwissenschaft Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank text 06 July 2017, Patricia Fischer & Neele Witte Overview Sentiment

More information

Collins and Eisner s algorithms

Collins and Eisner s algorithms Collins and Eisner s algorithms Syntactic analysis (5LN455) 2015-12-14 Sara Stymne Department of Linguistics and Philology Based on slides from Marco Kuhlmann Recap: Dependency trees dobj subj det pmod

More information

<Insert Picture Here> Oracle Policy Automation 10.0 Features and Benefits

<Insert Picture Here> Oracle Policy Automation 10.0 Features and Benefits Oracle Policy Automation 10.0 Features and Benefits December 2009 The following is intended to outline our general product direction. It is intended for information purposes only,

More information

Optimal Shift-Reduce Constituent Parsing with Structured Perceptron

Optimal Shift-Reduce Constituent Parsing with Structured Perceptron Optimal Shift-Reduce Constituent Parsing with Structured Perceptron Le Quang Thang Hanoi University of Science and Technology {lelightwin@gmail.com} Hiroshi Noji and Yusuke Miyao National Institute of

More information

Utilizing Dependency Language Models for Graph-based Dependency Parsing Models

Utilizing Dependency Language Models for Graph-based Dependency Parsing Models Utilizing Dependency Language Models for Graph-based Dependency Parsing Models Wenliang Chen, Min Zhang, and Haizhou Li Human Language Technology, Institute for Infocomm Research, Singapore {wechen, mzhang,

More information

MaltOptimizer: A System for MaltParser Optimization

MaltOptimizer: A System for MaltParser Optimization MaltOptimizer: A System for MaltParser Optimization Miguel Ballesteros Joakim Nivre Universidad Complutense de Madrid, Spain miballes@fdi.ucm.es Uppsala University, Sweden joakim.nivre@lingfil.uu.se Abstract

More information

System Combination Using Joint, Binarised Feature Vectors

System Combination Using Joint, Binarised Feature Vectors System Combination Using Joint, Binarised Feature Vectors Christian F EDERMAN N 1 (1) DFKI GmbH, Language Technology Lab, Stuhlsatzenhausweg 3, D-6613 Saarbrücken, GERMANY cfedermann@dfki.de Abstract We

More information

IAB European Mobile Optimised site study UK, France, Germany, Italy and Spain. February iabuk.net/contact

IAB European Mobile Optimised site study UK, France, Germany, Italy and Spain. February iabuk.net/contact IAB European Mobile Optimised site study UK, France, Germany, Italy and Spain February 2014 iabuk.net/contact 23 April 2014 Background 1. Study was first conducted in August 2012 (UK Only) 2. The first

More information

Generalized Higher-Order Dependency Parsing with Cube Pruning

Generalized Higher-Order Dependency Parsing with Cube Pruning Generalized Higher-Order Dependency Parsing with Cube Pruning Hao Zhang Ryan McDonald Google, Inc. {haozhang,ryanmcd}@google.com Abstract State-of-the-art graph-based parsers use features over higher-order

More information

Supplementary A. Built-in transition systems

Supplementary A. Built-in transition systems L. Aufrant, G. Wisniewski PanParser (Supplementary) Supplementary A. Built-in transition systems In the following, we document all transition systems built in PanParser, along with their cost. s and s

More information

Online Graph Planarisation for Synchronous Parsing of Semantic and Syntactic Dependencies

Online Graph Planarisation for Synchronous Parsing of Semantic and Syntactic Dependencies Online Graph Planarisation for Synchronous Parsing of Semantic and Syntactic Dependencies Ivan Titov University of Illinois at Urbana-Champaign James Henderson, Paola Merlo, Gabriele Musillo University

More information

Treex: Modular NLP Framework

Treex: Modular NLP Framework : Modular NLP Framework Martin Popel ÚFAL (Institute of Formal and Applied Linguistics) Charles University in Prague September 2015, Prague, MT Marathon Outline Motivation, vs. architecture internals Future

More information

Using Search-Logs to Improve Query Tagging

Using Search-Logs to Improve Query Tagging Using Search-Logs to Improve Query Tagging Kuzman Ganchev Keith Hall Ryan McDonald Slav Petrov Google, Inc. {kuzman kbhall ryanmcd slav}@google.com Abstract Syntactic analysis of search queries is important

More information

Alignment Link Projection Using Transformation-Based Learning

Alignment Link Projection Using Transformation-Based Learning Alignment Link Projection Using Transformation-Based Learning Necip Fazil Ayan, Bonnie J. Dorr and Christof Monz Department of Computer Science University of Maryland College Park, MD 20742 {nfa,bonnie,christof}@umiacs.umd.edu

More information

Task 3 Patient-Centred Information Retrieval: Team CUNI

Task 3 Patient-Centred Information Retrieval: Team CUNI Task 3 Patient-Centred Information Retrieval: Team CUNI Shadi Saleh and Pavel Pecina Charles University Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics, Czech Republic {saleh,pecina}@ufal.mff.cuni.cz

More information

The HDU Discriminative SMT System for Constrained Data PatentMT at NTCIR10

The HDU Discriminative SMT System for Constrained Data PatentMT at NTCIR10 The HDU Discriminative SMT System for Constrained Data PatentMT at NTCIR10 Patrick Simianer, Gesa Stupperich, Laura Jehl, Katharina Wäschle, Artem Sokolov, Stefan Riezler Institute for Computational Linguistics,

More information

Iterative CKY parsing for Probabilistic Context-Free Grammars

Iterative CKY parsing for Probabilistic Context-Free Grammars Iterative CKY parsing for Probabilistic Context-Free Grammars Yoshimasa Tsuruoka and Jun ichi Tsujii Department of Computer Science, University of Tokyo Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033 CREST, JST

More information

Bidirectional Transition-Based Dependency Parsing

Bidirectional Transition-Based Dependency Parsing Bidirectional Transition-Based Dependency Parsing Yunzhe Yuan, Yong Jiang, Kewei Tu School of Information Science and Technology ShanghaiTech University {yuanyzh,jiangyong,tukw}@shanghaitech.edu.cn Abstract

More information

Noisy Or-based model for Relation Extraction using Distant Supervision

Noisy Or-based model for Relation Extraction using Distant Supervision Noisy Or-based model for Relation Extraction using Distant Supervision Ajay Nagesh 1,2,3 1 IITB-Monash Research Academy ajaynagesh@cse.iitb.ac.in Gholamreza Haffari 2 Faculty of IT, Monash University gholamreza.haffari@monash.edu

More information

Dependency Parsing with Undirected Graphs

Dependency Parsing with Undirected Graphs Dependency Parsing with Undirected Graphs Carlos Gómez-Rodríguez Departamento de Computación Universidade da Coruña Campus de Elviña, 15071 A Coruña, Spain carlos.gomez@udc.es Daniel Fernández-González

More information

Optimal Incremental Parsing via Best-First Dynamic Programming

Optimal Incremental Parsing via Best-First Dynamic Programming Optimal Incremental Parsing via Best-First Dynamic Programming Kai Zhao 1 James Cross 1 1 Graduate Center City University of New York 365 Fifth Avenue, New York, NY 10016 {kzhao,jcross}@gc.cuny.edu Liang

More information

Bootstrapping via Graph Propagation

Bootstrapping via Graph Propagation Bootstrapping via Graph Propagation Max Whitney and Anoop Sarkar Simon Fraser University, School of Computing Science Burnaby, BC V5A 1S6, Canada {mwhitney,anoop}@sfu.ca Abstract Bootstrapping a classifier

More information

Professional, quality services

Professional, quality services Professional, quality services 001 Translations offers you the expertise that it has gathered in doing business internationally for more than 20 years. International coverage In-house translation of 10

More information

arxiv: v4 [cs.cl] 7 Jun 2016

arxiv: v4 [cs.cl] 7 Jun 2016 Edge-Linear First-Order Dependency Parsing with Undirected Minimum Spanning Tree Inference Effi Levi 1 Roi Reichart 2 Ari Rappoport 1 1 Institute of Computer Science, The Hebrew Univeristy 2 Faculty of

More information