The MetaGrammar Compiler: An NLP Application with a Multi-paradigm Architecture

Size: px
Start display at page:

Download "The MetaGrammar Compiler: An NLP Application with a Multi-paradigm Architecture"

Transcription

1 The MetaGrammar Compiler: An NLP Application with a Multi-paradigm Architecture Denys Duchier Joseph Le Roux Yannick Parmentier LORIA Campus cientifique, BP 239, F Vandœuvre-lès-Nancy, France MOZ Conference, 7 8 October, 2004

2

3

4 Developing large and practical grammars Grammars to build realistic NLP applications Lexicalized tree grammars N N* N V N N V N... eats John eats the apple eats (John) who eats the apple The lexicon maps each word to a set of alternative syntax trees

5 tructural redundancy affects maintenance and extensibility N V N N V PP John eats eats the apple V V P N is eaten by The apple is eaten by John hare and modularize information in the lexicon Automatically generate the lexicon from descriptions

6 haring information We consider tree fragments N V Intuition: combination of these fragments to obtain trees The combination process must be based on linguistic observations:, Regularities among verb trees (e.g. number of arguments) Realizations of verbal arguments (e.g. canonical, extracted, etc) V...

7 2 axis of grammar description (part 1) Tree structure sharing (fragment combinations) 1. John sleeps Canonicalubject ActiveVerb IntransitiveVerb N V 2. John eats the apple + = V N V Canonicalubject ActiveVerb CanonicalObject TransitiveVerb N V + + = V V N N V N

8 2 axis of grammar description (part 2) Alternative choices (e.g. to express paraphrases) CanonicalObject: John eats the apple. WhObject: Which apple does John eat?... CanonicalObject WhObject Object = or or... V N N

9 Our approach Tree fragments are represented by logic formulæ of a tree description language V N ( V ) ( N) (V N) Encapsulation of the tree fragments into classes 1. tructure sharing through multiple inheritance 2. Alternative choices through explicit disjunctions We obtain a DAG of classes: the metagrammar (MG)

10 1. tructure sharing VerbalMorphology V Active Passive V V V V V

11 1. tructure sharing VerbalMorphology V Canonical_ubject N V Active Passive V V V V V Canonical_Act N V V

12 2. Alternative choice Canonical_ubject Active V N V V Extracted_ubject N Canonical_Act N V V Extracted_Act N V V Intransitive_Act Canonical_Act OR Extracted_Act

13

14 Content of a class everal dimensions: yntactic dimension tree descriptions emantic dimension predicate logic... These dimensions share identifiers (logic variables) In the syntactic dimension, an identifier can refer to a node, or a feature In the semantic dimension, an identifier refers to either a predicate or an argument

15 Identifier scope Default scope of an identifier: the class How to reuse information contained in a class? We use a mechanism to import and export identifiers Exported identifiers can be renamed This mechanism prevents name conflicts An object-oriented way of handling identifier scope

16 Tree descriptions Intransitive Verb N V P(A) class IntransitiveVerb <syn> { node [cat=s]; node V[cat=v,pred=P]; node N[cat=n,arg0=A]; -> V; -> N; N << V } <sem> {P(A)}

17

18 From MG to logic programs MG Class = Name and Linguistic Info Info = Tree fragment, Class call, Alternation, Combination Valuated classes Logic Program Clause ::= Name Goal Goal ::= Description Name Goal Goal Goal Goal Queries Lexicon generation as the execution of the MG/program

19 DCGs tarting with words as descriptions, i.e. with a CF-grammar: NP VP VP V NP NP the N, V chased N rabbit wolf The following Prolog program recognizes the same language: s(in,out):- np(in, Mid),vp(Mid, Out).... v(in,out):- term( chased, In, Out). term(x,[x L],L). (In-Out is called the accumulator)

20 EDCGs Extended DCGs [Van Roy, 1990]: Generalized accumulation (logic or algebraic operation) Multiple accumulators In our case: 2 Dimensions: yntax and emantics Accumulation of tree descriptions and logic formulæ with unification

21

22 tep 1: Compilation Translate the concrete syntax into symbolic code: 1. MG tokenization and parsing: GUMP 2. Intermediate syntax tree checking: pre-compilation warnings / failures 3. Compilation of the intermediate code into symbolic code (records)

23 tep 2: An object-oriented virtual machine Execution of the symbolic code by a specific virtual machine (VM) A standard logic programming kernel (inspired by the Warren s Abstract Machine) An object-oriented VM: each record of the code corresponds to a method (easily extendible)

24 Why a new VM? 1. Easier to extend: non-standard data types (e.g. open feature structures or nodes) non-standard unification (e.g. polarities in Interaction Grammars), 2. Output of the VM: snapshot of the accumulators semantic dimension: accumulated formula syntactic dimension: accumulated tree description Computing models of a tree description Constraint solver

25 tep 3: A constraint-based tree description solver Computing of all minimal models of a tree description, Dominance constraint solver based on a set constraint approach [Duchier - Niehren, 2000], In a model, the position of a node is given by the values of 5 set variables: Eq x, Up x, Down x, Left x, Right x Up Left Eq Right x y [EqUp x Up y Down x EqDown y Left x Left y Right x Right y ] Down

26 Colors Precise control on fragment combinations Coloration of each node with a color {Black, Red, White} Restrictions on how colored nodes can be combined: a red node does not combine with another node a black node combines only with 0 or more white nodes a white node must combine with a black node

27 A node in the model a set of nodes in the description: singleton (red nodes) set composed of 1 black node and 0 or more white nodes This set contains only one non-white node, we introduce a variable RB x Additional constraints: x V r RB x = x Eq x = {x} x V b RB x = x x V w RB x V b

28 Output of the solver trees printed in an XML file a Qtk GUI

29

30 First results Device adaptable to grammars based on tree descriptions (TAG and IG for now) The compiler has been used by linguists to generate a large scale Lexicalized TAG (more than 3,000 trees produced from 175 classes) Automatically generated lexicons used for parsing (TAG: LORIA LTAG Parser2, DyALog system, IG: LEOPAR)

31 To sum up 2-level multi-paradigmity: User level: Mixing Object Oriented (MG architecture) and Logic Programming (unification mechanisms) Internal level: VM implemented in an OO way + Constraint programming Current work: modularization of the constraint solver extension to other formalisms (e.g. XDG)

SemTAG - a platform for Semantic Construction with Tree Adjoining Grammars

SemTAG - a platform for Semantic Construction with Tree Adjoining Grammars SemTAG - a platform for Semantic Construction with Tree Adjoining Grammars Yannick Parmentier parmenti@loria.fr Langue Et Dialogue Project LORIA Nancy Universities France Emmy Noether Project SFB 441 Tübingen

More information

COMPUTATIONAL SEMANTICS WITH FUNCTIONAL PROGRAMMING JAN VAN EIJCK AND CHRISTINA UNGER. lg Cambridge UNIVERSITY PRESS

COMPUTATIONAL SEMANTICS WITH FUNCTIONAL PROGRAMMING JAN VAN EIJCK AND CHRISTINA UNGER. lg Cambridge UNIVERSITY PRESS COMPUTATIONAL SEMANTICS WITH FUNCTIONAL PROGRAMMING JAN VAN EIJCK AND CHRISTINA UNGER lg Cambridge UNIVERSITY PRESS ^0 Contents Foreword page ix Preface xiii 1 Formal Study of Natural Language 1 1.1 The

More information

Syntax-semantics interface and the non-trivial computation of meaning

Syntax-semantics interface and the non-trivial computation of meaning 1 Syntax-semantics interface and the non-trivial computation of meaning APA/ASL Group Meeting GVI-2: Lambda Calculi, Type Systems, and Applications to Natural Language APA Eastern Division 108th Annual

More information

Compiler Design Overview. Compiler Design 1

Compiler Design Overview. Compiler Design 1 Compiler Design Overview Compiler Design 1 Preliminaries Required Basic knowledge of programming languages. Basic knowledge of FSA and CFG. Knowledge of a high programming language for the programming

More information

Definite-clause grammars: PSGs in Prolog (in principle, string rewrite described as a logical formula).

Definite-clause grammars: PSGs in Prolog (in principle, string rewrite described as a logical formula). Computational Encoding of PSGs Logical (declarative) Network-based (procedural) Definite-clause grammars: PSGs in Prolog (in principle, string rewrite described as a logical formula). For a grammar G =

More information

2068 (I) Attempt all questions.

2068 (I) Attempt all questions. 2068 (I) 1. What do you mean by compiler? How source program analyzed? Explain in brief. 2. Discuss the role of symbol table in compiler design. 3. Convert the regular expression 0 + (1 + 0)* 00 first

More information

About the Authors... iii Introduction... xvii. Chapter 1: System Software... 1

About the Authors... iii Introduction... xvii. Chapter 1: System Software... 1 Table of Contents About the Authors... iii Introduction... xvii Chapter 1: System Software... 1 1.1 Concept of System Software... 2 Types of Software Programs... 2 Software Programs and the Computing Machine...

More information

Syntax and Grammars 1 / 21

Syntax and Grammars 1 / 21 Syntax and Grammars 1 / 21 Outline What is a language? Abstract syntax and grammars Abstract syntax vs. concrete syntax Encoding grammars as Haskell data types What is a language? 2 / 21 What is a language?

More information

Ling/CSE 472: Introduction to Computational Linguistics. 5/21/12 Unification, parsing with unification Meaning representation

Ling/CSE 472: Introduction to Computational Linguistics. 5/21/12 Unification, parsing with unification Meaning representation Ling/CSE 472: Introduction to Computational Linguistics 5/21/12 Unification, parsing with unification Meaning representation Overview Unification Unification algorithm Parsing with unification Representing

More information

More Theories, Formal semantics

More Theories, Formal semantics Parts are based on slides by Carl Pollard Charles University, 2011-11-12 Optimality Theory Universal set of violable constraints: Faithfulness constraints:surface forms should be as close as to underlying

More information

Context-Free Grammars

Context-Free Grammars Context-Free Grammars Carl Pollard yntax 2 (Linguistics 602.02) January 3, 2012 Context-Free Grammars (CFGs) A CFG is an ordered quadruple T, N, D, P where a. T is a finite set called the terminals; b.

More information

Compiling Regular Expressions COMP360

Compiling Regular Expressions COMP360 Compiling Regular Expressions COMP360 Logic is the beginning of wisdom, not the end. Leonard Nimoy Compiler s Purpose The compiler converts the program source code into a form that can be executed by the

More information

An Efficient Implementation of PATR for Categorial Unification Grammar

An Efficient Implementation of PATR for Categorial Unification Grammar An Efficient Implementation of PATR for Categorial Unification Grammar Todd Yampol Stanford University Lauri Karttunen Xerox PARC and CSLI 1 Introduction This paper describes C-PATR, a new C implementation

More information

Proseminar on Semantic Theory Fall 2013 Ling 720 An Algebraic Perspective on the Syntax of First Order Logic (Without Quantification) 1

Proseminar on Semantic Theory Fall 2013 Ling 720 An Algebraic Perspective on the Syntax of First Order Logic (Without Quantification) 1 An Algebraic Perspective on the Syntax of First Order Logic (Without Quantification) 1 1. Statement of the Problem, Outline of the Solution to Come (1) The Key Problem There is much to recommend an algebraic

More information

COP4020 Spring 2011 Midterm Exam

COP4020 Spring 2011 Midterm Exam COP4020 Spring 2011 Midterm Exam Name: (Please print Put the answers on these sheets. Use additional sheets when necessary or write on the back. Show how you derived your answer (this is required for full

More information

CS101 Introduction to Programming Languages and Compilers

CS101 Introduction to Programming Languages and Compilers CS101 Introduction to Programming Languages and Compilers In this handout we ll examine different types of programming languages and take a brief look at compilers. We ll only hit the major highlights

More information

Introduction to Compiler

Introduction to Compiler Formal Languages and Compiler (CSE322) Introduction to Compiler Jungsik Choi chjs@khu.ac.kr 2018. 3. 8 Traditional Two-pass Compiler Source Front End Back End Compiler Target High level functions Recognize

More information

Principles of Programming Languages [PLP-2015] Detailed Syllabus

Principles of Programming Languages [PLP-2015] Detailed Syllabus Principles of Programming Languages [PLP-2015] Detailed Syllabus This document lists the topics presented along the course. The PDF slides published on the course web page (http://www.di.unipi.it/~andrea/didattica/plp-15/)

More information

Computational Linguistics: Syntax-Semantics

Computational Linguistics: Syntax-Semantics Computational Linguistics: Syntax-Semantics Raffaella Bernardi University of Trento Contents 1 The Three Tasks Revised................................... 3 2 Lambda terms and CFG...................................

More information

Compiler principles, PS1

Compiler principles, PS1 Compiler principles, PS1 1 Compiler structure A compiler is a computer program that transforms source code written in a programming language into another computer language. Structure of a compiler: Scanner

More information

General Overview of Mozart/Oz

General Overview of Mozart/Oz General Overview of Mozart/Oz Peter Van Roy pvr@info.ucl.ac.be 2004 P. Van Roy, MOZ 2004 General Overview 1 At a Glance Oz language Dataflow concurrent, compositional, state-aware, object-oriented language

More information

The Verb. From Probabilities to Internal Categories. Cem Bozşahin. Cognitive Science Department, The Informatics Institute, ODTÜ

The Verb. From Probabilities to Internal Categories. Cem Bozşahin. Cognitive Science Department, The Informatics Institute, ODTÜ The Verb From Probabilities to Internal Categories Cem Bozşahin bozsahin@metu.edu.tr Cognitive Science Department, The Informatics Institute, ODTÜ November 18, 2016 Joint work with Mark Steedman (with

More information

Dependency Grammar as Graph Description

Dependency Grammar as Graph Description Dependency Grammar as Graph Description Ralph Debusmann Programming Systems Lab Universität des Saarlandes Dependency Grammar as Graph Description p.1 This talk introduces a new meta grammar formalism

More information

Query Decomposition and Data Localization

Query Decomposition and Data Localization Query Decomposition and Data Localization Query Decomposition and Data Localization Query decomposition and data localization consists of two steps: Mapping of calculus query (SQL) to algebra operations

More information

Practical aspects in compiling tabular TAG parsers

Practical aspects in compiling tabular TAG parsers Workshop TAG+5, Paris, 25-27 May 2000 Practical aspects in compiling tabular TAG parsers Miguel A. Alonso Ý, Djamé Seddah Þ, and Éric Villemonte de la Clergerie Ý Departamento de Computación, Universidad

More information

9/5/17. The Design and Implementation of Programming Languages. Compilation. Interpretation. Compilation vs. Interpretation. Hybrid Implementation

9/5/17. The Design and Implementation of Programming Languages. Compilation. Interpretation. Compilation vs. Interpretation. Hybrid Implementation Language Implementation Methods The Design and Implementation of Programming Languages Compilation Interpretation Hybrid In Text: Chapter 1 2 Compilation Interpretation Translate high-level programs to

More information

Structure of a compiler. More detailed overview of compiler front end. Today we ll take a quick look at typical parts of a compiler.

Structure of a compiler. More detailed overview of compiler front end. Today we ll take a quick look at typical parts of a compiler. More detailed overview of compiler front end Structure of a compiler Today we ll take a quick look at typical parts of a compiler. This is to give a feeling for the overall structure. source program lexical

More information

Program Analysis ( 软件源代码分析技术 ) ZHENG LI ( 李征 )

Program Analysis ( 软件源代码分析技术 ) ZHENG LI ( 李征 ) Program Analysis ( 软件源代码分析技术 ) ZHENG LI ( 李征 ) lizheng@mail.buct.edu.cn Lexical and Syntax Analysis Topic Covered Today Compilation Lexical Analysis Semantic Analysis Compilation Translating from high-level

More information

TuLiPA: A Syntax-Semantics Parsing Environment for Mildly Context-Sensitive Formalisms

TuLiPA: A Syntax-Semantics Parsing Environment for Mildly Context-Sensitive Formalisms TuLiPA: A Syntax-Semantics Parsing Environment for Mildly Context-Sensitive Formalisms Yannick Parmentier, Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Johannes Dellert To cite this version: Yannick Parmentier,

More information

Chapter 4. Abstract Syntax

Chapter 4. Abstract Syntax Chapter 4 Abstract Syntax Outline compiler must do more than recognize whether a sentence belongs to the language of a grammar it must do something useful with that sentence. The semantic actions of a

More information

Evaluation Scheme L T P Total Credit Theory Mid Sem Exam

Evaluation Scheme L T P Total Credit Theory Mid Sem Exam DESIGN OF LANGUAGE PROCESSORS Semester II (Computer Engineering) SUB CODE: MECE201 Teaching Scheme (Credits and Hours): Teaching scheme Total Evaluation Scheme L T P Total Credit Theory Mid Sem Exam CIA

More information

Lecture 3: Lexical Analysis

Lecture 3: Lexical Analysis Lecture 3: Lexical Analysis COMP 524 Programming Language Concepts tephen Olivier January 2, 29 Based on notes by A. Block, N. Fisher, F. Hernandez-Campos, J. Prins and D. totts Goal of Lecture Character

More information

Compiler Design (40-414)

Compiler Design (40-414) Compiler Design (40-414) Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007 Evaluation: Midterm Exam 35% Final Exam 35% Assignments and Quizzes 10% Project

More information

A Principle Compiler for Extensible Dependency Grammar

A Principle Compiler for Extensible Dependency Grammar A Principle Compiler for Extensible Dependency Grammar Bachelor Thesis Programming Systems Lab Jochen Setz, 08.11.2007 Betreuer: Ralph Debusmann A Principle Compiler for Extensible Dependency Grammar p.1/45

More information

Compiler, Assembler, and Linker

Compiler, Assembler, and Linker Compiler, Assembler, and Linker Minsoo Ryu Department of Computer Science and Engineering Hanyang University msryu@hanyang.ac.kr What is a Compilation? Preprocessor Compiler Assembler Linker Loader Contents

More information

Compiler I: Syntax Analysis

Compiler I: Syntax Analysis Compiler I: Syntax Analysis Building a Modern Computer From First Principles www.nand2tetris.org Elements of Computing Systems, Nisan & Schocken, MIT Press, www.nand2tetris.org, Chapter 10: Compiler I:

More information

Syntax Analysis. Chapter 4

Syntax Analysis. Chapter 4 Syntax Analysis Chapter 4 Check (Important) http://www.engineersgarage.com/contributio n/difference-between-compiler-andinterpreter Introduction covers the major parsing methods that are typically used

More information

Object-oriented Compiler Construction

Object-oriented Compiler Construction 1 Object-oriented Compiler Construction Extended Abstract Axel-Tobias Schreiner, Bernd Kühl University of Osnabrück, Germany {axel,bekuehl}@uos.de, http://www.inf.uos.de/talks/hc2 A compiler takes a program

More information

Practical aspects in compiling tabular TAG parsers

Practical aspects in compiling tabular TAG parsers Workshop TAG+5, Paris, 25-27 May 2000 Practical aspects in compiling tabular TAG parsers Miguel A. Alonso, jamé Seddah, and Éric Villemonte de la Clergerie epartamento de Computación, Universidad de La

More information

CSE450 Translation of Programming Languages. Lecture 4: Syntax Analysis

CSE450 Translation of Programming Languages. Lecture 4: Syntax Analysis CSE450 Translation of Programming Languages Lecture 4: Syntax Analysis http://xkcd.com/859 Structure of a Today! Compiler Source Language Lexical Analyzer Syntax Analyzer Semantic Analyzer Int. Code Generator

More information

tokens parser 1. postfix notation, 2. graphical representation (such as syntax tree or dag), 3. three address code

tokens parser 1. postfix notation, 2. graphical representation (such as syntax tree or dag), 3. three address code Intermediate generation source program lexical analyzer tokens parser parse tree generation intermediate language The intermediate language can be one of the following: 1. postfix notation, 2. graphical

More information

Lexical Scanning COMP360

Lexical Scanning COMP360 Lexical Scanning COMP360 Captain, we re being scanned. Spock Reading Read sections 2.1 3.2 in the textbook Regular Expression and FSA Assignment A new assignment has been posted on Blackboard It is due

More information

ACCESSING DATABASE USING NLP

ACCESSING DATABASE USING NLP ACCESSING DATABASE USING NLP Pooja A.Dhomne 1, Sheetal R.Gajbhiye 2, Tejaswini S.Warambhe 3, Vaishali B.Bhagat 4 1 Student, Computer Science and Engineering, SRMCEW, Maharashtra, India, poojadhomne@yahoo.com

More information

SYED AMMAL ENGINEERING COLLEGE (An ISO 9001:2008 Certified Institution) Dr. E.M. Abdullah Campus, Ramanathapuram

SYED AMMAL ENGINEERING COLLEGE (An ISO 9001:2008 Certified Institution) Dr. E.M. Abdullah Campus, Ramanathapuram CS6660 COMPILER DESIGN Question Bank UNIT I-INTRODUCTION TO COMPILERS 1. Define compiler. 2. Differentiate compiler and interpreter. 3. What is a language processing system? 4. List four software tools

More information

1. true / false By a compiler we mean a program that translates to code that will run natively on some machine.

1. true / false By a compiler we mean a program that translates to code that will run natively on some machine. 1. true / false By a compiler we mean a program that translates to code that will run natively on some machine. 2. true / false ML can be compiled. 3. true / false FORTRAN can reasonably be considered

More information

Unification in Unification-based Grammar

Unification in Unification-based Grammar THE SIXTH JAPANESE-KOREAN JOINT CONFERENCE ON FORMAL LINGUISTICS,1991 Unification in Unification-based Grammar K.S.Choi, D.J.Son, and G.C.Kim Department of Computer Science Korea Advanced Institute of

More information

Clausal Architecture and Verb Movement

Clausal Architecture and Verb Movement Introduction to Transformational Grammar, LINGUIST 601 October 1, 2004 Clausal Architecture and Verb Movement 1 Clausal Architecture 1.1 The Hierarchy of Projection (1) a. John must leave now. b. John

More information

CMPT 379 Compilers. Parse trees

CMPT 379 Compilers. Parse trees CMPT 379 Compilers Anoop Sarkar http://www.cs.sfu.ca/~anoop 10/25/07 1 Parse trees Given an input program, we convert the text into a parse tree Moving to the backend of the compiler: we will produce intermediate

More information

LOGIC AND DISCRETE MATHEMATICS

LOGIC AND DISCRETE MATHEMATICS LOGIC AND DISCRETE MATHEMATICS A Computer Science Perspective WINFRIED KARL GRASSMANN Department of Computer Science University of Saskatchewan JEAN-PAUL TREMBLAY Department of Computer Science University

More information

Compilers and Interpreters

Compilers and Interpreters Overview Roadmap Language Translators: Interpreters & Compilers Context of a compiler Phases of a compiler Compiler Construction tools Terminology How related to other CS Goals of a good compiler 1 Compilers

More information

Chapter 10: Compiler I: Syntax Analysis

Chapter 10: Compiler I: Syntax Analysis Elements of Computing Systems, Nisan & Schocken, MIT Press, 2005 Chapter 10: Compiler I: Syntax Analysis www.idc.ac.il/tecs Usage and Copyright Notice: Copyright 2005 Noam Nisan and Shimon Schocken This

More information

Dependency and (R)MRS

Dependency and (R)MRS Dependency and (R)MRS Ann Copestake aac@cl.cam.ac.uk December 9, 2008 1 Introduction Note: for current purposes, this document lacks a proper introduction, in that it assumes readers know about MRS and

More information

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING Subject Name: CS2352 Principles of Compiler Design Year/Sem : III/VI

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING Subject Name: CS2352 Principles of Compiler Design Year/Sem : III/VI DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING Subject Name: CS2352 Principles of Compiler Design Year/Sem : III/VI UNIT I - LEXICAL ANALYSIS 1. What is the role of Lexical Analyzer? [NOV 2014] 2. Write

More information

The uniform treatment of constraints, coherency and completeness in a Lexical Functional Grammar compiler

The uniform treatment of constraints, coherency and completeness in a Lexical Functional Grammar compiler The uniform treatment of constraints, coherency and completeness in a Lexical Functional Grammar compiler Peter Hancox School of Computer Science University of Birmingham Lexical Functional Grammar (LFG)

More information

Compiler Design. Computer Science & Information Technology (CS) Rank under AIR 100

Compiler Design. Computer Science & Information Technology (CS) Rank under AIR 100 GATE- 2016-17 Postal Correspondence 1 Compiler Design Computer Science & Information Technology (CS) 20 Rank under AIR 100 Postal Correspondence Examination Oriented Theory, Practice Set Key concepts,

More information

Compiler Code Generation COMP360

Compiler Code Generation COMP360 Compiler Code Generation COMP360 Students who acquire large debts putting themselves through school are unlikely to think about changing society. When you trap people in a system of debt, they can t afford

More information

10/5/17. Lexical and Syntactic Analysis. Lexical and Syntax Analysis. Tokenizing Source. Scanner. Reasons to Separate Lexical and Syntax Analysis

10/5/17. Lexical and Syntactic Analysis. Lexical and Syntax Analysis. Tokenizing Source. Scanner. Reasons to Separate Lexical and Syntax Analysis Lexical and Syntactic Analysis Lexical and Syntax Analysis In Text: Chapter 4 Two steps to discover the syntactic structure of a program Lexical analysis (Scanner): to read the input characters and output

More information

Context-Free Grammars

Context-Free Grammars Department of Linguistics Ohio State University Syntax 2 (Linguistics 602.02) January 3, 2012 (CFGs) A CFG is an ordered quadruple T, N, D, P where a. T is a finite set called the terminals; b. N is a

More information

AUTOMATIC GRAPHIC USER INTERFACE GENERATION FOR VTK

AUTOMATIC GRAPHIC USER INTERFACE GENERATION FOR VTK AUTOMATIC GRAPHIC USER INTERFACE GENERATION FOR VTK Wilfrid Lefer LIUPPA - Université de Pau B.P. 1155, 64013 Pau, France e-mail: wilfrid.lefer@univ-pau.fr ABSTRACT VTK (The Visualization Toolkit) has

More information

Ling/CSE 472: Introduction to Computational Linguistics. 5/9/17 Feature structures and unification

Ling/CSE 472: Introduction to Computational Linguistics. 5/9/17 Feature structures and unification Ling/CSE 472: Introduction to Computational Linguistics 5/9/17 Feature structures and unification Overview Problems with CFG Feature structures Unification Agreement Subcategorization Long-distance Dependencies

More information

UNIVERSITY OF CALIFORNIA

UNIVERSITY OF CALIFORNIA UNIVERSITY OF CALIFORNIA Department of Electrical Engineering and Computer Sciences Computer Science Division CS164 Fall 1997 P. N. Hilfinger CS 164: Midterm Name: Please do not discuss the contents of

More information

Chemistry Studio. B.Tech Project. Ashish Gupta (Y8140) Akshay Mittal (Y8056)

Chemistry Studio. B.Tech Project. Ashish Gupta (Y8140) Akshay Mittal (Y8056) Chemistry Studio An Intelligent Tutoring System: Problem Solving B.Tech Project Ashish Gupta (Y8140) Akshay Mittal (Y8056) Mentored By: Prof. Amey Karkare, IIT Kanpur Dr. Sumit Gulwani, MSR Redmond Dr.

More information

A programming language requires two major definitions A simple one pass compiler

A programming language requires two major definitions A simple one pass compiler A programming language requires two major definitions A simple one pass compiler [Syntax: what the language looks like A context-free grammar written in BNF (Backus-Naur Form) usually suffices. [Semantics:

More information

Programming Languages

Programming Languages Programming Languages Tevfik Koşar Lecture - IX February 14 th, 2006 1 Roadmap Semantic Analysis Role of Semantic Analysis Static vs Dynamic Analysis Attribute Grammars Evaluating Attributes Decoration

More information

CJT^jL rafting Cm ompiler

CJT^jL rafting Cm ompiler CJT^jL rafting Cm ompiler ij CHARLES N. FISCHER Computer Sciences University of Wisconsin Madison RON K. CYTRON Computer Science and Engineering Washington University RICHARD J. LeBLANC, Jr. Computer Science

More information

COMP 181 Compilers. Administrative. Last time. Prelude. Compilation strategy. Translation strategy. Lecture 2 Overview

COMP 181 Compilers. Administrative. Last time. Prelude. Compilation strategy. Translation strategy. Lecture 2 Overview COMP 181 Compilers Lecture 2 Overview September 7, 2006 Administrative Book? Hopefully: Compilers by Aho, Lam, Sethi, Ullman Mailing list Handouts? Programming assignments For next time, write a hello,

More information

A simple syntax-directed

A simple syntax-directed Syntax-directed is a grammaroriented compiling technique Programming languages: Syntax: what its programs look like? Semantic: what its programs mean? 1 A simple syntax-directed Lexical Syntax Character

More information

CMSC 331 Final Exam Section 0201 December 18, 2000

CMSC 331 Final Exam Section 0201 December 18, 2000 CMSC 331 Final Exam Section 0201 December 18, 2000 Name: Student ID#: You will have two hours to complete this closed book exam. We reserve the right to assign partial credit, and to deduct points for

More information

6.184 Lecture 4. Interpretation. Tweaked by Ben Vandiver Compiled by Mike Phillips Original material by Eric Grimson

6.184 Lecture 4. Interpretation. Tweaked by Ben Vandiver Compiled by Mike Phillips Original material by Eric Grimson 6.184 Lecture 4 Interpretation Tweaked by Ben Vandiver Compiled by Mike Phillips Original material by Eric Grimson 1 Interpretation Parts of an interpreter Arithmetic calculator

More information

AUTOMATIC LFG GENERATION

AUTOMATIC LFG GENERATION AUTOMATIC LFG GENERATION MS Thesis for the Degree of Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science (Computer Science) at the National University of Computer and

More information

Multi-paradigm Declarative Languages

Multi-paradigm Declarative Languages Michael Hanus (CAU Kiel) Multi-paradigm Declarative Languages ICLP 2007 1 Multi-paradigm Declarative Languages Michael Hanus Christian-Albrechts-University of Kiel Programming Languages and Compiler Construction

More information

What is a compiler? var a var b mov 3 a mov 4 r1 cmpi a r1 jge l_e mov 2 b jmp l_d l_e: mov 3 b l_d: ;done

What is a compiler? var a var b mov 3 a mov 4 r1 cmpi a r1 jge l_e mov 2 b jmp l_d l_e: mov 3 b l_d: ;done What is a compiler? What is a compiler? Traditionally: Program that analyzes and translates from a high level language (e.g., C++) to low-level assembly language that can be executed by hardware int a,

More information

What is a compiler? Xiaokang Qiu Purdue University. August 21, 2017 ECE 573

What is a compiler? Xiaokang Qiu Purdue University. August 21, 2017 ECE 573 What is a compiler? Xiaokang Qiu Purdue University ECE 573 August 21, 2017 What is a compiler? What is a compiler? Traditionally: Program that analyzes and translates from a high level language (e.g.,

More information

CSE 413 Languages & Implementation. Hal Perkins Winter 2019 Structs, Implementing Languages (credits: Dan Grossman, CSE 341)

CSE 413 Languages & Implementation. Hal Perkins Winter 2019 Structs, Implementing Languages (credits: Dan Grossman, CSE 341) CSE 413 Languages & Implementation Hal Perkins Winter 2019 Structs, Implementing Languages (credits: Dan Grossman, CSE 341) 1 Goals Representing programs as data Racket structs as a better way to represent

More information

Section A. A grammar that produces more than one parse tree for some sentences is said to be ambiguous.

Section A. A grammar that produces more than one parse tree for some sentences is said to be ambiguous. Section A 1. What do you meant by parser and its types? A parser for grammar G is a program that takes as input a string w and produces as output either a parse tree for w, if w is a sentence of G, or

More information

CPW Method Version 1.0 March 18, 2009

CPW Method Version 1.0 March 18, 2009 CPW Method Version 1.0 March 18, 2009 Copyright 2009, Bernd J. chneider All Rights Reserved Abstract The CPW (Cognitive Process Workflow) Method is a process method, a process modeling method and a workflow

More information

CS 4240: Compilers and Interpreters Project Phase 1: Scanner and Parser Due Date: October 4 th 2015 (11:59 pm) (via T-square)

CS 4240: Compilers and Interpreters Project Phase 1: Scanner and Parser Due Date: October 4 th 2015 (11:59 pm) (via T-square) CS 4240: Compilers and Interpreters Project Phase 1: Scanner and Parser Due Date: October 4 th 2015 (11:59 pm) (via T-square) Introduction This semester, through a project split into 3 phases, we are going

More information

Context-Free Grammars. Carl Pollard Ohio State University. Linguistics 680 Formal Foundations Tuesday, November 10, 2009

Context-Free Grammars. Carl Pollard Ohio State University. Linguistics 680 Formal Foundations Tuesday, November 10, 2009 Context-Free Grammars Carl Pollard Ohio State University Linguistics 680 Formal Foundations Tuesday, November 10, 2009 These slides are available at: http://www.ling.osu.edu/ scott/680 1 (1) Context-Free

More information

Anatomy of a Compiler. Overview of Semantic Analysis. The Compiler So Far. Why a Separate Semantic Analysis?

Anatomy of a Compiler. Overview of Semantic Analysis. The Compiler So Far. Why a Separate Semantic Analysis? Anatomy of a Compiler Program (character stream) Lexical Analyzer (Scanner) Syntax Analyzer (Parser) Semantic Analysis Parse Tree Intermediate Code Generator Intermediate Code Optimizer Code Generator

More information

Language Translation, History. CS152. Chris Pollett. Sep. 3, 2008.

Language Translation, History. CS152. Chris Pollett. Sep. 3, 2008. Language Translation, History. CS152. Chris Pollett. Sep. 3, 2008. Outline. Language Definition, Translation. History of Programming Languages. Language Definition. There are several different ways one

More information

Formal Languages and Compilers Lecture I: Introduction to Compilers

Formal Languages and Compilers Lecture I: Introduction to Compilers Formal Languages and Compilers Lecture I: Introduction to Compilers Free University of Bozen-Bolzano Faculty of Computer Science POS Building, Room: 2.03 artale@inf.unibz.it http://www.inf.unibz.it/ artale/

More information

Question Bank. 10CS63:Compiler Design

Question Bank. 10CS63:Compiler Design Question Bank 10CS63:Compiler Design 1.Determine whether the following regular expressions define the same language? (ab)* and a*b* 2.List the properties of an operator grammar 3. Is macro processing a

More information

Introduction to Parsing. Lecture 8

Introduction to Parsing. Lecture 8 Introduction to Parsing Lecture 8 Adapted from slides by G. Necula Outline Limitations of regular languages Parser overview Context-free grammars (CFG s) Derivations Languages and Automata Formal languages

More information

Defining Program Syntax. Chapter Two Modern Programming Languages, 2nd ed. 1

Defining Program Syntax. Chapter Two Modern Programming Languages, 2nd ed. 1 Defining Program Syntax Chapter Two Modern Programming Languages, 2nd ed. 1 Syntax And Semantics Programming language syntax: how programs look, their form and structure Syntax is defined using a kind

More information

Compiler Construction

Compiler Construction Compiler Construction Thomas Noll Software Modeling and Verification Group RWTH Aachen University https://moves.rwth-aachen.de/teaching/ss-16/cc/ Conceptual Structure of a Compiler Source code x1 := y2

More information

10/4/18. Lexical and Syntactic Analysis. Lexical and Syntax Analysis. Tokenizing Source. Scanner. Reasons to Separate Lexical and Syntactic Analysis

10/4/18. Lexical and Syntactic Analysis. Lexical and Syntax Analysis. Tokenizing Source. Scanner. Reasons to Separate Lexical and Syntactic Analysis Lexical and Syntactic Analysis Lexical and Syntax Analysis In Text: Chapter 4 Two steps to discover the syntactic structure of a program Lexical analysis (Scanner): to read the input characters and output

More information

COMP 181. Agenda. Midterm topics. Today: type checking. Purpose of types. Type errors. Type checking

COMP 181. Agenda. Midterm topics. Today: type checking. Purpose of types. Type errors. Type checking Agenda COMP 181 Type checking October 21, 2009 Next week OOPSLA: Object-oriented Programming Systems Languages and Applications One of the top PL conferences Monday (Oct 26 th ) In-class midterm Review

More information

II. Language Processing ystem skeletal source program preprocessor source program compiler target object assembly program assembler relocatable machin

II. Language Processing ystem skeletal source program preprocessor source program compiler target object assembly program assembler relocatable machin CP 140 - Mathematical Foundations of C Dr.. Rodger ection: The tructure of a Compiler 1.1 What is a Compiler I. Translator Deænition: program in translator program in language! for! language X X Y Examples:

More information

Ortolang Tools : MarsaTag

Ortolang Tools : MarsaTag Ortolang Tools : MarsaTag Stéphane Rauzy, Philippe Blache, Grégoire de Montcheuil SECOND VARIAMU WORKSHOP LPL, Aix-en-Provence August 20th & 21st, 2014 ORTOLANG received a State aid under the «Investissements

More information

CSE6390E PROJECT REPORT HALUK MADENCIOGLU. April 20, 2010

CSE6390E PROJECT REPORT HALUK MADENCIOGLU. April 20, 2010 CSE6390E PROJECT REPORT HALUK MADENCIOGLU April 20, 2010 This report describes the design and implementation of a system of programs to deal with parsing natural language, building semantic structures

More information

Compiler Design. Dr. Chengwei Lei CEECS California State University, Bakersfield

Compiler Design. Dr. Chengwei Lei CEECS California State University, Bakersfield Compiler Design Dr. Chengwei Lei CEECS California State University, Bakersfield The course Instructor: Dr. Chengwei Lei Office: Science III 339 Office Hours: M/T/W 1:00-1:59 PM, or by appointment Phone:

More information

Science of Computer Programming. Aspect-oriented model-driven skeleton code generation: A graph-based transformation approach

Science of Computer Programming. Aspect-oriented model-driven skeleton code generation: A graph-based transformation approach Science of Computer Programming 75 (2010) 689 725 Contents lists available at ScienceDirect Science of Computer Programming journal homepage: www.elsevier.com/locate/scico Aspect-oriented model-driven

More information

Semantics as a Foreign Language. Gabriel Stanovsky and Ido Dagan EMNLP 2018

Semantics as a Foreign Language. Gabriel Stanovsky and Ido Dagan EMNLP 2018 Semantics as a Foreign Language Gabriel Stanovsky and Ido Dagan EMNLP 2018 Semantic Dependency Parsing (SDP) A collection of three semantic formalisms (Oepen et al., 2014;2015) Semantic Dependency Parsing

More information

Elementary Operations, Clausal Architecture, and Verb Movement

Elementary Operations, Clausal Architecture, and Verb Movement Introduction to Transformational Grammar, LINGUIST 601 October 3, 2006 Elementary Operations, Clausal Architecture, and Verb Movement 1 Elementary Operations This discussion is based on?:49-52. 1.1 Merge

More information

COMP4418 Knowledge Representation and Reasoning

COMP4418 Knowledge Representation and Reasoning COMP4418 Knowledge Representation and Reasoning Week 3 Practical Reasoning David Rajaratnam Click to edit Present s Name Practical Reasoning - My Interests Cognitive Robotics. Connect high level cognition

More information

Error Recovery during Top-Down Parsing: Acceptable-sets derived from continuation

Error Recovery during Top-Down Parsing: Acceptable-sets derived from continuation 2015 http://excel.fit.vutbr.cz Error Recovery during Top-Down Parsing: Acceptable-sets derived from continuation Alena Obluková* Abstract Parser is one of the most important parts of compiler. Syntax-Directed

More information

SYLLABUS UNIT - I UNIT - II UNIT - III UNIT - IV CHAPTER - 1 : INTRODUCTION CHAPTER - 4 : SYNTAX AX-DIRECTED TRANSLATION TION CHAPTER - 7 : STORA

SYLLABUS UNIT - I UNIT - II UNIT - III UNIT - IV CHAPTER - 1 : INTRODUCTION CHAPTER - 4 : SYNTAX AX-DIRECTED TRANSLATION TION CHAPTER - 7 : STORA Contents i SYLLABUS UNIT - I CHAPTER - 1 : INTRODUCTION Programs Related to Compilers. Translation Process, Major Data Structures, Other Issues in Compiler Structure, Boot Strapping and Porting. CHAPTER

More information

R13 SET Discuss how producer-consumer problem and Dining philosopher s problem are solved using concurrency in ADA.

R13 SET Discuss how producer-consumer problem and Dining philosopher s problem are solved using concurrency in ADA. R13 SET - 1 III B. Tech I Semester Regular Examinations, November - 2015 1 a) What constitutes a programming environment? [3M] b) What mixed-mode assignments are allowed in C and Java? [4M] c) What is

More information

Text Mining for Software Engineering

Text Mining for Software Engineering Text Mining for Software Engineering Faculty of Informatics Institute for Program Structures and Data Organization (IPD) Universität Karlsruhe (TH), Germany Department of Computer Science and Software

More information

A Test Environment for Natural Language Understanding Systems

A Test Environment for Natural Language Understanding Systems A Test Environment for Natural Language Understanding Systems Li Li, Deborah A. Dahl, Lewis M. Norton, Marcia C. Linebarger, Dongdong Chen Unisys Corporation 2476 Swedesford Road Malvern, PA 19355, U.S.A.

More information