SemTAG - a platform for Semantic Construction with Tree Adjoining Grammars

Similar documents
The MetaGrammar Compiler: An NLP Application with a Multi-paradigm Architecture

How to Build Argumental graphs Using TAG Shared Forest : a view from control verbs problematic

Scope and Situation Binding in LTAG using Semantic Unification

Practical aspects in compiling tabular TAG parsers

Practical aspects in compiling tabular TAG parsers

TuLiPA: A Syntax-Semantics Parsing Environment for Mildly Context-Sensitive Formalisms

Syntax-semantics interface and the non-trivial computation of meaning

2 Ambiguity in Analyses of Idiomatic Phrases

Proseminar on Semantic Theory Fall 2013 Ling 720 An Algebraic Perspective on the Syntax of First Order Logic (Without Quantification) 1

An Efficient Implementation of PATR for Categorial Unification Grammar

1. true / false By a compiler we mean a program that translates to code that will run natively on some machine.

COMPUTATIONAL SEMANTICS WITH FUNCTIONAL PROGRAMMING JAN VAN EIJCK AND CHRISTINA UNGER. lg Cambridge UNIVERSITY PRESS

11-682: Introduction to IR, NLP, MT and Speech. Parsing with Unification Grammars. Reading: Jurafsky and Martin, Speech and Language Processing

What needs to be kept track of ffl The current substitution ffl The current goal ffl The clause currently in use ffl Backtrack Information 2

More Theories, Formal semantics

Dependency Grammar as Graph Description

Time : 1 Hour Max Marks : 30

Lecture 5: Declarative Programming. The Declarative Kernel Language Machine. September 12th, 2011

LOGIC AND DISCRETE MATHEMATICS

About the Authors... iii Introduction... xvii. Chapter 1: System Software... 1

Compiling Techniques

Computational Linguistics: Feature Agreement

CSE450 Translation of Programming Languages. Lecture 4: Syntax Analysis

sbp: A Scannerless Boolean Parser

Towards Reusable NLP Components

Syntax and Grammars 1 / 21

Intermediate Code Generation

4. An interpreter is a program that

Lecture Outline. COOL operational semantics. Operational Semantics of Cool. Motivation. Lecture 13. Notation. The rules. Evaluation Rules So Far

10/18/18. Outline. Semantic Analysis. Two types of semantic rules. Syntax vs. Semantics. Static Semantics. Static Semantics.

Unification in Unification-based Grammar

LECTURE NOTES ON COMPILER DESIGN P a g e 2

COMP 181 Compilers. Administrative. Last time. Prelude. Compilation strategy. Translation strategy. Lecture 2 Overview

Typeful Ontologies with Direct Multilingual Verbalization

Torben./Egidius Mogensen. Introduction. to Compiler Design. ^ Springer

Context-Free Grammars

11. a b c d e. 12. a b c d e. 13. a b c d e. 14. a b c d e. 15. a b c d e

CMSC 330: Organization of Programming Languages. Formal Semantics of a Prog. Lang. Specifying Syntax, Semantics

Generating XTAG Parsers from Algebraic Specifications

Front End. Hwansoo Han

Operational Semantics of Cool

Compilers. Lecture 2 Overview. (original slides by Sam

Grammar Knowledge Transfer for Building RMRSs over Dependency Parses in Bulgarian

Compiler Theory. (Semantic Analysis and Run-Time Environments)

Lexical entries & clauses

COP 3402 Systems Software Syntax Analysis (Parser)

MIT Specifying Languages with Regular Expressions and Context-Free Grammars

MIT Specifying Languages with Regular Expressions and Context-Free Grammars. Martin Rinard Massachusetts Institute of Technology

CA Compiler Construction

COP5621 Exam 4 - Spring 2005

Computational Linguistics: Syntax-Semantics

Specifying Syntax. An English Grammar. Components of a Grammar. Language Specification. Types of Grammars. 1. Terminal symbols or terminals, Σ

Scannerless Boolean Parsing

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer. Compiler Design

Formally Certified Satisfiability Solving

5. Semantic Analysis!

A Solver-Independent Platform for Modeling Constrained Objects Involving Discrete and Continuous Domains

Semantics of programming languages

Operational Semantics. One-Slide Summary. Lecture Outline

Compiler Code Generation COMP360

Compiling Regular Expressions COMP360

A high-level approach to language description

Semantics and Pragmatics of NLP Propositional Logic, Predicates and Functions

Programming Language Concepts, cs2104 Lecture 04 ( )

The CKY algorithm part 1: Recognition

Context-Free Grammars. Carl Pollard Ohio State University. Linguistics 680 Formal Foundations Tuesday, November 10, 2009

A Simple Syntax-Directed Translator

LING 130: Quantified Noun Phrases

Ling 571: Deep Processing for Natural Language Processing

Principles of Programming Languages [PLP-2015] Detailed Syllabus

Solving Natural Language Math Problems

5. Semantic Analysis. Mircea Lungu Oscar Nierstrasz

Using an Enriched TAG Derivation Structure as Basis for Semantics

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING Subject Name: CS2352 Principles of Compiler Design Year/Sem : III/VI

COSE312: Compilers. Lecture 1 Overview of Compilers

CS 415 Midterm Exam Spring 2002

Context-free Grammars

INCORPORATING ADVANCED PROGRAMMING TECHNIQUES IN THE COMPUTER INFORMATION SYSTEMS CURRICULUM

Semantic analysis and intermediate representations. Which methods / formalisms are used in the various phases during the analysis?

Semantic Analysis and Type Checking

Course Overview. Introduction (Chapter 1) Compiler Frontend: Today. Compiler Backend:

A Development Environment for Large Natural Language Grammars

The Compiler So Far. CSC 4181 Compiler Construction. Semantic Analysis. Beyond Syntax. Goals of a Semantic Analyzer.

Evaluation of Predicate Calculus By Arve Meisingset, retired research scientist from Telenor Research Oslo Norway

Delimited Continuations, Applicative Functors and Natural Language Semantics

Lecture Outline. COOL operational semantics. Operational Semantics of Cool. Motivation. Notation. The rules. Evaluation Rules So Far.

CMSC 331 Final Exam Section 0201 December 18, 2000

CS606- compiler instruction Solved MCQS From Midterm Papers

Writing Evaluators MIF08. Laure Gonnord

Introduction. Compiler Design CSE Overview. 2 Syntax-Directed Translation. 3 Phases of Translation

1 Lexical Considerations

MIDTERM EXAM (Solutions)

Logic and Natural Language Semantics: Formal Semantics

Principles of Programming Languages COMP251: Syntax and Grammars

Boolean Representations and Combinatorial Equivalence

Outline. Programming Languages 1/16/18 PROGRAMMING LANGUAGE FOUNDATIONS AND HISTORY. Current

CS 415 Midterm Exam Spring SOLUTION

Anatomy of a Compiler. Overview of Semantic Analysis. The Compiler So Far. Why a Separate Semantic Analysis?

RYERSON POLYTECHNIC UNIVERSITY DEPARTMENT OF MATH, PHYSICS, AND COMPUTER SCIENCE CPS 710 FINAL EXAM FALL 96 INSTRUCTIONS

AUTOMATIC LFG GENERATION

Transcription:

SemTAG - a platform for Semantic Construction with Tree Adjoining Grammars Yannick Parmentier parmenti@loria.fr Langue Et Dialogue Project LORIA Nancy Universities France Emmy Noether Project SFB 441 Tübingen 18 January 2007 1/ 57

Introduction No clear consensus about a syntax / semantics interface for TAG No wide-coverage TAG for French including syntactic and semantic information Goals of the SemTAG system: 1. to provide with an environment to implement real-size TAGs equipped with a syntax / semantics interface, 2. to build underspecified semantic representations of sentences using such a grammar. 2/ 57

Outline Part 1. Grammar development Syntax: Lexicalised Tree Adjoining Grammars LTAG redundancy extensible MetaGrammar (XMG) the formalism Extension #1: different levels of description Extension #2: well-formedness constraints extensible MetaGrammar the implementation Some figures Part 2. Semantic Construction Syntax / Semantics interface in TAG Integration of the semantic interface within the metagrammar Semantic Construction An example Conclusion and Future work Acknowledgement 3/ 57

Part 1. Grammar development 4/ 57

Lexicalised Tree Adjoining Grammars syntactic support : Feature-Based Lexicalised TAG a set of elementary trees where nodes are labelled with feature structures (FS) each elementary tree is associated with at least one lexical anchor (Lexicalisation) two operations for combining trees: adjunction and substitution including unification on the FS 1) substitution (T1) N b t (T2) t N t U t N b 5/ 57

2) adjunction (T1) t N b (T2) t N N b * N t U t N bu b NB: at the end of the derivation, unification of top and bot FS at each node 6/ 57

LTAG redundancy TAG formalism is used in its lexicalised version for parsing. Remarks: many trees share common fragments + a given tree is associated with many lexical items (cf lexicalisation) N S N V N N N S S Jean mange une pomme N V La pomme que Jean mange 7/ 57

Metagrammars for LTAG Related problems: huge redundancy making the design and maintnance of the grammar difficult. e.g. what happens if the agreement representation is modified during grammar development? The MetaGrammar approach: describing the trees of a grammar as combinations of elementary tree fragments, capturing linguistic generalisations through these abstractions. 8/ 57

extensible MetaGrammar (1 / 2) Description of the grammar trees using an expressive and relatively intuitive language. MetaGrammar manipulation of elementary tree fragments using a control language. These elementary tree fragments are defined using a tree description logic. The control language can be compared with a Definite Clause Grammar, i.e. combination rules using disjunction and conjunction. Two methodological axes of description (Crabbé, 05): 1. structure sharing (i.e. reusable elementary tree fragments). 2. alternatives (i.e. disjunctions referring to alternative forms of a given grammatical function, etc). 9/ 57

extensible MetaGrammar (2 / 2) A language to describe tree fragments: Description ::= x y x + y x y x y x + y x y x = y x[f :E] x(p:e) (1) A language to combine tree fragments: Class ::= Name Content (2) Content ::= Description Name (3) Name Name Name Name 10/ 57

Example (1 / 2) Tree fragment #1: SubjectCan (X [cat : s] Y [cat : v] ) (X Z (mark : subst) [cat : n] ) (Z Y) SubjectCan Z [cat:n] X [cat:s] Y [cat:v] 11/ 57

Example (1 / 2) Tree fragment #1: SubjectCan (X [cat : s] Y [cat : v] ) (X Z (mark : subst) [cat : n] ) (Z Y) SubjectCan Z [cat:n] X [cat:s] Y [cat:v] Tree fragment #2: Active (X [cat : s] Y (mark : anchor) [cat : v] ) X Y ) Active X [cat:s] Y [cat:v] 12/ 57

Example (1 / 2) Tree fragment #1: SubjectCan (X [cat : s] Y [cat : v] ) (X Z (mark : subst) [cat : n] ) (Z Y) SubjectCan Z [cat:n] X [cat:s] Y [cat:v] Tree fragment #2: Active (X [cat : s] Y (mark : anchor) [cat : v] ) X Y ) Active X [cat:s] Y [cat:v] Combination rule: Intransitive SubjectCan Active ( ) 13/ 57

Example (2 / 2) Some trees for intransitive verbs (e.g., the lexical item sleeps) S S S N V V N V (Canonical Subject) (Active verb morph) (e.g. the boy sleeps) N S N N* S V N* S N (Extracted Subject) V (Active verb morph) N V (e.g. the boy who sleeps) Subject SubjectCan SubjectExt Intransitive Subject Active 14/ 57

Some features of the XMG formalism Flexible management of variables (local scope by default + export declarations) no name conflicts Class A ::= X,Y A {... } Class B ::= B { Z = A... Z.X } Possibility to factorise class contents via inheritence. Each class of the metagrammar may be equipped with a interface (a feature structure used to share information between classes, e.g. coindexation of semantic indices) The tree description language has been extended to support Interaction Grammars (Perrier, 03). 15/ 57

First extension of the formalism: different levels of description Possibility to describe not only tree fragments (i.e. syntactic information), but also flat semantic formulas. Semantic representation based on the Predicate Logic Unplugged of (Bos, 95). Semantic description language : Description ::= l:p(e 1,...,E n ) l:p(e 1,...,E n ) E i E j (4) Each level of description is processed in a specific dimension. The control language is then an Extended Definite Clause Grammar (Van Roy, 90). 16/ 57

Second extension of the formalism: well-formedness constraints Constraints on the structures produced from the metagrammar. Interests: to guaranty the validity of the structures (and avoid manual checking). to complete the structures according to linguistic criteria. Classification of these constraints into 4 categories: 1. Formal constraints 2. Operational constraints 3. Language-dependent constraints 4. Theoretical constraints 17/ 57

Formal constraints Constraints assuring that the trees generated by the model builder are regular TAG trees. On top of being trees, the output structures must respect some specific criteria: each node has a category label, leaf nodes are either marked as subst, foot or anchor, the category of the foot node is identical to that of the root node, etc. 18/ 57

Operational constraint (1 / 3) Constraints controlling the combinations of tree fragments (closely linked to the concept of Resources / Needs). Constraints based on a colouring of the nodes. Each node of the description is labelled either Black, Red or White. During minimal model computation, nodes are identified according to the following rules: w + w = w b + w = b b + b = r + { w ; b ; r } = 19/ 57

Operational constraint (2 / 3) Benefits: Avoids node naming issues (no global names). Allows to reduce the metagrammatical description (node equations are replaced with implicit coloured node identifications). Facilitates the reuse of a same tree fragment several times. 20/ 57

Operational constraint (3 / 3) Example: S w N r V w (SubjectCan) N r S b V b S w V w N r (Active) (ObjectCan) N r S w N r V w (SubjectRel) 21/ 57

Language-dependent constraints (1 / 2) For French, the ordering and uniqueness of clitics. (Perlmutter, 70): first they appear in front of the verb in a fixed order according to their rank (a-b) and second two different clitics in front of the verb cannot have the same rank (c). For instance the clitics le, la have the rank 3 and lui the rank 4 (rank is a node property). (a) Jean le 3 lui 4 donne John gives it to him (b) *Jean lui 4 le 3 donne *John gives to him it (c) *Jean le 3 la 3 donne *John gives it it 22/ 57

Language-dependent constraints (2 / 2) S S V V V N + V Cl 3 + V Cl 4 + V V (Jean) (le) (lui) (donne) S N V Cl 3 Cl 4 V (Jean le lui donne) S N V Cl 4 Cl 3 V (Jean lui le donne) 23/ 57

Theoretical principles Language-independent principles related to the grammatical formalism described. For TAG, such a principle may be the Principle of Predicate-Argument Coocurrency. NB: such principles are not yet implemented within the XMG system. 24/ 57

XMG the implementation A 3-step metagrammar compilation: 1. translation of the metagrammar into intermediate code for a specific virtual machine. 2. execution of this code and accumulation of partial tree descriptions. 3. solving of the tree descriptions accumulated in step 2. 25/ 57

The metagrammar is written using an object-oriented concrete syntax: class C1 declare?x export X { <syn> {node X[cat=s]} } class C2 declare?y export Y { <syn> {node Y[cat=v]} } Compilation into specific assembler code: class C import C1 C2 { <syn> {X->Y} } call(c1) % memory allocation + execution of code node(x) % unification of X with a node structure feat(x [(cat s)]) call(c2) node(y) feat(x [(cat v)]) dom(c1.x C2.Y) % accumulation of a dominance relation 26/ 57

A Virtual Machine with a logic kernel Architecture based on the Warren s Abstract Machine (Stack, Heap, Trail and Choice Point). Performs unification with chronological backtracking and multiple accumulation (to distinguish between dimensions). NB: Unification may concern specific data structures (e.g. polarised features for Interaction Grammars). 27/ 57

A Constraint Satisfaction Tree Description Solver (1 / 3) 1. Setting the constraint framework: Each node in the input description is associated with an integer. Then, we use an asbtract data type to refer to a node of a valid model in terms of the nodes being equals, above, below, or on its side: Up Eq Right Left Down N i x ::= node( Eq: { ints } Up: { ints } Down: { ints } Left: { ints } Right: { ints }) 28/ 57

A Constraint Satisfaction Tree Description Solver (2 / 3) The input description is converted into relation constraints on node sets. For instance, the dominance relation x y can be translated as: N i x Nj y [Ni x.equp Nj y.up Ni x.down Nj y.eqdown N i x.left N j y.left Ni x.right N j y.right ] 29/ 57

A Constraint Satisfaction Tree Description Solver (2 / 3) The input description is converted into relation constraints on node sets. For instance, the dominance relation x y can be translated as: N i x Nj y [Ni x.equp Nj y.up Ni x.down Nj y.eqdown N i x.left N j y.left Ni x.right N j y.right ] N i x.down Nj y.eqdown 30/ 57

A Constraint Satisfaction Tree Description Solver (3 / 3) 2. Searching the solutions to the problem: The solutions are the assignments for each of the node sets associated with the nodes of the input description. A distribution strategy is used to explore the consistent assignments for these node sets. The implementation of the solver follows the ideas of (Duchier and Niehren, 2000) and uses the constraint programming support of the Oz/Mozart environment. 31/ 57

Extension to specific constraints (1 / 2) This constraint framework can relatively easily be extended to solve specific constraints, such as those introduced previously. The idea: 1. extension of the node representation (tuples whose fields contain sets of nodes), 2. definition of additional constraints on these fields, reflecting the syntactic constraints we want to express. 32/ 57

Extension to specific constraints (2 / 2) example: Clitic uniqueness. In a valid model φ, there is only one node having a given property p. For each node x in the description, we add to its representation a field containing a boolean variable p x indicating whether the node denoting x in the model has this property or not: p x (N i x.eq Vφ p ) Finally, if true value 1 and false 0: p x 1 x φ 33/ 57

Some figures The XMG system has been used successfully to compute a core TAG for French: 6,000+ trees computed from a description containing 293 classes (Crabbé, 05). The compilation of this TAG takes about 15 min with a P4 processor 2.6 GHz and 1 GB RAM. This core TAG has been evaluated on the TSNLP and has a coverage rate of 75 % (recognition of grammatical sentences). This core TAG has been extended with flat semantics (Gardent, 06) so that it can be used both for parsing and generation (Kow et al, 06). 34/ 57

Outline Part 1. Grammar development Syntax: Lexicalised Tree Adjoining Grammars LTAG redundancy extensible MetaGrammar (XMG) the formalism Extension #1: different levels of description Extension #2: well-formedness constraints extensible MetaGrammar the implementation Some figures Part 2. Semantic Construction Syntax / Semantics interface in TAG Integration of the semantic interface within the metagrammar Semantic Construction An example Conclusion and Future work Acknowledgement 35/ 57

Part 2. Semantic Construction 36/ 57

Syntax / Semantics interface in TAG (1 / 2) Syntax / Semantics interface based on (Gardent and Kallmeyer, 03) Semantic representation language: flat semantic formulas (Bos, 95) using unification variables John l 0 : john(j) Mary l 1 : mary(m) loves l 2 : loves(x,y ) semantic operation: union modulo unification John loves Mary l 2 : love(j,m),l 0 : john(j),l 1 : mary(m) 37/ 57

Syntax / Semantics interface in TAG (2 / 2) principle: (i) each elementary tree is associated with a semantic formula (ii) the FS decorating the nodes of the trees share the unification variables occuring in the semantic formulas (iii) the association between tree nodes and semantic variables specify the syntax / semantics interface S NP X VP NP j V NP Y NP m John loves Mary l 2 : love(j,m), l 0 : john(j), l 1 : mary(m) l 0 :john(j) l 2 :love(x,y) l 1 :mary(m) 38/ 57

Integration of the semantic interface within the metagrammar (1 / 2) Target: share semantic indices with the corresponding node features within tree fragments. How: use the class interface to extend the scope of a given node feature (and give it a global name), the same for semantic arguments. During class combinations, the interfaces are unified, resulting in the unification of node features and corresponding semantic arguments. 39/ 57

Integration of the semantic interface within the metagrammar (2 / 2) Example: Intransitive: Subject: Active: UnaryRel: S N [idx=x] l 0 :Rel(X) VP = S N [idx=i] VP S VP l 0:Rel(I1) arg1=x subjidx=x subjidx=i arg1=i1 (Result) (Grammatical functions) (Verb morph) (Semantics) 40/ 57

Semantic Construction Based on the result of the syntactic parsing, in our case from a derivation forest (De La Clergerie, 05) Advantage: using the factorisation of the parses for computing all the semantic representations A 3 steps process: (1) extraction of the semantic information included in the grammar semantic lexicon (2) pure syntactic parsing on the basis of the grammar without any semantic feature derivation forest (3) rebuilding of the semantic representation from (a) the semantic lexicon and (b) the derivation operations 41/ 57

1. Extraction of the semantic lexicon (1 / 2) to be able to rebuild the derivations done during parsing, we have to first store the semantic information contained in the FB-TAG trees Automatic extraction process: 1. for each tree in the grammar G, numbering of the nodes with their gorn address resulting grammar GC 2. for each tree in the grammar GC: (a) create a purely syntactic FB-TAG tree (b) create an entry in the semantic lexicon composed of the tree name, the lemma, the semantic representation, and the semantic features associated with tree nodes (link kept via the gorn address) 42/ 57

1. Extraction of the semantic lexicon (2 / 2) NP 2 2 gen 6 6 4top 4num idx 3 1 A 7 2 B 5 4 X S [ VP 2 6top 4 bot " # 3 gen 1 A num 2 B 7 i 5 hidx 3 l r runs ] 3 l r : runs( 4 X) 43/ 57

1. Extraction of the semantic lexicon (2 / 2) NP 2 2 gen 6 6 4top 4num idx 3 1 A 7 2 B 5 4 X S [ VP 2 6top 4 bot " # 3 gen 1 A num 2 B 7 i 5 hidx 3 l r runs ] 3 l r : runs( 4 X) Syntactic tree S(0) NP(1) 2 4top " gen num # 3 1 A 5 2 B VP(2) 2 4top " gen num # 3 1 A 5 2 B runs τ n0v runs 44/ 57

1. Extraction of the semantic lexicon (2 / 2) NP 2 2 gen 6 6 4top 4num idx 3 1 A 7 2 B 5 4 X S [ VP 2 6top 4 bot " # 3 gen 1 A num 2 B 7 i 5 hidx 3 l r runs ] 3 l r : runs( 4 X) Syntactic tree S(0) Semantic lexicon 0 NP(1) 2 4top " gen num # 3 1 A 5 2 B τ n0v runs VP(2) 2 4top " gen num runs # 3 1 A 5 2 B 1» top h idx 2 Xi 2» bot [ ] 1 l r : runs( 2 X) τ n0v runs hidx 1 l r i 45/ 57

2. Pure syntactic parsing Parsing using the automatically extracted syntactic grammar: after extraction of the semantic lexicon, the resulting grammar consists of a purely syntactic FB-TAG we thus use it to parse sentences with a FB-TAG parser produced automatically by the DyALog system of (De La Clergerie, 05) as a result, we obtain a derivation forest (i.e. a shared representation of all possible derivation trees of the given sentence) 46/ 57

3. Computing semantic representations top-down traversal of the shared derivation forest for each node of the derivation forest (corresponding to an elementary tree), we retrieve its entry from the semantic lexicon for each derivation edge, we recompute the unification operations done on the derived tree: unification on node FS, i.e. on semantic features recorded in the semantic lexicon we collect the semantic representations of each entry of the semantic lexicon N.B. these representations contain unified variables 47/ 57

An example (1 / 2) Sentence to parse: John runs Derivation forest τ n0v runs 1 τ propern john Content of the semantic lexicon: τ propern John 0» h i bot idx 1 j [ ] l j : john( 1 j) 1» top τ n0v runs 0 h idx 3 Xi 2» bot [ ] 2 l r : runs( 3 X) hidx 2 l r i 48/ 57

An example (2 / 2) 0 0 2 4 top bot h idx 3 i5 1 j 1» top h idx 3 Xi 2 2 4 top bot 3 i5 hidx 2 l r John [ ] [ ] l j : john( 1 j) 2 l r : runs( 3 X) runs 49/ 57

An example (2 / 2) 0 0 2 4 top bot h idx 3 i5 1 j 1» top h idx 3 Xi 2 2 4 top bot 3 i5 hidx 2 l r John [ ] [ ] l j : john( 1 j) 2 l r : runs( 3 X) runs 50/ 57

An example (2 / 2) 0 1 2 6 4 top bot h idx h idx i3 3 X i 7 5 1 j 2 2 4 top bot 3 i5 hidx 2 l r John [ ] [ ] l j : john( 1 j) 2 l r : runs( 3 X) runs 51/ 57

An example (2 / 2) 0 1 2 6 4 top bot h idx h idx i3 3 X i 7 5 1 j 2 2 4 top bot 3 i5 hidx 2 l r John [ ] [ ] l j : john( 1 j) 2 l r : runs( 3 X) runs 52/ 57

An example (2 / 2) 0 1 h i idx 1 j 2 h idx 2 l r i John runs [ ] [ ] l j : john( 1 j) 2 l r : runs( 1 j) 53/ 57

SemTAG from the inside 54/ 57

Conclusion (1 / 2) The XMG formalism includes a fully declarative language to handle tree fragments. On top of describing tree-based grammar, XMG allows (i) to describe other linguistic levels of description (i.e. semantics) and (ii) to express constraints on the computed structures. The XMG system has been used to produce a core TAG for French (one for syntax and one equipped with semantics). Both of these grammars are (being) evaluated using the TSNLP. 55/ 57

Conclusion (2 / 2) The Semantic Construction program allows high information sharing by using a metagrammar and a tabular parser. The Syntax / Semantic interface chosen presents some interesting features : homogeneous interface using feature structures, unification as a semantic composition operation. Future work concerns : the extension of the control language to facilitate metagrammar development: node typing? the extension of the metagrammatical framework to describe different target formalisms (from a given metagrammar), development of a library of constraints? the extension of the semantic interface and the semantic construction process to deal with different semantic representations, tag semantic with λ-terms? 56/ 57

Acknowledgement extensible MetaGrammar Benoît Crabbé, Denys Duchier and Joseph Le Roux. Semantic Constructor Claire Gardent, Eric Kow. For further information about the SemTAG platform : http://trac.loria.fr/~semtag http://trac.loria.fr/~semconst http://trac.loria.fr/~geni http://sourcesup.cru.fr/xmg 57/ 57