Meaning Banking and Beyond

Similar documents
A platform for collaborative semantic annotation

Final Project Discussion. Adam Meyers Montclair State University

UGroningen: Negation detection with Discourse Representation Structures

Importing MASC into the ANNIS linguistic database: A case study of mapping GrAF

NLP Final Project Fall 2015, Due Friday, December 18

Tools for Annotating and Searching Corpora Practical Session 1: Annotating

Lecture 14: Annotation

Semantic and Multimodal Annotation. CLARA University of Copenhagen August 2011 Susan Windisch Brown

ANC2Go: A Web Application for Customized Corpus Creation

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed

Introduction to Text Mining. Hongning Wang

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

UIMA-based Annotation Type System for a Text Mining Architecture

Package corenlp. June 3, 2015

Hidden Markov Models. Natural Language Processing: Jordan Boyd-Graber. University of Colorado Boulder LECTURE 20. Adapted from material by Ray Mooney

TTIC 31190: Natural Language Processing

Refresher on Dependency Syntax and the Nivre Algorithm

COMP90042 LECTURE 3 LEXICAL SEMANTICS COPYRIGHT 2018, THE UNIVERSITY OF MELBOURNE

Deliverable D1.4 Report Describing Integration Strategies and Experiments

Morpho-syntactic Analysis with the Stanford CoreNLP

Making Sense Out of the Web

MRD-based Word Sense Disambiguation: Extensions and Applications

Multiword deconstruction in AnCora dependencies and final release data

Ortolang Tools : MarsaTag

School of Computing and Information Systems The University of Melbourne COMP90042 WEB SEARCH AND TEXT ANALYSIS (Semester 1, 2017)

Knowledge Engineering with Semantic Web Technologies

structure of the presentation Frame Semantics knowledge-representation in larger-scale structures the concept of frame

Statistical parsing. Fei Xia Feb 27, 2009 CSE 590A

Background and Context for CLASP. Nancy Ide, Vassar College

Semantics Isn t Easy Thoughts on the Way Forward

Yahoo! Webscope Datasets Catalog January 2009, 19 Datasets Available

A Multilingual Social Media Linguistic Corpus

CMPT 755 Compilers. Anoop Sarkar.

NLP Chain. Giuseppe Castellucci Web Mining & Retrieval a.a. 2013/2014

Using Search-Logs to Improve Query Tagging

Dynamic Feature Selection for Dependency Parsing

Large-Scale Syntactic Processing: Parsing the Web. JHU 2009 Summer Research Workshop

A Korean Knowledge Extraction System for Enriching a KBox

Similarity Overlap Metric and Greedy String Tiling at PAN 2012: Plagiarism Detection

Advanced Topics in Information Retrieval Natural Language Processing for IR & IR Evaluation. ATIR April 28, 2016

Building Multilingual Resources and Neural Models for Word Sense Disambiguation. Alessandro Raganato March 15th, 2018

A tool for Cross-Language Pair Annotations: CLPA

Stack- propaga+on: Improved Representa+on Learning for Syntax

Personalized Terms Derivative

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island

Online Graph Planarisation for Synchronous Parsing of Semantic and Syntactic Dependencies

Dependency grammar and dependency parsing

Ling/CSE 472: Introduction to Computational Linguistics. 5/9/17 Feature structures and unification

Precise Medication Extraction using Agile Text Mining

Dependency grammar and dependency parsing

Dependency grammar and dependency parsing

Learning Compositional Semantics for Open Domain Semantic Parsing

TechWatchTool: Innovation and Trend Monitoring

CSC401 Natural Language Computing

TectoMT: Modular NLP Framework

slide courtesy of D. Yarowsky Splitting Words a.k.a. Word Sense Disambiguation Intro to NLP - J. Eisner 1

Natural Language Processing

Watson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself)

DBpedia Spotlight at the MSM2013 Challenge

Principles of Programming Languages COMP251: Syntax and Grammars

EDAN20 Language Technology Chapter 13: Dependency Parsing

Narrative Schema as World Knowledge for Coreference Resolution

Text, Knowledge, and Information Extraction. Lizhen Qu

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases

Parts of Speech, Named Entity Recognizer

A Hybrid Unsupervised Web Data Extraction using Trinity and NLP

AT&T: The Tag&Parse Approach to Semantic Parsing of Robot Spatial Commands

Web Product Ranking Using Opinion Mining

Importing MASC into the ANNIS linguistic database: A case study of mapping GrAF

Universal Dependencies to Logical Forms with Negation Scope

Linked Open Data Cloud. John P. McCrae, Thierry Declerck

Machine Learning in GATE

Natural Language Processing Pipelines to Annotate BioC Collections with an Application to the NCBI Disease Corpus

Semantic Web and Natural Language Processing

MASC: A Community Resource For and By the People

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

Learning Latent Linguistic Structure to Optimize End Tasks. David A. Smith with Jason Naradowsky and Xiaoye Tiger Wu

English Understanding: From Annotations to AMRs

Knowledge extraction from audio content service providers' API descriptions

Maximum Entropy based Natural Language Interface for Relational Database

Text Mining via Information Extraction

CS395T Project 2: Shift-Reduce Parsing

CS 224N Assignment 2 Writeup

TEXTPRO-AL: An Active Learning Platform for Flexible and Efficient Production of Training Data for NLP Tasks

Transition-Based Dependency Parsing with Stack Long Short-Term Memory

Corpus Linguistics for NLP APLN550. Adam Meyers Montclair State University 9/22/2014 and 9/29/2014

WebSAIL Wikifier at ERD 2014

Domain Analysis. SWEN-261 Introduction to Software Engineering. Department of Software Engineering Rochester Institute of Technology.

Fully Delexicalized Contexts for Syntax-Based Word Embeddings

A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering

The Multilingual Language Library

SyntaViz: Visualizing Voice Queries through a Syntax-Driven Hierarchical Ontology

Language Resources and Linked Data

Data for linguistics ALEXIS DIMITRIADIS. Contents First Last Prev Next Back Close Quit

Christoph Treude. Bimodal Software Documentation

A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP

Ontology-guided Extraction of Complex Nested Relationships

NLP in practice, an example: Semantic Role Labeling

WordNet-based User Profiles for Semantic Personalization

Grounded Compositional Semantics for Finding and Describing Images with Sentences

Transcription:

Meaning Banking and Beyond Valerio Basile Wimmics, Inria November 18, 2015

Semantics is a well-kept secret in texts, accessible only to humans. Anonymous I BEG TO DIFFER

Surface Meaning

Step by step analysis Dividing text into words

Step by step analysis Dividing text into words Labeling words

Emotions play an important role in p. noun verb det adjective noun prep decision making noun noun

Step by step analysis Dividing text into words Labeling words Finding links between words

Emotions play an important role in decision making subj obj attr nn

Step by step analysis Dividing text into words Labeling words Finding links between words Extracting the meaning

Emotions play an important role in decision making play(emotion, role) important(role) emotion >1...

Step by step analysis Dividing text into words Tokenization Labeling words Tagging Finding links between words Syntax Extracting the meaning Semantics

Software pipeline supervised Tokenization Elephant (Evang et al. 2013) Tagging supervised C&C tools (Curran et al. 2007) Syntax Semantics Boxer (Bos, 2008) rule-based

Emotions play an important role in decision making http://gmb.let.rug.nl/webdemo

Supervised learning Show enough examples to the machine and it will learn and generalize. As opposed to unsupervised learning, e.g. clustering

Supervised learning enough examples = millions of words + annotation that is, an annotated text corpus Examples: Penn treebank (syntax) Semcor (word senses) TWITA (sentiment),...

What about a semantics?

What about a semantics? Meaning Bank (a treebank for semantics)

How to build a meaning bank Manually: Collection of texts + Expert annotation

How to build a meaning bank Manually: Collection of texts + Expert annotation DO NOT TRY THIS AT HOME

How to build a meaning bank By bootstrap: Collection of texts + Analysis software + Manual correction

Collection of texts English Public domain Whole documents Short Open domain

Collection of texts 73,352 documents 10,103 (accepted) 6.3 sentence per document ~90% from VoA (newswire) Jokes, legal text, fables,...

Analysis software GNU Make Elephant tokenization Daemon process C&C tools taggins & parsing Boxer Semantic analysis

Manual correction Silver standard Experts and the crowd

The GMB Explorer http://gmb.let.rug.nl/explorer

The GMB Explorer 33 users 173,173 annotations (including automatically generated) http://gmb.let.rug.nl/explorer

Gamification http://www.wordrobe.org

Gamification Leaderboards Badges Agreement-based score http://www.wordrobe.org

Gamification 59,413 Answers 13 Games 1,732 Players 2 Datasets http://www.wordrobe.org

A semantically annotated resource that anyone can edit. Johan Bos, Valerio Basile, Kilian Evang, Noortje Venhuizen, Johannes Bjerva (forthcoming): The Groningen Meaning Bank. In Handbook of Linguistic Annotation. Berlin: Springer. Valerio Basile, Johan Bos, Kilian Evang, Noortje Venhuizen Developing a large semantically annotated corpus. LREC 2012 Valerio Basile, Johan Bos, Kilian Evang, Noortje Venhuizen A platform for collaborative semantic annotation. EACL 2012 http://gmb.let.rug.nl

Beyond Meaning Banking

Beyond Meaning Banking Autonomous learning of the meaning of objects

Beyond Meaning Banking Autonomous learning of the meaning of objects

Beyond Meaning Banking Bridging semantic analysis with entity linking to build a knowledge base by reading the Web

Beyond Meaning Banking When a glass or cup is emptied, The robot will ask if it should serve more serving (FrameNet) Agent Robot (DBPedia) Patient Glass (DBPedia)

Towards a Web Meaning Bank Better NLP tools train extract Better Linked Open Data

Fin Meaning Banking and Beyond Valerio Basile November 18, 2015