Lecture 24: NER & Entity Linking
|
|
- Egbert Young
- 5 years ago
- Views:
Transcription
1 Lecture 24: NER & Entity Linking Kai-Wei Chang University of Virginia kw@kwchang.net Couse webpage: 1
2 Organizing knowledge It s a version of Chicago the standard classic Macintosh menu font, with that distinctive thick diagonal in the N. Chicago was used by default for Mac menus through MacOS 7.6, and OS 8 was released mid Chicago VIII was one of the early 70s-era Chicago albums to catch my ear, along with Chicago II. Slides are adapted from Dan Roth 2
3 Cross-document co-reference resolution It s a version of Chicago the standard classic Macintosh menu font, with that distinctive thick diagonal in the N. Chicago was used by default for Mac menus through MacOS 7.6, and OS 8 was released mid Chicago VIII was one of the early 70s-era Chicago albums to catch my ear, along with Chicago II. 3
4 Reference resolution: (disambiguation to Wikipedia) It s a version of Chicago the standard classic Macintosh menu font, with that distinctive thick diagonal in the N. Chicago was used by default for Mac menus through MacOS 7.6, and OS 8 was released mid Chicago VIII was one of the early 70s-era Chicago albums to catch my ear, along with Chicago II. 4
5 The Reference Collection has Structure It s a version of Chicago the standard classic Macintosh menu font, with that distinctive thick diagonal in the N. Chicago was used by default for Mac menus through MacOS 7.6, and OS 8 was released mid Chicago VIII was one of the early 70s-era Chicago albums to catch my ear, along with Chicago II. Is_a Is_a Used_In Released Succeeded 5
6 Analysis of Information Networks It s a version of Chicago the standard classic Macintosh menu font, with that distinctive thick diagonal in the N. Chicago was used by default for Mac menus through MacOS 7.6, and OS 8 was released mid Chicago VIII was one of the early 70s-era Chicago albums to catch my ear, along with Chicago II. 6
7 Wikipedia as a knowledge resource. Is_a Is_a Used_In Released Succeeded 7
8 Wikification: The Reference Problem Blumenthal (D) is a candidate for the U.S. Senate seat now held by Christopher Dodd (D), and he has held a commanding lead in the race since he entered it. But the Times report has the potential to fundamentally reshape the contest in the Nutmeg State. Cycles of Knowledge: Grounding for/using Knowledge Blumenthal (D) is a candidate for the U.S. Senate seat now held by Christopher Dodd (D), and he has held a commanding lead in the race since he entered it. But the Times report has the potential to fundamentally reshape the contest in the Nutmeg State. 8
9 Challenging v Dealing with Ambiguity of Natural Language v Mentions of entities and concepts could have multiple meanings v Dealing with Variability of Natural Language v A given concept could be expressed in many ways v Wikification addresses these two issues in a specific way: v The Reference Problem v What is meant by this concept? (WSD + Grounding) v More than just co-reference (within and across documents) 9
10 General Challenges Blumenthal (D) is a candidate for the U.S. Senate seat now held by Christopher Dodd (D), and he has held a commanding lead in the race since he entered it. But the Times report has the potential to fundamentally reshape the contest in the Nutmeg State. Ambiguity Variability Times The New York Times The Times CT The Nutmeg State Connecticut Concepts outside of Wikipedia (NIL) Blumenthal? Scale Millions of labels 10
11 Wikification: Subtasks v Wikification and Entity Linking requires addressing several sub-tasks: v Identifying Target Mentions v Mentions in the input text that should be Wikified v Identifying Candidate Titles v Candidate Wikipedia titles that could correspond to each mention v Candidate Title Ranking v Rank the candidate titles for a given mention v NIL Detection and Clustering v Identify mentions that do not correspond to a Wikipedia title v Entity Linking: cluster NIL mentions that represent the same entity. 11
12 High-level Algorithmic Approach. v Input: A text document d; Output: a set of pairs (m i,t i ) v m i are mentions in d; t j (m i ) are corresponding Wikipedia titles, or NIL. v v v (1) Identify mentions m i in d (2) Local Inference v For each m i in d: v Identify a set of relevant titles T(m i ) v Rank titles t i T(m i ) [E.g., consider local statistics of edges [(m i,t i ), (m i,*), and (*, t i )] occurrences in the Wikipedia graph] (3) Global Inference v For each document d: v Consider all m i d; and all t i T(m i ) v Re-rank titles t i T(m i ) [E.g., if m, m are related by virtue of being in d, their corresponding titles t, t may also be related] 12
13 Local approach A text Document Identified mentions Wikipedia Articles Γ is a solution to the problem A set of pairs (m,t) m: a mention in the document t: the matched Wikipedia Title Local score of matching the mention to the title (decomposed by m i ) 13
14 Global Approach: Using Additional Structure Text Document(s) News, Blogs, Wikipedia Articles 14 Adding a global term to evaluate how good the structure of the solution is. Use the local solutions Γ (each mention considered independently. Evaluate the structure based on pairwise coherence scores Ψ(t i,t j ) Choose those that satisfy document coherence conditions.
Jianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationStanford-UBC at TAC-KBP
Stanford-UBC at TAC-KBP Eneko Agirre, Angel Chang, Dan Jurafsky, Christopher Manning, Valentin Spitkovsky, Eric Yeh Ixa NLP group, University of the Basque Country NLP group, Stanford University Outline
More informationUBC Entity Discovery and Linking & Diagnostic Entity Linking at TAC-KBP 2014
UBC Entity Discovery and Linking & Diagnostic Entity Linking at TAC-KBP 2014 Ander Barrena, Eneko Agirre, Aitor Soroa IXA NLP Group / University of the Basque Country, Donostia, Basque Country ander.barrena@ehu.es,
More informationWikification and Beyond: The Challenges of Entity and Concept Grounding
Wikification and Beyond: The Challenges of Entity and Concept Grounding Dan Roth (UIUC), Heng Ji (RPI) Ming Wei Chang (MSR), Taylor Cassidy (ARL&IBM) http://nlp.cs.rpi.edu/paper/wikificationtutorial.pdf
More informationCoreference Resolution with Knowledge. Haoruo Peng March 20, 2015
Coreference Resolution with Knowledge Haoruo Peng March 20, 2015 1 Coreference Problem 2 Coreference Problem 3 Online Demo Please check outhttp://cogcomp.cs.illinois.edu/page/demo_view/coref 4 General
More informationBc. Pavel Taufer. Named Entity Recognition and Linking
MASTER THESIS Bc. Pavel Taufer Named Entity Recognition and Linking Institute of Formal and Applied Linguistics Supervisor of the master thesis: Study programme: Study branch: RNDr. Milan Straka, Ph.D.
More informationCMU System for Entity Discovery and Linking at TAC-KBP 2017
CMU System for Entity Discovery and Linking at TAC-KBP 2017 Xuezhe Ma, Nicolas Fauceglia, Yiu-chang Lin, and Eduard Hovy Language Technologies Institute Carnegie Mellon University 5000 Forbes Ave, Pittsburgh,
More informationTesting Process and Methods. CS 490MT/5555, Spring 2017, Yongjie Zheng
Testing Process and Methods CS 490MT/5555, Spring 2017, Yongjie Zheng Context of Software Testing Verification & Validation Verification: checking that the software conforms to its specification. Validation:
More informationEntity Linking. David Soares Batista. November 11, Disciplina de Recuperação de Informação, Instituto Superior Técnico
David Soares Batista Disciplina de Recuperação de Informação, Instituto Superior Técnico November 11, 2011 Motivation Entity-Linking is the process of associating an entity mentioned in a text to an entry,
More informationEvent Registry. Gregor Leban. Jozef Stefan Institute
Event Registry Gregor Leban Jozef Stefan Institute Event registry http://eventregistry.org/ Event registry is a system for collecting news content published globally automatic identification of world events
More informationslide courtesy of D. Yarowsky Splitting Words a.k.a. Word Sense Disambiguation Intro to NLP - J. Eisner 1
Splitting Words a.k.a. Word Sense Disambiguation 600.465 - Intro to NLP - J. Eisner Representing Word as Vector Could average over many occurrences of the word... Each word type has a different vector
More informationWebSAIL Wikifier at ERD 2014
WebSAIL Wikifier at ERD 2014 Thanapon Noraset, Chandra Sekhar Bhagavatula, Doug Downey Department of Electrical Engineering & Computer Science, Northwestern University {nor.thanapon, csbhagav}@u.northwestern.edu,ddowney@eecs.northwestern.edu
More informationPapers for comprehensive viva-voce
Papers for comprehensive viva-voce Priya Radhakrishnan Advisor : Dr. Vasudeva Varma Search and Information Extraction Lab, International Institute of Information Technology, Gachibowli, Hyderabad, India
More informationRelevance Feedback and Query Reformulation. Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price. Outline
Relevance Feedback and Query Reformulation Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price IR on the Internet, Spring 2010 1 Outline Query reformulation Sources of relevance
More informationHandling Place References in Text
Handling Place References in Text Introduction Most (geographic) information is available in the form of textual documents Place reference resolution involves two-subtasks: Recognition : Delimiting occurrences
More informationLinking Entities in Chinese Queries to Knowledge Graph
Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn
More informationCMU System for Entity Discovery and Linking at TAC-KBP 2016
CMU System for Entity Discovery and Linking at TAC-KBP 2016 Xuezhe Ma, Nicolas Fauceglia, Yiu-chang Lin, and Eduard Hovy Language Technologies Institute Carnegie Mellon University 5000 Forbes Ave, Pittsburgh,
More informationSemantic Multimedia Information Retrieval Based on Contextual Descriptions
Semantic Multimedia Information Retrieval Based on Contextual Descriptions Nadine Steinmetz and Harald Sack Hasso Plattner Institute for Software Systems Engineering, Potsdam, Germany, nadine.steinmetz@hpi.uni-potsdam.de,
More informationExploiting DBpedia for Graph-based Entity Linking to Wikipedia
Exploiting DBpedia for Graph-based Entity Linking to Wikipedia Master Thesis presented by Bernhard Schäfer submitted to the Research Group Data and Web Science Prof. Dr. Christian Bizer University Mannheim
More informationGraph-Based Entity Linking
Graph-Based Entity Linking A. Naderi, J. Turmo, H. Rodríguez TALP Research Center Universitat Politecnica de Catalunya February 20, 2014 1 / 29 Table of contents 1 Introduction Entity Linking EL applications
More informationTIB AV-Portal. Margret Plank 19th of January 2015 TACC Meeting
TIB AV-Portal Margret Plank 19th of January 2015 TACC Meeting German National Library of Science and Technology (TIB) German National Library of Science and Technology for all areas of engineering as well
More informationTowards Summarizing the Web of Entities
Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin
More informationA Joint Model for Discovering and Linking Entities
A Joint Model for Discovering and Linking Entities Michael Wick Sameer Singh Harshal Pandya Andrew McCallum School of Computer Science University of Massachusetts Amherst MA {mwick, sameer, harshal, mccallum}@cs.umass.edu
More informationInternational Journal of Video& Image Processing and Network Security IJVIPNS-IJENS Vol:10 No:02 7
International Journal of Video& Image Processing and Network Security IJVIPNS-IJENS Vol:10 No:02 7 A Hybrid Method for Extracting Key Terms of Text Documents Ahmad Ali Al-Zubi Computer Science Department
More informationEntity Linking at TAC Task Description
Entity Linking at TAC 2013 Task Description Version 1.0 of April 9, 2013 1 Introduction The main goal of the Knowledge Base Population (KBP) track at TAC 2013 is to promote research in and to evaluate
More informationSoftware Modelling. UML Class Diagram Notation. Unified Modeling Language (UML) UML Modelling. CS 247: Software Engineering Principles
CS 247: Software Engineering Principles UML Modelling InStacks Software Modelling Publication[p] borrow(m,p) / BorrowItem(m, p, today) return(p) / ReturnItem(p, today) OnLoan lost A software model is an
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationCS 247: Software Engineering Principles. UML Modelling
CS 247: Software Engineering Principles UML Modelling Agenda: UML class diagrams UML object diagrams UML sequence diagrams Reading: Martin Fowler, UML Distilled, 3rd ed, Addison-Wesley Professional, 2004.
More informationOutline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity
Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using
More informationThe Pairwise-Comparison Method
The Pairwise-Comparison Method Lecture 10 Section 1.5 Robb T. Koether Hampden-Sydney College Mon, Sep 11, 2017 Robb T. Koether (Hampden-Sydney College) The Pairwise-Comparison Method Mon, Sep 11, 2017
More informationTulip: Lightweight Entity Recognition and Disambiguation Using Wikipedia-Based Topic Centroids. Marek Lipczak Arash Koushkestani Evangelos Milios
Tulip: Lightweight Entity Recognition and Disambiguation Using Wikipedia-Based Topic Centroids Marek Lipczak Arash Koushkestani Evangelos Milios Problem definition The goal of Entity Recognition and Disambiguation
More informationMidterm Exam #1 Version A CS 122A Winter 2017
NAME: SEAT NO.: STUDENT ID: Midterm Exam #1 Version A CS 122A Winter 2017 Max. Points: 100 (Please read the instructions carefully) Instructions: - The total time for the exam is 50 minutes; be sure to
More informationWeb-based Question Answering: Revisiting AskMSR
Web-based Question Answering: Revisiting AskMSR Chen-Tse Tsai University of Illinois Urbana, IL 61801, USA ctsai12@illinois.edu Wen-tau Yih Christopher J.C. Burges Microsoft Research Redmond, WA 98052,
More informationPersonalized Page Rank for Named Entity Disambiguation
Personalized Page Rank for Named Entity Disambiguation Maria Pershina Yifan He Ralph Grishman Computer Science Department New York University New York, NY 10003, USA {pershina,yhe,grishman}@cs.nyu.edu
More informationA Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition
A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es
More informationExtracting Entities from Microblogs and Populating them to Knowledge Bases; the Problems of Big Data
Extracting Entities from Microblogs and Populating them to Knowledge Bases; the Problems of Big Data Robert Neumayer neumayer@idi.ntnu.no Norwegian University of Science and Technology Trondheim, Norway
More informationA Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition
A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es
More informationA cocktail approach to the VideoCLEF 09 linking task
A cocktail approach to the VideoCLEF 09 linking task Stephan Raaijmakers Corné Versloot Joost de Wit TNO Information and Communication Technology Delft, The Netherlands {stephan.raaijmakers,corne.versloot,
More informationIntroduction to System Programming
Introduction to System Programming Introduction to System Programming Why system programming? Basic operation of a computer system Summary Acknowledgement: slides based on the cs:app2e material 2 Why System
More informationWeb Search: Techniques, algorithms and Aplications. Basic Techniques for Web Search
Web Search: Techniques, algorithms and Aplications Basic Techniques for Web Search German Rigau [Based on slides by Eneko Agirre and Christopher Manning and Prabhakar Raghavan] 1
More informationPreference Elicitation for Single Crossing Domain
Preference Elicitation for Single Crossing Domain joint work with Neeldhara Misra (IIT Gandhinagar) March 6, 2017 Appeared in IJCAI 2016 Motivation for Preference Elicitation One often wants to learn how
More informationAnnouncements. CS 188: Artificial Intelligence Spring Generative vs. Discriminative. Classification: Feature Vectors. Project 4: due Friday.
CS 188: Artificial Intelligence Spring 2011 Lecture 21: Perceptrons 4/13/2010 Announcements Project 4: due Friday. Final Contest: up and running! Project 5 out! Pieter Abbeel UC Berkeley Many slides adapted
More informationParsing tree matching based question answering
Parsing tree matching based question answering Ping Chen Dept. of Computer and Math Sciences University of Houston-Downtown chenp@uhd.edu Wei Ding Dept. of Computer Science University of Massachusetts
More informationPig for Natural Language Processing. Max Jakob
Pig for Natural Language Processing Max Jakob Agenda 1 2 3 Introduction (speaker, affiliation, project) Named Entities pignlproc Speaker: Max Jakob MSc in Computational Linguistics Software Developer at
More informationCollective Entity Linking on Relational Graph Model with Mentions
Collective Entity Linking on Relational Graph Model with Mentions Jing Gong 1[00],Chong Feng 1[00],Yong Liu 2[11], Ge Shi 1[22] and Heyan Huang 1[00] 1 Beijing Institute of Technology University, Beijing
More informationCS 200. Lecture 06. Database Introduction. Database Introduction. CS 200 Winter 2018
CS 200 Lecture 06 1 Miscellaneous Notes Abbreviations aka also known as DBMS DataBase Management System mutatis mutantis with the necessary changes having been made 2 Please read and highlight BEFORE lab
More informationCS Part III 1 Dr. Rajesh Subramanyan, 2005
CS250 -- Part III 1 Dr. Rajesh Subramanyan, 2005 Memories Chapter 12 Caches And Caching CS250 -- Part III 2 Dr. Rajesh Subramanyan, 2005 Topics Introduction Definition Key idea Characteristics of a cache
More informationExam Marco Kuhlmann. This exam consists of three parts:
TDDE09, 729A27 Natural Language Processing (2017) Exam 2017-03-13 Marco Kuhlmann This exam consists of three parts: 1. Part A consists of 5 items, each worth 3 points. These items test your understanding
More informationHigh-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data
High-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data Preeti Bhargava Lithium Technologies Klout San Francisco, CA preeti.bhargava@lithium.com ABSTRACT The Entity
More informationLINDEN: Linking Named Entities with Knowledge Base via Semantic Knowledge
LINDEN: Linking Named Entities with Knowledge Base via Semantic Knowledge Wei Shen 1, Jianyong Wang 1, Ping Luo 2, Min Wang 2 1 Tsinghua National Laboratory for Information Science and Technology, Department
More informationNUS-I2R: Learning a Combined System for Entity Linking
NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm
More informationPRIS at TAC2012 KBP Track
PRIS at TAC2012 KBP Track Yan Li, Sijia Chen, Zhihua Zhou, Jie Yin, Hao Luo, Liyin Hong, Weiran Xu, Guang Chen, Jun Guo School of Information and Communication Engineering Beijing University of Posts and
More informationCS 534: Computer Vision Segmentation and Perceptual Grouping
CS 534: Computer Vision Segmentation and Perceptual Grouping Spring 2005 Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Where are we? Image Formation Human vision Cameras Geometric Camera
More informationJoseph Casamento. Christopher Struttmann. Faculty Advisor: Phillip Chan
Joseph Casamento Christopher Struttmann Faculty Advisor: Phillip Chan About Us Chris Struttmann Florida Tech Senior (B.S. Software
More informationAdding High-Precision Links to Wikipedia
Adding High-Precision Links to Wikipedia Thanapon Noraset Chandra Bhagavatula Doug Downey Department of Electrical Engineering & Computer Science Northwestern University Evanston, IL 60208 {nor csbhagav}@u.northwestern.edu,
More informationS-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking
S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking Yi Yang * and Ming-Wei Chang # * Georgia Institute of Technology, Atlanta # Microsoft Research, Redmond Traditional
More informationArtificial Intelligence
Artificial Intelligence 194 (2013) 130 150 Contents lists available at SciVerse ScienceDirect Artificial Intelligence www.elsevier.com/locate/artint Evaluating Entity Linking with Wikipedia Ben Hachey
More informationGIR experiements with Forostar at GeoCLEF 2007
GIR experiements with Forostar at GeoCLEF 2007 Simon Overell 1, João Magalhães 1 and Stefan Rüger 2,1 1 Multimedia & Information Systems Department of Computing, Imperial College London, SW7 2AZ, UK 2
More informationSketchUp Help. 1 of 15 8/29/09 10:00 AM
SketchUp Help Creating your first model It will take some time to master SketchUp as with any new software. However, following is a project to help you better learn the tools and concepts covered in this
More informationUnsupervised Rank Aggregation with Distance-Based Models
Unsupervised Rank Aggregation with Distance-Based Models Alexandre Klementiev, Dan Roth, and Kevin Small University of Illinois at Urbana-Champaign Motivation Consider a panel of judges Each (independently)
More informationIntroduction to Virtual Machines. Michael Jantz
Introduction to Virtual Machines Michael Jantz Acknowledgements Slides adapted from Chapter 1 in Virtual Machines: Versatile Platforms for Systems and Processes by James E. Smith and Ravi Nair Credit to
More informationText, Knowledge, and Information Extraction. Lizhen Qu
Text, Knowledge, and Information Extraction Lizhen Qu A bit about Myself PhD: Databases and Information Systems Group (MPII) Advisors: Prof. Gerhard Weikum and Prof. Rainer Gemulla Thesis: Sentiment Analysis
More informationWikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population Heather Simpson 1, Stephanie Strassel 1, Robert Parker 1, Paul McNamee
More informationCMU System for Entity Discovery and Linking at TAC-KBP 2015
CMU System for Entity Discovery and Linking at TAC-KBP 2015 Nicolas Fauceglia, Yiu-Chang Lin, Xuezhe Ma, and Eduard Hovy Language Technologies Institute Carnegie Mellon University 5000 Forbes Ave, Pittsburgh,
More informationHyperlink-Induced Topic Search (HITS) over Wikipedia Articles using Apache Spark
Hyperlink-Induced Topic Search (HITS) over Wikipedia Articles using Apache Spark Due: Sept. 27 Wednesday 5:00PM Submission: via Canvas, individual submission Instructor: Sangmi Lee Pallickara Web page:
More informationBenchmarks for Enterprise Linking: Thomson Reuters R&D at TAC 2013
Benchmarks for Enterprise Linking: Thomson Reuters R&D at TAC 2013 Tom Vacek Hiroko Bretz Frank Schilder Ben Hachey Thomson Reuters R&D School of Information Technologies 610 Opperman Drive University
More informationFinal report - CS224W: Using Wikipedia pages traffic data to infer pertinence of links
Final report - CS224W: Using Wikipedia pages traffic data to infer pertinence of links Marc Thibault; SUNetID number: 06227968 Mael Trean; SUNetIDnumber: 06228438 I - Introduction/Motivation/Problem Definition
More informationAnnouncements. CS 188: Artificial Intelligence Spring Classification: Feature Vectors. Classification: Weights. Learning: Binary Perceptron
CS 188: Artificial Intelligence Spring 2010 Lecture 24: Perceptrons and More! 4/20/2010 Announcements W7 due Thursday [that s your last written for the semester!] Project 5 out Thursday Contest running
More informationarxiv: v1 [cs.ir] 2 Mar 2017
DAWT: Densely Annotated Wikipedia Texts across multiple languages Nemanja Spasojevic, Preeti Bhargava, Guoning Hu Lithium Technologies Klout San Francisco, CA {nemanja.spasojevic, preeti.bhargava, guoning.hu}@lithium.com
More informationGeometric Modeling Lecture Series. Prof. G. Wang Department of Mechanical and Industrial Engineering University of Manitoba
Geometric Modeling 25.353 Lecture Series Prof. G. Wang Department of Mechanical and Industrial Engineering University of Manitoba Introduction Geometric modeling is as important to CAD as governing equilibrium
More informationCS-XXX: Graduate Programming Languages. Lecture 9 Simply Typed Lambda Calculus. Dan Grossman 2012
CS-XXX: Graduate Programming Languages Lecture 9 Simply Typed Lambda Calculus Dan Grossman 2012 Types Major new topic worthy of several lectures: Type systems Continue to use (CBV) Lambda Caluclus as our
More informationAutomatically Geotagging Unstructured Text with Open Source Tools. Charlie Greenbacker, Principal Data Scientist
Automatically Geotagging Unstructured Text with Open Source Tools Charlie Greenbacker, Principal Data Scientist Background! About Me:! Data Scientist! Natural Language Processing! Unstructured Data è Information!
More informationRobust and Collective Entity Disambiguation through Semantic Embeddings
Robust and Collective Entity Disambiguation through Semantic Embeddings Stefan Zwicklbauer University of Passau Passau, Germany szwicklbauer@acm.org Christin Seifert University of Passau Passau, Germany
More informationEvaluating Entity Linking with Wikipedia
Evaluating Entity Linking with Wikipedia Ben Hachey a Will Radford b,c Joel Nothman b,c Matthew Honnibal d James R. Curran b,c a Research & Development Thomson Reuters Corporation St. Paul, MN 55123, USA
More informationInformation Retrieval
Introduction to Information Retrieval CS276 Information Retrieval and Web Search Christopher Manning and Prabhakar Raghavan Lecture 1: Boolean retrieval Information Retrieval Information Retrieval (IR)
More informationMulticore and Multiprocessor Systems: Part I
Chapter 3 Multicore and Multiprocessor Systems: Part I Max Planck Institute Magdeburg Jens Saak, Scientific Computing II 44/342 Symmetric Multiprocessing Definition (Symmetric Multiprocessing (SMP)) he
More informationThe Internet. History & Current Applications
The Internet History & Current Applications Popescu 2012 1 Connecting computers to other computers Share data Join computing forces Ensure resiliency 2 Types of Communication Synchronous: sender and receiver
More informationStanford-UBC Entity Linking at TAC-KBP
Stanford-UBC Entity Linking at TAC-KBP Angel X. Chang, Valentin I. Spitkovsky, Eric Yeh, Eneko Agirre, Christopher D. Manning Computer Science Department, Stanford University, Stanford, CA, USA IXA NLP
More informationOntology based Web Page Topic Identification
Ontology based Web Page Topic Identification Abhishek Singh Rathore Department of Computer Science & Engineering Maulana Azad National Institute of Technology Bhopal, India Devshri Roy Department of Computer
More informationCITESEERX DATA: SEMANTICIZING SCHOLARLY PAPERS
CITESEERX DATA: SEMANTICIZING SCHOLARLY PAPERS Jian Wu, IST, Pennsylvania State University Chen Liang, IST, Pennsylvania State University Huaiyu Yang, EECS, Vanderbilt University C. Lee Giles, IST & CSE
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY Computer Systems Engineering: Spring Quiz I
Department of Electrical Engineering and Computer Science MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.033 Computer Systems Engineering: Spring 2016 Quiz I There are 15 questions and 13 pages in this quiz booklet.
More informationSupervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Running Example. Mention Pair Model. Mention Pair Example
Supervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Many machine learning models for coreference resolution have been created, using not only different feature sets but also fundamentally
More informationAugust 2012 Daejeon, South Korea
Building a Web of Linked Entities (Part I: Overview) Pablo N. Mendes Free University of Berlin August 2012 Daejeon, South Korea Outline Part I A Web of Linked Entities Challenges Progress towards solutions
More informationParsing Part II. (Ambiguity, Top-down parsing, Left-recursion Removal)
Parsing Part II (Ambiguity, Top-down parsing, Left-recursion Removal) Ambiguous Grammars Definitions If a grammar has more than one leftmost derivation for a single sentential form, the grammar is ambiguous
More informationLSI UNED at M-WePNaD: Embeddings for Person Name Disambiguation
LSI UNED at M-WePNaD: Embeddings for Person Name Disambiguation Andres Duque, Lourdes Araujo, and Juan Martinez-Romo Dpto. Lenguajes y Sistemas Informáticos Universidad Nacional de Educación a Distancia
More informationWikulu: Information Management in Wikis Enhanced by Language Technologies
Wikulu: Information Management in Wikis Enhanced by Language Technologies Iryna Gurevych (this is joint work with Dr. Torsten Zesch, Daniel Bär and Nico Erbs) 1 UKP Lab: Projects UKP Lab Educational Natural
More informationBi-directional Linkability From Wikipedia to Documents and Back Again: UMass at TREC 2012 Knowledge Base Acceleration Track
Bi-directional Linkability From Wikipedia to Documents and Back Again: UMass at TREC 2012 Knowledge Base Acceleration Track Jeffrey Dalton University of Massachusetts, Amherst jdalton@cs.umass.edu Laura
More informationINFO 1103 Homework Project 2
INFO 1103 Homework Project 2 February 15, 2019 Due March 13, 2019, at the end of the lecture period. 1 Introduction In this project, you will design and create the appropriate tables for a version of the
More informationCOS 116 The Computational Universe Laboratory 1: Web 2.0
COS 116 The Computational Universe Laboratory 1: Web 2.0 Must be completed by the noon Tuesday, February 9, 2010. In this week s lab, you ll explore some web sites that encourage collaboration among their
More information0. Database Systems 1.1 Introduction to DBMS Information is one of the most valuable resources in this information age! How do we effectively and efficiently manage this information? - How does Wal-Mart
More informationTowards open-domain QA. Question answering. TReC QA framework. TReC QA: evaluation
Question ing Overview and task definition History Open-domain question ing Basic system architecture Watson s architecture Techniques Predictive indexing methods Pattern-matching methods Advanced techniques
More informationBUPT at TREC 2009: Entity Track
BUPT at TREC 2009: Entity Track Zhanyi Wang, Dongxin Liu, Weiran Xu, Guang Chen, Jun Guo Pattern Recognition and Intelligent System Lab, Beijing University of Posts and Telecommunications, Beijing, China,
More informationCS674 Natural Language Processing
CS674 Natural Language Processing A question What was the name of the enchanter played by John Cleese in the movie Monty Python and the Holy Grail? Question Answering Eric Breck Cornell University Slides
More informationReddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011
Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 1. Introduction Reddit is one of the most popular online social news websites with millions
More informationДинамичното семантично публикуване в Би Би Си (Empowering Dynamic Semantic Publishing at the BBC) CESAR, META-NET Meeting, Sofia
Динамичното семантично публикуване в Би Би Си (Empowering Dynamic Semantic Publishing at the BBC) CESAR, META-NET Meeting, Sofia May 2012 Presentation Outline Ontotext Linked data BBC s Business case The
More informationChinese Microblog Entity Linking System Combining Wikipedia and Search Engine Retrieval Results
Chinese Microblog Entity Linking System Combining Wikipedia and Search Engine Retrieval Results Zeyu Meng, Dong Yu, and Endong Xun Inter. R&D center for Chinese Education, Beijing Language and Culture
More informationIt s time for a semantic engine!
It s time for a semantic engine! Ido Dagan Bar-Ilan University, Israel 1 Semantic Knowledge is not the goal it s a primary mean to achieve semantic inference! Knowledge design should be derived from its
More informationShort Text Categorization Exploiting Contextual Enrichment and External Knowledge
Short Text Categorization Exploiting Contextual Enrichment and External Knowledge Stefano Mizzaro, Marco Pavan, Ivan Scagnetto, Martino Valenti!! University of Udine, Italy 1 Disclaimer Keep it simple,
More informationWebTLab: A cooccurrence-based approach to KBP 2010 Entity-Linking task
WebTLab: A cooccurrence-based approach to KBP 2010 Entity-Linking task Norberto Fernández, Jesus A. Fisteus, Luis Sánchez, Eduardo Martín {berto,jaf,luiss,emartin}@it.uc3m.es Web Technologies Laboratory
More informationWhat s in a name? Rebecca Dridan -
Rebecca Dridan The Data Text taken from Linux blogs Automatic markup normalisation and sentence segmentation, manually corrected Parsed with off-the-shelf English Resource Grammar 452 manually annotated
More information