Querying unnormalized and Inc mpl te Knowledge Bases
|
|
- Bernadette Fletcher
- 5 years ago
- Views:
Transcription
1 Querying unnormalized and Inc mpl te Knowledge Bases Percy Liang Stanford University Automated Knowledge Base Construction (AKBC) 2016 June 17, 2016
2 Computing the answer What is the second most populous city in California? 1
3 Computing the answer What is the second most populous city in California? semantic parsing argmax(type.city ContainedBy.CA, Population, 2) 1
4 Computing the answer What is the second most populous city in California? semantic parsing argmax(type.city ContainedBy.CA, Population, 2) execute San Diego 1
5 Computing the answer Which states capitals are also their largest cities by area? 2
6 Computing the answer Which states capitals are also their largest cities by area? semantic parsing µx.type.usstate Capital.argmax(Type.City ContainedBy.x, Area) 2
7 Computing the answer Which states capitals are also their largest cities by area? semantic parsing µx.type.usstate Capital.argmax(Type.City ContainedBy.x, Area) execute Arizona,Hawaii,Idaho,Indiana,Iowa,Oklahoma,Utah 2
8 Computing the answer Which states capitals are also their largest cities by area? semantic parsing µx.type.usstate Capital.argmax(Type.City ContainedBy.x, Area) execute Arizona,Hawaii,Idaho,Indiana,Iowa,Oklahoma,Utah Strongly leverages KB structure! 2
9 Freebase [Bollacker, 2008; Google, 2013] 100M entities (nodes) 1B assertions (edges) MichelleObama PlacesLived Spouse Gender Female StartDate USState Type Event21 Event8 Hawaii ContainedBy Location Type Marriage UnitedStates ContainedBy ContainedBy Chicago BarackObama PlaceOfBirth Honolulu Location Event3 PlacesLived Type DateOfBirth Profession Type Person Politician City 3
10 Freebase [Bollacker, 2008; Google, 2013] 100M entities (nodes) 1B assertions (edges) MichelleObama PlacesLived Spouse Gender Female StartDate USState Type Event21 Event8 Hawaii ContainedBy Location Type Marriage UnitedStates ContainedBy ContainedBy Chicago BarackObama PlaceOfBirth Honolulu Location Event3 PlacesLived Type DateOfBirth Profession Type Person Politician City 3
11 hiking trails near Palo Alto dishes at Oren s Hummus ACL 2014 papers MichelleObama Gender PlacesLived Spouse Event21 Event8 Female StartDate USState Type Hawaii ContainedBy Location Type Marriage UnitedStates ContainedBy ContainedBy Chicago BarackObama PlaceOfBirth Honolulu Location Event3 PlacesLived Type DateOfBirth Profession Type Person Politician City 4
12 PlacesLived Location Type Location Spouse Gender ContainedBy PlacesLived Type StartDate Marriage DateOfBirth Profession ContainedBy PlaceOfBirth Type ContainedBy Type hiking trails near Palo Alto dishes at Oren s Hummus ACL 2014 papers MichelleObama Event21 Event8 Female USState Hawaii UnitedStates Chicago Event3 BarackObama Honolulu Person Politician City Fewer than 10% of WebQuestions answerable via Freebase 4
13 Obtaining better KBs 5
14 Obtaining better KBs KBC 5
15 Obtaining better KBs KBC QA 5
16 Obtaining better KBs KBC QA 5
17 Philosophy Focus on the end-to-end task of question answering. Let that end goal drive learning and construction of intermediate knowledge representations. 6
18 Philosophy Focus on the end-to-end task of question answering. Let that end goal drive learning and construction of intermediate knowledge representations. KBC/QA 6
19 Outline On web pages On tables In vector space 7
20 Simple semantic parsing on web pages Panupong (Ice) Pasupat ACL
21 Semantic parsing on the web Input: query x hiking trails near Baltimore web page w 9
22 Semantic parsing on the web Input: query x hiking trails near Baltimore web page w 9
23 Semantic parsing on the web Input: query x hiking trails near Baltimore web page w 9
24 Semantic parsing on the web Input: query x hiking trails near Baltimore web page w Output: list of entities y [Avalon Super Loop, Patapsco Valley State Park,...] 9
25 [Sahuguet and Azavant, 1999; Liu et al., 2000; Crescenzi et al., 2001] Logical forms: XPath expressions html head body table h1 table tr tr tr... tr td td td td th th td td td td z = /html[1]/body[1]/table[2]/tr/td[1] 10
26 [Sahuguet and Azavant, 1999; Liu et al., 2000; Crescenzi et al., 2001] Logical forms: XPath expressions html head body table h1 table tr tr tr... tr td td td td th th td td td td z = /html[1]/body[1]/table[2]/tr/td[1] A low-level KB 10
27 Framework hiking trails near Baltimore x w html head body
28 Framework hiking trails near Baltimore x w html head body Generation ( Z 8500) Z 11
29 Framework hiking trails near Baltimore x w html head body Generation ( Z 8500) Z Model /html[1]/body[1]/table[2]/tr/td[1] z 11
30 Framework hiking trails near Baltimore x w html head body Generation ( Z 8500) Z Model /html[1]/body[1]/table[2]/tr/td[1] z Execution [Avalon Super Loop, Patapsco Valley State Park,...] y 11
31 Dataset airlines of italy natural causes of global warming lsu football coaches bf3 submachine guns badminton tournaments foods high in dha technical colleges in south carolina songs on glee season 5 singers who use auto tune san francisco radio stations 12
32 Dataset airlines of italy natural causes of global warming lsu football coaches 12
33 Dataset statistics 2773 examples 2269 unique queries 894 unique headwords long tail! 1483 unique web domains long tail! ( wrapper induction) 13
34 Results Baseline (Most frequent extraction predicates) Accuracy 5 14
35 Correct prediction Query: disney channel movies /html[1]/body/div[2]/div/div/div[3]/div[1]/div/div/div/div/b 15
36 Ranking error Query: doctors at emory /html/body/div[3]/div[4]/table/tbody/tr/td[2] Need better understanding of entities/categories 16
37 Coverage error Query: hedge funds in new york /html/body/div[3]/div[3]/div[4]/.../table/tbody/tr/td[2]/a Need compositionality 17
38 Outline On web pages On tables In vector space 18
39 Semantic parsing on tables Panupong (Ice) Pasupat ACL 2015, ACL
40 In what city did Piotr s last 1st place finish occur? 20
41 How long did it take this competitor to finish the 4x400 meter relay at Universiade in 2005? 20
42 Where was the competition held immediately before the one in Turkey? 20
43 How many times has this competitor placed 5th or better in competition? 20
44 Dataset Statistics: question/answers 2100 tables 6.3 columns and 27.5 rows per table 21
45 Dataset Statistics: question/answers 2100 tables 6.3 columns and 27.5 rows per table Challenges: High logical complexity (conjunction, disjunction, superlatives, comparatives, aggregation, arithmetic) Tables are unnormalized Train and test tables are distinct; need to generalize! 21
46 Knowledge representation Add normalization / auxiliary edges (custom functions), push resolution to semantic parsing 22
47 Model Greece held its last Summer Olympics in which year?
48 Model Greece held its last Summer Olympics in which year? R[Date].R[Year].argmax(Country.Greece, Index)
49 Results IR: Train classifer to pick answer directly from table. WQ: Use logical complexity of our previous Freebase work. 50 answer accuracy IR WQ Full 24
50 Oracle accuracy Can the system even generate a set of candidates containing the answer? How many times did Greece hold the summer olympics? 2 25
51 Oracle accuracy Can the system even generate a set of candidates containing the answer? How many times did Greece hold the summer olympics? 2 Method LF accuracy ACL % ACL 2016 (dynamic prog. on denotations) 76.0% 25
52 Error analysis Unhandled operations (19%): Was there more gold medals won than silver? Which movies were number 1 for at least two consecutive weeks? How many titles had the same author listed as the illustrator? 26
53 Error analysis Unhandled operations (19%): Was there more gold medals won than silver? Which movies were number 1 for at least two consecutive weeks? How many titles had the same author listed as the illustrator? Table normalization: In what city did Piotr s last 1st place finish occur?...[bangkok, Thailand]... How long does the show defcon 3 last?...[2pm-3pm]... 26
54 Error analysis Unhandled operations (19%): Was there more gold medals won than silver? Which movies were number 1 for at least two consecutive weeks? How many titles had the same author listed as the illustrator? Table normalization: In what city did Piotr s last 1st place finish occur?...[bangkok, Thailand]... How long does the show defcon 3 last?...[2pm-3pm]... Lexical mismatch: Mexican Mexico, airplane Model 26
55 Outline On web pages On tables In vector space 27
56 Compositional Queries in Vector Space EMNLP 2015 Kelvin Guu John Miller 28
57 Focus on path queries TadLincoln/Parents/Location 29
58 Focus on path queries TadLincoln/Parents/Location Strength: compositionality 29
59 Focus on path queries TadLincoln/Parents/Location Strength: compositionality Weaknesses: can t handle fact incompleteness, hypotheticals abraham lincoln/daughter/ethnicity 29
60 Vector space relation modeling Score(entity1, relation, entity2) 30
61 Vector space relation modeling Score(entity1, relation, entity2) Prior work: Tensor factorization [Nickel et al., 2011] Neural Tensor Network [Socher et al., 2013] TransE [Bordes et al., 2013] Universal schema [Riedel et al., 2013] General framework + comparison [Yang et al., 2015] Compositional embedding of paths [Neelakantan et al., 2015] 30
62 Vector space relation modeling Score(entity1, relation, entity2) Prior work: Tensor factorization [Nickel et al., 2011] Neural Tensor Network [Socher et al., 2013] TransE [Bordes et al., 2013] Universal schema [Riedel et al., 2013] General framework + comparison [Yang et al., 2015] Compositional embedding of paths [Neelakantan et al., 2015] Strength: handles incompleteness Weakness: no compositionality 30
63 Goal Compositionality + Handle Incompleteness 31
64 Path queries in vector space 32
65 Path queries in vector space 32
66 Path queries in vector space: example x tad lincoln W location W parents xspringfield 33
67 Experimental setup: models Bilinear [Nickel et al., 2012]: T r (v) = v W r 34
68 Experimental setup: models Bilinear [Nickel et al., 2012]: T r (v) = v W r Bilinear-Diag [Yang et al., 2015]: T r (v) = v diag(w r ) TransE [Bordes et al., 2013]: T r (v) = v + w r 34
69 Experimental setup: datasets (paths length 1-5 generated randomly) 35
70 Experimental setup: training SINGLE: standard training abraham lincoln/location springfield stephen curry/birthdate
71 Experimental setup: training SINGLE: standard training abraham lincoln/location springfield stephen curry/birthdate COMP: compositional training abraham lincoln/location springfield stephen curry/birthdate
72 Experimental setup: training SINGLE: standard training abraham lincoln/location springfield stephen curry/birthdate COMP: compositional training abraham lincoln/location springfield stephen curry/birthdate tad lincoln/parents/location springfield stephen curry/wife/birthdate
73 Experiments: knowledge base completion abraham lincoln/location springfield 37
74 Experiments: knowledge base completion abraham lincoln/location springfield Compositional training improves KBC! Surprising it works: train on paths, test on single edges... 37
75 Experiments: knowledge base completion abraham lincoln/location springfield Compositional training improves KBC! Surprising it works: train on paths, test on single edges... Think of it as a form of path regularization 37
76 Final remarks 38
77 KBC + QA Knowledge bases provide structure for difficult aggregation questions (computing the answer) 39
78 KBC + QA Knowledge bases provide structure for difficult aggregation questions (computing the answer) Focus on end task of question answering; knowledge bases are just caching 39
79 KBC + QA Knowledge bases provide structure for difficult aggregation questions (computing the answer) Focus on end task of question answering; knowledge bases are just caching Question-dependent KB 39
80 Question-dependent KB Multiple levels of compositionality (think n-gram backoff) dog-friendly hiking trails near Palo Alto 40
81 Question-dependent KB Multiple levels of compositionality (think n-gram backoff) [dog-friendly] [hiking trails near Palo Alto] 40
82 Question-dependent KB Multiple levels of compositionality (think n-gram backoff) [dog-friendly] [hiking trails] [near Palo Alto] 40
83 Question-dependent KB Multiple levels of compositionality (think n-gram backoff) [dog-friendly] [hiking trails] [near Palo Alto] /body/div[2]/(click)/body/div/(search)/body/div[3] Query a low-level KB 40
84 What about unstructured text? 41
85 Reading comprehension dataset 42
86 Reading comprehension dataset - 107,785 question/answer pairs articles - Answer is span of paragraph 42
87 Reading comprehension dataset - 107,785 question/answer pairs articles - Answer is span of paragraph Method F1 Sliding window 20.0% Log. reg 51.0% Human 86.8% 42
88 Reading comprehension dataset - 107,785 question/answer pairs articles - Answer is span of paragraph Method F1 Sliding window 20.0% Log. reg 51.0% Human 86.8% stanford-qa.com 42
89 Code and data worksheets.codalab.org Collaborators Panupong Pasupat Kelvin Guu John Miller Funding Google Microsoft DARPA Thank you! 43
arxiv: v1 [cs.cl] 3 Jun 2015
Traversing Knowledge Graphs in Vector Space Kelvin Gu Stanford University kgu@stanford.edu John Miller Stanford University millerjp@stanford.edu Percy Liang Stanford University pliang@cs.stanford.edu arxiv:1506.01094v1
More informationarxiv: v2 [cs.cl] 19 Aug 2015
Traversing Knowledge Graphs in Vector Space Kelvin Guu Stanford University kguu@stanford.edu John Miller Stanford University millerjp@stanford.edu Percy Liang Stanford University pliang@cs.stanford.edu
More informationarxiv: v2 [cs.ai] 18 Sep 2013
Lambda Dependency-Based Compositional Semantics Percy Liang September 19, 2013 arxiv:1309.4408v2 [cs.ai] 18 Sep 2013 Abstract This short note presents a new formal language, lambda dependency-based compositional
More informationNeural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision Anonymized for review Abstract Extending the success of deep neural networks to high level tasks like natural language
More informationConstruction and Querying of Large-scale Knowledge Bases. Part II: Schema-agnostic Knowledge Base Querying
Construction and Querying of Large-scale Knowledge Bases Part II: Schema-agnostic Knowledge Base Querying Transformation in Information Search Desktop search Mobile search Which hotel has a roller coaster
More informationarxiv: v1 [cs.cl] 3 Aug 2015
Compositional Semantic Parsing on Semi-Structured Tables Panupong Pasupat Computer Science Department Stanford University ppasupat@cs.stanford.edu Percy Liang Computer Science Department Stanford University
More informationLeveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing. Interspeech 2013
Leveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing LARRY HECK, DILEK HAKKANI-TÜR, GOKHAN TUR Focus of This Paper SLU and Entity Extraction (Slot Filling) Spoken Language Understanding
More informationQuestion Answering over Knowledge Bases: Entity, Text, and System Perspectives. Wanyun Cui Fudan University
Question Answering over Knowledge Bases: Entity, Text, and System Perspectives Wanyun Cui Fudan University Backgrounds Question Answering (QA) systems answer questions posed by humans in a natural language.
More informationThe Latest Progress of KB-QA. Ming Zhou Microsoft Research Asia NLPCC 2018 Workshop Hohhot, 2018
The Latest Progress of KB-QA Ming Zhou Microsoft Research Asia NLPCC 2018 Workshop Hohhot, 2018 Query CommunityQA KBQA TableQA PassageQA VQA Edited Response Question Generation Knowledge Table Doc Image
More informationInferring Logical Forms From Denotations
Inferring Logical Forms From Denotations Panupong Pasupat Computer Science Department Stanford University ppasupat@csstanfordedu Percy Liang Computer Science Department Stanford University pliang@csstanfordedu
More informationOutline. Morning program Preliminaries Semantic matching Learning to rank Entities
112 Outline Morning program Preliminaries Semantic matching Learning to rank Afternoon program Modeling user behavior Generating responses Recommender systems Industry insights Q&A 113 are polysemic Finding
More informationStructured Data on the Web
Structured Data on the Web Alon Halevy Google Australasian Computer Science Week January, 2010 Structured Data & The Web Andree Hudson, 4 th of July Hard to find structured data via search engines
More informationarxiv: v1 [cs.cl] 31 Oct 2016
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision Chen Liang, Jonathan Berant, Quoc Le, Kenneth D. Forbus, Ni Lao arxiv:1611.00020v1 [cs.cl] 31 Oct 2016 Northwestern
More informationWeb Data Extraction Using Tree Structure Algorithms A Comparison
Web Data Extraction Using Tree Structure Algorithms A Comparison Seema Kolkur, K.Jayamalini Abstract Nowadays, Web pages provide a large amount of structured data, which is required by many advanced applications.
More informationAnswering Natural Language Questions via Phrasal Semantic Parsing
Answering Natural Language Questions via Phrasal Semantic Parsing Kun Xu Sheng Zhang Yansong Feng Dongyan Zhao Peking University December 9, 2014 Movie Movie 1 What else movies did the director of the
More informationS-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking
S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking Yi Yang * and Ming-Wei Chang # * Georgia Institute of Technology, Atlanta # Microsoft Research, Redmond Traditional
More informationOn Building Natural Language Interface to Data and Services
On Building Natural Language Interface to Data and Services Department of Computer Science Human-machine interface for digitalized world 2 One interface for all 3 Research on natural language interface
More informationLearning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li
Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,
More informationKnowledge Graph Embedding with Numeric Attributes of Entities
Knowledge Graph Embedding with Numeric Attributes of Entities Yanrong Wu, Zhichun Wang College of Information Science and Technology Beijing Normal University, Beijing 100875, PR. China yrwu@mail.bnu.edu.cn,
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationarxiv: v2 [cs.ai] 21 Jan 2016
Neural Enquirer: Learning to Query Tables with Natural Language arxiv:1512.00965v2 [cs.ai] 21 Jan 2016 Pengcheng Yin Zhengdong Lu Hang Li Ben Kao Dept. of Computer Science The University of Hong Kong {pcyin,
More informationManual Wrapper Generation. Automatic Wrapper Generation. Grammar Induction Approach. Overview. Limitations. Website Structure-based Approach
Automatic Wrapper Generation Kristina Lerman University of Southern California Manual Wrapper Generation Manual wrapper generation requires user to Specify the schema of the information source Single tuple
More informationSeq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning
Seq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning V. Zhong, C. Xiong, R. Socher Salesforce Research arxiv: 1709.00103 Reviewed by : Bill Zhang University of Virginia
More informationNgram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University sekine@cs.nyu.edu Kapil Dalwani Computer Science Department
More informationSelf-tuning ongoing terminology extraction retrained on terminology validation decisions
Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo Maldonado and David Lewis ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin
More informationCOLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA
COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia WWW 2010
More informationKnowledge graph embeddings
Knowledge graph embeddings Paolo Rosso, Dingqi Yang, Philippe Cudré-Mauroux Synonyms Knowledge base embeddings RDF graph embeddings Definitions Knowledge graph embeddings: a vector representation of entities
More informationSocial Data. Toby Segaran Author, Programming Collective Intelligence Data Magnate, Metaweb Technologies
Social Data Toby Segaran Author, Programming Collective Intelligence Data Magnate, Metaweb Technologies Data mining? Sorting through data* to identify patterns and establish relationships * usually a lot
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationLinking Entities in Chinese Queries to Knowledge Graph
Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn
More informationWeb Data Extraction. Craig Knoblock University of Southern California. This presentation is based on slides prepared by Ion Muslea and Kristina Lerman
Web Data Extraction Craig Knoblock University of Southern California This presentation is based on slides prepared by Ion Muslea and Kristina Lerman Extracting Data from Semistructured Sources NAME Casablanca
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationModeling System Calls for Intrusion Detection with Dynamic Window Sizes
Modeling System Calls for Intrusion Detection with Dynamic Window Sizes Eleazar Eskin Computer Science Department Columbia University 5 West 2th Street, New York, NY 27 eeskin@cs.columbia.edu Salvatore
More informationINFORMATION TECHNOLOGY: PAPER II. 1. This question paper consists of 9 pages. Please check that your question paper is complete.
NATIONAL SENIOR CERTIFICATE EXAMINATION NOVEMBER 2011 INFORMATION TECHNOLOGY: PAPER II Time: 3 hours 120 marks PLEASE READ THE FOLLOWING INSTRUCTIONS CAREFULLY 1. This question paper consists of 9 pages.
More informationCompositional Learning of Embeddings for Relation Paths in Knowledge Bases and Text
Compositional Learning of Embeddings for Relation Paths in Knowledge Bases and Text Kristina Toutanova Microsoft Research Redmond, WA, USA Xi Victoria Lin Computer Science & Engineering University of Washington
More informationLearning Meanings for Sentences with Recursive Autoencoders
Learning Meanings for Sentences with Recursive Autoencoders Tsung-Yi Lin and Chen-Yu Lee Department of Electrical and Computer Engineering University of California, San Diego {tsl008, chl260}@ucsd.edu
More informationQuery-Free News Search
Query-Free News Search by Monika Henzinger, Bay-Wei Chang, Sergey Brin - Google Inc. Brian Milch - UC Berkeley presented by Martin Klein, Santosh Vuppala {mklein, svuppala}@cs.odu.edu ODU, Norfolk, 03/21/2007
More informationRepresentation Learning of Knowledge Graphs with Hierarchical Types
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) Representation Learning of Knowledge Graphs with Hierarchical Types Ruobing Xie, 1 Zhiyuan Liu, 1,2
More informationGhent University-IBCN Participation in TAC-KBP 2015 Cold Start Slot Filling task
Ghent University-IBCN Participation in TAC-KBP 2015 Cold Start Slot Filling task Lucas Sterckx, Thomas Demeester, Johannes Deleu, Chris Develder Ghent University - iminds Gaston Crommenlaan 8 Ghent, Belgium
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationKnowledge Transfer for Out-of-Knowledge-Base Entities: A Graph Neural Network Approach
Knowledge Transfer for Out-of-Knowledge-Base Entities: A Graph Neural Network Approach Takuo Hamaguchi, 1 Hidekazu Oiwa, 2 Masashi Shimbo, 1 and Yuji Matsumoto 1 1 Nara Institute of Science and Technology,
More informationIntelligent ranking for photo galleries using sharing intent
Technical Disclosure Commons Defensive Publications Series June 20, 2018 Intelligent ranking for photo galleries using sharing intent John Oberbeck Follow this and additional works at: https://www.tdcommons.org/dpubs_series
More informationDecember 4, BigData 2017 Enterprise Knowledge Graphs for Large Scale Analytics 1/47
December 4, 2017 BigData 2017 Enterprise Knowledge Graphs for Large Scale Analytics 1/47 Knowledge Graphs Analytics Knowledge Graph Analytics Finding Entities of Interest Entity Search and Recommendation
More informationFinding Relevant Relations in Relevant Documents
Finding Relevant Relations in Relevant Documents Michael Schuhmacher 1, Benjamin Roth 2, Simone Paolo Ponzetto 1, and Laura Dietz 1 1 Data and Web Science Group, University of Mannheim, Germany firstname@informatik.uni-mannheim.de
More informationPredicting answer types for question-answering
Predicting answer types for question-answering Ivan Bogatyy Google Inc. 1600 Amphitheatre pkwy Mountain View, CA 94043 bogatyy@google.com Abstract We tackle the problem of answer type prediction, applying
More informatione e Conceptual design begins with the collection of requirements and results needed from the database (ER Diag.)
Instructor: Jinze Liu Fall 2008 Phases of Database Design Data Requirements e e Conceptual design begins with the collection of requirements and results needed from the database (ER Diag.) Conceptual Design
More informationDatabases. Jörg Endrullis. VU University Amsterdam
Databases Jörg Endrullis VU University Amsterdam The Relational Model Overview 1. Relational Model Concepts: Schema, State 2. Null Values 3. Constraints: General Remarks 4. Key Constraints 5. Foreign Key
More informationCTL.SC4x Technology and Systems
in Supply Chain Management CTL.SC4x Technology and Systems Key Concepts Document This document contains the Key Concepts for the SC4x course, Weeks 1 and 2. These are meant to complement, not replace,
More informationCS674 Natural Language Processing
CS674 Natural Language Processing A question What was the name of the enchanter played by John Cleese in the movie Monty Python and the Holy Grail? Question Answering Eric Breck Cornell University Slides
More informationRecursive Deep Models for Semantic Compositionality Over a Sentiment Treebank text
Philosophische Fakultät Seminar für Sprachwissenschaft Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank text 06 July 2017, Patricia Fischer & Neele Witte Overview Sentiment
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationBirkbeck. (University of London) BSc/FD EXAMINATION. Department of Computer Science and Information Systems. Database Management (COIY028H6)
Birkbeck (University of London) BSc/FD EXAMINATION Department of Computer Science and Information Systems Database Management (COIY028H6) CREDIT VALUE: 15 credits Date of examination: 9 June 2016 Duration
More informationA Hybrid Neural Model for Type Classification of Entity Mentions
A Hybrid Neural Model for Type Classification of Entity Mentions Motivation Types group entities to categories Entity types are important for various NLP tasks Our task: predict an entity mention s type
More informationPresented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu
Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day
More informationKnowledge Graphs: In Theory and Practice
Knowledge Graphs: In Theory and Practice Sumit Bhatia 1 and Nitish Aggarwal 2 1 IBM Research, New Delhi, India 2 IBM Watson, San Jose, CA sumitbhatia@in.ibm.com, nitish.aggarwal@ibm.com November 10, 2017
More informationDesign Process Modeling Constraints E-R Diagram Design Issues Weak Entity Sets Extended E-R Features Design of the Bank Database Reduction to
Design Process Modeling Constraints E-R Diagram Design Issues Weak Entity Sets Extended E-R Features Design of the Bank Database Reduction to Relation Schemas Database Design UML A database can be modeled
More informationIntroduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.
Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How
More informationAAAI 2018 Tutorial Building Knowledge Graphs. Craig Knoblock University of Southern California
AAAI 2018 Tutorial Building Knowledge Graphs Craig Knoblock University of Southern California Wrappers for Web Data Extraction Extracting Data from Semistructured Sources NAME Casablanca Restaurant STREET
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationWEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE
WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE *Vidya.V.L, **Aarathy Gandhi *PG Scholar, Department of Computer Science, Mohandas College of Engineering and Technology, Anad **Assistant Professor,
More informationThe Basic (Flat) Relational Model. Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley
The Basic (Flat) Relational Model Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Outline The Relational Data Model and Relational Database Constraints Relational
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationNews Filtering and Summarization System Architecture for Recognition and Summarization of News Pages
Bonfring International Journal of Data Mining, Vol. 7, No. 2, May 2017 11 News Filtering and Summarization System Architecture for Recognition and Summarization of News Pages Bamber and Micah Jason Abstract---
More informationSome Interesting Applications of Theory. PageRank Minhashing Locality-Sensitive Hashing
Some Interesting Applications of Theory PageRank Minhashing Locality-Sensitive Hashing 1 PageRank The thing that makes Google work. Intuition: solve the recursive equation: a page is important if important
More informationCreating a resume with Microsoft Word
Resume for Word, Page 1 Creating a resume with Microsoft Word, 1995 In this lesson you will learn some basic word processing skills using Microsoft Word Loading Word Click the Start Button on the bottom
More informationMOBILE NETVIEW 3.0 REPORT GUIDE 2013
MOBILE NETVIEW 3.0 REPORT GUIDE 2013 MOBILE NETVIEW 3.0 Mobile NetView 3.0 provides measurement of both internet websites and apps accessed on ios and Android devices collected by the On Device Meter (ODM).
More informationQuery Processing & Optimization
Query Processing & Optimization 1 Roadmap of This Lecture Overview of query processing Measures of Query Cost Selection Operation Sorting Join Operation Other Operations Evaluation of Expressions Introduction
More informationThe Entity-Relationship (ER) Model
The Entity-Relationship (ER) Model Week 1-2 Professor Jessica Lin The E-R Model 2 The E-R Model The Entity-Relationship Model The E-R (entity-relationship) data model views the real world as a set of basic
More informationA Keyword-based Structured Query Language
Expressive and Flexible Access to Web-Extracted Data : A Keyword-based Structured Query Language Department of Computer Science and Engineering Indian Institute of Technology Delhi 22th September 2011
More informationDOT-K: Distributed Online Top-K Elements Algorithm with Extreme Value Statistics
DOT-K: Distributed Online Top-K Elements Algorithm with Extreme Value Statistics Nick Carey, Tamás Budavári, Yanif Ahmad, Alexander Szalay Johns Hopkins University Department of Computer Science ncarey4@jhu.edu
More informationCollaborative Framework for Testing Web Application Vulnerabilities Using STOWS
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,
More informationSEO for Local Pet Businesses. John Curtis SEO Expert
SEO for Local Pet Businesses John Curtis SEO Expert SEO 101 What small business owners need to know about SEO 1. Defining SEO 2. Brand vs Non-brand queries 3. SEO Myths and Misconceptions 4. Local SEO
More informationImproving Difficult Queries by Leveraging Clusters in Term Graph
Improving Difficult Queries by Leveraging Clusters in Term Graph Rajul Anand and Alexander Kotov Department of Computer Science, Wayne State University, Detroit MI 48226, USA {rajulanand,kotov}@wayne.edu
More informationQuery Evaluation in Wireless Sensor Networks
Query Evaluation in Wireless Sensor Networks Project Presentation for Comp 8790 Student: Yongxuan Fu Supervised by: Prof. Weifa Liang Presented on: 07/11/13 Outline Background Preliminary Algorithm Design
More informationOpen-domain Factoid Question Answering via Knowledge Graph Search
Open-domain Factoid Question Answering via Knowledge Graph Search Ahmad Aghaebrahimian, Filip Jurčíček Charles University in Prague, Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics
More informationDatabase Group Research Overview. Immanuel Trummer
Database Group Research Overview Immanuel Trummer Talk Overview User Query Data Analysis Result Processing Talk Overview Fact Checking Query User Data Vocalization Data Analysis Result Processing Query
More informationReal-time population of Knowledge Bases: Opportunities and Challenges. Ndapa Nakashole Gerhard Weikum
Real-time population of Knowledge Bases: Opportunities and Challenges Ndapa Nakashole Gerhard Weikum AKBC Workshop at NAACL 2012 Real-time Data Sources In news and social media, the implicit query is:
More informationPerformance Optimization for Informatica Data Services ( Hotfix 3)
Performance Optimization for Informatica Data Services (9.5.0-9.6.1 Hotfix 3) 1993-2015 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means (electronic,
More informationGenerating EML from a Relational Database Management System (RDBMS)
Page 13 of 34 Generating EML from a Relational Database Management System (RDBMS) - Zhiqiang Yang and Don Henshaw (AND) The Ecological Metadata Language (EML) is an open metadata specification and provides
More informationEmpirical Evaluation of RNN Architectures on Sentence Classification Task
Empirical Evaluation of RNN Architectures on Sentence Classification Task Lei Shen, Junlin Zhang Chanjet Information Technology lorashen@126.com, zhangjlh@chanjet.com Abstract. Recurrent Neural Networks
More informationCS377: Database Systems Text data and information. Li Xiong Department of Mathematics and Computer Science Emory University
CS377: Database Systems Text data and information retrieval Li Xiong Department of Mathematics and Computer Science Emory University Outline Information Retrieval (IR) Concepts Text Preprocessing Inverted
More informationEnd-User Evaluations of Semantic Web Technologies
End-User Evaluations of Semantic Web Technologies Rob McCool 1, Andrew J. Cowell 2 and David A. Thurman 3 1 Knowledge Systems Lab, Stanford University robm@ksl.stanford.edu 2 Rich Interaction Environments,
More informationAnnouncements. JSon Data Structures. JSon Syntax. JSon Semantics: a Tree! JSon Primitive Datatypes. Introduction to Database Systems CSE 414
Introduction to Database Systems CSE 414 Lecture 13: Json and SQL++ Announcements HW5 + WQ5 will be out tomorrow Both due in 1 week Midterm in class on Friday, 5/4 Covers everything (HW, WQ, lectures,
More informationOntology-based Service Discovery Front-end Interface for GloServ
Ontology-based Service Discovery Front-end Interface for GloServ Knarig Arabshian, Christian Dickmann and Henning Schulzrinne Department of Computer Science Columbia University, New York NY 10027, USA
More informationCombining Text Embedding and Knowledge Graph Embedding Techniques for Academic Search Engines
Combining Text Embedding and Knowledge Graph Embedding Techniques for Academic Search Engines SemDeep-4, Oct. 2018 Gengchen Mai Krzysztof Janowicz Bo Yan STKO Lab, University of California, Santa Barbara
More informationKeywords Data alignment, Data annotation, Web database, Search Result Record
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web
More informationUnstructured Data. CS102 Winter 2019
Winter 2019 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for patterns in data
More informationCOUNT Function. The COUNT function returns the number of rows in a query.
Created by Ahsan Arif COUNT Function The COUNT function returns the number of rows in a query. The syntax for the COUNT function is: SELECT COUNT(expression) FROM tables WHERE predicates; Note: The COUNT
More informationIntroduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure
Databases databases Terminology of relational model Properties of database relations. Relational Keys. Meaning of entity integrity and referential integrity. Purpose and advantages of views. The relational
More informationWikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population Heather Simpson 1, Stephanie Strassel 1, Robert Parker 1, Paul McNamee
More informationBig Data Management and NoSQL Databases
NDBI040 Big Data Management and NoSQL Databases Lecture 10. Graph databases Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz http://www.ksi.mff.cuni.cz/~holubova/ndbi040/ Graph Databases Basic
More informationCombining Neural Networks and Log-linear Models to Improve Relation Extraction
Combining Neural Networks and Log-linear Models to Improve Relation Extraction Thien Huu Nguyen and Ralph Grishman Computer Science Department, New York University {thien,grishman}@cs.nyu.edu Outline Relation
More informationOutline. Part I. Introduction Part II. ML for DI. Part III. DI for ML Part IV. Conclusions and research direction
Outline Part I. Introduction Part II. ML for DI ML for entity linkage ML for data extraction ML for data fusion ML for schema alignment Part III. DI for ML Part IV. Conclusions and research direction Data
More informationIntelligent Transportation Traffic Data Management
Intelligent Transportation Traffic Data Management Ugur Demiryurek Asscociate Director, IMSC Viterbi School of Engineering University of Southern California Los Angeles, CA 900890781 demiryur@usc.edu 1
More informationChameleon Metadata s Data Science Basics Tutorial Series. DSB-2: Information Gain (IG) By Eric Thornton,
Chameleon Metadata s Data Science Basics Tutorial Series Data Science Basics Syllabus for DSB-2 (DSB-2-Infographic-1) Download PDF version here: DSB-2-Information-Gain-V10.pdf DSB-2: Information Gain (IG)
More informationSearch Engines. Charles Severance
Search Engines Charles Severance Google Architecture Web Crawling Index Building Searching http://infolab.stanford.edu/~backrub/google.html Google Search Google I/O '08 Keynote by Marissa Mayer Usablity
More informationModeling System Calls for Intrusion Detection with Dynamic Window Sizes
Modeling System Calls for Intrusion Detection with Dynamic Window Sizes Eleazar Eskin Computer Science Department Columbia University 5 West 2th Street, New York, NY 27 eeskin@cs.columbia.edu Salvatore
More informationSentence selection with neural networks over string kernels
Sentence selection with neural networks over string kernels Mihai Dan Mașala, Ștefan Rușeți, Traian Rebedea KES 2017 University POLITEHNICA of Bucharest Introduction Sentence selection: given a question,
More informationANNUAL REPORT Visit us at project.eu Supported by. Mission
Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing
More informationCoreference Resolution with Knowledge. Haoruo Peng March 20, 2015
Coreference Resolution with Knowledge Haoruo Peng March 20, 2015 1 Coreference Problem 2 Coreference Problem 3 Online Demo Please check outhttp://cogcomp.cs.illinois.edu/page/demo_view/coref 4 General
More information