KNOWLEDGE GRAPH CONSTRUCTION

Size: px
Start display at page:

Download "KNOWLEDGE GRAPH CONSTRUCTION"

Transcription

1 KNOWLEDGE GRAPH CONSTRUCTION Jay Pujara Karlsruhe Institute of Technology 7/7/2015

2 Can Computers Create Knowledge? Internet Massive source of publicly available information Knowledge

3 Computers + Knowledge =

4 Motivating Problem: New Opportunities Internet Extraction Knowledge Graph (KG) Massive source of publicly available information Cutting-edge IE methods Structured representation of entities, their labels and the relationships between them

5 Motivating Problem: Real Challenges Internet Extraction Knowledge Graph Difficult! Noisy! Contains many errors and inconsistencies

6 Knowledge Graphs in the wild

7 Overview Problem: Build a Knowledge Graph from millions of noisy extractions Approach: Knowledge Graph Identification reasons jointly over all facts in the knowledge graph Method: Use probabilistic soft logic to easily specify models and efficiently optimize them Results: State-of-the-art performance on real-world datasets producing knowledge graphs with millions of facts

8 NELL: The Never-Ending Language Learner Large-scale IE project (Carlson et al., 2010) Lifelong learning: aims to read the web Ontology of known labels and relations Knowledge base contains millions of facts

9 Examples of NELL errors

10 Entity co-reference errors Kyrgyzstan has many variants: Kyrgystan Kyrgistan Kyrghyzstan Kyrgzstan Kyrgyz Republic

11 Missing and spurious labels Kyrgyzstan is labeled a bird and a country

12 Missing and spurious relations Kyrgyzstan s location is ambiguous Kazakhstan, Russia and US are included in possible locations

13 Violations of ontological knowledge Equivalence of co-referent entities (sameas) SameAs(Kyrgyzstan, Kyrgyz Republic) Mutual exclusion (disjointwith) of labels MUT(bird, country) Selectional preferences (domain/range) of relations RNG(countryLocation, continent) Enforcing these constraints require jointly considering multiple extractions

14 KNOWLEDGE GRAPH IDENTIFICATION

15 Motivating Problem (revised) (noisy) Extraction Graph Knowledge Graph Internet Large-scale IE Joint Reasoning =

16 Knowledge Graph Identification Problem: Knowledge Graph Extraction Graph Performs graph identification: entity resolution collective classification link prediction Knowledge Graph Identification Solution: Knowledge Graph Identification (KGI) Enforces ontological constraints Incorporates multiple uncertain sources =

17 Illustration of KGI: Extractions Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital)

18 Illustration of KGI: Extraction Graph Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital) Extraction Graph Kyrgyzstan Lbl bird country Kyrgyz Republic Rel(hasCapital) Bishkek

19 Illustration of KGI: Ontology + ER Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital) Ontology: Dom(hasCapital, country) Mut(country, bird) Entity Resolution: SameEnt(Kyrgyz Republic, Kyrgyzstan) (Annotated) Extraction Graph SameEnt Kyrgyzstan Lbl country bird Kyrgyz Republic Dom Rel(hasCapital) Bishkek

20 Illustration of KGI Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital) Ontology: Dom(hasCapital, country) Mut(country, bird) Entity Resolution: SameEnt(Kyrgyz Republic, Kyrgyzstan) (Annotated) Extraction Graph SameEnt Kyrgyzstan Lbl country bird Kyrgyz Republic Dom Rel(hasCapital) Bishkek After Knowledge Graph Identification Kyrgyzstan Lbl country Kyrgyz Republic Rel(hasCapital) Bishkek

21 (Pujara et al., ISWC13) Viewing KGI as a probabilistic graphical model Lbl(Kyrgyzstan, bird) Rel(hasCapital, Kyrgyzstan, Bishkek) Lbl(Kyrgyzstan, country) Lbl(Kyrgyz Republic, country) Lbl(Kyrgyz Republic, bird) Rel(hasCapital, Kyrgyz Republic, Bishkek)

22 (Pujara et al., ISWC13) Background: Probabilistic Soft Logic (PSL) (Broecheler et al., UAI10; Kimming et al., NIPS-ProbProg12) Templating language for hinge-loss MRFs, very scalable! Model specified as a collection of logical formulas SameEnt(E 1,E 2 ) ^ Lbl(E 1,L) ) Lbl(E 2,L) Uses soft-logic formulation Truth values of atoms relaxed to [0,1] interval Truth values of formulas derived from Lukasiewicz t-norm

23 (Pujara et al., ISWC13) Background: PSL Rules to Distributions Rules are grounded by substituting literals into formulas w EL : SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country) ) Lbl(Kyrygyz Republic, country) Each ground rule has a weighted distance to satisfaction derived from the formula s truth value P(G E) = 1 Z exp $ % r R w r ϕ (G)& r ' The PSL program can be interpreted as a joint probability distribution over all variables in knowledge graph, conditioned on the extractions

24 (Pujara et al., ISWC13) Background: Finding the best knowledge graph MPE inference solves max G P(G) to find the best KG In PSL, inference solved by convex optimization Efficient: running time empirically scales with O( R ) (Bach et al., NIPS12)

25 (Pujara et al., ISWC13) PSL Rules for KGI Model

26 (Pujara et al., ISWC13) PSL Rules: Uncertain Extractions Weight for source T (relations) Predicate representing uncertain relation extraction from extractor T Relation in Knowledge Graph w CR-T : CandRel T (E 1,E 2,R) w CL-T : CandLbl T (E,L) ) Rel(E 1,E 2,R) ) Lbl(E,L) Weight for source T (labels) Predicate representing uncertain label extraction from extractor T Label in Knowledge Graph

27 (Pujara et al., ISWC13) PSL Rules: Entity Resolution SameEnt predicate captures confidence that entities are co-referent Rules require co-referent entities to have the same labels and relations Creates an equivalence class of co-referent entities

28 (Pujara et al., ISWC13) PSL Rules: Ontology Inverse: w O : Inv(R, S) ^ Rel(E 1,E 2,R) ) Rel(E 2,E 1,S) Selectional Preference: w O : Dom(R, L) ^ Rel(E 1,E 2,R) ) Lbl(E 1,L) w O : Rng(R, L) ^ Rel(E 1,E 2,R) ) Lbl(E 2,L) Subsumption: w O : Sub(L, P ) ^ Lbl(E,L) ) Lbl(E,P) w O : RSub(R, S) ^ Rel(E 1,E 2,R) ) Rel(E 1,E 2,S) Mutual Exclusion: w O : Mut(L 1,L 2 ) ^ Lbl(E,L 1 ) ) Lbl(E,L 2 ) w O : RMut(R, S) ^ Rel(E 1,E 2,R) ) Rel(E 1,E 2,S) Adapted from Jiang et al., ICDM 2012

29 [ 1] CandLbl struct (Kyrgyzstan, bird) ) Lbl(Kyrgyzstan, bird) φ 1 φ φ [ 2] CandRel pat (Kyrgyz Rep., Asia, locatedin) ) Rel(Kyrgyz Rep., Asia, locatedin) Lbl(Kyrgyzstan, bird) Lbl(Kyrgyz Rep., bird) [ 3] SameEnt(Kyrgyz Rep., Kyrgyzstan) ^ Lbl(Kyrgyz Rep., country) ) Lbl(Kyrgyzstan, country) φ 5 φ [ 4] Dom(locatedIn, country) ^ Rel(Kyrgyz Rep., Asia, locatedin) ) Lbl(Kyrgyz Rep., country) Lbl(Kyrgyzstan, country) Lbl(Kyrgyz Rep., country) [ 5] Mut(country, bird) ^ Lbl(Kyrgyzstan, country) ) Lbl(Kyrgyzstan, bird) φ φ 2 φ 3 φ 4 Rel(Kyrgyz Rep., Asia, locatedin) φ

30 Probability Distribution over KGs P(G E) = 1 Z exp $ % r R w r ϕ (G)& r ' CandLbl T (kyrgyzstan, bird) ) Lbl(kyrgyzstan, bird) Mut(bird, country) ^ Lbl(kyrgyzstan, bird) ) Lbl(kyrgyzstan, country) SameEnt(kyrgz republic, kyrgyzstan) ^ Lbl(kyrgz republic, country) ) Lbl(kyrgyzstan, country)

31 EVALUATION

32 (Pujara et al., ISWC13) Two Evaluation Datasets Description LinkedBrainz Community-supplied data about musical artists, labels, and creative works NELL Real-world IE system extracting general facts from the WWW Noise Realistic synthetic noise Imperfect extractors and ambiguous web pages Candidate Facts Unique Labels and Relations Ontological Constraints 810K 1.3M K

33 (Pujara et al., ISWC13) LinkedBrainz Open source communitydriven structured database of music metadata Uses proprietary schema to represent data Built on popular ontologies such as FOAF and FRBR Widely used for music data (e.g. BBC Music Site) LinkedBrainz project provides an RDF mapping from MusicBrainz data to Music Ontology using the D2RQ tool

34 (Pujara et al., ISWC13) LinkedBrainz dataset for KGI mo:release mo:record mo:record mo:track mo:track mo:published_as mo:signal mo:label mo:label foaf:maker mo:musicalartist inverseof subclassof subclassof foaf:made mo:solomusicartist mo:musicgroup Mapping to FRBR/FOAF ontology DOM RNG INV SUB RSUB MUT rdfs:domain rdfs:range owl:inverseof rdfs:subclassof rdfs:subpropertyof owl:disjointwith

35 (Pujara et al., ISWC13) LinkedBrainz experiments Comparisons: Baseline PSL-EROnly PSL-OntOnly PSL-KGI model Use noisy truth values as fact scores Only apply rules for Entity Resolution Only apply rules for Ontological reasoning Apply Knowledge Graph Identification AUC Precision Recall F1 at.5 Max F1 Baseline PSL-EROnly PSL-OntOnly PSL-KGI

36 (Pujara et al., ISWC13) NELL Evaluation: two settings Target Set: restrict to a subset of KG (Jiang, ICDM12) Complete: Infer full knowledge graph?? Closed-world model Uses a target set: subset of KG Derived from 2-hop neighborhood Excludes trivially satisfied variables Open-world model All possible entities, relations, labels Inference assigns truth value to each variable

37 (Pujara et al., ISWC13) NELL experiments: Target Set Task: Compute truth values of a target set derived from the evaluation data Comparisons: Baseline Average confidences of extractors for each fact in the NELL candidates NELL Evaluate NELL s promotions (on the full knowledge graph) MLN Method of (Jiang, ICDM12) estimates marginal probabilities with MC-SAT PSL-KGI Apply full Knowledge Graph Identification model Running Time: Inference completes in 10 seconds, values for 25K facts AUC F1 Baseline NELL MLN (Jiang, 12) PSL-KGI

38 (Pujara et al., ISWC13) NELL experiments: Complete knowledge graph Task: Compute a full knowledge graph from uncertain extractions Comparisons: NELL NELL s strategy: ensure ontological consistency with existing KB PSL-KGI Apply full Knowledge Graph Identification model Running Time: Inference completes in 130 minutes, producing 4.3M facts AUC Precision Recall F1 NELL PSL-KGI

39 Knowledge Graph Identification. Pujara, Miao, Getoor, Cohen Conclusion Knowledge Graph Identification is a powerful technique for producing knowledge graphs from noisy IE system output Using PSL we are able to enforce global ontological constraints and capture uncertainty in our model Unlike previous work, our approach infers complete knowledge graphs for datasets with millions of extractions Code available on GitHub: Questions?

40 Problem: Incremental Updates to KG? How do we add new extractions to the Knowledge Graph?

41 Naïve Approach: Full KGI over extractions

42 Improving the naïve approach Intuition: Much of previous KG does not change Online collective inference: Selectively update the MAP state Bound the regret of partial updates Efficiently determine which variables to infer

43 Key Idea: fix some variables, infer others

44 Key Collaborators

45 Approximation: KGI over subset of graph

46 Theory: Bounding Inference Regret Regret = kfull inference partial updatek

47 Theory: Bounding Inference Regret Regret = kfull inference partial updatek R n (x, y S ; ẇ), 1 n kh(x; ẇ) h(x, y S; ẇ)k 1

48 Theory: Bounding Inference Regret R n (x, y S ; ẇ), 1 n kh(x; ẇ) h(x, y S; ẇ)k 1 R n (x, y S ; ẇ) apple O s Bkwk 2 n w p ky S ŷ S k 1!

KNOWLEDGE GRAPH IDENTIFICATION

KNOWLEDGE GRAPH IDENTIFICATION KNOWLEDGE GRAPH IDENTIFICATION Jay Pujara 1, Hui Miao 1, Lise Getoor 1, William Cohen 2 1 University of Maryland, College Park, US 2 Carnegie Mellon University International Semantic Web Conference 10/25/2013

More information

KNOWLEDGE GRAPH CONSTRUCTION

KNOWLEDGE GRAPH CONSTRUCTION KNOWLEDGE GRAPH CONSTRUCTION Jay Pujara University of Maryland, College Park Max Planck Institute 7/9/2015 Can Computers Create Knowledge? Internet Massive source of publicly available information Knowledge

More information

Using Semantics & Statistics to Turn Data into Knowledge

Using Semantics & Statistics to Turn Data into Knowledge Using Semantics & Statistics to Turn Data into Knowledge Jay Pujara and Hui Miao Computer Science Dept. University of Maryland College Park, MD 20742 {jay,hui}@cs.umd.edu Lise Getoor Computer Science Dept.

More information

Ontology-Aware Partitioning for Knowledge Graph Identification

Ontology-Aware Partitioning for Knowledge Graph Identification Ontology-Aware Partitioning for Knowledge Graph Identification Jay Pujara, Hui Miao, Lise Getoor Dept of Computer Science University of Maryland College Park, MD 20742 {jay,hui,getoor}@cs.umd.edu ABSTRACT

More information

Knowledge Graph Completion. Mayank Kejriwal (USC/ISI)

Knowledge Graph Completion. Mayank Kejriwal (USC/ISI) Knowledge Graph Completion Mayank Kejriwal (USC/ISI) What is knowledge graph completion? An intelligent way of doing data cleaning Deduplicating entity nodes (entity resolution) Collective reasoning (probabilistic

More information

Agrowing body of research focuses on extracting knowledge

Agrowing body of research focuses on extracting knowledge Using Semantics and Statistics to Turn Data into Knowledge Jay Pujara, Hui Miao, Lise Getoor, William W. Cohen n Many information-extraction and knowledge base construction systems are addressing the challenge

More information

Building Dynamic Knowledge Graphs

Building Dynamic Knowledge Graphs Building Dynamic Knowledge Graphs Jay Pujara Department of Computer Science University of Maryland College Park, MD 20742 jay@cs.umd.edu Lise Getoor Department of Computer Science University of California

More information

foxpsl: A Fast, Optimized and extended PSL Implementation

foxpsl: A Fast, Optimized and extended PSL Implementation International Journal of Approximate Reasoning 00 (2015) 1 12 Journal Logo foxpsl: A Fast, Optimized and extended PSL Implementation Sara Magliacane a, Philip Stutz c, Paul Groth b, Abraham Bernstein c

More information

arxiv: v1 [cs.ai] 4 Jul 2016

arxiv: v1 [cs.ai] 4 Jul 2016 Generic Statistical Relational Entity Resolution in Knowledge Graphs Jay Pujara University of California, Santa Cruz Santa Cruz, CA 95064 jay@cs.umd.edu Lise Getoor University of California, Santa Cruz

More information

Generic Statistical Relational Entity Resolution in Knowledge Graphs

Generic Statistical Relational Entity Resolution in Knowledge Graphs Generic Statistical Relational Entity Resolution in Knowledge Graphs Jay Pujara University of California, Santa Cruz Santa Cruz, CA 95064 jay@cs.umd.edu Lise Getoor University of California, Santa Cruz

More information

Forward Chaining Reasoning Tool for Rya

Forward Chaining Reasoning Tool for Rya Forward Chaining Reasoning Tool for Rya Rya Working Group, 6/29/2016 Forward Chaining Reasoning Tool for Rya 6/29/2016 1 / 11 OWL Reasoning OWL (the Web Ontology Language) facilitates rich ontology definition

More information

Constraint Propagation for Efficient Inference in Markov Logic

Constraint Propagation for Efficient Inference in Markov Logic Constraint Propagation for Efficient Inference in Tivadar Papai 1 Parag Singla 2 Henry Kautz 1 1 University of Rochester, Rochester NY 14627, USA 2 University of Texas, Austin TX 78701, USA September 13,

More information

A Collective, Probabilistic Approach to Schema Mapping

A Collective, Probabilistic Approach to Schema Mapping A Collective, Probabilistic Approach to Schema Mapping Angelika Kimmig KU Leuven angelika.kimmig@cs.kuleuven.be Alex Memory University of Maryland memory@cs.umd.edu Renée J. Miller University of Toronto

More information

Collective Entity Resolution in Relational Data

Collective Entity Resolution in Relational Data Collective Entity Resolution in Relational Data I. Bhattacharya, L. Getoor University of Maryland Presented by: Srikar Pyda, Brett Walenz CS590.01 - Duke University Parts of this presentation from: http://www.norc.org/pdfs/may%202011%20personal%20validation%20and%20entity%20resolution%20conference/getoorcollectiveentityresolution

More information

Probabilistic Visitor Stitching on Cross-Device Web Logs

Probabilistic Visitor Stitching on Cross-Device Web Logs Probabilistic Visitor Stitching on Cross-Device Web Logs Sungchul Kim Adobe Research San Jose, CA 95110 sukim@adobe.com Eunyee Koh Adobe Research San Jose, CA 95110 eunyee@adobe.com Nikhil Kini UC Santa

More information

Network Lasso: Clustering and Optimization in Large Graphs

Network Lasso: Clustering and Optimization in Large Graphs Network Lasso: Clustering and Optimization in Large Graphs David Hallac, Jure Leskovec, Stephen Boyd Stanford University September 28, 2015 Convex optimization Convex optimization is everywhere Introduction

More information

Matthew Richardson Microsoft Research One Microsoft Way Redmond, WA

Matthew Richardson Microsoft Research One Microsoft Way Redmond, WA Speeding up Inference in Statistical Relational Learning by Clustering Similar Query Literals Lilyana Mihalkova 1 Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 lilyanam@cs.utexas.edu

More information

Structured Learning. Jun Zhu

Structured Learning. Jun Zhu Structured Learning Jun Zhu Supervised learning Given a set of I.I.D. training samples Learn a prediction function b r a c e Supervised learning (cont d) Many different choices Logistic Regression Maximum

More information

Separating Objects and Clutter in Indoor Scenes

Separating Objects and Clutter in Indoor Scenes Separating Objects and Clutter in Indoor Scenes Salman H. Khan School of Computer Science & Software Engineering, The University of Western Australia Co-authors: Xuming He, Mohammed Bennamoun, Ferdous

More information

Leveraging Data and Structure in Ontology Integration

Leveraging Data and Structure in Ontology Integration Leveraging Data and Structure in Ontology Integration O. Udrea L. Getoor R.J. Miller Group 15 Enrico Savioli Andrea Reale Andrea Sorbini DEIS University of Bologna Searching Information in Large Spaces

More information

Semantic Web. Tahani Aljehani

Semantic Web. Tahani Aljehani Semantic Web Tahani Aljehani Motivation: Example 1 You are interested in SOAP Web architecture Use your favorite search engine to find the articles about SOAP Keywords-based search You'll get lots of information,

More information

arxiv: v1 [cs.si] 2 Jul 2016

arxiv: v1 [cs.si] 2 Jul 2016 Adaptive Neighborhood Graph Construction for Inference in Multi-Relational Networks Shobeir Fakhraei,2 Dhanya Sridhar 2 Jay Pujara 2 Lise Getoor 2 shobeir@cs.umd.edu dsridhar@ucsc.edu jay@cs.umd.edu getoor@soe.ucsc.edu

More information

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017 Machine Learning Support Vector Machines Fabio Vandin November 20, 2017 1 Classification and Margin Consider a classification problem with two classes: instance set X = R d label set Y = { 1, 1}. Training

More information

Logic and Reasoning in the Semantic Web (part I RDF/RDFS)

Logic and Reasoning in the Semantic Web (part I RDF/RDFS) Logic and Reasoning in the Semantic Web (part I RDF/RDFS) Fulvio Corno, Laura Farinetti Politecnico di Torino Dipartimento di Automatica e Informatica e-lite Research Group http://elite.polito.it Outline

More information

Formalising the Semantic Web. (These slides have been written by Axel Polleres, WU Vienna)

Formalising the Semantic Web. (These slides have been written by Axel Polleres, WU Vienna) Formalising the Semantic Web (These slides have been written by Axel Polleres, WU Vienna) The Semantics of RDF graphs Consider the following RDF data (written in Turtle): @prefix rdfs: .

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

Naïve Bayes for text classification

Naïve Bayes for text classification Road Map Basic concepts Decision tree induction Evaluation of classifiers Rule induction Classification using association rules Naïve Bayesian classification Naïve Bayes for text classification Support

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies

GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies Leopold Franzens Universität Innsbruck GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies Martin HEPP DERI Innsbruck

More information

Development in Object Detection. Junyuan Lin May 4th

Development in Object Detection. Junyuan Lin May 4th Development in Object Detection Junyuan Lin May 4th Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb,

More information

Semantic Annotation, Search and Analysis

Semantic Annotation, Search and Analysis Semantic Annotation, Search and Analysis Borislav Popov, Ontotext Ontology A machine readable conceptual model a common vocabulary for sharing information machine-interpretable definitions of concepts in

More information

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Маркус Крёч (Markus Krötzsch) University of Oxford. KESW Summer School 2012

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Маркус Крёч (Markus Krötzsch) University of Oxford. KESW Summer School 2012 University of Oxford Department of Computer Science OWL 2 Profiles An Introduction to Lightweight Ontology Languages Маркус Крёч (Markus Krötzsch) University of Oxford KESW Summer School 2012 Remark for

More information

Markov Logic: Representation

Markov Logic: Representation Markov Logic: Representation Overview Statistical relational learning Markov logic Basic inference Basic learning Statistical Relational Learning Goals: Combine (subsets of) logic and probability into

More information

Structuring Linked Data Search Results Using Probabilistic Soft Logic

Structuring Linked Data Search Results Using Probabilistic Soft Logic Structuring Linked Data Search Results Using Probabilistic Soft Logic Duhai Alshukaili, Alvaro A.A. Fernandes, and Norman W. Paton School of Computer Science, University of Manchester Oxford Road, Manchester

More information

Presented By Aditya R Joshi Neha Purohit

Presented By Aditya R Joshi Neha Purohit Presented By Aditya R Joshi Neha Purohit Pellet What is Pellet? Pellet is an OWL- DL reasoner Supports nearly all of OWL 1 and OWL 2 Sound and complete reasoner Written in Java and available from http://

More information

A Collective, Probabilistic Approach to Schema Mapping

A Collective, Probabilistic Approach to Schema Mapping A Collective, Probabilistic Approach to Schema Mapping Angelika Kimmig, Alex Memory, Renée Miller, Lise Getoor ILP 2017 (published at ICDE 2017) 1 Context: Data Exchange & source emp id company 1 Alice

More information

ACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang

ACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang ACM MM 2010 Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang Harbin Institute of Technology National University of Singapore Microsoft Corporation Proliferation of images and videos on the Internet

More information

Lessons Learned from a Greenhorn Ontologist

Lessons Learned from a Greenhorn Ontologist Lessons Learned from a Greenhorn Ontologist Image credit: https://emmatrinidad.wordpress.com/2014/04/24/yoko-ono/ Steven Folsom, Metadata Strategist and Standards Advocate, Cornell University Library Beyond

More information

Semantic Web. MPRI : Web Data Management. Antoine Amarilli Friday, January 11th 1/29

Semantic Web. MPRI : Web Data Management. Antoine Amarilli Friday, January 11th 1/29 Semantic Web MPRI 2.26.2: Web Data Management Antoine Amarilli Friday, January 11th 1/29 Motivation Information on the Web is not structured 2/29 Motivation Information on the Web is not structured This

More information

Machine Learning for Annotating Semantic Web Services

Machine Learning for Annotating Semantic Web Services Machine Learning for Annotating Semantic Web Services Machine Learning for Annotating Semantic Web Services Andreas Heß, Nicholas Kushmerick University College Dublin, Ireland {andreas.hess, nick}@ucd.ie

More information

A Scalable Approach to Incrementally Building Knowledge Graphs

A Scalable Approach to Incrementally Building Knowledge Graphs A Scalable Approach to Incrementally Building Knowledge Graphs Gleb Gawriljuk 1, Andreas Harth 1, Craig A. Knoblock 2, and Pedro Szekely 2 1 Institute of Applied Informatics and Formal Description Methods

More information

Ontological Modeling: Part 13

Ontological Modeling: Part 13 Ontological Modeling: Part 13 Terry Halpin INTI International University This is the thirteenth in a series of articles on ontology-based approaches to modeling. The main focus is on popular ontology languages

More information

CSE 573: Artificial Intelligence Autumn 2010

CSE 573: Artificial Intelligence Autumn 2010 CSE 573: Artificial Intelligence Autumn 2010 Lecture 16: Machine Learning Topics 12/7/2010 Luke Zettlemoyer Most slides over the course adapted from Dan Klein. 1 Announcements Syllabus revised Machine

More information

Unit 2 RDF Formal Semantics in Detail

Unit 2 RDF Formal Semantics in Detail Unit 2 RDF Formal Semantics in Detail Axel Polleres Siemens AG Österreich VU 184.729 Semantic Web Technologies A. Polleres VU 184.729 1/41 Where are we? Last time we learnt: Basic ideas about RDF and how

More information

Interacting with Linked Data Part I: General Introduction

Interacting with Linked Data Part I: General Introduction Interacting with Linked Data Part I: General Introduction Agenda Part 0: Welcome Part I: General Introduction to Semantic Technologies Part II: Advanced Concepts Part III: OWLIM Part IV: Information Workbench-

More information

Towards Incremental Grounding in Tuffy

Towards Incremental Grounding in Tuffy Towards Incremental Grounding in Tuffy Wentao Wu, Junming Sui, Ye Liu University of Wisconsin-Madison ABSTRACT Markov Logic Networks (MLN) have become a powerful framework in logical and statistical modeling.

More information

Week 4. COMP62342 Sean Bechhofer, Uli Sattler

Week 4. COMP62342 Sean Bechhofer, Uli Sattler Week 4 COMP62342 Sean Bechhofer, Uli Sattler sean.bechhofer@manchester.ac.uk, uli.sattler@manchester.ac.uk Today Some clarifications from last week s coursework More on reasoning: extension of the tableau

More information

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213) Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding

More information

Usage of Linked Data Introduction and Application Scenarios. Presented by: Barry Norton

Usage of Linked Data Introduction and Application Scenarios. Presented by: Barry Norton Usage of Linked Data Introduction and Application Scenarios Presented by: Barry Norton Agenda 1. Motivation Scenario 2. Linked Data Foundations 3. Introduction to Linked Data 4. Linked Data use case scenarios

More information

Hierarchical Image-Region Labeling via Structured Learning

Hierarchical Image-Region Labeling via Structured Learning Hierarchical Image-Region Labeling via Structured Learning Julian McAuley, Teo de Campos, Gabriela Csurka, Florent Perronin XRCE September 14, 2009 McAuley et al (XRCE) Hierarchical Image-Region Labeling

More information

Local higher-order graph clustering

Local higher-order graph clustering Local higher-order graph clustering Hao Yin Stanford University yinh@stanford.edu Austin R. Benson Cornell University arb@cornell.edu Jure Leskovec Stanford University jure@cs.stanford.edu David F. Gleich

More information

An Algebra of Lightweight Ontologies: Implementation and Applications

An Algebra of Lightweight Ontologies: Implementation and Applications An Algebra of Lightweight Ontologies: Implementation and Applications Marco A. Casanova Departamento de Informática, PUC-Rio NII Shonan Meeting: Implicit and explicit semantics integration in proof based

More information

The Semantic Web. Mansooreh Jalalyazdi

The Semantic Web. Mansooreh Jalalyazdi 1 هو العليم 2 The Semantic Web Mansooreh Jalalyazdi 3 Content Syntactic web XML Add semantics Representation Language RDF, RDFS OWL Query languages 4 History of the Semantic Web Tim Berners-Lee vision

More information

A Deep Learning Framework for Authorship Classification of Paintings

A Deep Learning Framework for Authorship Classification of Paintings A Deep Learning Framework for Authorship Classification of Paintings Kai-Lung Hua ( 花凱龍 ) Dept. of Computer Science and Information Engineering National Taiwan University of Science and Technology Taipei,

More information

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009 Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context

More information

Semantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic

Semantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic Semantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic Outline MediaWiki what it is, how it works Semantic MediaWiki MediaWiki

More information

Link Mining & Entity Resolution. Lise Getoor University of Maryland, College Park

Link Mining & Entity Resolution. Lise Getoor University of Maryland, College Park Link Mining & Entity Resolution Lise Getoor University of Maryland, College Park Learning in Structured Domains Traditional machine learning and data mining approaches assume: A random sample of homogeneous

More information

Ontological Modeling: Part 11

Ontological Modeling: Part 11 Ontological Modeling: Part 11 Terry Halpin LogicBlox and INTI International University This is the eleventh in a series of articles on ontology-based approaches to modeling. The main focus is on popular

More information

Syllabus. 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures

Syllabus. 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures Syllabus 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures Image classification How to define a category? Bicycle Paintings with women Portraits Concepts,

More information

Link Prediction in Relational Data

Link Prediction in Relational Data Link Prediction in Relational Data Alexandra Chouldechova STATS 319, March 1, 2011 Motivation for Relational Models Quantifying trends in social interaction Improving document classification Inferring

More information

Joint Steering Committee for Development of RDA. Mapping ISBD and RDA element sets; briefing/ discussion paper

Joint Steering Committee for Development of RDA. Mapping ISBD and RDA element sets; briefing/ discussion paper To: From: Subject: Joint Steering Committee for Development of RDA Alan Danskin, Chair, JSC Mapping ISBD and RDA element sets; briefing/ discussion paper The following paper has been received from Gordon

More information

Complex Prediction Problems

Complex Prediction Problems Problems A novel approach to multiple Structured Output Prediction Max-Planck Institute ECML HLIE08 Information Extraction Extract structured information from unstructured data Typical subtasks Named Entity

More information

Constructing Popular Routes from Uncertain Trajectories

Constructing Popular Routes from Uncertain Trajectories Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei, Yu Zheng, Wen-Chih Peng presented by Slawek Goryczka Scenarios A trajectory is a sequence of data points recording location information

More information

Bryan Smith May 2010

Bryan Smith May 2010 Bryan Smith May 2010 Tool (Onto2SMem) to generate declarative knowledge base in SMem from ontology Sound (if incomplete) inference Proof of concept Baseline implementation Semantic memory (SMem) Store

More information

Learning Tractable Probabilistic Models Pedro Domingos

Learning Tractable Probabilistic Models Pedro Domingos Learning Tractable Probabilistic Models Pedro Domingos Dept. Computer Science & Eng. University of Washington 1 Outline Motivation Probabilistic models Standard tractable models The sum-product theorem

More information

Logic and Databases. Lecture 4 - Part 2. Phokion G. Kolaitis. UC Santa Cruz & IBM Research - Almaden

Logic and Databases. Lecture 4 - Part 2. Phokion G. Kolaitis. UC Santa Cruz & IBM Research - Almaden Logic and Databases Phokion G. Kolaitis UC Santa Cruz & IBM Research - Almaden Lecture 4 - Part 2 2 / 17 Alternative Semantics of Queries Bag Semantics We focused on the containment problem for conjunctive

More information

Feature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule.

Feature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule. CS 188: Artificial Intelligence Fall 2008 Lecture 24: Perceptrons II 11/24/2008 Dan Klein UC Berkeley Feature Extractors A feature extractor maps inputs to feature vectors Dear Sir. First, I must solicit

More information

Learning decomposable models with a bounded clique size

Learning decomposable models with a bounded clique size Learning decomposable models with a bounded clique size Achievements 2014-2016 Aritz Pérez Basque Center for Applied Mathematics Bilbao, March, 2016 Outline 1 Motivation and background 2 The problem 3

More information

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley WISE: Large Scale Content Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1 A picture is worth a thousand words. Query by Images What leaf?

More information

INF3580/4580 Semantic Technologies Spring 2017

INF3580/4580 Semantic Technologies Spring 2017 INF3580/4580 Semantic Technologies Spring 2017 Lecture 9: Model Semantics & Reasoning Martin Giese 13th March 2017 Department of Informatics University of Oslo Today s Plan 1 Repetition: RDF semantics

More information

Generative and discriminative classification techniques

Generative and discriminative classification techniques Generative and discriminative classification techniques Machine Learning and Category Representation 013-014 Jakob Verbeek, December 13+0, 013 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.13.14

More information

Detecting and Parsing of Visual Objects: Humans and Animals. Alan Yuille (UCLA)

Detecting and Parsing of Visual Objects: Humans and Animals. Alan Yuille (UCLA) Detecting and Parsing of Visual Objects: Humans and Animals Alan Yuille (UCLA) Summary This talk describes recent work on detection and parsing visual objects. The methods represent objects in terms of

More information

Table of Contents. iii

Table of Contents. iii Current Web 1 1.1 Current Web History 1 1.2 Current Web Characteristics 2 1.2.1 Current Web Features 2 1.2.2 Current Web Benefits 3 1.2.3. Current Web Applications 3 1.3 Why the Current Web is not Enough

More information

A Constraint Satisfaction Approach to Geospatial Reasoning

A Constraint Satisfaction Approach to Geospatial Reasoning A Constraint Satisfaction Approach to Geospatial Reasoning Martin Michalowski and Craig A. Knoblock Information Sciences Institute, Department of Computer Science, University of Southern California Outline

More information

Automatic Domain Partitioning for Multi-Domain Learning

Automatic Domain Partitioning for Multi-Domain Learning Automatic Domain Partitioning for Multi-Domain Learning Di Wang diwang@cs.cmu.edu Chenyan Xiong cx@cs.cmu.edu William Yang Wang ww@cmu.edu Abstract Multi-Domain learning (MDL) assumes that the domain labels

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

DEEP STRUCTURED OUTPUT LEARNING FOR UNCONSTRAINED TEXT RECOGNITION

DEEP STRUCTURED OUTPUT LEARNING FOR UNCONSTRAINED TEXT RECOGNITION DEEP STRUCTURED OUTPUT LEARNING FOR UNCONSTRAINED TEXT RECOGNITION Max Jaderberg, Karen Simonyan, Andrea Vedaldi, Andrew Zisserman Visual Geometry Group, Department Engineering Science, University of Oxford,

More information

RDF /RDF-S Providing Framework Support to OWL Ontologies

RDF /RDF-S Providing Framework Support to OWL Ontologies RDF /RDF-S Providing Framework Support to OWL Ontologies Rajiv Pandey #, Dr.Sanjay Dwivedi * # Amity Institute of information Technology, Amity University Lucknow,India * Dept.Of Computer Science,BBA University

More information

That Doesn't Make Sense! A Case Study in Actively Annotating Model Explanations

That Doesn't Make Sense! A Case Study in Actively Annotating Model Explanations That Doesn't Make Sense! A Case Study in Actively Annotating Model Explanations Sameer Singh University of California, Irvine NIPS 2017 Workshop on Learning with Limited Labeled Data Relation Extraction

More information

Linguaggi Logiche e Tecnologie per la Gestione Semantica dei testi

Linguaggi Logiche e Tecnologie per la Gestione Semantica dei testi Linguaggi Logiche e Tecnologie per la Gestione Semantica dei testi Outline Brief recap on RDFS+ Using RDFS+ SKOS FOAF Recap RDFS+ includes a subset of the constructs in OWL. It offers more expressive power

More information

Scalable Web Service Composition with Partial Matches

Scalable Web Service Composition with Partial Matches Scalable Web Service Composition with Partial Matches Adina Sirbu and Jörg Hoffmann Digital Enterprise Research Institute (DERI) University of Innsbruck, Austria firstname.lastname@deri.org Abstract. We

More information

Combining Text Embedding and Knowledge Graph Embedding Techniques for Academic Search Engines

Combining Text Embedding and Knowledge Graph Embedding Techniques for Academic Search Engines Combining Text Embedding and Knowledge Graph Embedding Techniques for Academic Search Engines SemDeep-4, Oct. 2018 Gengchen Mai Krzysztof Janowicz Bo Yan STKO Lab, University of California, Santa Barbara

More information

Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data

Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data Vito Claudio Ostuni, Tommaso Di Noia, Roberto Mirizzi, Eugenio Di Sciascio Polytechnic University of Bari, Italy {ostuni,mirizzi}@deemail.poliba.it,

More information

bibliotek-o : a BIBFRAME Implementation

bibliotek-o : a BIBFRAME Implementation bibliotek-o : a BIBFRAME Implementation Folsom, Steven Cornell University sf433@cornell.edu Kovari, Jason Cornell University jak473@cornell.edu Younes, Rebecca Cornell University rebecca.younes@cornell.edu

More information

Structured Models in. Dan Huttenlocher. June 2010

Structured Models in. Dan Huttenlocher. June 2010 Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies

More information

Semantic Web Test

Semantic Web Test Semantic Web Test 24.01.2017 Group 1 No. A B C D 1 X X X 2 X X 3 X X 4 X X 5 X X 6 X X X X 7 X X 8 X X 9 X X X 10 X X X 11 X 12 X X X 13 X X 14 X X 15 X X 16 X X 17 X 18 X X 19 X 20 X X 1. Which statements

More information

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Markus Krötzsch University of Oxford. Reasoning Web 2012

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Markus Krötzsch University of Oxford. Reasoning Web 2012 University of Oxford Department of Computer Science OWL 2 Profiles An Introduction to Lightweight Ontology Languages Markus Krötzsch University of Oxford Reasoning Web 2012 Remark for the Online Version

More information

Semantic Web Technologies: Web Ontology Language

Semantic Web Technologies: Web Ontology Language Semantic Web Technologies: Web Ontology Language Motivation OWL Formal Semantic OWL Synopsis OWL Programming Introduction XML / XML Schema provides a portable framework for defining a syntax RDF forms

More information

USING CLASSIFIER CASCADES FOR SCALABLE CLASSIFICATION 9/1/2011

USING CLASSIFIER CASCADES FOR SCALABLE  CLASSIFICATION 9/1/2011 USING CLASSIFIER CASCADES FOR SCALABLE E-MAIL CLASSIFICATION Jay Pujara Hal Daumé III Lise Getoor jay@cs.umd.edu me@hal3.name getoor@cs.umd.edu 9/1/2011 Building a scalable e-mail system 1 Goal: Maintain

More information

Querying Data through Ontologies

Querying Data through Ontologies Querying Data through Ontologies Instructor: Sebastian Link Thanks to Serge Abiteboul, Ioana Manolescu, Philippe Rigaux, Marie-Christine Rousset and Pierre Senellart Web Data Management and Distribution

More information

FOUNDATIONS OF SEMANTIC WEB TECHNOLOGIES

FOUNDATIONS OF SEMANTIC WEB TECHNOLOGIES FOUNDATIONS OF SEMANTIC WEB TECHNOLOGIES Semantics of RDF(S) Sebastian Rudolph Dresden, 25 April 2014 Content Overview & XML Introduction into RDF RDFS Syntax & Intuition Tutorial 1 RDFS Semantics RDFS

More information

CSEP 573: Artificial Intelligence

CSEP 573: Artificial Intelligence CSEP 573: Artificial Intelligence Machine Learning: Perceptron Ali Farhadi Many slides over the course adapted from Luke Zettlemoyer and Dan Klein. 1 Generative vs. Discriminative Generative classifiers:

More information

Representing musicology knowledge on the Web using Linked Data

Representing musicology knowledge on the Web using Linked Data Representing musicology knowledge on the Web using Linked Data The Semantic Web and Why You Should Care Digital Humanities at Oxford Summer School Kevin Page Oxford e-research Centre, University of Oxford.

More information

4 The StdTrip Process

4 The StdTrip Process 4 The StdTrip Process 4.1 The a priori Approach As discussed in section 2.8 the a priori approach emphasizes the reuse of widely adopted standards for database design as a means to secure future interoperability.

More information

Estimating Human Pose in Images. Navraj Singh December 11, 2009

Estimating Human Pose in Images. Navraj Singh December 11, 2009 Estimating Human Pose in Images Navraj Singh December 11, 2009 Introduction This project attempts to improve the performance of an existing method of estimating the pose of humans in still images. Tasks

More information

Flexible Tools for the Semantic Web

Flexible Tools for the Semantic Web Flexible Tools for the Semantic Web (instead of Jans Aasman from Franz Inc.) Software Systems Group (STS) Hamburg University of Technology (TUHH) Hamburg-Harburg, Germany (and GmbH & Co. KG) 1 Flexible

More information

AN ABSTRACT OF THE THESIS OF

AN ABSTRACT OF THE THESIS OF AN ABSTRACT OF THE THESIS OF Sicheng Xiong for the degree of Master of Science in Computer Science presented on April 25, 2013. Title: Active Learning of Constraints for Semi-Supervised Clustering Abstract

More information

A Workflow for Improving Medical Visualization of Semantically Annotated CT-Images

A Workflow for Improving Medical Visualization of Semantically Annotated CT-Images A Workflow for Improving Medical Visualization of Semantically Annotated CT-Images Alexander Baranya 1,2, Luis Landaeta 1,2, Alexandra La Cruz 1, and Maria-Esther Vidal 2 1 Biophysic and Bioengeneering

More information

OWLIM Reasoning over FactForge

OWLIM Reasoning over FactForge OWLIM Reasoning over FactForge Barry Bishop, Atanas Kiryakov, Zdravko Tashev, Mariana Damova, Kiril Simov Ontotext AD, 135 Tsarigradsko Chaussee, Sofia 1784, Bulgaria Abstract. In this paper we present

More information

PROBABILISTIC GRAPHICAL MODELS SPECIFIED BY PROBABILISTIC LOGIC PROGRAMS: SEMANTICS AND COMPLEXITY

PROBABILISTIC GRAPHICAL MODELS SPECIFIED BY PROBABILISTIC LOGIC PROGRAMS: SEMANTICS AND COMPLEXITY PROBABILISTIC GRAPHICAL MODELS SPECIFIED BY PROBABILISTIC LOGIC PROGRAMS: SEMANTICS AND COMPLEXITY Fabio G. Cozman and Denis D. Mauá Universidade de São Paulo, Brazil September 8, 2016 1 / 23 Many languages

More information