KNOWLEDGE GRAPH CONSTRUCTION

Size: px
Start display at page:

Download "KNOWLEDGE GRAPH CONSTRUCTION"

Transcription

1 KNOWLEDGE GRAPH CONSTRUCTION Jay Pujara University of Maryland, College Park Max Planck Institute 7/9/2015

2 Can Computers Create Knowledge? Internet Massive source of publicly available information Knowledge

3 Computers + Knowledge =

4 What does it mean to create knowledge? What do we mean by knowledge?

5 Defining the Questions Extraction Representation Reasoning and Inference

6 Defining the Questions Extraction Representation Reasoning and Inference

7 A Revised Knowledge-Creation Diagram Internet Extraction Knowledge Graph (KG) Massive source of publicly available information Cutting-edge IE methods Structured representation of entities, their labels and the relationships between them

8 Knowledge Graphs in the wild

9 Motivating Problem: Real Challenges Internet Extraction Knowledge Graph Difficult! Noisy! Contains many errors and inconsistencies

10 NELL: The Never-Ending Language Learner Large-scale IE project (Carlson et al., AAAI10) Lifelong learning: aims to read the web Ontology of known labels and relations Knowledge base contains millions of facts

11 Examples of NELL errors

12 Entity co-reference errors Kyrgyzstan has many variants: Kyrgystan Kyrgistan Kyrghyzstan Kyrgzstan Kyrgyz Republic

13 Missing and spurious labels Kyrgyzstan is labeled a bird and a country

14 Missing and spurious relations Kyrgyzstan s location is ambiguous Kazakhstan, Russia and US are included in possible locations

15 Violations of ontological knowledge Equivalence of co-referent entities (sameas) SameEntity(Kyrgyzstan, Kyrgyz Republic) Mutual exclusion (disjointwith) of labels MUT(bird, country) Selectional preferences (domain/range) of relations RNG(countryLocation, continent) Enforcing these constraints requires jointly considering multiple extractions across documents

16 Examples where joint models have succeeded Information extraction ER+Segmentation: Poon & Domingos, AAAI07 SRL: Srikumar & Roth, EMNLP11 Within-doc extraction: Singh et al., AKBC13 Social and communication networks Fusion: Eldardiry & Neville, MLG10 Acts: Carvalho & Cohen, SIGIR05 GraphID: Namata et al., KDD11

17 GRAPH IDENTIFICATION

18 Slides courtesy Getoor, Namata, Kok Transformation Graph Identification Input Graph Available but inappropriate for analysis Output Graph Appropriate for further analysis

19 Slides courtesy Getoor, Namata, Kok Motivation: Different Networks Neil Smith Mary Taylor Robert Lee Anne Cole Mary Jones Label: CEO Manager Assistant Programmer Communication Network Nodes: Address Edges: Communication Node Attributes: Words Organizational Network Nodes: Person Edges: Manages Node Labels: Title

20 Slides courtesy Getoor, Namata, Kok Graph Identification Neil Smith Graph Iden+fica+on Mary Taylor Robert Lee Anne Cole Mary Jones Label: CEO Manager Assistant Programmer Input Graph: Communication Network Output Graph: Social Network

21 Slides courtesy Getoor, Namata, Kok Graph Identification Graph Iden+fica+on Input Graph: Communication Network Output Graph: Social Network What s involved?

22 Slides courtesy Getoor, Namata, Kok Graph Identification Neil Smith ER Mary Taylor Robert Lee Anne Cole Mary Jones Input Graph: Communication Network Output Graph: Social Network What s involved? Entity Resolution (ER): Map input graph nodes to output graph nodes

23 Slides courtesy Getoor, Namata, Kok Graph Identification Neil Smith ER+LP Mary Taylor Robert Lee Anne Cole Mary Jones Input Graph: Communication Network Output Graph: Social Network What s involved? Entity Resolution (ER): Map input graph nodes to output graph nodes Link Prediction (LP): Predict existence of edges in output graph

24 Slides courtesy Getoor, Namata, Kok Graph Identification Neil Smith ER+LP+NL Mary Taylor Robert Lee Anne Cole Mary Jones Label: CEO Manager Assistant Programmer Input Graph: Communication Network Output Graph: Social Network What s involved? Entity Resolution (ER): Map input graph nodes to output graph nodes Link Prediction (LP): Predict existence of edges in output graph Node Labeling (NL): Infer the labels of nodes in the output graph

25 Slides courtesy Getoor, Namata, Kok Problem Dependencies ER LP Input Graph NL Most work looks at these tasks in isolation In graph identification they are: Evidence-Dependent Inference depend on observed input graph e.g., ER depends on input graph Intra-Dependent Inference within tasks are dependent e.g., NL prediction depend on other NL predictions Inter-Dependent Inference across tasks are dependent e.g., LP depend on ER and NL predictions

26 KNOWLEDGE GRAPH IDENTIFICATION Pujara, Miao, Getoor, Cohen, ISWC 2013 (best student paper)

27 (Pujara et al., ISWC13) Motivating Problem (revised) (noisy) Extraction Graph Knowledge Graph Internet Large-scale IE Joint Reasoning =

28 (Pujara et al., ISWC13) Knowledge Graph Identification Problem: Knowledge Graph Extraction Graph Performs graph identification: entity resolution node labeling link prediction Knowledge Graph Identification Solution: Knowledge Graph Identification (KGI) Enforces ontological constraints Incorporates multiple uncertain sources =

29 Illustration of KGI: Extractions (Pujara et al., ISWC13) Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital)

30 (Pujara et al., ISWC13) Illustration of KGI: Ontology + ER Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital) Extraction Graph Kyrgyzstan Lbl bird country Kyrgyz Republic Rel(hasCapital) Bishkek

31 (Pujara et al., ISWC13) Illustration of KGI: Ontology + ER Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital) Ontology: Dom(hasCapital, country) Mut(country, bird) Entity Resolution: SameEnt(Kyrgyz Republic, Kyrgyzstan) (Annotated) Extraction Graph SameEnt Kyrgyzstan Lbl country bird Kyrgyz Republic Dom Rel(hasCapital) Bishkek

32 (Pujara et al., ISWC13) Illustration of KGI Uncertain Extractions:.5: Lbl(Kyrgyzstan, bird).7: Lbl(Kyrgyzstan, country).9: Lbl(Kyrgyz Republic, country).8: Rel(Kyrgyz Republic, Bishkek, hascapital) Ontology: Dom(hasCapital, country) Mut(country, bird) Entity Resolution: SameEnt(Kyrgyz Republic, Kyrgyzstan) (Annotated) Extraction Graph SameEnt Kyrgyzstan Lbl country bird Kyrgyz Republic Dom Rel(hasCapital) Bishkek After Knowledge Graph Identification Kyrgyzstan Lbl country Kyrgyz Republic Rel(hasCapital) Bishkek

33 (Pujara et al., ISWC13) Modeling Knowledge Graph Identification

34 (Pujara et al., ISWC13) Viewing KGI as a probabilistic graphical model Lbl(Kyrgyzstan, bird) Rel(hasCapital, Kyrgyzstan, Bishkek) Lbl(Kyrgyzstan, country) Lbl(Kyrgyz Republic, country) Lbl(Kyrgyz Republic, bird) Rel(hasCapital, Kyrgyz Republic, Bishkek)

35 (Pujara et al., ISWC13) Background: Probabilistic Soft Logic (PSL) (Broecheler et al., UAI10; Kimming et al., NIPS-ProbProg12) Templating language for hinge-loss MRFs, very scalable! Model specified as a collection of logical formulas SameEnt(E 1,E 2 ) ^ Lbl(E 1,L) ) Lbl(E 2,L) Uses soft-logic formulation Truth values of atoms relaxed to [0,1] interval Truth values of formulas derived from Lukasiewicz t-norm p ^q = max(0,p+ q 1) p _q =min(1,p+ q) p =1 p p )q =min(1,q p + 1)

36 Soft Logic Tutorial: Rules to Groundings Given a database of evidence, we can convert rule templates to instances (grounding) Rules are grounded by substituting literals into formulas SameEnt(E 1,E 2 ) ^ Lbl(E 1,L) ) Lbl(E 2,L) SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country) ) Lbl(Kyrygyz Republic, country) The soft logic interpretation assigns a satisfaction value to each ground rule

37 Soft Logic Tutorial: Groundings to Satisfaction SameEnt(Kyrgyzstan, Kyrygyz Republic) : 0.9 ^ Lbl(Kyrgyzstan, country) : 0.8 p _q = max(0,p+ q 1) SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country) = max(0, )

38 Soft Logic Tutorial: Groundings to Satisfaction (SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country)) : 0.7 ) Lbl(Kyrygyz Republic, country) : 0.6 p )q =min(1,q p + 1) SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country) ) Lbl(Kyrygyz Republic, country) =min(1, ) = 0.9

39 Soft Logic Tutorial: Inferring Satisfaction (SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country)) : 0.7 ) Lbl(Kyrygyz Republic, country) :?

40 Soft Logic Tutorial: Distance to Satisfaction

41 (Pujara et al., ISWC13) Background: PSL Rules to Distributions Rules are grounded by substituting literals into formulas w EL : SameEnt(Kyrgyzstan, Kyrygyz Republic) ^ Lbl(Kyrgyzstan, country) ) Lbl(Kyrygyz Republic, country) Each ground rule has a weighted distance to satisfaction derived from the formula s truth value P(G E) = 1 Z exp $ % r R w r ϕ (G)& r ' The PSL program can be interpreted as a joint probability distribution over all variables in knowledge graph, conditioned on the extractions

42 (Pujara et al., ISWC13) Background: Finding the best knowledge graph MPE inference solves max G P(G) to find the best KG In PSL, inference solved by convex optimization Efficient: running time empirically scales with O( R ) (Bach et al., NIPS12)

43 (Pujara et al., ISWC13) PSL Rules for KGI Model

44 (Pujara et al., ISWC13) PSL Rules: Uncertain Extractions Weight for source T (relations) Predicate representing uncertain relation extraction from extractor T Relation in Knowledge Graph w CR-T : CandRel T (E 1,E 2,R) w CL-T : CandLbl T (E,L) ) Rel(E 1,E 2,R) ) Lbl(E,L) Weight for source T (labels) Predicate representing uncertain label extraction from extractor T Label in Knowledge Graph

45 (Pujara et al., ISWC13) PSL Rules: Entity Resolution SameEnt predicate captures confidence that entities are co-referent Rules require co-referent entities to have the same labels and relations Creates an equivalence class of co-referent entities

46 (Pujara et al., ISWC13) PSL Rules: Ontology Inverse: w O : Inv(R, S) ^ Rel(E 1,E 2,R) ) Rel(E 2,E 1,S) Selectional Preference: w O : Dom(R, L) ^ Rel(E 1,E 2,R) ) Lbl(E 1,L) w O : Rng(R, L) ^ Rel(E 1,E 2,R) ) Lbl(E 2,L) Subsumption: w O : Sub(L, P ) ^ Lbl(E,L) ) Lbl(E,P) w O : RSub(R, S) ^ Rel(E 1,E 2,R) ) Rel(E 1,E 2,S) Mutual Exclusion: w O : Mut(L 1,L 2 ) ^ Lbl(E,L 1 ) ) Lbl(E,L 2 ) w O : RMut(R, S) ^ Rel(E 1,E 2,R) ) Rel(E 1,E 2,S) Adapted from Jiang et al., ICDM 2012

47 [ 1] CandLbl struct (Kyrgyzstan, bird) ) Lbl(Kyrgyzstan, bird) φ 1 φ φ [ 2] CandRel pat (Kyrgyz Rep., Asia, locatedin) ) Rel(Kyrgyz Rep., Asia, locatedin) Lbl(Kyrgyzstan, bird) Lbl(Kyrgyz Rep., bird) [ 3] SameEnt(Kyrgyz Rep., Kyrgyzstan) ^ Lbl(Kyrgyz Rep., country) ) Lbl(Kyrgyzstan, country) φ 5 φ [ 4] Dom(locatedIn, country) ^ Rel(Kyrgyz Rep., Asia, locatedin) ) Lbl(Kyrgyz Rep., country) Lbl(Kyrgyzstan, country) Lbl(Kyrgyz Rep., country) [ 5] Mut(country, bird) ^ Lbl(Kyrgyzstan, country) ) Lbl(Kyrgyzstan, bird) φ φ 2 φ 3 φ 4 Rel(Kyrgyz Rep., Asia, locatedin) φ

48 Probability Distribution over KGs P(G E) = 1 Z exp $ % r R w r ϕ (G)& r ' CandLbl T (kyrgyzstan, bird) ) Lbl(kyrgyzstan, bird) Mut(bird, country) ^ Lbl(kyrgyzstan, bird) ) Lbl(kyrgyzstan, country) SameEnt(kyrgz republic, kyrgyzstan) ^ Lbl(kyrgz republic, country) ) Lbl(kyrgyzstan, country)

49 Evaluation (Pujara et al., ISWC13)

50 (Pujara et al., ISWC13) Two Evaluation Datasets Description LinkedBrainz Community-supplied data about musical artists, labels, and creative works NELL Real-world IE system extracting general facts from the WWW Noise Realistic synthetic noise Imperfect extractors and ambiguous web pages Candidate Facts 810K 1.3M Unique Labels and Relations Ontological Constraints K

51 (Pujara et al., ISWC13) LinkedBrainz Open source communitydriven structured database of music metadata Uses proprietary schema to represent data Built on popular ontologies such as FOAF and FRBR Widely used for music data (e.g. BBC Music Site) LinkedBrainz project provides an RDF mapping from MusicBrainz data to Music Ontology using the D2RQ tool

52 (Pujara et al., ISWC13) LinkedBrainz dataset for KGI mo:release mo:record mo:record mo:track mo:track mo:published_as mo:signal mo:label mo:label foaf:maker mo:musicalartist inverseof subclassof subclassof foaf:made mo:solomusicartist mo:musicgroup Mapping to FRBR/FOAF ontology DOM rdfs:domain RNG rdfs:range INV owl:inverseof SUB rdfs:subclassof RSUB rdfs:subpropertyof MUT owl:disjointwith

53 (Pujara et al., ISWC13) LinkedBrainz experiments Comparisons: Baseline PSL-EROnly PSL-OntOnly PSL-KGI Use noisy truth values as fact scores Only apply rules for Entity Resolution Only apply rules for Ontological reasoning Apply Knowledge Graph Identification model AUC Precision Recall F1 at.5 Max F1 Baseline PSL-EROnly PSL-OntOnly PSL-KGI

54 NELL Evaluation: two settings (Pujara et al., ISWC13) Target Set: restrict to a subset of KG (Jiang, ICDM12) Complete: Infer full knowledge graph?? Closed-world model Uses a target set: subset of KG Derived from 2-hop neighborhood Excludes trivially satisfied variables Open-world model All possible entities, relations, labels Inference assigns truth value to each variable

55 (Pujara et al., ISWC13) NELL experiments: Target Set Task: Compute truth values of a target set derived from the evaluation data Comparisons: Baseline Average confidences of extractors for each fact in the NELL candidates NELL Evaluate NELL s promotions (on the full knowledge graph) MLN Method of (Jiang, ICDM12) estimates marginal probabilities with MC-SAT PSL-KGI Apply full Knowledge Graph Identification model Running Time: Inference completes in 10 seconds, values for 25K facts AUC F1 Baseline NELL MLN (Jiang, 12) PSL-KGI

56 (Pujara et al., ISWC13) NELL experiments: Complete knowledge graph Task: Compute a full knowledge graph from uncertain extractions Comparisons: NELL NELL s strategy: ensure ontological consistency with existing KB PSL-KGI Apply full Knowledge Graph Identification model Running Time: Inference completes in 130 minutes, producing 4.3M facts AUC Precision Recall F1 NELL PSL-KGI

57 KNOWLEDGE GRAPH ENTITY RESOLUTION

58 [Pujara, BayLearn14] Problem: Merge domain KG to global KG

59 [Pujara, BayLearn14] Approach: Factored Entity Resolution model Goal: Build a generic entity resolution model for KGs Build on vast amount of work on Entity Resolution PSL provides an easy, flexible, sophisticated models Local Collective General String similarity Sparsity; Transitivity New Entity New Entity prior New Entity penalty Knowledge Graph Type compatibility Relation compatibility Domain-Specific (Album length) (Artist s country)

60 [Pujara, BayLearn14] Preliminary Results Task: ER from MusicBrainz to Google KG Data: 11K MusicBrainz entities (5/5-6/29/14) 330K Freebase entities 15.7M relations 11K human labels Methods F1 AUPRC General Collective NewEntity

61 FASTER KNOWLEDGE GRAPH CONSTRUCTION

62 Partitioning

63 (Pujara et al., AKBC13) Problem: Knowledge Graphs are HUGE

64 (Pujara et al., AKBC13) Solution: Partition the Knowledge Graph

65 (Pujara et al., AKBC13) Partitioning: advantages and drawbacks Advantages Smaller problems Parallel Inference Speed / Quality Tradeoff Drawbacks Partitioning large graph time-consuming Key dependencies may be lost New facts require re-partitioning

66 (Pujara et al., AKBC13) Key idea: Ontology-aware partitioning Partition the ontology graph, not the knowledge graph Location City Mut Dom citysportsteam Inv teamplaysincity locatedin Rng Dom teamplayssport SportsTeam State Rng Rng Sport Induce a partitioning of the knowledge graph based on the ontology partition

67 (Pujara et al., AKBC13) Considerations: Ontology-aware Partitions Advantages: Ontology is a smaller graph Ontology coupled with dependencies New facts can reuse partitions Disadvantages: Insensitive to data distribution All dependencies treated equally

68 (Pujara et al., AKBC13) Refinement: include data frequency Annotate each ontological element with its frequency Location 10 citysportsteam City 1706 Inv teamplaysincity 822 Mut locatedin teamplayssport State 1177 Dom Rng Rng Dom SportsTeam Rng Sport 2568 Partition ontology with constraint of equal vertex weights

69 (Pujara et al., AKBC13) Refinement: weight edges by type Weight edges by their ontological importance Location City Mut 3 Dom citysportsteam Inv 116 teamplaysincity locatedin Rng 116 Dom teamplayssport SportsTeam State Rng Rng Sport

70 (Pujara et al., AKBC13) Experiments: Partitioning Approaches Comparisons (6 partitions): NELL Default promotion strategy, no KGI KGI No partitioning, full knowledge graph model baseline KGI, Randomly assign extractions to partition Ontology KGI, Edge min-cut of ontology graph O+Vertex KGI, Weight ontology vertices by frequency O+V+Edge KGI, Weight ontology edges by inv. frequency AUPRC Running Time (min) Opt. Terms NELL KGI M baseline M Ontology M O+Vertex M O+V+Edge M

71 Evolving Models

72 Problem: Incremental Updates to KG? How do we add new extractions to the Knowledge Graph?

73 Naïve Approach: Full KGI over extractions

74 Improving the naïve approach Intuition: Much of previous KG does not change Online collective inference: Selectively update the MAP state Bound the regret of partial updates Efficiently determine which variables to infer

75 Key Idea: fix some variables, infer others

76 Approximation: KGI over subset of graph

77 Theory: Regret of approximating update R n (x, y S ; ẇ) apple O s Bkwk 2 n w p ky S ŷ S k 1!

78 Practice: Regret and Approximation Algo inference regret Do Nothing Random 50% Value 50% WLM 50% Relational 50% # epochs

79 Conclusion Knowledge Graph Identification is a powerful technique for producing knowledge graphs from noisy IE system output Using PSL we are able to enforce global ontological constraints and capture uncertainty in our model Unlike previous work, our approach infers complete knowledge graphs for datasets with millions of extractions Code available on GitHub:

80 Key Collaborators

KNOWLEDGE GRAPH CONSTRUCTION

KNOWLEDGE GRAPH CONSTRUCTION KNOWLEDGE GRAPH CONSTRUCTION Jay Pujara Karlsruhe Institute of Technology 7/7/2015 Can Computers Create Knowledge? Internet Massive source of publicly available information Knowledge Computers + Knowledge

More information

KNOWLEDGE GRAPH IDENTIFICATION

KNOWLEDGE GRAPH IDENTIFICATION KNOWLEDGE GRAPH IDENTIFICATION Jay Pujara 1, Hui Miao 1, Lise Getoor 1, William Cohen 2 1 University of Maryland, College Park, US 2 Carnegie Mellon University International Semantic Web Conference 10/25/2013

More information

Ontology-Aware Partitioning for Knowledge Graph Identification

Ontology-Aware Partitioning for Knowledge Graph Identification Ontology-Aware Partitioning for Knowledge Graph Identification Jay Pujara, Hui Miao, Lise Getoor Dept of Computer Science University of Maryland College Park, MD 20742 {jay,hui,getoor}@cs.umd.edu ABSTRACT

More information

Using Semantics & Statistics to Turn Data into Knowledge

Using Semantics & Statistics to Turn Data into Knowledge Using Semantics & Statistics to Turn Data into Knowledge Jay Pujara and Hui Miao Computer Science Dept. University of Maryland College Park, MD 20742 {jay,hui}@cs.umd.edu Lise Getoor Computer Science Dept.

More information

Knowledge Graph Completion. Mayank Kejriwal (USC/ISI)

Knowledge Graph Completion. Mayank Kejriwal (USC/ISI) Knowledge Graph Completion Mayank Kejriwal (USC/ISI) What is knowledge graph completion? An intelligent way of doing data cleaning Deduplicating entity nodes (entity resolution) Collective reasoning (probabilistic

More information

Agrowing body of research focuses on extracting knowledge

Agrowing body of research focuses on extracting knowledge Using Semantics and Statistics to Turn Data into Knowledge Jay Pujara, Hui Miao, Lise Getoor, William W. Cohen n Many information-extraction and knowledge base construction systems are addressing the challenge

More information

Building Dynamic Knowledge Graphs

Building Dynamic Knowledge Graphs Building Dynamic Knowledge Graphs Jay Pujara Department of Computer Science University of Maryland College Park, MD 20742 jay@cs.umd.edu Lise Getoor Department of Computer Science University of California

More information

arxiv: v1 [cs.ai] 4 Jul 2016

arxiv: v1 [cs.ai] 4 Jul 2016 Generic Statistical Relational Entity Resolution in Knowledge Graphs Jay Pujara University of California, Santa Cruz Santa Cruz, CA 95064 jay@cs.umd.edu Lise Getoor University of California, Santa Cruz

More information

Generic Statistical Relational Entity Resolution in Knowledge Graphs

Generic Statistical Relational Entity Resolution in Knowledge Graphs Generic Statistical Relational Entity Resolution in Knowledge Graphs Jay Pujara University of California, Santa Cruz Santa Cruz, CA 95064 jay@cs.umd.edu Lise Getoor University of California, Santa Cruz

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

Matthew Richardson Microsoft Research One Microsoft Way Redmond, WA

Matthew Richardson Microsoft Research One Microsoft Way Redmond, WA Speeding up Inference in Statistical Relational Learning by Clustering Similar Query Literals Lilyana Mihalkova 1 Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 lilyanam@cs.utexas.edu

More information

Collective Entity Resolution in Relational Data

Collective Entity Resolution in Relational Data Collective Entity Resolution in Relational Data I. Bhattacharya, L. Getoor University of Maryland Presented by: Srikar Pyda, Brett Walenz CS590.01 - Duke University Parts of this presentation from: http://www.norc.org/pdfs/may%202011%20personal%20validation%20and%20entity%20resolution%20conference/getoorcollectiveentityresolution

More information

foxpsl: A Fast, Optimized and extended PSL Implementation

foxpsl: A Fast, Optimized and extended PSL Implementation International Journal of Approximate Reasoning 00 (2015) 1 12 Journal Logo foxpsl: A Fast, Optimized and extended PSL Implementation Sara Magliacane a, Philip Stutz c, Paul Groth b, Abraham Bernstein c

More information

Link Mining & Entity Resolution. Lise Getoor University of Maryland, College Park

Link Mining & Entity Resolution. Lise Getoor University of Maryland, College Park Link Mining & Entity Resolution Lise Getoor University of Maryland, College Park Learning in Structured Domains Traditional machine learning and data mining approaches assume: A random sample of homogeneous

More information

Collective Graph Identification

Collective Graph Identification Collective Graph Identification Galileo Mark Namata Dept. of Computer Science University of Maryland College Park, MD 20742, USA namatag@cs.umd.edu Stanley Kok Dept. of Computer Science University of Maryland

More information

Constraint Propagation for Efficient Inference in Markov Logic

Constraint Propagation for Efficient Inference in Markov Logic Constraint Propagation for Efficient Inference in Tivadar Papai 1 Parag Singla 2 Henry Kautz 1 1 University of Rochester, Rochester NY 14627, USA 2 University of Texas, Austin TX 78701, USA September 13,

More information

Forward Chaining Reasoning Tool for Rya

Forward Chaining Reasoning Tool for Rya Forward Chaining Reasoning Tool for Rya Rya Working Group, 6/29/2016 Forward Chaining Reasoning Tool for Rya 6/29/2016 1 / 11 OWL Reasoning OWL (the Web Ontology Language) facilitates rich ontology definition

More information

Network Lasso: Clustering and Optimization in Large Graphs

Network Lasso: Clustering and Optimization in Large Graphs Network Lasso: Clustering and Optimization in Large Graphs David Hallac, Jure Leskovec, Stephen Boyd Stanford University September 28, 2015 Convex optimization Convex optimization is everywhere Introduction

More information

Structured Learning. Jun Zhu

Structured Learning. Jun Zhu Structured Learning Jun Zhu Supervised learning Given a set of I.I.D. training samples Learn a prediction function b r a c e Supervised learning (cont d) Many different choices Logistic Regression Maximum

More information

A Collective, Probabilistic Approach to Schema Mapping

A Collective, Probabilistic Approach to Schema Mapping A Collective, Probabilistic Approach to Schema Mapping Angelika Kimmig KU Leuven angelika.kimmig@cs.kuleuven.be Alex Memory University of Maryland memory@cs.umd.edu Renée J. Miller University of Toronto

More information

Random Walk Inference and Learning. Carnegie Mellon University 7/28/2011 EMNLP 2011, Edinburgh, Scotland, UK

Random Walk Inference and Learning. Carnegie Mellon University 7/28/2011 EMNLP 2011, Edinburgh, Scotland, UK Random Walk Inference and Learning in A Large Scale Knowledge Base Ni Lao, Tom Mitchell, William W. Cohen Carnegie Mellon University 2011.7.28 1 Outline Motivation Inference in Knowledge Bases The NELL

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

Semantic Web. Tahani Aljehani

Semantic Web. Tahani Aljehani Semantic Web Tahani Aljehani Motivation: Example 1 You are interested in SOAP Web architecture Use your favorite search engine to find the articles about SOAP Keywords-based search You'll get lots of information,

More information

Probabilistic Visitor Stitching on Cross-Device Web Logs

Probabilistic Visitor Stitching on Cross-Device Web Logs Probabilistic Visitor Stitching on Cross-Device Web Logs Sungchul Kim Adobe Research San Jose, CA 95110 sukim@adobe.com Eunyee Koh Adobe Research San Jose, CA 95110 eunyee@adobe.com Nikhil Kini UC Santa

More information

Leveraging Data and Structure in Ontology Integration

Leveraging Data and Structure in Ontology Integration Leveraging Data and Structure in Ontology Integration O. Udrea L. Getoor R.J. Miller Group 15 Enrico Savioli Andrea Reale Andrea Sorbini DEIS University of Bologna Searching Information in Large Spaces

More information

Learning Tractable Probabilistic Models Pedro Domingos

Learning Tractable Probabilistic Models Pedro Domingos Learning Tractable Probabilistic Models Pedro Domingos Dept. Computer Science & Eng. University of Washington 1 Outline Motivation Probabilistic models Standard tractable models The sum-product theorem

More information

Local higher-order graph clustering

Local higher-order graph clustering Local higher-order graph clustering Hao Yin Stanford University yinh@stanford.edu Austin R. Benson Cornell University arb@cornell.edu Jure Leskovec Stanford University jure@cs.stanford.edu David F. Gleich

More information

That Doesn't Make Sense! A Case Study in Actively Annotating Model Explanations

That Doesn't Make Sense! A Case Study in Actively Annotating Model Explanations That Doesn't Make Sense! A Case Study in Actively Annotating Model Explanations Sameer Singh University of California, Irvine NIPS 2017 Workshop on Learning with Limited Labeled Data Relation Extraction

More information

Constructing Popular Routes from Uncertain Trajectories

Constructing Popular Routes from Uncertain Trajectories Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei, Yu Zheng, Wen-Chih Peng presented by Slawek Goryczka Scenarios A trajectory is a sequence of data points recording location information

More information

arxiv: v1 [cs.si] 2 Jul 2016

arxiv: v1 [cs.si] 2 Jul 2016 Adaptive Neighborhood Graph Construction for Inference in Multi-Relational Networks Shobeir Fakhraei,2 Dhanya Sridhar 2 Jay Pujara 2 Lise Getoor 2 shobeir@cs.umd.edu dsridhar@ucsc.edu jay@cs.umd.edu getoor@soe.ucsc.edu

More information

Fast Nearest Neighbor Search on Large Time-Evolving Graphs

Fast Nearest Neighbor Search on Large Time-Evolving Graphs Fast Nearest Neighbor Search on Large Time-Evolving Graphs Leman Akoglu Srinivasan Parthasarathy Rohit Khandekar Vibhore Kumar Deepak Rajan Kun-Lung Wu Graphs are everywhere Leman Akoglu Fast Nearest Neighbor

More information

ACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang

ACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang ACM MM 2010 Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang Harbin Institute of Technology National University of Singapore Microsoft Corporation Proliferation of images and videos on the Internet

More information

Guiding Semi-Supervision with Constraint-Driven Learning

Guiding Semi-Supervision with Constraint-Driven Learning Guiding Semi-Supervision with Constraint-Driven Learning Ming-Wei Chang 1 Lev Ratinov 2 Dan Roth 3 1 Department of Computer Science University of Illinois at Urbana-Champaign Paper presentation by: Drew

More information

Syllabus. 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures

Syllabus. 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures Syllabus 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures Image classification How to define a category? Bicycle Paintings with women Portraits Concepts,

More information

Learning to Rank. Tie-Yan Liu. Microsoft Research Asia CCIR 2011, Jinan,

Learning to Rank. Tie-Yan Liu. Microsoft Research Asia CCIR 2011, Jinan, Learning to Rank Tie-Yan Liu Microsoft Research Asia CCIR 2011, Jinan, 2011.10 History of Web Search Search engines powered by link analysis Traditional text retrieval engines 2011/10/22 Tie-Yan Liu @

More information

Markov Logic: Representation

Markov Logic: Representation Markov Logic: Representation Overview Statistical relational learning Markov logic Basic inference Basic learning Statistical Relational Learning Goals: Combine (subsets of) logic and probability into

More information

Memory-Efficient Inference in Relational Domains

Memory-Efficient Inference in Relational Domains Memory-Efficient Inference in Relational Domains Parag Singla Pedro Domingos Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350, U.S.A. {parag, pedrod}@cs.washington.edu

More information

How Co-Occurrence can Complement Semantics?

How Co-Occurrence can Complement Semantics? How Co-Occurrence can Complement Semantics? Atanas Kiryakov & Borislav Popov ISWC 2006, Athens, GA Semantic Annotations: 2002 #2 Semantic Annotation: How and Why? Information extraction (text-mining) for

More information

Semantic Web. MPRI : Web Data Management. Antoine Amarilli Friday, January 11th 1/29

Semantic Web. MPRI : Web Data Management. Antoine Amarilli Friday, January 11th 1/29 Semantic Web MPRI 2.26.2: Web Data Management Antoine Amarilli Friday, January 11th 1/29 Motivation Information on the Web is not structured 2/29 Motivation Information on the Web is not structured This

More information

Development in Object Detection. Junyuan Lin May 4th

Development in Object Detection. Junyuan Lin May 4th Development in Object Detection Junyuan Lin May 4th Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb,

More information

A Scalable Approach to Incrementally Building Knowledge Graphs

A Scalable Approach to Incrementally Building Knowledge Graphs A Scalable Approach to Incrementally Building Knowledge Graphs Gleb Gawriljuk 1, Andreas Harth 1, Craig A. Knoblock 2, and Pedro Szekely 2 1 Institute of Applied Informatics and Formal Description Methods

More information

Towards Incremental Grounding in Tuffy

Towards Incremental Grounding in Tuffy Towards Incremental Grounding in Tuffy Wentao Wu, Junming Sui, Ye Liu University of Wisconsin-Madison ABSTRACT Markov Logic Networks (MLN) have become a powerful framework in logical and statistical modeling.

More information

Coreference Resolution with Knowledge. Haoruo Peng March 20, 2015

Coreference Resolution with Knowledge. Haoruo Peng March 20, 2015 Coreference Resolution with Knowledge Haoruo Peng March 20, 2015 1 Coreference Problem 2 Coreference Problem 3 Online Demo Please check outhttp://cogcomp.cs.illinois.edu/page/demo_view/coref 4 General

More information

Automatic Domain Partitioning for Multi-Domain Learning

Automatic Domain Partitioning for Multi-Domain Learning Automatic Domain Partitioning for Multi-Domain Learning Di Wang diwang@cs.cmu.edu Chenyan Xiong cx@cs.cmu.edu William Yang Wang ww@cmu.edu Abstract Multi-Domain learning (MDL) assumes that the domain labels

More information

Tag Based Image Search by Social Re-ranking

Tag Based Image Search by Social Re-ranking Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule

More information

arxiv: v1 [cs.ai] 12 Jul 2015

arxiv: v1 [cs.ai] 12 Jul 2015 A Probabilistic Approach to Knowledge Translation Shangpu Jiang and Daniel Lowd and Dejing Dou Computer and Information Science University of Oregon, USA {shangpu,lowd,dou}@cs.uoregon.edu arxiv:1507.03181v1

More information

Graph Classification in Heterogeneous

Graph Classification in Heterogeneous Title: Graph Classification in Heterogeneous Networks Name: Xiangnan Kong 1, Philip S. Yu 1 Affil./Addr.: Department of Computer Science University of Illinois at Chicago Chicago, IL, USA E-mail: {xkong4,

More information

Learning Directed Probabilistic Logical Models using Ordering-search

Learning Directed Probabilistic Logical Models using Ordering-search Learning Directed Probabilistic Logical Models using Ordering-search Daan Fierens, Jan Ramon, Maurice Bruynooghe, and Hendrik Blockeel K.U.Leuven, Dept. of Computer Science, Celestijnenlaan 200A, 3001

More information

Bi-directional Joint Inference for Entity Resolution and Segmentation Using Imperatively-Defined Factor Graphs

Bi-directional Joint Inference for Entity Resolution and Segmentation Using Imperatively-Defined Factor Graphs Bi-directional Joint Inference for Entity Resolution and Segmentation Using Imperatively-Defined Factor Graphs Sameer Singh, Karl Schultz, and Andrew McCallum Department of Computer Science University

More information

Finding Topic-centric Identified Experts based on Full Text Analysis

Finding Topic-centric Identified Experts based on Full Text Analysis Finding Topic-centric Identified Experts based on Full Text Analysis Hanmin Jung, Mikyoung Lee, In-Su Kang, Seung-Woo Lee, Won-Kyung Sung Information Service Research Lab., KISTI, Korea jhm@kisti.re.kr

More information

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Alessandro Epasto J. Feldman*, S. Lattanzi*, S. Leonardi, V. Mirrokni*. *Google Research Sapienza U. Rome Motivation Recommendation

More information

Clustering. Bruno Martins. 1 st Semester 2012/2013

Clustering. Bruno Martins. 1 st Semester 2012/2013 Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2012/2013 Slides baseados nos slides oficiais do livro Mining the Web c Soumen Chakrabarti. Outline 1 Motivation Basic Concepts

More information

Constraint-Based Entity Matching

Constraint-Based Entity Matching Constraint-Based Entity Matching Warren Shen Xin Li AnHai Doan University of Illinois, Urbana, USA {whshen, xli1, anhai}@cs.uiuc.edu Abstract Entity matching is the problem of deciding if two given mentions

More information

A Collective, Probabilistic Approach to Schema Mapping

A Collective, Probabilistic Approach to Schema Mapping A Collective, Probabilistic Approach to Schema Mapping Angelika Kimmig, Alex Memory, Renée Miller, Lise Getoor ILP 2017 (published at ICDE 2017) 1 Context: Data Exchange & source emp id company 1 Alice

More information

Scholarly Big Data: Leverage for Science

Scholarly Big Data: Leverage for Science Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA giles@ist.psu.edu http://clgiles.ist.psu.edu Funded in part by NSF, Allen Institute for

More information

Collective Spammer Detection in Evolving Multi-Relational Social Networks

Collective Spammer Detection in Evolving Multi-Relational Social Networks + Collective Spammer Detection in Evolving Multi-Relational Social Networks Shobeir Fakhraei (University of Maryland) James Foulds (University of California, Santa Cruz) Madhusudana Shashanka (if(we) Inc.,

More information

Separating Objects and Clutter in Indoor Scenes

Separating Objects and Clutter in Indoor Scenes Separating Objects and Clutter in Indoor Scenes Salman H. Khan School of Computer Science & Software Engineering, The University of Western Australia Co-authors: Xuming He, Mohammed Bennamoun, Ferdous

More information

Logic and Reasoning in the Semantic Web (part I RDF/RDFS)

Logic and Reasoning in the Semantic Web (part I RDF/RDFS) Logic and Reasoning in the Semantic Web (part I RDF/RDFS) Fulvio Corno, Laura Farinetti Politecnico di Torino Dipartimento di Automatica e Informatica e-lite Research Group http://elite.polito.it Outline

More information

Structuring Linked Data Search Results Using Probabilistic Soft Logic

Structuring Linked Data Search Results Using Probabilistic Soft Logic Structuring Linked Data Search Results Using Probabilistic Soft Logic Duhai Alshukaili, Alvaro A.A. Fernandes, and Norman W. Paton School of Computer Science, University of Manchester Oxford Road, Manchester

More information

Machine Learning for Annotating Semantic Web Services

Machine Learning for Annotating Semantic Web Services Machine Learning for Annotating Semantic Web Services Machine Learning for Annotating Semantic Web Services Andreas Heß, Nicholas Kushmerick University College Dublin, Ireland {andreas.hess, nick}@ucd.ie

More information

arxiv: v1 [cs.ai] 11 Aug 2015

arxiv: v1 [cs.ai] 11 Aug 2015 Type-Constrained Representation Learning in Knowledge Graphs Denis Krompaß 1,2, Stephan Baier 1, and Volker Tresp 1,2 arxiv:1508.02593v1 [cs.ai] 11 Aug 2015 Siemens AG, Corporate Technology, Munich, Germany

More information

Theoretical Foundations of Clustering. Margareta Ackerman

Theoretical Foundations of Clustering. Margareta Ackerman Theoretical Foundations of Clustering Margareta Ackerman The Theory-Practice Gap Clustering is one of the most widely used tools for exploratory data analysis. Identifying target markets Constructing phylogenetic

More information

Bryan Smith May 2010

Bryan Smith May 2010 Bryan Smith May 2010 Tool (Onto2SMem) to generate declarative knowledge base in SMem from ontology Sound (if incomplete) inference Proof of concept Baseline implementation Semantic memory (SMem) Store

More information

Network Based Hard/Soft Information Fusion Data Association Process Gregory Tauer, Kedar Sambhoos, Rakesh Nagi (co-pi), Moises Sudit (co-pi)

Network Based Hard/Soft Information Fusion Data Association Process Gregory Tauer, Kedar Sambhoos, Rakesh Nagi (co-pi), Moises Sudit (co-pi) Network Based Hard/Soft Information Fusion Data Association Process Gregory Tauer, Kedar Sambhoos, Rakesh Nagi (co-pi), Moises Sudit (co-pi) Objectives: Formulate and implement a workable, quantitativelybased

More information

Music Recommendation with Implicit Feedback and Side Information

Music Recommendation with Implicit Feedback and Side Information Music Recommendation with Implicit Feedback and Side Information Shengbo Guo Yahoo! Labs shengbo@yahoo-inc.com Behrouz Behmardi Criteo b.behmardi@criteo.com Gary Chen Vobile gary.chen@vobileinc.com Abstract

More information

Semantic Web Test

Semantic Web Test Semantic Web Test 24.01.2017 Group 1 No. A B C D 1 X X X 2 X X 3 X X 4 X X 5 X X 6 X X X X 7 X X 8 X X 9 X X X 10 X X X 11 X 12 X X X 13 X X 14 X X 15 X X 16 X X 17 X 18 X X 19 X 20 X X 1. Which statements

More information

Interacting with Linked Data Part I: General Introduction

Interacting with Linked Data Part I: General Introduction Interacting with Linked Data Part I: General Introduction Agenda Part 0: Welcome Part I: General Introduction to Semantic Technologies Part II: Advanced Concepts Part III: OWLIM Part IV: Information Workbench-

More information

CSE 573: Artificial Intelligence Autumn 2010

CSE 573: Artificial Intelligence Autumn 2010 CSE 573: Artificial Intelligence Autumn 2010 Lecture 16: Machine Learning Topics 12/7/2010 Luke Zettlemoyer Most slides over the course adapted from Dan Klein. 1 Announcements Syllabus revised Machine

More information

XETA: extensible metadata System

XETA: extensible metadata System XETA: extensible metadata System Abstract: This paper presents an extensible metadata system (XETA System) which makes it possible for the user to organize and extend the structure of metadata. We discuss

More information

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day

More information

De#anonymizing,Social,Networks, and,inferring,private,attributes, Using,Knowledge,Graphs,

De#anonymizing,Social,Networks, and,inferring,private,attributes, Using,Knowledge,Graphs, De#anonymizing,Social,Networks, and,inferring,private,attributes, Using,Knowledge,Graphs, Jianwei Qian Illinois Tech Chunhong Zhang BUPT Xiang#Yang Li USTC,/Illinois Tech Linlin Chen Illinois Tech Outline

More information

Table of Contents. iii

Table of Contents. iii Current Web 1 1.1 Current Web History 1 1.2 Current Web Characteristics 2 1.2.1 Current Web Features 2 1.2.2 Current Web Benefits 3 1.2.3. Current Web Applications 3 1.3 Why the Current Web is not Enough

More information

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Маркус Крёч (Markus Krötzsch) University of Oxford. KESW Summer School 2012

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Маркус Крёч (Markus Krötzsch) University of Oxford. KESW Summer School 2012 University of Oxford Department of Computer Science OWL 2 Profiles An Introduction to Lightweight Ontology Languages Маркус Крёч (Markus Krötzsch) University of Oxford KESW Summer School 2012 Remark for

More information

Learning the Structure of Sum-Product Networks. Robert Gens Pedro Domingos

Learning the Structure of Sum-Product Networks. Robert Gens Pedro Domingos Learning the Structure of Sum-Product Networks Robert Gens Pedro Domingos w 20 10x O(n) X Y LL PLL CLL CMLL Motivation SPN Structure Experiments Review Learning Graphical Models Representation Inference

More information

Combinatorial Selection and Least Absolute Shrinkage via The CLASH Operator

Combinatorial Selection and Least Absolute Shrinkage via The CLASH Operator Combinatorial Selection and Least Absolute Shrinkage via The CLASH Operator Volkan Cevher Laboratory for Information and Inference Systems LIONS / EPFL http://lions.epfl.ch & Idiap Research Institute joint

More information

Knowledge Discovery and Data Mining 1 (VO) ( )

Knowledge Discovery and Data Mining 1 (VO) ( ) Knowledge Discovery and Data Mining 1 (VO) (707.003) Data Matrices and Vector Space Model Denis Helic KTI, TU Graz Nov 6, 2014 Denis Helic (KTI, TU Graz) KDDM1 Nov 6, 2014 1 / 55 Big picture: KDDM Probability

More information

New Approach to Graph Databases

New Approach to Graph Databases Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few

More information

Flexible Tools for the Semantic Web

Flexible Tools for the Semantic Web Flexible Tools for the Semantic Web (instead of Jans Aasman from Franz Inc.) Software Systems Group (STS) Hamburg University of Technology (TUHH) Hamburg-Harburg, Germany (and GmbH & Co. KG) 1 Flexible

More information

Statistical Physics of Community Detection

Statistical Physics of Community Detection Statistical Physics of Community Detection Keegan Go (keegango), Kenji Hata (khata) December 8, 2015 1 Introduction Community detection is a key problem in network science. Identifying communities, defined

More information

Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction

Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction Takuya Akiba U Tokyo Takanori Hayashi U Tokyo Nozomi Nori Kyoto

More information

ECLT 5810 Clustering

ECLT 5810 Clustering ECLT 5810 Clustering What is Cluster Analysis? Cluster: a collection of data objects Similar to one another within the same cluster Dissimilar to the objects in other clusters Cluster analysis Grouping

More information

Papers for comprehensive viva-voce

Papers for comprehensive viva-voce Papers for comprehensive viva-voce Priya Radhakrishnan Advisor : Dr. Vasudeva Varma Search and Information Extraction Lab, International Institute of Information Technology, Gachibowli, Hyderabad, India

More information

4 The StdTrip Process

4 The StdTrip Process 4 The StdTrip Process 4.1 The a priori Approach As discussed in section 2.8 the a priori approach emphasizes the reuse of widely adopted standards for database design as a means to secure future interoperability.

More information

Scalable Web Service Composition with Partial Matches

Scalable Web Service Composition with Partial Matches Scalable Web Service Composition with Partial Matches Adina Sirbu and Jörg Hoffmann Digital Enterprise Research Institute (DERI) University of Innsbruck, Austria firstname.lastname@deri.org Abstract. We

More information

Online Pattern Recognition in Multivariate Data Streams using Unsupervised Learning

Online Pattern Recognition in Multivariate Data Streams using Unsupervised Learning Online Pattern Recognition in Multivariate Data Streams using Unsupervised Learning Devina Desai ddevina1@csee.umbc.edu Tim Oates oates@csee.umbc.edu Vishal Shanbhag vshan1@csee.umbc.edu Machine Learning

More information

Entity and Knowledge Base-oriented Information Retrieval

Entity and Knowledge Base-oriented Information Retrieval Entity and Knowledge Base-oriented Information Retrieval Presenter: Liuqing Li liuqing@vt.edu Digital Library Research Laboratory Virginia Polytechnic Institute and State University Blacksburg, VA 24061

More information

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley WISE: Large Scale Content Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1 A picture is worth a thousand words. Query by Images What leaf?

More information

Lecture 26: Missing data

Lecture 26: Missing data Lecture 26: Missing data Reading: ESL 9.6 STATS 202: Data mining and analysis December 1, 2017 1 / 10 Missing data is everywhere Survey data: nonresponse. 2 / 10 Missing data is everywhere Survey data:

More information

CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks

CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks Archana Sulebele, Usha Prabhu, William Yang (Group 29) Keywords: Link Prediction, Review Networks, Adamic/Adar,

More information

Presented By Aditya R Joshi Neha Purohit

Presented By Aditya R Joshi Neha Purohit Presented By Aditya R Joshi Neha Purohit Pellet What is Pellet? Pellet is an OWL- DL reasoner Supports nearly all of OWL 1 and OWL 2 Sound and complete reasoner Written in Java and available from http://

More information

ECLT 5810 Clustering

ECLT 5810 Clustering ECLT 5810 Clustering What is Cluster Analysis? Cluster: a collection of data objects Similar to one another within the same cluster Dissimilar to the objects in other clusters Cluster analysis Grouping

More information

Combining Gradient Boosting Machines with Collective Inference to Predict Continuous Values

Combining Gradient Boosting Machines with Collective Inference to Predict Continuous Values Combining Gradient Boosting Machines with Collective Inference to Predict Continuous Values Iman Alodah Computer Science Department Purdue University West Lafayette, Indiana 47906 Email: ialodah@purdue.edu

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

Integrating Probabilistic Reasoning with Constraint Satisfaction

Integrating Probabilistic Reasoning with Constraint Satisfaction Integrating Probabilistic Reasoning with Constraint Satisfaction IJCAI Tutorial #7 Instructor: Eric I. Hsu July 17, 2011 http://www.cs.toronto.edu/~eihsu/tutorial7 Getting Started Discursive Remarks. Organizational

More information

Complex Prediction Problems

Complex Prediction Problems Problems A novel approach to multiple Structured Output Prediction Max-Planck Institute ECML HLIE08 Information Extraction Extract structured information from unstructured data Typical subtasks Named Entity

More information

The Semantic Web. Mansooreh Jalalyazdi

The Semantic Web. Mansooreh Jalalyazdi 1 هو العليم 2 The Semantic Web Mansooreh Jalalyazdi 3 Content Syntactic web XML Add semantics Representation Language RDF, RDFS OWL Query languages 4 History of the Semantic Web Tim Berners-Lee vision

More information

Re-using Cool URIs: Entity Reconciliation Against LOD Hubs

Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak, Vassilios Peristeras LDOW 2011 Copyright 2009. All rights reserved. The Web of Data The Web of Data The Web of Data

More information

Learning Markov Logic Network Structure by Template Constructing

Learning Markov Logic Network Structure by Template Constructing Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2017 Learning Markov Logic Network Structure by Template Constructing Yingbei Tong Iowa State University Follow

More information

Exact Algorithms Lecture 7: FPT Hardness and the ETH

Exact Algorithms Lecture 7: FPT Hardness and the ETH Exact Algorithms Lecture 7: FPT Hardness and the ETH February 12, 2016 Lecturer: Michael Lampis 1 Reminder: FPT algorithms Definition 1. A parameterized problem is a function from (χ, k) {0, 1} N to {0,

More information

GRAPH-BASED KNOWLEDGE MODELS JELENA JOVANOVIĆ. Web:

GRAPH-BASED KNOWLEDGE MODELS JELENA JOVANOVIĆ.   Web: GRAPH-BASED KNOWLEDGE MODELS JELENA JOVANOVIĆ Email: jeljov@gmail.com Web: http://jelenajovanovic.net OVERVIEW 2 Graphs and semantic networks for knowledge representation Data and knowledge graphs in the

More information