{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the Same Challenge

Size: px
Start display at page:

Download "{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the Same Challenge"

Transcription

1 Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) {Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the Same Challenge Amit P. Sheth Wright State University - Main Campus, amit.sheth@wright.edu Follow this and additional works at: Part of the Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, and the Science and Technology Studies Commons Repository Citation Sheth, A. P. (2006). {Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the Same Challenge.. This Presentation is brought to you for free and open access by the The Ohio Center of Excellence in Knowledge-Enabled Computing (Kno.e.sis) at CORE Scholar. It has been accepted for inclusion in Kno.e.sis Publications by an authorized administrator of CORE Scholar. For more information, please contact corescholar@

2 {Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the same challenge? Invited Talk, International Workshop on Ontology Matching collocated with the 5th International Semantic Web Conference ISWC-2006, November 5, 2006, Athens GA Professor Amit Sheth Special Thanks: Meena Nagarajan Acknowledgment: SemDis project, funded by NSF

3 Information System needs and Ontology Matching goals Generation III (information brokering) Generation II (mediators) 1990s Generation I (federated DB/ multidatabases) 1980s SemDis, ISIS Semantics (Ontology, Context, Relationships, KB) VideoAnywhere Semantic Web, some DL-II projects, InfoQuilt Semagix SCORE, Applied Semantics OBSERVER VisualHarness InfoHarness Mermaid DDTS Metadata (Domain model) InfoSleuth, KMed, DL-I projects Infoscopes, HERMES, SIMS, Garlic,TSIMMIS,Harvest, RUFUS,... Data (Schema, semantic data modeling) Multibase, MRDSM, ADDS, IISS, Omnibase,...

4 Information systems - From mediators to information brokering Mediators between heterogeneous information sources InfoHarness, VisualHarness, InfoSleuth, SIMS, Garlic etc. Circa End User Web Browsers End User Web Browsers End User Web Browsers IH Clients Internet Repository... 1 IH Server Repository m Metadata Database (Metabase) (Oracle) Image VisualHarness Architecture IH administrative tools Raw Data Audio Text Video Information Resources

5 Information systems - From mediators to information brokers Information brokers InfoQuilt, OBSERVER etc. INFORMATION CONSUMERS Corporations Universities User Query People Programs Government User Query User Query INFORMATION BROKERING Domain Specific Ontologies Information System Data Repository Information System Circa Newswires Corporations Universities Research Labs INFORMATION PROVIDERS

6 Need for querying across multiple ontologies OBSERVER IRM Interontologies Relationships... Ontologies Mappings/ Ontology Server Query Processor User Query Query Mappings/ Processor Ontology Server Mappings/ Ontology Server Query Processor... Repositories Circa 1994, Ontologies Repositories Ontologies

7 Ontology Matching goals Goals of ontology matching (and mapping, or integration) Shallow analysis to identify dependencies for integration Deeper analysis to create mappings for query based transformations / integration Integrate schemas to create a global schema Integrate instance bases Sheth, Review of a real world experience in database schema integration (Bellcore, ca. 1993)

8 Ontology Matching changing notions Given the distributed nature of modeling domains and metadata, the need for matching advanced to Information Integration Now Query processing not limited to multiple databases or ontologies, but multiple domains and sources of information Exploiting structured, semi-structured and unstructured data sources, multi-model Web sources

9 The process of Ontology Matching Different for purposes of merging / aligning ontologies Type of relationships that suffice to be discovered are limited to equivalence / inclusion / disjointness / overlap mappings Different for purposes of information integration to analytics to discovery Need for discovering more Complex mappings Named relationships / associations Graph based / numerical mappings

10 Top down and bottom up view to ontology matching Top Down: schema + instance integration to provide information integration

11 Top down and bottom up view to ontology matching Bottom up: exploit external data sources to drive schema matching

12 A step back DB vs. Ontology - Fundamental differences

13 Schema integration goals DB vs. Ontology DB schema integration goal Defining an integrated view of the data for all applications using the data. Ontology schema integration goal Defining an agreement between multiple ontology schemas modeled for the same domain.

14 Goals are different because of differences in: The modeling paradigms A database schema is a model for the data that one more applications intend to use. An ontology is a model of knowledge for a bounded region of interest (also known as a domain) Data vs. Knowledge : A DB instance base is not the same as an ontology instance base A database models data to be used by one or more applications An ontology models knowledge about a domain, independent of the application

15 Modeling Database vs. Ontology schemas - Fundamental differences Axis of comparison Modeling perspective Structure vs. Semantics Database schemas Intended to model data being used by one or more applications Emphasis while modeling is on structure of the tables Ontology schemas Intended to model a domain Emphasis while modeling is on the semantics of the domain emphasis on relationships, also facts/knowledge/ground truth

16 Agreement Instance metadata modeling / expressiveness Limited to a syntactic agreement between applications using the data Limited expressivity in capturing instance level metadata due to static schemas Symbolizes agreement of the modeling of a domain possibly used by applications in varying contexts. In both Choice cases of modeling however, affects the schema the possible is only an More expressive abstraction space of heterogeneities of the real world; modeling and paradigm the real power/semantics lies at the therefore the process of matching. instance level. Context of modeling Well defined by applications using the data Modeling of a domain irrespective of applications

17 The space of heterogeneities in DB schema integration Conflicts/Heterogeneities in DB schema integration Model / representation : relational vs. network vs. hierarchical models Structural / schematic : Domain Incompatibilities Entity Definition Incompatibilities Data Value Incompatibilities Abstraction level Incompatibilities Largely syntactic and structural; relatively few semantic conflicts Sheth/Kashyap 1992, Kim/Seo 1993, Kashyap/Sheth 1996)

18 The space of heterogeneities in ontology schema integration Conflicts/Heterogeneities in ontology schema integration Significant conflicts in perception of a domain semantic conflicts Other heterogeneities are similar to those in the DB world Model / representation : OWL/RDF ; topic maps etc. Structural : modeling as an entity vs. an attribute/property; generalization vs. abstraction etc. Largely semantic conflicts; comparable syntactic conflicts

19 Key Observations There are significant philosophical differences in how a DB schema and an Ontology schema are modeled In spite of these distinctions, many schema matching techniques overlap significantly. Have we advanced the state of art in ontology schema matching?

20 Schema Integration DB vs. Ontology Have we advanced the state of art?

21 Schema Integration techniques used Schema matching techniques Schema level DB Information exploited Ontology Syntactic Linguistic: Matching names, descriptions, namespaces etc. Constraint-based: Constraint matches on data types, value ranges, uniqueness, cardinalities etc. Matching Matching Table and class, column properties/ level relationship, names and attribute level constraints names and constraints

22 Schema Integration techniques used Schema matching techniques Schema level DB Information exploited Ontology Structural Constraint-based: Tree / Graph structure matching Matching structures of relational tables Matching class hierarchies and structures

23 Schema Integration techniques used Schema matching techniques Instance level DB Information exploited Ontology Linguistic IR techniques, word frequencies, key terms, combination of key terms etc. Constraint based Numerical value patterns, ranges useful for recognizing phone numbers etc. Hybrid approaches use a combination of all techniques

24 Discovered semantic relationships State of the art in DBs and Ontologies Relationships with set semantics: overlap / disjointness / exclusion / equivalence / subsumption Their logical encodings are what they mean Of more interest is discovering arbitrary named relationships Relationships such as works_for or causes have real-world semantics. Their encoding in first order logic lacks semantic grounding. Matching and mapping closely tied. Ability to capture complex mapping (e.g., semantic proximity) puts significantly different demand on matching

25 Key Observation DB and Ontology schema matching techniques overlap significantly Not much advancement since DB schema integration efforts Ontologies formalize the semantics of a domain, but matching is still primarily syntactic / structural. The semantics of named relationships is largely unexploited The real semantics lies in the relationships connecting entities Modeled as first class objects in Ontologies In DB, they are not explicit and have to be inferred

26 (Complex) named relationships and Ontology Matching

27 (Complex) named relationships - example VOLCANO ENVIRON. LOCATION PYROCLASTIC FLOW ASH RAIN BUILDING DESTROYS WEATHER COOLS TEMP PLANT PEOPLE LOCATION DESTROYS KILLS

28 Discovering such (complex) named relationships Matching techniques have exhausted Schema + Instance properties Ontology modeling de couples schema + instance base Tremendous opportunity to exploit knowledge present outside the ontology knowledge base (External structured, semi-structured and unstructured data sources)

29 Knowledge discovery and validation Query and update Relevant docs Prediction of - Pathways - Symptoms of Diseases - Other complex relationship PubMe d etc. DBs

30 A Vision for Ontology Matching : Discovering simple to complex matches from schema, instances and corpus SIMPLE TO COMPLEX MATCHES Ontologies Possible identifiable matches: equivalence / inclusion / overlap / disjointness Semantic metadata Possible to identify more complex relationships from the corpus. Today, the Food and Drug Administration ( FDA ) is announcing that it has asked Pfizer, Inc. to voluntarily withdraw Bextra from the market. Pfizer has agreed to suspend sales and marketing of Bextra in the, pending further discussions with the agency. Heterogeneous data

31 Corpus based schema matching

32 The Intuition Biologically active substance Lipid instance_of Fish Oils affects causes causes affects??????? UMLS complicates Disease or Syndrome instance_of Raynaud s Disease MeSH 9284 documents 5 documents 4733 documents PubMed

33 The Method Identify entities and Relationships in Parse Tree Modifiers Modified entities Composite Entities

34 Key Observation What is interesting is not the entity estrogen or endometrium Current KR frameworks do not model this. Capturing this might affect the way we The real think knowledge of matching lies and in mapping. the complex and modified entities an excessive endogeneous stimulation by estrogen

35 Converting candidate relationships to ontology matches Linguistic and statistical challenges: Variations of entities, relationships and associations Translating instance level findings to the schema level GOING FROM several discovered relationships like Deficiency in migraine causes Migraine TO substance X causes condition Y

36 Discovery vs. Validation of relationships two sides of the coin Discovering complex relationships from text is a hard problem Natural Language challenges (not all sentences are well formed) Validating complex relationships / hypothesis is relatively simpler

37 Corpus based Hypothesis validation Migraine Patient affectedby Does magnesium alleviate effects of migraine in patients? Magnesium One possible Stress hypothesized inhibit connection isa between magnesium and migraine. Calcium Channel Blockers Complex Query PubMed Supporting Document sets retrieved

38 From matching to mappings several challenges Mappings are not always simple mathematical / string transformations Examples of complex mappings Associations / paths between classes Graph based / form fitting functions author _ of E 2 : Paper author _ of E 6 : Person author _ of E 1 : Reviewer author _ of E 7 : Submission E 4 : Paper knows author _ of E 3 : Person knows E 5 : Person Number of earthquakes with magnitude > 7 almost constant. So if at all, then nuclear tests only cause earthquakes with magnitude < 7

39 The take home message

40 A world beyond simple matches and mappings The distinction between schema and instances is slowly disappearing Need to go beyond Integrating well-mannered new and external schemas data and sources, mining and analyzing them is gaining importance. knowledge representations; and relatively simpler mappings Tremendous opportunities and challenges in using more information than what is modeled in a schema and captured in an instance base.

41 For more information LSDIS Lab: Kno.e.sis Center:

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Perceptions

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Perceptions Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 7-19-2011 Citizen Sensing: Opportunities and Challenges in Mining Social

More information

Semantic Web Applications in Industry, Government, Health Care and Life Sciences

Semantic Web Applications in Industry, Government, Health Care and Life Sciences Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 4-18-2007 Semantic Web Applications in Industry, Government, Health

More information

Sensor Data Management

Sensor Data Management Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 8-14-2007 Sensor Data Management Cory Andrew Henson Wright State University

More information

Trailblazing, Complex Hypothesis Evaluation, Abductive Reasoning and Semantic Web - Exploring Possible Synergy

Trailblazing, Complex Hypothesis Evaluation, Abductive Reasoning and Semantic Web - Exploring Possible Synergy Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 8-2007 Trailblazing, Complex Hypothesis Evaluation, Abductive Reasoning

More information

Semantic Web Technology Evaluation Ontology (SWETO): A Test Bed for Evaluating Tools and Benchmarking Applications

Semantic Web Technology Evaluation Ontology (SWETO): A Test Bed for Evaluating Tools and Benchmarking Applications Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 5-22-2004 Semantic Web Technology Evaluation Ontology (SWETO): A Test

More information

A Framework for Schema-Driven Relationship Discovery from Unstructured Text

A Framework for Schema-Driven Relationship Discovery from Unstructured Text Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 11-2006 A Framework for Schema-Driven Relationship Discovery from Unstructured

More information

Using Ontologies for Data and Semantic Integration

Using Ontologies for Data and Semantic Integration Using Ontologies for Data and Semantic Integration Monica Crubézy Stanford Medical Informatics, Stanford University ~~ November 4, 2003 Ontologies Conceptualize a domain of discourse, an area of expertise

More information

Semantics to Empower Services Science: Using Semantics at Middleware, Web Services and Business Levels

Semantics to Empower Services Science: Using Semantics at Middleware, Web Services and Business Levels Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 6-12-2007 Semantics to Empower Services Science: Using Semantics at

More information

SPARQL Query Re-Writing for Spatial Datasets Using Partonomy Based Transformation Rules

SPARQL Query Re-Writing for Spatial Datasets Using Partonomy Based Transformation Rules Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 12-2009 SPARQL Query Re-Writing for Spatial Datasets Using Partonomy

More information

SA-REST: Using Semantics to Empower RESTful Services and Smashups with Better Interoperability and Mediation

SA-REST: Using Semantics to Empower RESTful Services and Smashups with Better Interoperability and Mediation Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 5-22-2008 SA-REST: Using Semantics to Empower RESTful Services and

More information

A Modular Approach to Document Indexing and Semantic Search

A Modular Approach to Document Indexing and Semantic Search Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 7-2005 A Modular Approach to Document Indexing and Semantic Search

More information

iexplore: Interactive Browsing and Exploring Biomedical Knowledge

iexplore: Interactive Browsing and Exploring Biomedical Knowledge Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 11-2012 iexplore: Interactive Browsing and Exploring Biomedical Knowledge

More information

What is Text Mining? Sophia Ananiadou National Centre for Text Mining University of Manchester

What is Text Mining? Sophia Ananiadou National Centre for Text Mining   University of Manchester National Centre for Text Mining www.nactem.ac.uk University of Manchester Outline Aims of text mining Text Mining steps Text Mining uses Applications 2 Aims Extract and discover knowledge hidden in text

More information

Text mining tools for semantically enriching the scientific literature

Text mining tools for semantically enriching the scientific literature Text mining tools for semantically enriching the scientific literature Sophia Ananiadou Director National Centre for Text Mining School of Computer Science University of Manchester Need for enriching the

More information

Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A Comprehensive Path Towards Event Monitoring and Situational Awareness

Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A Comprehensive Path Towards Event Monitoring and Situational Awareness Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 2-17-2009 Semantic Integration of Citizen Sensor Data and Multilevel

More information

METEOR-S Web service Annotation Framework with Machine Learning Classification

METEOR-S Web service Annotation Framework with Machine Learning Classification METEOR-S Web service Annotation Framework with Machine Learning Classification Nicole Oldham, Christopher Thomas, Amit Sheth, Kunal Verma LSDIS Lab, Department of CS, University of Georgia, 415 GSRC, Athens,

More information

Matthew S. Perry.

Matthew S. Perry. Matthew S. Perry http://knoesis.wright.edu/students/mperry msperry@gmail.com RESEARCH INTERESTS I have broad research interests in database systems, geographic information systems, and the Semantic Web.

More information

Opus: University of Bath Online Publication Store

Opus: University of Bath Online Publication Store Patel, M. (2004) Semantic Interoperability in Digital Library Systems. In: WP5 Forum Workshop: Semantic Interoperability in Digital Library Systems, DELOS Network of Excellence in Digital Libraries, 2004-09-16-2004-09-16,

More information

Extending SPARQL to Support Spatially and Temporally Related Information

Extending SPARQL to Support Spatially and Temporally Related Information Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 6-16-2009 Extending SPARQL to Support Spatially and Temporally Related

More information

Precise Medication Extraction using Agile Text Mining

Precise Medication Extraction using Agile Text Mining Precise Medication Extraction using Agile Text Mining Chaitanya Shivade *, James Cormack, David Milward * The Ohio State University, Columbus, Ohio, USA Linguamatics Ltd, Cambridge, UK shivade@cse.ohio-state.edu,

More information

Protégé-2000: A Flexible and Extensible Ontology-Editing Environment

Protégé-2000: A Flexible and Extensible Ontology-Editing Environment Protégé-2000: A Flexible and Extensible Ontology-Editing Environment Natalya F. Noy, Monica Crubézy, Ray W. Fergerson, Samson Tu, Mark A. Musen Stanford Medical Informatics Stanford University Stanford,

More information

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How

More information

Adding formal semantics to the Web

Adding formal semantics to the Web Adding formal semantics to the Web building on top of RDF Schema Jeen Broekstra On-To-Knowledge project Context On-To-Knowledge IST project about content-driven knowledge management through evolving ontologies

More information

ONTOLOGY MATCHING: A STATE-OF-THE-ART SURVEY

ONTOLOGY MATCHING: A STATE-OF-THE-ART SURVEY ONTOLOGY MATCHING: A STATE-OF-THE-ART SURVEY December 10, 2010 Serge Tymaniuk - Emanuel Scheiber Applied Ontology Engineering WS 2010/11 OUTLINE Introduction Matching Problem Techniques Systems and Tools

More information

InfoHarness: Managing Distributed, Heterogeneous Information

InfoHarness: Managing Distributed, Heterogeneous Information Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 1999 InfoHarness: Managing Distributed, Heterogeneous Information Kshitij

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

Ontology-driven Integration and Analysis for Semantic Applications in Business Intelligence and National Security

Ontology-driven Integration and Analysis for Semantic Applications in Business Intelligence and National Security Ontology-driven Integration and Analysis for Semantic Applications in Business Intelligence and National Security Ontology and Semantic Web Technical Exchange Meeting MITRE, McLean, June 12, 2003 Amit

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information

INTEROPERABILITY IN INFORMATION SYSTEMS: FROM SYSTEM, SYNTAX, STRUCTURE TO SEMANTICS Amit P. Sheth

INTEROPERABILITY IN INFORMATION SYSTEMS: FROM SYSTEM, SYNTAX, STRUCTURE TO SEMANTICS Amit P. Sheth Interoperating Geographic Information Systems M F Goodchild, M J Egenhofer, R Fegeas and C A Kottman (eds). Kluwer. 0 CHANGING FOCUS ON INTEROPERABILITY IN INFORMATION SYSTEMS: FROM SYSTEM, SYNTAX, STRUCTURE

More information

Semantic Brokering over Dynamic Heterogeneous Web Resources. Anne H. H. Ngu. Department of Computer Science Southwest Texas State University

Semantic Brokering over Dynamic Heterogeneous Web Resources. Anne H. H. Ngu. Department of Computer Science Southwest Texas State University Semantic Brokering over Dynamic Heterogeneous Web s Anne H. H. Ngu Department of Computer Science Southwest Texas State University November 2002 Overview Objectives of data integration in InfoSleuth system.

More information

FIBO Shared Semantics. Ontology-based Financial Standards Thursday Nov 7 th 2013

FIBO Shared Semantics. Ontology-based Financial Standards Thursday Nov 7 th 2013 FIBO Shared Semantics Ontology-based Financial Standards Thursday Nov 7 th 2013 FIBO Conceptual and Operational Ontologies: Two Sides of a Coin FIBO Business Conceptual Ontologies Primarily human facing

More information

Part I: Data Mining Foundations

Part I: Data Mining Foundations Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?

More information

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Eloise Currie and Mary Parmelee SAS Institute, Cary NC About SAS: The Power to Know SAS: The Market Leader in

More information

DATA IS DEAD WITHOUT WHAT-IF MODELS

DATA IS DEAD WITHOUT WHAT-IF MODELS DATA IS DEAD WITHOUT WHAT-IF MODELS Peter J. Haas, Paul P. Maglio, Patricia G. Selinger, and Wang-Chiew Tan IBM Almaden Research Center Congratulations, Database Community! Transactions & Reports, IMS

More information

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD 10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Spring 90-91

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Spring 90-91 بسمه تعالی Semantic Web Semantic Web Services Morteza Amini Sharif University of Technology Spring 90-91 Outline Semantic Web Services Basics Challenges in Web Services Semantics in Web Services Web Service

More information

CORE Scholar. Wright State University. Amit P. Sheth Wright State University - Main Campus,

CORE Scholar. Wright State University. Amit P. Sheth Wright State University - Main Campus, Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 9-28-2007 Realizing the Relationship Web: Morphing Information Access

More information

SELF-SERVICE SEMANTIC DATA FEDERATION

SELF-SERVICE SEMANTIC DATA FEDERATION SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical

More information

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web

More information

Semantic Web: Promising Technologies and Current Applications in Health Care & Life Sciences

Semantic Web: Promising Technologies and Current Applications in Health Care & Life Sciences Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 7-25-2007 Semantic Web: Promising Technologies and Current Applications

More information

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS Manoj Paul, S. K. Ghosh School of Information Technology, Indian Institute of Technology, Kharagpur 721302, India - (mpaul, skg)@sit.iitkgp.ernet.in

More information

The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets

The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets Ben Ridge Road Weather Station, South Esk River Catchment, Tasmania The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets Holger Neuhaus Michael Compton Commonwealth Scientific

More information

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,

More information

IBM Research Report. Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI)

IBM Research Report. Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI) RC24074 (W0610-047) October 10, 2006 Computer Science IBM Research Report Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI) David Ferrucci, J. William Murdock, Christopher

More information

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries EunKyung Chung, eunkyung.chung@usm.edu School of Library

More information

1. PUBLISHABLE SUMMARY

1. PUBLISHABLE SUMMARY D1.2.2. 12-Monthly Report FP7-ICT-2011.4.4 1. PUBLISHABLE SUMMARY This project has received funding from the European Union s Seventh Framework Programme for research, technological development and demonstration

More information

Text Mining and the. Text Mining and the Semantic Web. Semantic Web. Tim Finin. University of Maryland Baltimore County

Text Mining and the. Text Mining and the Semantic Web. Semantic Web. Tim Finin. University of Maryland Baltimore County Text Mining and the Text Mining and the Semantic Web Semantic Web Tim Finin University of Maryland Baltimore County recommend tell register Next Generation Data Mining Workshop Baltimore, November 2002

More information

The Model-Driven Semantic Web Emerging Standards & Technologies

The Model-Driven Semantic Web Emerging Standards & Technologies The Model-Driven Semantic Web Emerging Standards & Technologies Elisa Kendall Sandpiper Software March 24, 2005 1 Model Driven Architecture (MDA ) Insulates business applications from technology evolution,

More information

Acquiring Experience with Ontology and Vocabularies

Acquiring Experience with Ontology and Vocabularies Acquiring Experience with Ontology and Vocabularies Walt Melo Risa Mayan Jean Stanford The author's affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended

More information

Fausto Giunchiglia and Mattia Fumagalli

Fausto Giunchiglia and Mattia Fumagalli DISI - Via Sommarive 5-38123 Povo - Trento (Italy) http://disi.unitn.it FROM ER MODELS TO THE ENTITY MODEL Fausto Giunchiglia and Mattia Fumagalli Date (2014-October) Technical Report # DISI-14-014 From

More information

Lecture Telecooperation. D. Fensel Leopold-Franzens- Universität Innsbruck

Lecture Telecooperation. D. Fensel Leopold-Franzens- Universität Innsbruck Lecture Telecooperation D. Fensel Leopold-Franzens- Universität Innsbruck First Lecture: Introduction: Semantic Web & Ontology Introduction Semantic Web and Ontology Part I Introduction into the subject

More information

SciMiner User s Manual

SciMiner User s Manual SciMiner User s Manual Copyright 2008 Junguk Hur. All rights reserved. Bioinformatics Program University of Michigan Ann Arbor, MI 48109, USA Email: juhur@umich.edu Homepage: http://jdrf.neurology.med.umich.edu/sciminer/

More information

Contents Contents 1 Introduction Entity Types... 37

Contents Contents 1 Introduction Entity Types... 37 1 Introduction...1 1.1 Functions of an Information System...1 1.1.1 The Memory Function...3 1.1.2 The Informative Function...4 1.1.3 The Active Function...6 1.1.4 Examples of Information Systems...7 1.2

More information

Lily: Ontology Alignment Results for OAEI 2009

Lily: Ontology Alignment Results for OAEI 2009 Lily: Ontology Alignment Results for OAEI 2009 Peng Wang 1, Baowen Xu 2,3 1 College of Software Engineering, Southeast University, China 2 State Key Laboratory for Novel Software Technology, Nanjing University,

More information

Mining Class Hierarchies from XML Data: Representation Techniques

Mining Class Hierarchies from XML Data: Representation Techniques Mining Class Hierarchies from XML Data: Representation Techniques Paolo Ceravolo 1 and Ernesto Damiani 1 Department of Information Technology University of Milan Via Bramante, 65-26013 Crema (Italy) damiani,

More information

A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications

A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications Mid June, 2007 Department of Computer Science, University of Pise, Italy Why Semantic Web Biological information: an underused resource

More information

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016 + Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html

More information

0.1 Upper ontologies and ontology matching

0.1 Upper ontologies and ontology matching 0.1 Upper ontologies and ontology matching 0.1.1 Upper ontologies Basics What are upper ontologies? 0.1 Upper ontologies and ontology matching Upper ontologies (sometimes also called top-level or foundational

More information

Semantic Sensor Web. CORE Scholar. Wright State University. Amit P. Sheth Wright State University - Main Campus,

Semantic Sensor Web. CORE Scholar. Wright State University. Amit P. Sheth Wright State University - Main Campus, Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 12-17-2008 Semantic Sensor Web Amit P. Sheth Wright State University

More information

Introduction to Data Management for Ocean Science Research

Introduction to Data Management for Ocean Science Research Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological and Chemical Oceanography Data Management Office 12 November 2009 Ocean Acidification Short Course Woods Hole, MA USA

More information

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

MDA & Semantic Web Services Integrating SWSF & OWL with ODM

MDA & Semantic Web Services Integrating SWSF & OWL with ODM MDA & Semantic Web Services Integrating SWSF & OWL with ODM Elisa Kendall Sandpiper Software March 30, 2006 Level Setting An ontology specifies a rich description of the Terminology, concepts, nomenclature

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

Data Integration and Data Warehousing Database Integration Overview

Data Integration and Data Warehousing Database Integration Overview Data Integration and Data Warehousing Database Integration Overview Sergey Stupnikov Institute of Informatics Problems, RAS ssa@ipi.ac.ru Outline Information Integration Problem Heterogeneous Information

More information

Knowledge Engineering with Semantic Web Technologies

Knowledge Engineering with Semantic Web Technologies This file is licensed under the Creative Commons Attribution-NonCommercial 3.0 (CC BY-NC 3.0) Knowledge Engineering with Semantic Web Technologies Lecture 5: Ontological Engineering 5.3 Ontology Learning

More information

Semantics Enhanced Services: METEOR-S, SAWSDL and SA-REST

Semantics Enhanced Services: METEOR-S, SAWSDL and SA-REST Semantics Enhanced Services: METEOR-S, SAWSDL and SA-REST Amit P. Sheth, Karthik Gomadam, Ajith Ranabahu Services Research Lab, kno.e.sis center, Wright State University, Dayton, OH {amit,karthik, ajith}@knoesis.org

More information

Web Services Annotation and Reasoning

Web Services Annotation and Reasoning Web Services Annotation and Reasoning, W3C Workshop on Frameworks for Semantics in Web Services Web Services Annotation and Reasoning Peter Graubmann, Evelyn Pfeuffer, Mikhail Roshchin Siemens AG, Corporate

More information

Ranking Documents Semantically Using Ontological Relationships

Ranking Documents Semantically Using Ontological Relationships Ranking Documents Semantically Using Ontological Relationships Boanerges Aleman-Meza +, I. Budak Arpinar *, Mustafa V. Nural * and Amit P. Sheth δ + Rice University, Houston, TX, 77005, USA, ba8@rice.edu

More information

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND 41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia

More information

A method for recommending ontology alignment strategies

A method for recommending ontology alignment strategies A method for recommending ontology alignment strategies He Tan and Patrick Lambrix Department of Computer and Information Science Linköpings universitet, Sweden This is a pre-print version of the article

More information

Database Heterogeneity

Database Heterogeneity Database Heterogeneity Lecture 13 1 Outline Database Integration Wrappers Mediators Integration Conflicts 2 1 1. Database Integration Goal: providing a uniform access to multiple heterogeneous information

More information

Introduction to Data Management. Lecture #3 (Conceptual DB Design) Instructor: Chen Li

Introduction to Data Management. Lecture #3 (Conceptual DB Design) Instructor: Chen Li Introduction to Data Management Lecture #3 (Conceptual DB Design) Instructor: Chen Li 1 Announcements v HW #1 is now available v Today s plan Conceptual DB design, cont. Advanced ER concepts 2 Weak Entities

More information

An Improving for Ranking Ontologies Based on the Structure and Semantics

An Improving for Ranking Ontologies Based on the Structure and Semantics An Improving for Ranking Ontologies Based on the Structure and Semantics S.Anusuya, K.Muthukumaran K.S.R College of Engineering Abstract Ontology specifies the concepts of a domain and their semantic relationships.

More information

Data Warehousing Fundamentals by Mark Peco

Data Warehousing Fundamentals by Mark Peco Data Warehousing Fundamentals by Mark Peco All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks of their

More information

Review for Exam 1 CS474 (Norton)

Review for Exam 1 CS474 (Norton) Review for Exam 1 CS474 (Norton) What is a Database? Properties of a database Stores data to derive information Data in a database is, in general: Integrated Shared Persistent Uses of Databases The Integrated

More information

New Approach to Graph Databases

New Approach to Graph Databases Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few

More information

Interoperability ~ An Introduction

Interoperability ~ An Introduction Interoperability ~ An Introduction Cyndy Chandler Biological and Chemical Oceanography Data Management Office (BCO-DMO) Woods Hole Oceanographic Institution 26 July 2008 MMI OOS Interoperability Planning

More information

Towards Summarizing the Web of Entities

Towards Summarizing the Web of Entities Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin

More information

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge Discover hidden information from your texts! Information overload is a well known issue in the knowledge industry. At the same time most of this information becomes available in natural language which

More information

Information Management Fundamentals by Dave Wells

Information Management Fundamentals by Dave Wells Information Management Fundamentals by Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks

More information

Semantics and Ontologies for Geospatial Information. Dr Kristin Stock

Semantics and Ontologies for Geospatial Information. Dr Kristin Stock Semantics and Ontologies for Geospatial Information Dr Kristin Stock Introduction The study of semantics addresses the issue of what data means, including: 1. The meaning and nature of basic geospatial

More information

Knowledge Integration Environment

Knowledge Integration Environment Knowledge Integration Environment Aka Knowledge is Everything D.Sottara, PhD OMG Technical Meeting Spring 2013, Reston, VA Outline Part I The Consolidated Past : Drools 5.x Drools Expert Object-Oriented,

More information

Describe The Differences In Meaning Between The Terms Relation And Relation Schema

Describe The Differences In Meaning Between The Terms Relation And Relation Schema Describe The Differences In Meaning Between The Terms Relation And Relation Schema describe the differences in meaning between the terms relation and relation schema. consider the bank database of figure

More information

Ontologies and Database Schema: What s the Difference? Michael Uschold, PhD Semantic Arts.

Ontologies and Database Schema: What s the Difference? Michael Uschold, PhD Semantic Arts. Ontologies and Database Schema: What s the Difference? Michael Uschold, PhD Semantic Arts. Objective To settle once and for all the question: What is the difference between an ontology and a database schema?

More information

Semantic Technologies for Nuclear Knowledge Modelling and Applications

Semantic Technologies for Nuclear Knowledge Modelling and Applications Semantic Technologies for Nuclear Knowledge Modelling and Applications D. Beraha 3 rd International Conference on Nuclear Knowledge Management 7.-11.11.2016, Vienna, Austria Why Semantics? Machines understanding

More information

Document Retrieval using Predication Similarity

Document Retrieval using Predication Similarity Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research

More information

A probabilistic logic incorporating posteriors of hierarchic graphical models

A probabilistic logic incorporating posteriors of hierarchic graphical models A probabilistic logic incorporating posteriors of hierarchic graphical models András s Millinghoffer, Gábor G Hullám and Péter P Antal Department of Measurement and Information Systems Budapest University

More information

2 Which Methodology for Building Ontologies? 2.1 A Work Still in Progress Many approaches (for a complete survey, the reader can refer to the OntoWeb

2 Which Methodology for Building Ontologies? 2.1 A Work Still in Progress Many approaches (for a complete survey, the reader can refer to the OntoWeb Semantic Commitment for Designing Ontologies: A Proposal Bruno Bachimont 1,Antoine Isaac 1;2, Raphaël Troncy 1;3 1 Institut National de l'audiovisuel, Direction de la Recherche 4, Av. de l'europe - 94366

More information

OKKAM-based instance level integration

OKKAM-based instance level integration OKKAM-based instance level integration Paolo Bouquet W3C RDF2RDB This work is co-funded by the European Commission in the context of the Large-scale Integrated project OKKAM (GA 215032) RoadMap Using the

More information

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES Jeremy Carroll, Ralph Hodgson, {jeremy,ralph}@topquadrant.com This paper is submitted to The W3C Workshop on Semantic Web in Energy Industries

More information

Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories

Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories Ornsiri Thonggoom, Il-Yeol Song, Yuan An The ischool at Drexel Philadelphia, PA USA Outline Long Term Research

More information

Information Retrieval and Knowledge Organisation

Information Retrieval and Knowledge Organisation Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector

More information

Open Data Integration. Renée J. Miller

Open Data Integration. Renée J. Miller Open Data Integration Renée J. Miller miller@northeastern.edu !2 Open Data Principles Timely & Comprehensive Accessible and Usable Complete - All public data is made available. Public data is data that

More information

Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA)

Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA) Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA) A presentation to GMU/AFCEA symposium "Critical Issues in C4I" Michelle Dirner, James Blalock, Eric Yuan National

More information

Wither OWL in a knowledgegraphed, Linked-Data World?

Wither OWL in a knowledgegraphed, Linked-Data World? Wither OWL in a knowledgegraphed, Linked-Data World? Jim Hendler @jahendler Tetherless World Professor of Computer, Web and Cognitive Science Director, Rensselaer Institute for Data Exploration and Applications

More information

Today s topic CS347. Results list clustering example. Why cluster documents. Clustering documents. Lecture 8 May 7, 2001 Prabhakar Raghavan

Today s topic CS347. Results list clustering example. Why cluster documents. Clustering documents. Lecture 8 May 7, 2001 Prabhakar Raghavan Today s topic CS347 Clustering documents Lecture 8 May 7, 2001 Prabhakar Raghavan Why cluster documents Given a corpus, partition it into groups of related docs Recursively, can induce a tree of topics

More information

Causal Models for Scientific Discovery

Causal Models for Scientific Discovery Causal Models for Scientific Discovery Research Challenges and Opportunities David Jensen College of Information and Computer Sciences Computational Social Science Institute Center for Data Science University

More information

A cell-cycle knowledge integration framework

A cell-cycle knowledge integration framework A cell-cycle knowledge integration framework Erick Antezana Dept. of Plant Systems Biology. Flanders Interuniversity Institute for Biotechnology/Ghent University. Ghent BELGIUM. erant@psb.ugent.be http://www.psb.ugent.be/cbd/

More information