COMA A system for flexible combination of schema matching approaches. Hong-Hai Do, Erhard Rahm University of Leipzig, Germany dbs.uni-leipzig.
|
|
- Claire Ball
- 5 years ago
- Views:
Transcription
1 COMA A system for flexible combination of schema matching approaches Hong-Hai Do, Erhard Rahm University of Leipzig, Germany dbs.uni-leipzig.de
2 Content Motivation The COMA approach Comprehensive matcher library Flexible combination scheme Novel reuse-oriented match approach Evaluation setup and results Conclusions and future work 2
3 Motivation Schema matching: Finding semantic correspondences between two schemas Crucial step in many applications Data integration: mediators, data warehouses E-Business: XML message mapping... Currently manual, time-consuming, tedious Need for approaches to automate the task as much as possible PO1 PO2 Customer ShipTo DeliverTo BillTo custname custstreet shiptostreet shiptocity Address custcity custzip shiptozip City Street Zip PO1.ShipTo.shipToCity PO2.DeliverTo.Address.City 3
4 Individual Match Approaches Schema-based Instance-based Reuse-oriented Element Structure Element Element Structure Linguistic Constraintbased Constraintbased Linguistic Constraintbased Dictionaries Thesauries Previous match results Names Descriptions Types Keys Parents Children Leaves IR (word frequencies, key terms) Value pattern and ranges Survey paper [Rahm, Bernstein - VLDB Journal 01] 4
5 Combining Match Approaches Combination of match algorithms Hybrid: fixed combination, difficult to extend and improve currently most common: Cupid, SemInt, SimilarityFlooding, DIKE, MOMIS, TranScm Composite: combination of the results of independently executed matchers currently only for machine learning-based techniques: LSD, GLUE COMA: Framework for flexible COmbination of MAtch algorithms Extensible matcher library Combination scheme with various combination strategies 5
6 System Architecture Matcher Library Schema Import S1 S2 UserFeedback Match Iteration Similarity cube Matcher 1 Matcher 2 Matcher 3 Matcher execution Combination of match results S1 S2 S2 S1 Mapping User Interaction (optional) Combination Scheme 6
7 Combination Scheme 1. Aggregation of matcherspecific results Average, Max, Min, Weighted SmallLarge, LargeSmall, Both 2. Match direction Matchers S1 S2 Similarity cube S1 S2 Similarity matrix S1 S2 s1 s S2 S1 s2 s Match results [S1, S2, 0.7] Combined similarity 3. Selection of match candidates MaxN (Max1), Threshold, MaxDelta, Threshold+MaxN, Threshold+MaxDelta 4. Computation of combined similarity Dice, Average 7
8 Match Processing: Example S1 shiptocity Matcher1: 0.6 Matcher2: 0.8 S2 City 1. Aggregation shiptostreet Matcher1: 0.8 Matcher2: 0.4 S1 shiptocity Average: 0.7 S2 City 2. Direction shiptostreet Average: 0.6 S1 > S2 LargeSmall (Match candidates for smaller schema S2) S2 elements City S1 elements shiptocity shiptostreet Sim Selection Max1 S2 elements City S1 elements shiptocity Sim 0.7 Threshold(0.5) S2 elements S1 elements City shiptocity City shiptostreet Sim
9 Matcher Library Type Matcher Schema Info Auxiliary Info Constituent Matchers Simple Affix Element names n-gram Element names Soundex Element names EditDistance Element names Synonym Element names External dictionaries DataType Data types Data type compatibility table UserFeedback User-specified (mis-) matches Hybrid Name Element names Affix, 3-Gram, Synonym TypeName Data Types+Names DataType, Name NamePath Names+Paths Name Children Child elements TypeName Leaves Leaf elements TypeName Reuseoriented Schema Existing schema-level match results 9
10 Reuse-oriented Matching S1 firstname lastname m S FName LName m S2 Name S1 m S2 m = MatchCompose (m1, m2) firstname lastname Name The MatchCompose operation: Transitivity of element similarity Composition of similarity relationships Reuse of multiple match correspondences vs. reuse of single element-level correspondences from synonym tables, thesauries 10
11 Schema-level Reuse The Schema matcher: Search repository S1 Si, S2 Si Match- Compose Aggregation Direction Selection S1 S2 Match problem S1 Sj, Sj S2 Sk S1, S2 Sk Existing match results Similarity cube S1 S2 Match result Reuse complete match results at the schema level Exploit all possible reuse opportunities Limit negative effects of transitivity 11
12 Real-world Evaluation 5 real-world schemas (XML Purchase order), 10 match tasks CIDX, Excel, Noris, Paragon, Apertum from biztalk.org elements Systematic evaluation (automatic mode) 1 Series = 10 Experiments: Test of 1 configuration of (Matcher, Aggregation, Direction, Selection, Combined similarity) with 10 match tasks 12,312 series = 123,120 experiments Matchers Aggregation Direction Selection Combined Sim No reuse Reuse 5 single 11 combinations 2 single 12 combinations -Max -Average -Min -Max -Average -Min -LargeSmall -SmallLarge -Both -MaxN(1-4) -Delta( ) -Threshold( ) -Threshold(0.5)+ MaxN(1-4) -Threshold(0.5)+ Delta( ) -Average -Dice -Average Σ =
13 Match Quality Measures Comparison of automatically with manually (i.e. real) derived match correspondences Real matches Suggested matches A B C A: Missed matches B: Correct matches C: False matches Quality measures: SimilarityFlooding [ICDE02]: B Precision = B + C A+ C Overall = 1 A+ B Recall = B C = A+ B B A+ B 1 = Recall* 2 Precision Overall: post-match effort to add missed and to remove false matches; negative Overall no gain Computed for single experiments and averaged over 10 experiments for each series (average Overall, etc.) 13
14 Results: Combination Strategies (1) #Series #All Series = Min Overall Most no-reuse series have negative average Overall Good matcher/strategy: Positive average Overall High presence in higher Overall ranges Series share 100% 90% 80% 70% 60% 50% 40% 30% 20% Max Average is used by all series with average Overall > 0.6 Aggregation (2376 series/strategy) Min Average 10% 0% Min Overall Aggregation: Average (compensating) 14
15 Results: Combination Strategies (2) Series share Direction: Both (considering both directions) Selection: Threshold+Delta (above threshold + within tolerance) Combined similarity: Average (pessimistic) Matcher: All (combination of all hybrid matchers) 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% Best selection (228 series/strategy) Thr(0.8) Delta(0.02) MaxN(1) Thr(0.5)+MaxN(1) Thr(0.5)+Delta(0.02) 0% Min Overall Series share Series share 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Sm alllarge Direction (2736 series/strategy) LargeSmall 0% Min Overall Computation of combined similarity (4104 series/strategy) 100% Dice Both Average Min Overall 15
16 Results: Single Matchers 0,9 1 0,8 0,7 0,6 0,5 0,4 0,3 0,2 0,1 0-0,1-0,2-0,3 NamePath TypeName No reuse a) Single matchers Leaves Children Name SchemaM Reuse SchemaA avg Precision avg Recall avg Overall SchemaM: Schema with manually derived (real) match results SchemaA: Schema with match results automatically derived using the default match operation Instability of some single (hybrid) matchers (negative Overall) because of shared elements E.g. DeliverTo.Address and BillTo.Address Considering hierarchical names (NamePath) more accurate Schema-level reuse very effective: Essential improvement over no-reuse hybrid matchers Reusing approved match results better than automatically derived match results 16
17 Results: Combined Match Approaches 1 0,9 0,8 0,7 0,6 0,5 0,4 0,3 0,2 0,1 0 All+SchemaM SchemaM+NamePath SchemaM+Name SchemaM+TypeName b) Matcher combinations avg Precision avg Recall avg Overall SchemaM+Leaves SchemaM+Children All NamePath+Leaves NamePath+Children NamePath+TypeName NamePath+Name All: Combination of all no-reuse hybrid matchers All+SchemaM: Combination of all no-reuse hybrid matchers and SchemaM Reuse matchers outperform no-reuse matchers Best no-reuse All : 0.73 average Overall (Precision 0.95, Recall 0.78) Best reuse All+SchemaM: 0.82 average Overall (Precision 0.93, Recall 0.89) Combinations outperform single hybrid matchers Combined matchers, e.g. All, consider many aspects at the same time NamePath+Leaves: effective scheme, considering paths to identify context of shared elements, and leaves to cope with structural conflicts 17
18 Results: Match Sensitivity Overall 1 0,9 0,8 0,7 0,6 0,5 0,4 0,3 0,2 0,1 0 Overall(No Reuse) #All Elements 1<->2 1<->3 2<->3 1<->4 2<->4 3<->4 1<->5 2<->5 3<->5 4<->5 Match tasks Overall(Manual Reuse) Impact of schema characteristics : Degrading match quality with increase of schema size Best combinations: no-reuse All and reuse-oriented All+Schema High stability across different match tasks Little tuning effort for the default match operation # Elements 18
19 Conclusions and Future Work The COMA framework Extensible matcher library, including novel reuse approach Powerful combination scheme for both specifying match operations and constructing new matchers from existing ones Comprehensive evaluation on real-world schemas High effectiveness on large schemas Reuse: essential improvement over no-reuse Composite approach as THE solution for matcher combination Future work Matchers: more powerful reuse strategies, instance-based matchers More intelligent combination strategies Application to more real-world scenarios, esp. in bioinformatics 19
Comparison of Schema Matching Evaluations
Comparison of Schema Matching Evaluations Hong-Hai Do, Sergey Melnik, and Erhard Rahm Department of Computer Science, University of Leipzig Augustusplatz 10-11, 04109 Leipzig, Germany {hong, melnik, rahm}@informatik.uni-leipzig.de
More informationOutline A Survey of Approaches to Automatic Schema Matching. Outline. What is Schema Matching? An Example. Another Example
A Survey of Approaches to Automatic Schema Matching Mihai Virtosu CS7965 Advanced Database Systems Spring 2006 April 10th, 2006 2 What is Schema Matching? A basic problem found in many database application
More informationXML Schema Matching Using Structural Information
XML Schema Matching Using Structural Information A.Rajesh Research Scholar Dr.MGR University, Maduravoyil, Chennai S.K.Srivatsa Sr.Professor St.Joseph s Engineering College, Chennai ABSTRACT Schema matching
More informationA Generic Algorithm for Heterogeneous Schema Matching
You Li, Dongbo Liu, and Weiming Zhang A Generic Algorithm for Heterogeneous Schema Matching You Li1, Dongbo Liu,3, and Weiming Zhang1 1 Department of Management Science, National University of Defense
More informationA Tool for Semi-Automated Semantic Schema Mapping: Design and Implementation
A Tool for Semi-Automated Semantic Schema Mapping: Design and Implementation Dimitris Manakanatas, Dimitris Plexousakis Institute of Computer Science, FO.R.T.H. P.O. Box 1385, GR 71110, Heraklion, Greece
More informationMatching Large XML Schemas
Matching Large XML Schemas Erhard Rahm, Hong-Hai Do, Sabine Maßmann University of Leipzig, Germany rahm@informatik.uni-leipzig.de Abstract Current schema matching approaches still have to improve for very
More informationGeneric Schema Matching with Cupid
Generic Schema Matching with Cupid Jayant Madhavan Philip A. Bernstein Erhard Rahm University of Washington Microsoft Corporation University of Leipzig jayant@cs.washington.edu philbe@microsoft.com rahm@informatik.uni-leipzig.de
More informationGeneric Model Management
Generic Model Management A Database Infrastructure for Schema Manipulation Philip A. Bernstein Microsoft Corporation April 29, 2002 2002 Microsoft Corp. 1 The Problem ithere is 30 years of DB Research
More informationNAME SIMILARITY MEASURES FOR XML SCHEMA MATCHING
NAME SIMILARITY MEASURES FOR XML SCHEMA MATCHING Ali El Desoukey Mansoura University, Mansoura, Egypt Amany Sarhan, Alsayed Algergawy Tanta University, Tanta, Egypt Seham Moawed Middle Delta Company for
More informationMatching Anatomy Ontologies
Matching Anatomy Ontologies - Quality and Performance Anika Groß Interdisciplinary Centre for Bioinformatics http://www.izbi.uni-leipzig.de Database Group Leipzig http://dbs dbs.uni-leipzig.de Leipzig,
More informationGeneric Schema Matching with Cupid
Generic Schema Matching with Cupid J. Madhavan, P. Bernstein, E. Rahm Overview Schema matching problem Cupid approach Experimental results Conclusion presentation by Viorica Botea The Schema Matching Problem
More informationGeneric Schema Matching with Cupid
Generic Schema Matching with Cupid Jayant Madhavan 1 Philip A. Bernstein Erhard Rahm 1 University of Washington Microsoft Corporation University of Leipzig jayant@cs.washington.edu philbe@microsoft.com
More informationRewrite Techniques for Performance Optimization of Schema Matching Processes
Rewrite Techniques for Performance Optimization of Schema Matching Processes Eric Peukert SAP Research 01187 Dresden, Germany eric.peukert@sap.com Henrike Berthold SAP Research 01187 Dresden, Germany henrike.berthold@sap.com
More informationSimilarity Flooding: A versatile Graph Matching Algorithm and its Application to Schema Matching
Similarity Flooding: A versatile Graph Matching Algorithm and its Application to Schema Matching Sergey Melnik, Hector Garcia-Molina (Stanford University), and Erhard Rahm (University of Leipzig), ICDE
More informationMASTER THESIS. Schema Matching and Automatic Web Data Extraction. carried out at the
... Georg Gottlob MASTER THESIS Schema Matching and Automatic Web Data Extraction carried out at the Institute of Information Systems Database and Artificial Intelligence Group of the Vienna University
More informationThesis report. A Data Transformation Walkthrough. Final research project, Universiteit Twente. R.J.G. van Bloem
Thesis report A Data Transformation Walkthrough Final research project, Universiteit Twente name student nr.: cluster: graduation advisor: 2nd graduation advisor: external company: external advisor: date:
More informationOn Matching Schemas Automatically
UNIVERSITÄT LEIPZIG Institut für Informatik On Matching Schemas Automatically Erhard Rahm Philip A. Bernstein Report Nr. 1 (2001) On Matching Schemas Automatically Erhard Rahm University of Leipzig Leipzig,
More informationA Survey of Schema-based Matching Approaches
A Survey of Schema-based Matching Approaches Pavel Shvaiko 1 and Jérôme Euzenat 2 1 University of Trento, Povo, Trento, Italy, pavel@dit.unitn.it 2 INRIA, Rhône-Alpes, France, Jerome.Euzenat@inrialpes.fr
More informationMapping Composition for Matching Large Life Science Ontologies
Mapping Composition for Matching Large Life Science Ontologies Anika Groß 1,2, Michael Hartung 1,2, Toralf Kirsten 2,3, Erhard Rahm 1,2 1 Department of Computer Science, University of Leipzig 2 Interdisciplinary
More informationIndustrial-Strength Schema Matching
Industrial-Strength Schema Matching Philip A. Bernstein, Sergey Melnik, Michalis Petropoulos, Christoph Quix Microsoft Research Redmond, WA, U.S.A. {philbe, melnik}@microsoft.com, mpetropo@cs.ucsd.edu,
More informationQuickMig - Automatic Schema Matching for Data Migration Projects
QuickMig - Automatic Schema Matching for Data Migration Projects Christian Drumm SAP Research, Karlsruhe, Germany christian.drumm@sap.com Matthias Schmitt SAP AG, St.Leon-Rot, Germany ma.schmitt@sap.com
More informationA LIBRARY OF SCHEMA MATCHING ALGORITHMS FOR DATASPACE MANAGEMENT SYSTEMS
A LIBRARY OF SCHEMA MATCHING ALGORITHMS FOR DATASPACE MANAGEMENT SYSTEMS A dissertation submitted to the University of Manchester for the degree of Master of Science in the Faculty of Engineering and Physical
More informationA Framework for Domain-Specific Interface Mapper (DSIM)
56 IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.12, December 2008 A Framework for Domain-Specific Interface Mapper (DSIM) Komal Kumar Bhatia1 and A. K. Sharma2, YMCA
More informationAN APPROACH TO SCHEMA MAPPING GENERATION FOR DATA WAREHOUSING
Department of Computer Science and Engineering University of Texas at Arlington Arlington, TX 76019 AN APPROACH TO SCHEMA MAPPING GENERATION FOR DATA WAREHOUSING Karthik Jagannathan Technical Report CSE-2003-4
More informationA GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS
A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS Manoj Paul, S. K. Ghosh School of Information Technology, Indian Institute of Technology, Kharagpur 721302, India - (mpaul, skg)@sit.iitkgp.ernet.in
More informationMOMA - A Mapping-based Object Matching System
MOMA - A Mapping-based Object Matching System Andreas Thor University of Leipzig, Germany thor@informatik.uni-leipzig.de Erhard Rahm University of Leipzig, Germany rahm@informatik.uni-leipzig.de ABSTRACT
More information3 Classifications of ontology matching techniques
3 Classifications of ontology matching techniques Having defined what the matching problem is, we attempt at classifying the techniques that can be used for solving this problem. The major contributions
More informationRestricting the Overlap of Top-N Sets in Schema Matching
Restricting the Overlap of Top-N Sets in Schema Matching Eric Peukert SAP Research Dresden 01187 Dresden, Germany eric.peukert@sap.com Erhard Rahm University of Leipzig Leipzig, Germany rahm@informatik.uni-leipzig.de
More informationInteractive Schema Matching With Semantic Functions
Interactive Schema Matching With Semantic Functions Guilian Wang 1 Joseph Goguen 1 Young-Kwang Nam 2 Kai Lin 3 1 Department of Computer Science and Engineering, U.C. San Diego. {guilian, goguen}@cs.ucsd.edu
More informationInstance-based matching in COMA++
Erhard Rahm http://dbs.uni-leipzig.de June 3, 2009 Ontologies Ontology Matching Problem Match techniques and prototypes (e.g., GLUE) Instance-based matching in COMA++ Constraint- / Content-based Matching
More informationA Sequence-based Ontology Matching Approach
A Sequence-based Ontology Matching Approach Alsayed Algergawy, Eike Schallehn and Gunter Saake 1 Abstract. The recent growing of the Semantic Web requires the need to cope with highly semantic heterogeneities
More informationReducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard Dragut 1 and Ramon Lawrence 2 1 Department of Computer Science, University of Illinois at Chicago 2 Department
More informationXML Grammar Similarity: Breakthroughs and Limitations
XML Grammar Similarity: Breakthroughs and Limitations Joe TEKLI, Richard CHBEIR* and Kokou YETONGNON LE2I Laboratory University of Bourgogne Engineer s Wing BP 47870 21078 Dijon CEDEX FRANCE Phone: (+33)
More informationXBenchMatch: a Benchmark for XML Schema Matching Tools
XBenchMatch: a Benchmark for XML Schema Matching Tools Fabien Duchateau, Zohra Bellahsene, Ela Hunt To cite this version: Fabien Duchateau, Zohra Bellahsene, Ela Hunt. XBenchMatch: a Benchmark for XML
More informationPoster Session: An Indexing Structure for Automatic Schema Matching
Poster Session: An Indexing Structure for Automatic Schema Matching Fabien Duchateau LIRMM - UMR 5506 Université Montpellier 2 34392 Montpellier Cedex 5 - France duchatea@lirmm.fr Mark Roantree Interoperable
More informationComposing Mappings between Schemas using a Reference Ontology
Composing Mappings between Schemas using a Reference Ontology Eduard Dragut, Ramon Lawrence IDEA Lab, Department of Computer Science, University of Iowa Iowa City, IA, USA, 52242 {eduard-dragut, ramon-lawrence}@uiowa.edu
More informationEnabling Product Comparisons on Unstructured Information Using Ontology Matching
Enabling Product Comparisons on Unstructured Information Using Ontology Matching Maximilian Walther, Niels Jäckel, Daniel Schuster, and Alexander Schill Technische Universität Dresden, Faculty of Computer
More informationLeveraging Data and Structure in Ontology Integration
Leveraging Data and Structure in Ontology Integration O. Udrea L. Getoor R.J. Miller Group 15 Enrico Savioli Andrea Reale Andrea Sorbini DEIS University of Bologna Searching Information in Large Spaces
More informationDesigning a Benchmark for the Assessment of XML Schema Matching Tools
Designing a Benchmark for the Assessment of XML Schema Matching Tools Fabien Duchateau, Zohra Bellahsene To cite this version: Fabien Duchateau, Zohra Bellahsene. Designing a Benchmark for the Assessment
More informationBMatch: A Quality/Performance Balanced Approach for Large Scale Schema Matching
BMatch: A Quality/Performance Balanced Approach for Large Scale Schema Matching Fabien Duchateau 1 and Zohra Bellahsene 1 and Mathieu Roche 1 LIRMM - Université Montpellier 2 161 rue Ada 34000 Montpellier,
More informationOKKAM-based instance level integration
OKKAM-based instance level integration Paolo Bouquet W3C RDF2RDB This work is co-funded by the European Commission in the context of the Large-scale Integrated project OKKAM (GA 215032) RoadMap Using the
More informationA Flexible Approach for Planning Schema Matching Algorithms
A Flexible Approach for Planning Schema Matching Algorithms Fabien Duchateau, Zohra Bellahsene, Remi Coletta To cite this version: Fabien Duchateau, Zohra Bellahsene, Remi Coletta. A Flexible Approach
More informationSchema-based Semantic Matching: Algorithms, a System and a Testing Methodology
Schema-based Semantic Matching: Algorithms, a System and a Testing Methodology Abstract. Schema/ontology/classification matching is a critical problem in many application domains, such as, schema/ontology/classification
More informationXML Schema Automatic Matching Solution
XML Schema Automatic Matching Solution Huynh Quyet Thang, Vo Sy Nam Abstract Schema matching plays a key role in many different applications, such as schema integration, data integration, data warehousing,
More informationEffective Mapping Composition for Biomedical Ontologies
Effective Mapping Composition for Biomedical Ontologies Michael Hartung 1,2, Anika Gross 1,2, Toralf Kirsten 2, and Erhard Rahm 1,2 1 Department of Computer Science, University of Leipzig 2 Interdisciplinary
More informationPRIOR System: Results for OAEI 2006
PRIOR System: Results for OAEI 2006 Ming Mao, Yefei Peng University of Pittsburgh, Pittsburgh, PA, USA {mingmao,ypeng}@mail.sis.pitt.edu Abstract. This paper summarizes the results of PRIOR system, which
More informationXML Data in (Object-) Relational Databases
XML Data in (Object-) Relational Databases RNDr. Irena Mlýnková irena.mlynkova@mff.cuni.cz Charles University Faculty of Mathematics and Physics Department of Software Engineering Prague, Czech Republic
More informationMatching Schemas for Geographical Information Systems Using Semantic Information
Matching Schemas for Geographical Information Systems Using Semantic Information Christoph Quix, Lemonia Ragia, Linlin Cai, and Tian Gan Informatik V, RWTH Aachen University, Germany {quix, ragia, cai,
More informationOntology Matching Techniques: a 3-Tier Classification Framework
Ontology Matching Techniques: a 3-Tier Classification Framework Nelson K. Y. Leung RMIT International Universtiy, Ho Chi Minh City, Vietnam nelson.leung@rmit.edu.vn Seung Hwan Kang Payap University, Chiang
More informationTarget and source schemas may contain integrity constraints. source schema(s) assertions relating elements of the global schema to elements of the
Data integration Data Integration System: target (integrated) schema source schema (maybe more than one) assertions relating elements of the global schema to elements of the source schema(s) Target and
More informationSemantic Interoperability. Being serious about the Semantic Web
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA 1 Being serious about the Semantic Web It is not one person s ontology It is not several people s common
More informationCOMPUSOFT, An international journal of advanced computer technology, 4 (6), June-2015 (Volume-IV, Issue-VI)
ISSN:2320-0790 Duplicate Detection in Hierarchical XML Multimedia Data Using Improved Multidup Method Prof. Pramod. B. Gosavi 1, Mrs. M.A.Patel 2 Associate Professor & HOD- IT Dept., Godavari College of
More informationOntology Matching Using an Artificial Neural Network to Learn Weights
Ontology Matching Using an Artificial Neural Network to Learn Weights Jingshan Huang 1, Jiangbo Dang 2, José M. Vidal 1, and Michael N. Huhns 1 {huang27@sc.edu, jiangbo.dang@siemens.com, vidal@sc.edu,
More informationUniversity of Rome Tor Vergata GENOMA. GENeric Ontology Matching Architecture
University of Rome Tor Vergata GENOMA GENeric Ontology Matching Architecture Maria Teresa Pazienza +, Roberto Enea +, Andrea Turbati + + ART Group, University of Rome Tor Vergata, Via del Politecnico 1,
More informationarxiv: v1 [cs.db] 23 Feb 2016
SIFT: An Algorithm for Extracting Structural Information From Taxonomies Jorge Martinez-Gil, Software Competence Center Hagenberg (Austria), jorgemar@acm.org Keywords: Algorithms; Knowledge Engineering;
More informationOntology matching using vector space
University of Wollongong Research Online University of Wollongong in Dubai - Papers University of Wollongong in Dubai 2008 Ontology matching using vector space Zahra Eidoon University of Tehran, Iran Nasser
More informationD3.1 Micro-mapping composition
D3.1 Micro-mapping composition Author: Contributors: Dissemination: Contributing to: WP 3 Date: October 25, 2011 Revision: V1.4 Zoltan Miklos EPFL Quoc Viet Hung Nguyen, Nguyen Thanh Tam, Duong Chi Thang,
More informationOntology matching techniques: a 3-tier classification framework
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Ontology matching techniques: a 3-tier classification framework Nelson
More informationSchema and constraints-based matching and merging of Topic Maps
Information Processing and Management 43 (2007) 930 945 www.elsevier.com/locate/infoproman Schema and constraints-based matching and merging of Topic Maps Jung-Mn Kim a, *, Hyopil Shin b, Hyoung-Joo Kim
More informationComparing Information Structures Used in the Maritime Defence and Security Domain
DRDC-RDDC-2016-P106 Comparing Information Structures Used in the Maritime Defence and Security Domain Anthony W. ISENOR a,1, Yannick ALLARD b, Michel MAYRAND b a Defence Research and Development Canada
More informationSemantic-based Optimal XML Schema Matching:
2010 International Conference on E-business, Management and Economics IPEDR vol.3 (2011) (2011) IACSIT Press, Hong Kong Semantic-based Optimal XML Schema Matching: A Mathematical Programming Approach Jaewook
More informationA User-Guided Approach for Large-Scale Multi-schema Integration
A User-Guided Approach for Large-Scale Multi-schema Integration Muhammad Wasimullah Khan 1 and Jelena Zdravkovic 2 1 School of Information and Communication Technology, Royal Institute of Technology 2
More informationAutomatic Complex Schema Matching Across Web Query Interfaces: A Correlation Mining Approach
Automatic Complex Schema Matching Across Web Query Interfaces: A Correlation Mining Approach BIN HE and KEVIN CHEN-CHUAN CHANG University of Illinois at Urbana-Champaign To enable information integration,
More informationChapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model
Chapter 3 The Multidimensional Model: Basic Concepts Introduction Multidimensional Model Multidimensional concepts Star Schema Representation Conceptual modeling using ER, UML Conceptual modeling using
More informationLiSTOMS: a Light-weighted Self-tuning Ontology Mapping System
2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology LiSTOMS: a Light-weighted Self-tuning Ontology Mapping System Zhen Zhen Junyi Shen Institute of Computer
More informationAn Experiment on the Matching and Reuse of XML Schemas
An Experiment on the Matching and Reuse of XML s Jianguo Lu 1, Shengrui Wang 2, Ju Wang 1 1 School of Computer Science, University of Windsor jlu@cs.uwindsor.ca, ju_wang@yahoo.com 2 Department of Computer
More informationA method for recommending ontology alignment strategies
A method for recommending ontology alignment strategies He Tan and Patrick Lambrix Department of Computer and Information Science Linköpings universitet, Sweden This is a pre-print version of the article
More informationInnovation Lab at Univ. of Leipzig on semantic Web Data Integration Funded by BMBF (German ministry for research and education)
Erhard Rahm http://dbs.uni-leipzig.de http://dbs.uni-leipzig.de/wdi-lab November 25, 2009 Innovation Lab at Univ. of Leipzig on semantic Web Data Integration Funded by BMBF (German ministry for research
More informationAn Improving for Ranking Ontologies Based on the Structure and Semantics
An Improving for Ranking Ontologies Based on the Structure and Semantics S.Anusuya, K.Muthukumaran K.S.R College of Engineering Abstract Ontology specifies the concepts of a domain and their semantic relationships.
More informationComparing Similarity Combination Methods for Schema Matching
Comparing Similarity Combination Methods for Schema Matching Eric Peukert SAP AG, SAP Research Chemnitzer Str. 48 01187 Dresden, Germany eric.peukert@sap.com Sabine Maßmann, Kathleen König WDI-Lab - Institut
More informationTRIE BASED METHODS FOR STRING SIMILARTIY JOINS
TRIE BASED METHODS FOR STRING SIMILARTIY JOINS Venkat Charan Varma Buddharaju #10498995 Department of Computer and Information Science University of MIssissippi ENGR-654 INFORMATION SYSTEM PRINCIPLES RESEARCH
More informationCombining Multiple Query Interface Matchers Using Dempster-Shafer Theory of Evidence
Combining Multiple Query Interface Matchers Using Dempster-Shafer Theory of Evidence Jun Hong, Zhongtian He and David A. Bell School of Electronics, Electrical Engineering and Computer Science Queen s
More informationIntroduction to Query Processing and Query Optimization Techniques. Copyright 2011 Ramez Elmasri and Shamkant Navathe
Introduction to Query Processing and Query Optimization Techniques Outline Translating SQL Queries into Relational Algebra Algorithms for External Sorting Algorithms for SELECT and JOIN Operations Algorithms
More informationInstance-based matching of hierarchical ontologies
Instance-based matching of hierarchical ontologies Andreas Thor, Toralf Kirsten, Erhard Rahm University of Leipzig {thor,tkirsten,rahm}@informatik.uni-leipzig.de Abstract: We study an instance-based approach
More informationA Flexible Approach Based on the user Preferences for Schema Matching
A Flexible Approach Based on the user Preferences for Schema Matching Wided Guédria Faculté des Sciences de Tunis, Campus universitaire, 1060 Tunis, Tunisie, Email: wided.guedria@lirmm.fr Zohra Bellahsène
More information38050 Povo Trento (Italy), Via Sommarive 14 A SURVEY OF SCHEMA-BASED MATCHING APPROACHES. Pavel Shvaiko and Jerome Euzenat
UNIVERSITY OF TRENTO DEPARTMENT OF INFORMATION AND COMMUNICATION TECHNOLOGY 38050 Povo Trento (Italy), Via Sommarive 14 http://www.dit.unitn.it A SURVEY OF SCHEMA-BASED MATCHING APPROACHES Pavel Shvaiko
More informationEvolution of the COMA Match System
Evolution of the COMA Match System Sabine Massmann, Salvatore Raunich, David Aumüller, Patrick Arnold, Erhard Rahm WDI Lab, Institute of Computer Science University of Leipzig, Leipzig, Germany {lastname}@informatik.uni-leipzig.de
More informationCreating a Mediated Schema Based on Initial Correspondences
Creating a Mediated Schema Based on Initial Correspondences Rachel A. Pottinger University of Washington Seattle, WA, 98195 rap@cs.washington.edu Philip A. Bernstein Microsoft Research Redmond, WA 98052-6399
More informationOn Matching Large Life Science Ontologies in Parallel
On Matching Large Life Science Ontologies in Parallel Anika Groß 1,2, Michael Hartung 1,2, Toralf Kirsten 2,3, Erhard Rahm 1,2 1 Department of Computer Science, University of Leipzig 2 Interdisciplinary
More informationXML Schema Structural Equivalence
XML Schema Structural Equivalence Angela C. Duta, Ken Barker, Reda Alhajj University of Calgary, Department of Computer Science 2500 University Drive N.W. Calgary, Alberta Canada, T2N 1N4 {duta, barker,
More informationRA: An XML Schema Reduction Algorithm
RA: An XML Schema Reduction Algorithm Angela C. Duta 1, Ken Barker 1, and Reda Alhajj 12 1 ADSA Laboratory, Department of Computer Science, University of Calgary 2500 University Drive NW, Calgary, Canada
More informationINTEROPERABILITY IN COLLABORATIVE NETWORK OF BIODIVERSITY ORGANIZATIONS
54 INTEROPERABILITY IN COLLABORATIVE NETWORK OF BIODIVERSITY ORGANIZATIONS Ozgul Unal and Hamideh Afsarmanesh University of Amsterdam, THE NETHERLANDS {ozgul, hamideh}@science.uva.nl Schematic and semantic
More informationSemantic Schema Matching
Semantic Schema Matching Fausto Giunchiglia, Pavel Shvaiko, and Mikalai Yatskevich University of Trento, Povo, Trento, Italy {fausto pavel yatskevi}@dit.unitn.it Abstract. We view match as an operator
More informationConfiguration for Registry*, Repository or Hybrid** Master Data Management. *aka Federated or Virtual **aka Coexistence
Configuration for Registry*, Repository or Hybrid** Master Data Management *aka Federated or Virtual **aka Coexistence October 2017 (Best viewed in slideshow mode) Revision 3.6 Copyright 2017 WhamTech,
More informationA survey of schema-based matching approaches
A survey of schema-based matching approaches Pavel Shvaiko, Jérôme Euzenat To cite this version: Pavel Shvaiko, Jérôme Euzenat. A survey of schema-based matching approaches. Journal on Data Semantics,
More informationDiscovering Complex Matchings across Web Query Interfaces: A Correlation Mining Approach
Discovering Complex Matchings across Web Query Interfaces: A Correlation Mining Approach Bin He, Kevin Chen-Chuan Chang, Jiawei Han Computer Science Department University of Illinois at Urbana-Champaign
More informationYet Another Matcher. Fabien Duchateau, Remi Coletta, Zohra Bellahsene, Renée Miller
Yet Another Matcher Fabien Duchateau, Remi Coletta, Zohra Bellahsene, Renée Miller To cite this version: Fabien Duchateau, Remi Coletta, Zohra Bellahsene, Renée Miller. Yet Another Matcher. RR-916, 29.
More informationAn Automatic Domain Independent Schema Matching in Integrating Schemas of Heterogeneous Relational Databases
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING XX, XXX-XXX (201X) An Automatic Domain Independent Schema Matching in Integrating Schemas of Heterogeneous Relational Databases HAMIDAH IBRAHIM 1, YASER KARASNEH
More informationSeMap: A Generic Schema Matching System
SeMap: A Generic Schema Matching System by Ting Wang B.Sc., Zhejiang University, 2004 A THESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science in The Faculty of
More informationHotMatch Results for OEAI 2012
HotMatch Results for OEAI 2012 Thanh Tung Dang, Alexander Gabriel, Sven Hertling, Philipp Roskosch, Marcel Wlotzka, Jan Ruben Zilke, Frederik Janssen, and Heiko Paulheim Technische Universität Darmstadt
More informationCSE 880:Database Systems. ER Model and Relation Schemas
CSE 880:Database Systems ER Model and Relation Schemas 1 Major Steps for Database Design and Implementation 1. Requirements Collection and Analysis: Produces database requirements such as types of data,
More informationCaravela: Semantic Content Management with Automatic Information Integration and Categorization
Caravela: Semantic Content Management with Automatic Information Integration and Categorization (System Description) David Aumueller and Erhard Rahm University of Leipzig {david,rahm}@informatik.uni-leipzig.de
More informationETL and OLAP Systems
ETL and OLAP Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester
More informationLearning mappings and queries
Learning mappings and queries Marie Jacob University Of Pennsylvania DEIS 2010 1 Schema mappings Denote relationships between schemas Relates source schema S and target schema T Defined in a query language
More informationElements of the E-R Model
Chapter 3: The Entity Relationship Model Agenda Basic Concepts of the E-R model (Entities, Attributes, Relationships) Basic Notations of the E-R model ER Model 1 Elements of the E-R Model E-R model was
More informationWhat you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web
What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved
More informationData Cleansing. LIU Jingyuan, Vislab WANG Yilei, Theoretical group
Data Cleansing LIU Jingyuan, Vislab WANG Yilei, Theoretical group What is Data Cleansing Data cleansing (data cleaning) is the process of detecting and correcting (or removing) errors or inconsistencies
More informationAlgorithms for Query Processing and Optimization. 0. Introduction to Query Processing (1)
Chapter 19 Algorithms for Query Processing and Optimization 0. Introduction to Query Processing (1) Query optimization: The process of choosing a suitable execution strategy for processing a query. Two
More informationSemi-automatic Generation of Active Ontologies from Web Forms
Semi-automatic Generation of Active Ontologies from Web Forms Martin Blersch, Mathias Landhäußer, and Thomas Mayer (IPD) 1 KIT The Research University in the Helmholtz Association www.kit.edu "Create an
More informationSangam: A Framework for Modeling Heterogeneous Database Transformations
Sangam: A Framework for Modeling Heterogeneous Database Transformations Kajal T. Claypool University of Massachusetts-Lowell Lowell, MA Email: kajal@cs.uml.edu Elke A. Rundensteiner Worcester Polytechnic
More information