E-Fuice. Integration of E-Commerce Data

Similar documents
Data Querying, Extraction and Integration II: Applications. Recuperación de Información 2007 Lecture 5.

Instance-based matching of hierarchical ontologies

A Framework for Generic Integration of XML Sources. Wolfgang May Institut für Informatik Universität Freiburg Germany

Outline A Survey of Approaches to Automatic Schema Matching. Outline. What is Schema Matching? An Example. Another Example

Prabhat Shah. Finding the right products to sell on Amazon and best practise selling techniques. Online Seller UK

How does online survey mode affect answers to a customer feedback loyalty survey? PAPOR 2013 Aarti Gupta & Jason Lee

OWLS-SLR An OWL-S Service Profile Matchmaker

Instance-based matching in COMA++

Implementing Games User Research Processes Throughout Development: Beyond Playtesting

Approach for Mapping Ontologies to Relational Databases

Limitations of XPath & XQuery in an Environment with Diverse Schemes

A LEXICAL APPROACH FOR TAXONOMY MAPPING

Camera Phones: Handsets Hot, Prints Not

Fundamentals of Digital Media Lesson Objectives

Ontology Matching with CIDER: Evaluation Report for the OAEI 2008

Semantic Web and Web2.0. Dr Nicholas Gibbins

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

Case-based Recommendation. Peter Brusilovsky with slides of Danielle Lee

Changes to Underlying Architecture Impact Universal Search Results

Information Integration of Partially Labeled Data

AMAZON.COM RECOMMENDATIONS ITEM-TO-ITEM COLLABORATIVE FILTERING PAPER BY GREG LINDEN, BRENT SMITH, AND JEREMY YORK

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

CONFIGURING LDAP. What is LDAP? Getting Started with LDAP

An e-commerce Framework for Wearable Devices

DOC // POLAROID A520 DIGITAL CAMERA MANUAL ARCHIVE

A Survey of Schema-based Matching Approaches

Knowledge Engineering with Semantic Web Technologies

Slice Intelligence!

FUNimation UX Research Roadmap

HOW TO BUY A TICKET ONLINE GENERIC GUIDE

NBA 600: Day 15 Online Search 116 March Daniel Huttenlocher

OPENSTACK BEIJING CONFERENCE. by: Steven Hallett Head of Cloud Infrastructure Engineering and Operations

Bingo punch cards Bingo Bingo Bingo Bingo Bingo Bingo Bingo Punch Bingo Cards Punch Bingo Cards Bingo Cards bingo Bingo Cards PUNCH bingo

Introduction to the Semantic Web

Source Code Search System Using The Knowledge Framework of The Semantic Web. The Graduate School of Science and Technology Kobe University

DISNEY PIXAR INSIDE OUT: THE ESSENTIAL GUIDE (DK ESSENTIAL GUIDES) BY DK

A Tagging Approach to Ontology Mapping

WHAT IS WEB 3.0? Abstract. While the concept of Web2.0 has made a significant impact on the

How Big Data is Changing User Engagement

Welcome to Create Share Sell

JavaScript: The Definitive Guide By David Flanagan

Semantic Interoperability. Being serious about the Semantic Web

INCREASED PPC & WEBSITE CONVERSIONS. AMMA MARKETING Online Marketing - Websites Big Agency Background & Small Agency Focus

Amazon E-Commerce Service 4.0 Release Notes

38050 Povo Trento (Italy), Via Sommarive 14 A SURVEY OF SCHEMA-BASED MATCHING APPROACHES. Pavel Shvaiko and Jerome Euzenat

Zurich Open Repository and Archive. Private Cross-page Movie Recommendations with the Firefox add-on OMORE

Widgets for Faceted Browsing

Domain-specific Concept-based Information Retrieval System

D&R Advertisement Program

PDF - KODAK EASYSHARE Z760 CAMERA MEMORY CARD EBOOK

[EPUB] AKAI PRO MPK MINI MANUAL

For Immediate Release 20 January, 2015

Competitive Analysis. Client MSC Apex Software UX Team Defenders of UX 2nd Presentation Date

For Immediate Release 10 February, 2015

SAMSUNG DIGITAL CAMCORDER SCD23 MANUAL PDF

PANASONIC LUMIX DMC GH2 E-BOOK

Refining searches. Refine initially: query. Refining after search. Explicit user feedback. Explicit user feedback

AUDIO VIDEO & NAVIGATION SYSTEM OWNER S MANUAL

The 2 nd Generation Web - Opportunities and Problems

SIG-SWO-A OWL. Semantic Web

CONFIGURING ENTERGROUP. What is LDAP? Getting Started with LDAP

These phone numbers are for Human Resources use only.

vs Other CCTV cameras Multimedia & Communication Technology

Analysing Ferret XML Reports to Estimate the Density of Copied Code

Windows Azure Mobile Services

Instructions For Nintendo Wii U Release Date. Australian Price >>>CLICK HERE<<<

Linking FRBR Entities to LOD through Semantic Matching

Sitecore E-Commerce Cookbook

Graybar E-Commerce Site Welcome to our new online store! Easier to use Improved product search Streamlined availability and fulfillment

Enriching Ontology Mappings with Semantic Relations

Web Search Basics Introduction to Information Retrieval INF 141/ CS 121 Donald J. Patterson

Evaluation of ontology matching

DOC / PANASONIC LUMIX DMC G1K SERVICE MANUAL DOWNLOAD

In-class activities: Sep 25, 2017

Active Image Database Management Jau-Yuen Chen

: Semantic Web (2013 Fall)

Preliminary Design Report

Opus: University of Bath Online Publication Store

Part 12: Advanced Topics in Collaborative Filtering. Francesco Ricci

Matching Anatomy Ontologies

Learning & Development Online. Quick reference Guide. Version 2.0. Page 1

Disney Junior Encyclopedia Of Animated Characters: Including Characters From Your Favorite Disney*Pixar Films By Disney Book Group;M.L.

Metadata Standards and Applications. 6. Vocabularies: Attributes and Values

Innovation Lab at Univ. of Leipzig on semantic Web Data Integration Funded by BMBF (German ministry for research and education)

WINTER. GCSE Theory. Template. Lesson 6: Personal Digital Devices. PC s

Hello kitty digital camera driver optical zoom. Hello kitty digital camera driver optical zoom.zip

CONCEPT A [TYPE] CONCEPT B

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

GAMES CONTROL. Manual English Version GfK 2014 Games Control Manual

The Semantic Web & Ontologies

Enabling Product Comparisons on Unstructured Information Using Ontology Matching

Scalable Algorithmic Techniques Decompositions & Mapping. Alexandre David

E-BOOK // RICOH CAPLIO R10 DIGITAL CAMERA ARCHIVE

An Improving for Ranking Ontologies Based on the Structure and Semantics

Project Management Learning & coaching User Adoption E-learning Governance Support

A Location Model for Ambient Intelligence

Elementary Statistics Using The TI-83/84 Plus Calculator (3rd Edition) (Triola Statistics Series) By Mario F. Triola

THE MOBILE LOVE AFFAIR

Real-Time 3D Rendering With DirectX And HLSL: A Practical Guide To Graphics Programming (Game Design) By Paul Varcholik

Instructions For Using A Kindle Fire Hdx Screen

Transcription:

E-Fuice Integration of E-Commerce Data

Agenda ifuice applications in the E-Commerce domain E-Fuice prototype Matching instances and ontologies using ifuice scripts First results of matching Outlook

E-Fuice: Product Search and Recommendations For end users: Structured product search with query language: Find all products which conform to query conditions: (ObjectType= Product@Amazon OR ObjectType= Product@Hugendubel ) AND ProductType= Book AND Category = Fiction AND (Cover= Hardcover OR Audiobook=true) AND Price<80 Product recommendations based on different data sources: Following products from our assortment received the best client reviews on Amazon.de

E-Fuice: Product Comparison For end users: Product comparison: compare characteristics of different products : Digital-Kamera DC 4400 4 Mio. Pixel 3x optisches Zoom 4x digitales Zoom Digital-Kamera DC 50slim 5 Mio. Pixel 3x optisches Zoom 4x digitales Zoom 119.95 179.95 Price comparison: compare prices of identical/similar products in different shops : Quelle Canon IXUS 40 Preis: 289.00 Amazon IXUS 40 Preis: 229.00 Ebay CANON IXUS 40! NEUWARE! RECHNUNG! HÄNDLER! OVP Preis: 249.00

E-Fuice: Analysis of the Market Competition For business users: Analyze competitors product assortment: Product listings: which identical/similar products cost more/less at competitors shops? Price level: what is the average price of all products in the same category at competitors shops? Breadth of the price spectrum: how large is the price difference between the cheapest and the most expensive product in a product category? How large is the price variance?

E-Fuice Prototype Integrating 4 Online- Shops: www.amazon.com www.ebay.de www.softunity.com www.bestpreis24.de Product Assortment: Software Games, Game Console Accessories Movies (DVD,VHS)

E-Fuice Prototype For business users -- executing queries over integrated data from ifuice interface For end users -- Web portal featuring: Navigation in the integrated data Complex queries Recommendations

E-Fuice: Source Mapping Model Ontology@Softunity =466 Ontology@Amazon =1930, manually pruned Ontology@Ebay =1141 Product@Softunity =2711 Product@Amazon =93897 Product@Ebay =? (many) Product@Bestpreis24 Review@Amazon

Matching Ontologies and Instances Using ifuice Scripts #Loading ontologies $suonto:=queryinstances(ontology@softunity,all) $amonto:=queryinstances(ontology@amazon,all) #Loading mapping from ontologies to products and product details $amontoprod:=map($amonto,amazon.ontoprod) $suontoprod:=map($suonto,softunity.ontoprod) $amprod:=loadinstances(product@amazon) $suprod:=loadinstances(product@softunity) #Matching products $su_am_instance_eqean:=joinmapresult($suprod,$amprod,[domain.ean]=[range.ean]) $su_am_instance_title_trigram09:=attrmatch($suprod,$amprod,0.9,[title],[title])

Matching Ontologies and Instances Using ifuice Scripts (continued) #Loading manual ontology mapping $am_su_onto_mm:=map($amonto,softunity.ontoamazon2softunitymanualmerged) #Executing COMA ontology mapping $am_su_onto_coma:=map($amonto, ontoamazon2softunitycomacontextssyn) #Creating instance-based match for ontologies $su_am_onto_over_ean:=composematchdice(compose($suontoprod,$su_am_instance _eqean), inverse($amontoprod),0.5) #Creating combined mapping (COMA + Instances) #Other combination algorithms -> further research $su_am_combined:=union(inverse($am_su_onto_coma), $su_am_onto_over_ean) #Checking correct correspondences $correct_coma:=intersect($su_am_onto_coma, $am_su_onto_mm) $correct_ean:=intersect($su_am_onto_over_ean, $am_su_onto_mm) $correct_combined:=intersect($su_am_combined,$am_su_onto_mm) #Calculate recall/precision $recall_coma:=count($correct_coma)/count($am_su_onto_mm) $precision_coma:=count($correct_coma)/count($am_su_onto_coma)

Matching Amazon and Ebay Ontologies with COMA++ Both ontologies loaded as OWL files into COMA++: Ebay Ontology: 1137 nodes, Amazon Ontology: 1930 nodes (manually pruned) Match Strategy Nr of corr. Nr of corr. Recall Precision correct MANUAL 374 374 1 1 FILTEREDCONTEXT: NODES/COMA 179 51 0.1363 0.2849 FILTEREDCONTEXT:NODES/CONTEXT 123 51 0.1363 0.4146 ALLCONTEXT: TAXONOMY 2846 189 0.5121 0.0664 (softunity as global taxonomy) ALLCONTEXT: TAXONOMY_AVERAGE 2264 171 0.4634 0.0755 (softunity as global taxonomy) ALLCONTEXT: COMA 244 101 0.27 0.4139 ALLCONTEXT: CONTEXTS 336 185 0.4946 0.5505 ALLCONTEXT: CONTEXTS with SYNONYMS, 341 202 0.5401 0.5923 SELECTION 0, 0.01, 0.5 (DEFAULT) ALLCONTEXT: CONTEXTS with SYNONYMS, 347 218 0.5828 0.6282 SELECTION 2, 0.03, 0.7

Matching Softunity and Amazon Ontologies with COMA++ Both ontologies loaded as OWL files into COMA: Softunity Ontology: 466 nodes, Amazon Ontology: 1930 nodes (manually pruned) Match Strategy Nr of corr. Nr of corr. correct Recall Precision MANUAL 212 212 1 1 ALLCONTEXT: COMA 64 25 0.11792453 0.390625 ALLCONTEXT: CONTEXTS_SYN SEL 0,0.01,0.5 (default) 156 112 0.5283019 0.71794873 ALLCONTEXT: CONTEXTS_SYN SEL 0,0.02,0.5 194 117 0.5518868 0.6030928 ALLCONTEXT: CONTEXTS_SYN SEL 0,0.008,0.5 174 114 0.5377358 0.6551724 ALLCONTEXT: CONTEXTS_SYN SEL 2,0.03,0.7 45 32 0.1509434 0.7111111

Matching Products: Softunity vs. Amazon Products matched using IFuice. Softunity: 2711 products, Amazon : 42942 (Software and Games) Match Strategy Nr of corr. Nr of corr. correct Recall Precision UNAMBIGUOUS MATCHING (With EAN) 2133 2133 1 1 Exact matching based on product title 925 779 0,3652 0,8421 Trigram matching based on title, threshold 0.99 926 780 0,3656 0,8423 Trigram matching based on title, threshold 0.9 1296 801 0,3755 0,6180 Trigram matching based on title, threshold 0.7 3915 1500 0,7032 0,3831 Trigram matching based on title, threshold 0.5 32647 1947 0,9127 0,0596 MSSQL Fuzzy Matching, threshold 0.9 1684 932 0,4369 0,5534 MSSQL Fuzzy Matching, threshold 0.7 3942 1435 0,6727 0,3655

Combined Mapping: Softunity Ontology vs. Amazon Ontology Using Instance(Product) Mappings Softunity Ontology: 466 nodes, Amazon Ontology: 1930 nodes (manually pruned) Softunity: 2711 products, Amazon : 93897 products Match Strategy Nr of corr. Nr of corr. correct Recall Precision MANUAL COMBINED MAPPING 351 351 1 1 INSTANCE EAN Dice Sim. >0.5 286 209 0.5954 0.7307 INSTANCE EAN Dice Sim. >0.7 135 109 0.3105 0.8074 INSTANCE EAN Dice Sim. >0.9 68 53 0.1509 0.7794 COMA CONTEXTS_SYN 156 112 0.3190 0.7179 COMBINED INSTANCE EAN Dice Sim. >0.5 AND CONTEXTS_SYN 410 287 0.8176 0.7 COMBINED INSTANCE EAN Dice Sim. >0.7 AND CONTEXTS_SYN 265 193 0.5498 0.7283 COMBINED INSTANCE EAN Dice Sim. >0.9 AND CONTEXTS_SYN 210 148 0.4216 0.7047 Dice coefficient: D= 2Nc/(Na+Nb), where Nc number of instance correspondences for a concept, Na, Nb number of instances in the concept in the ontologies being matched

Combined Mapping (continued) Overlap between COMA mapping and instance Mapping (Dice>0.5): ~7%. The correspondences which are found only in COMA mapping can be attributed to the fact that only 39% of the concepts in softunity ontology have instances. The correspondences which are found only in the Instance mapping are based on background knowledge, which is not explicitly found in the names of ontology concepts. Examples: Unterhaltung- und Gesellschaftsspiele -> Fun-Spiele Spiele/Konsolen/Zubehör/Nach Hersteller/Microsoft -> Spiele/X-Box/Originalzubehör Problem: orthogonal subontologies, for example: Filme: Nach Genre Nach Produktionsland Because of this, Zeichentrick is mapped to Japan, Classic Western Collection to Italien. Special heuristics are needed to resolve this.

Further Research Questions: E-Commerce data are highly dynamic (ontologies may change once in several month, instances change up to several times a week, ebay -- constantly): Automatic ontology evolution support using new ontology and instance data, could be implemented as ifuice script (Reuse for negative results)

Further Research Questions: Instance-based correspondences can be used not only directly for matching ontologies, but also to amend synonym tables for reuse by name matchers. (example: Unterhaltung->Fun, etc.) Orthogonal subontologies: often correspond to attribute names and values in objects. (example: Nach Produktionsland -> attribute Produktionsland)

Further Research Questions: Ontology-Correspondences in our implementation are very simple: no distinction between different types of relations is made (subclassing, equality). This is sufficient for web recommendations, but may be insufficient for structured queries.

Thank you for your attention.