Best Practices for World-Class Search

Size: px
Start display at page:

Download "Best Practices for World-Class Search"

Transcription

1 Best Practices for World-Class Search MARY HOLSTEGE Distinguished Engineer, 4 June 2018 MARKLOGIC CORPORATION

2 SLIDE: 2 4 June 2018 MARKLOGIC CORPORATION

3 Search Application: Search for a Purpose To answer questions about the world To discover useful information To find relevant information so it can be acted upon To bring important information to someone's attention To partition data into useful chunks for analysis SLIDE: 3 4 June 2018 MARKLOGIC CORPORATION

4 Search Application: Search for a Purpose To answer questions about the world To discover useful information To find relevant information so it can be acted upon To bring important information to someone's attention To partition data into useful chunks for analysis SLIDE: 4 4 June 2018 MARKLOGIC CORPORATION

5 Oh, the humanities! Oh, the humanities! Great search needs great understanding of humans Linguistics Psychology Anthropology SLIDE: 5 4 June 2018 MARKLOGIC CORPORATION

6 CC at: rs Blocks some melanin. Often gives light colored eyes. GG at: rs Blocks some melanin. Often gives light colored eyes. CC at: rs Low Melanin. Basis for Gray, Blue, Green, or Yellow Eyes if no other pigmentation is present. CC at: rs Blocks some melanin. Often gives light colored eyes. CT at: rs Blue. CT at: rs Blue. CT at: rs Contrasting sphincter around pupil. AA at: rs Med Brown on Sphincter AA at: rs Weak Amber Gradient TT at: rs Penetrance modifier. Blue. GG at: rs Gray ring around outer edge TT at: rs Starburst (Collarette) SLIDE: 6 4 June 2018 MARKLOGIC CORPORATION

7 Search Application: Search for a Purpose To answer questions about the world To discover useful information To find relevant information so it can be acted upon To bring important information to someone's attention To partition data into useful chunks for analysis SLIDE: 7 4 June 2018 MARKLOGIC CORPORATION

8 Improve Search Application: Fitter for Purpose Improve answers to questions Improve discoverability of information Improve usefulness of information Improve relevance of information Improve ability to act on information Improve visibility of important information Improve ability to partition information SLIDE: 8 4 June 2018 MARKLOGIC CORPORATION

9 SLIDE: 9 4 June 2018 MARKLOGIC CORPORATION

10 The Search Tripod Data Query Response SLIDE: 10 4 June 2018 MARKLOGIC CORPORATION

11 Battleplan Understanding Analysis Continual improvement Pertinence Feedback Augmentation Interaction Use SLIDE: 11 4 June 2018 MARKLOGIC CORPORATION

12 Search for a Purpose: Improve answers to questions Understand question, understand answer Make answer pertinent to question Make answer useful and usable SLIDE: 12 4 June 2018 MARKLOGIC CORPORATION

13 Data

14 Data Analysis Augmentation Interaction Feedback Keywords Classification Data modeling Entity recognition Link relationships Quality rankings Metrics Metadata Collections Semantic markup Entity markup Quality Range dimensions Linked data Reference data "SEO" Document proxies Visualizations Clustering Linking Popularity Reviews/ratings Folk taxonomies Annotations Linking SLIDE: 14 4 June 2018 MARKLOGIC CORPORATION

15 SLIDE: 15 4 June 2018 MARKLOGIC CORPORATION

16 Semantic Markup With Linkage to Facts Augment data with RDF triples, ontology Create entity dictionary from SKOS ontology Analyze and augment data with entity markup SLIDE: 16 4 June 2018 MARKLOGIC CORPORATION

17 geo:cayman_islands a skos:concept; skos:inscheme geo:area; skos:preflabel "Cayman Islands"^^xsd:string; geo:agriculturalareanotes "Manual Estimation"^^xsd:string; geo:agriculturalareatotal 2.7; geo:agriculturalareaunit "1000 Ha"^^xsd:string; geo:agriculturalareayear "2009"^^xsd:int; geo:populationtotal 56.0; geo:populationunit "1000"^^xsd:string; geo:populationyear "2010"^^xsd:int;; geo:codedbpediaid "Cayman_Islands"^^xsd:string; geo:countryareatotal 26.4; geo:countryareaunit "1000 Ha"^^xsd:string; SLIDE: 17 4 June 2018 MARKLOGIC CORPORATION

18 <para> <ex:business id=" Partners</ex:business> has a controlling interest in several companies with accounts in the <geo:area id=" Islands</geo:area>, regulators have learned. Other offshore accounts in <geo:area id=" and <geo:area id=" have been tied to the chairman, Mr. Q. A spokesperson for Mr. Q declined to comment. </para> <para> Investigations continue. </para> SLIDE: 18 4 June 2018 MARKLOGIC CORPORATION

19 SLIDE: 19 4 June 2018 MARKLOGIC CORPORATION

20 SLIDE: 20 4 June 2018 MARKLOGIC CORPORATION

21 SLIDE: 21 4 June 2018 MARKLOGIC CORPORATION

22 SLIDE: 22 4 June 2018 MARKLOGIC CORPORATION

23 SLIDE: 23 4 June 2018 MARKLOGIC CORPORATION

24 SLIDE: 24 4 June 2018 MARKLOGIC CORPORATION

25 SLIDE: 25 4 June 2018 MARKLOGIC CORPORATION

26 Query

27 Query Analysis Augmentation Interaction Feedback Canned queries FAQs Query patterns Entity recognition Natural(istic) language Metrics User interests Synonyms Related terms Entity queries Disambiguation Semantic queries Parsed string Facets Sliders Timelines Maps Breadcrumbs Scatter/gather User behaviors User interests Common queries SLIDE: 27 4 June 2018 MARKLOGIC CORPORATION

28 SLIDE: 28 4 June 2018 MARKLOGIC CORPORATION

29 Analyze and Transform Query Create special dictionaries for query analysis Normalize query words, strip stopwords Replace with tagged forms Match tagged query to query patterns Parse tagged query with appropriate bindings SLIDE: 29 4 June 2018 MARKLOGIC CORPORATION

30 // Input query "Does XYZW Partners have business in the Cayman Islands?" // Normalized "XYZW Partners has business Cayman Islands" // Analyzed and tagged query string (business:221932) (control:hold OR get:receive OR suffer:have) (person:clientele OR organization:enterprise OR event:commerce) (location:cayman_islands) SLIDE: 30 4 June 2018 MARKLOGIC CORPORATION

31 // Matched to query pattern "business:* control:hold organization:*" to disambiguate "business: control:hold organization:enterprise location:cayman_islands" // Parsed with appropriate bindings cts:and-query(( cts:field-value-query("id", " cts:or-query(("control","hold","govern","run","have"),"synonym"), cts:or-query(("organization","business","enterprise","insitution"),"synonym"), cts:or-query(( cts:field-value-query("id"," "George Town", "West Bay", "Bodden Town", "East End", "North Side", "West End" )) )) SLIDE: 31 4 June 2018 MARKLOGIC CORPORATION

32 SLIDE: 32 4 June 2018 MARKLOGIC CORPORATION

33 SLIDE: 33 COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

34 SLIDE: 34 4 June 2018 MARKLOGIC CORPORATION

35 SLIDE: 35 4 June 2018 MARKLOGIC CORPORATION

36 SLIDE: 36 4 June 2018 MARKLOGIC CORPORATION

37 SLIDE: 37 4 June 2018 MARKLOGIC CORPORATION

38 SLIDE: 38 4 June 2018 MARKLOGIC CORPORATION

39 Response

40 Response Analysis Augmentation Interaction Feedback Clusters Links Disambiguation Re-ranking Best bets Metrics Document proxies Result clusters Facts Info boxes Related queries Relevance tuning Navigational cues Annotated TOCs Scatter/gather Linked data views Facets Sliders Timelines Maps User behavior SLIDE: 40 4 June 2018 MARKLOGIC CORPORATION

41 SLIDE: 41 4 June 2018 MARKLOGIC CORPORATION

42 Infobox Extract concept IDs from query string Select best entity for results Query for facts related to ID SLIDE: 42 4 June 2018 MARKLOGIC CORPORATION

43 // Input query "Does XYZW Partners have business in the Cayman Islands?" // Extracted entity IDs " " SLIDE: 43 4 June 2018 MARKLOGIC CORPORATION

44 // Facts for chosen ID (predicate + object) "Manual Estimation"^^xsd:string "1000 Ha"^^xsd:string "2009"^^xsd:int "1000"^^xsd:string "2010"^^xsd:int "Cayman_Islands"^^xsd:string "1000 Ha"^^xsd:string SLIDE: 44 4 June 2018 MARKLOGIC CORPORATION

45 SLIDE: 45 4 June 2018 MARKLOGIC CORPORATION

46 SLIDE: 46 4 June 2018 MARKLOGIC CORPORATION

47 SLIDE: 47 4 June 2018 MARKLOGIC CORPORATION

48 SLIDE: 48 4 June 2018 MARKLOGIC CORPORATION

49 SLIDE: 49 4 June 2018 MARKLOGIC CORPORATION

50 SUMMARY Building Great Search Applications Humans Understand user purposes Answers Discovery Analysis Action Humanities Search tripod Strengthen all legs Data Query Response Battleplan Analysis Understanding Augmentation Pertinence Interaction Use Feedback Continual improvement SLIDE: 50 4 June 2018 MARKLOGIC CORPORATION

51 Questions?

52 Appendix

53 SLIDE: 53 4 June 2018 MARKLOGIC CORPORATION

54 SLIDE: 54 4 June 2018 MARKLOGIC CORPORATION

55 SLIDE: 55 4 June 2018 MARKLOGIC CORPORATION

56 SLIDE: 56 4 June 2018 MARKLOGIC CORPORATION

57 SLIDE: 57 4 June 2018 MARKLOGIC CORPORATION

58 SLIDE: 58 4 June 2018 MARKLOGIC CORPORATION

59 SLIDE: 59 4 June 2018 MARKLOGIC CORPORATION

60 SLIDE: 60 COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

61 SLIDE: 61 4 June 2018 MARKLOGIC CORPORATION

62 SLIDE: 62 4 June 2018 MARKLOGIC CORPORATION

63 Data

64 Goals Improve searchability - Precision: specific simplified scoping - Recall: related information explicit - Ranking: explicit context, quality Improve usability - Facts from text - Summarization, TOCs, other proxies Improve discoverability - Facets, classifications SLIDE: 64 4 June 2018 MARKLOGIC CORPORATION

65 Analysis Keywords and classification Modeling Entity recognition Link analysis Quality rankings SLIDE: 65 4 June 2018 MARKLOGIC CORPORATION

66 Augmentation Metadata Collections Semantic markup Entity markup Quality Range dimensions Linked data Contextual information Reference data SLIDE: 66 4 June 2018 MARKLOGIC CORPORATION

67 Interaction Document proxies - KWIC snippets - Summaries - TileBar - Color lines Relationships - Clustering - Link visualizations SLIDE: 67 4 June 2018 MARKLOGIC CORPORATION

68 Feedback Popularity, reviews, ratings => quality adjustments, range adjustments Folk taxonomies => classification Annotations => related information Additional linked data => context SLIDE: 68 4 June 2018 MARKLOGIC CORPORATION

69 Selected Techniques Expert indexers (keywords, classifications) - Primed/augmented with classification engines Semantic markup - Authoring toolchain - Harmonization/projection Entity markup - NER integrations - Ontology-driven or query-based entity extraction Linked data, reference data SLIDE: 69 4 June 2018 MARKLOGIC CORPORATION

70 Queries

71 Goals Improve expressibility - Simpler expression => better query Improve search effectiveness - Precision, recall, ranking - Focus in, focus out SLIDE: 71 4 June 2018 MARKLOGIC CORPORATION

72 Analysis Canned queries, query patterns, FAQs Identify known entities Natural(istic) language interpretation SLIDE: 72 4 June 2018 MARKLOGIC CORPORATION

73 Augmentation Boost queries - User interests, related terms Synonym, thesaurus expansion Canned query recognition Entity recognition Query expansion Contextual augmentation Relevance tweaks SLIDE: 73 4 June 2018 MARKLOGIC CORPORATION

74 Interaction Natural(istic) language input Query builders: facets, timelines, maps, links, forms More like this, less like that Result zooming/slicing Breadcrumbs SLIDE: 74 4 June 2018 MARKLOGIC CORPORATION

75 Feedback User behavior => disambiguation User interests => query augmentation Common queries => canned queries and results SLIDE: 75 4 June 2018 MARKLOGIC CORPORATION

76 Selected Techniques Query string parsing/pre-processing - NLP - Regex analysis - NER, ontology-driven entity extraction Augmentation with profiles, thesauri, etc. Reverse query against FAQs Range/geospatial indexes SLIDE: 76 4 June 2018 MARKLOGIC CORPORATION

77 Response

78 Goals Improve usefulness - Immediate answers - Navigation - Context Improve conversation - What happens next? SLIDE: 78 4 June 2018 MARKLOGIC CORPORATION

79 Analysis Clustering Disambiguation Re-ranking Best bets SLIDE: 79 4 June 2018 MARKLOGIC CORPORATION

80 Augmentation Document proxies and context Results clustering Specific facts Info boxes Related queries Relevance tuning Navigational cues (within/across documents) Exploration interfaces SLIDE: 80 4 June 2018 MARKLOGIC CORPORATION

81 Interaction Results cluster navigation Linked data views Facet/timeline/map/sliders Annotated TOCs Counts/coloring Immediate action affordances SLIDE: 81 4 June 2018 MARKLOGIC CORPORATION

82 Feedback User actions => adjust rankings, adjust interests Hide/expose => streamlined personal interface SLIDE: 82 4 June 2018 MARKLOGIC CORPORATION

83 Selected Techniques Snippeting, summarization Clustering Re-ranking, LTR Query entities => result entities => related information Range/geospatial indexes => display widgets Linked data presentations SLIDE: 83 4 June 2018 MARKLOGIC CORPORATION

AutoFocus, an Open Source Facet-Driven Enterprise Search Solution

AutoFocus, an Open Source Facet-Driven Enterprise Search Solution AutoFocus, an Open Source Facet-Driven Enterprise Search Solution ISKO UK Event, November 5, 2007 RANGANATHAN REVISITED: FACETS FOR THE FUTURE presentation by Jeroen Wester, CTO Aduna key facts Open source

More information

HOW TO BUILD AN AWESOME SEARCH APP

HOW TO BUILD AN AWESOME SEARCH APP HOW TO BUILD AN AWESOME SEARCH APP Stu McLean, Ph.D., Principal Consultant, Public Sector - Civilian, MarkLogic Ganesh Vaideeswaran, Senior Director, Development, MarkLogic Agenda Components of a search

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Fusing Corporate Thesaurus Management with Linked Data using PoolParty

Fusing Corporate Thesaurus Management with Linked Data using PoolParty Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl PoolParty at a glance Developed by punkt. netservices Current release: PoolParty 2.8 Main focus on three application

More information

ISSUES IN INFORMATION RETRIEVAL Brian Vickery. Presentation at ISKO meeting on June 26, 2008 At University College, London

ISSUES IN INFORMATION RETRIEVAL Brian Vickery. Presentation at ISKO meeting on June 26, 2008 At University College, London ISSUES IN INFORMATION RETRIEVAL Brian Vickery Presentation at ISKO meeting on June 26, 2008 At University College, London NEEDLE IN HAYSTACK MY BACKGROUND Plant chemist, then reports librarian Librarian,

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17

More information

Annotation Component in KiWi

Annotation Component in KiWi Annotation Component in KiWi Marek Schmidt and Pavel Smrž Faculty of Information Technology Brno University of Technology Božetěchova 2, 612 66 Brno, Czech Republic E-mail: {ischmidt,smrz}@fit.vutbr.cz

More information

Historical Text Mining:

Historical Text Mining: Historical Text Mining Historical Text Mining, and Historical Text Mining: Challenges and Opportunities Dr. Robert Sanderson Dept. of Computer Science University of Liverpool azaroth@liv.ac.uk http://www.csc.liv.ac.uk/~azaroth/

More information

Ravel Law Quick Start Guide

Ravel Law Quick Start Guide Ravel Law Quick Start Guide Table of Contents Start Your Search 2 Search Results: The Visualization Map 4 Case Reading 6 Frequently Asked Questions 11 Appendix: Judge Analytics 13 Start Your Judge Search

More information

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University Outline Communities Ontology value, issues, problems, solutions Ontology languages Terms for ontology Ontologies April

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

2015 Search Ranking Factors

2015 Search Ranking Factors 2015 Search Ranking Factors Introduction Abstract Technical User Experience Content Social Signals Backlinks Big Picture Takeaway 2 2015 Search Ranking Factors Here, at ZED Digital, our primary concern

More information

Ontology-based Architecture Documentation Approach

Ontology-based Architecture Documentation Approach 4 Ontology-based Architecture Documentation Approach In this chapter we investigate how an ontology can be used for retrieving AK from SA documentation (RQ2). We first give background information on the

More information

Architecting Knowledge Middleware

Architecting Knowledge Middleware Architecting Knowledge Middleware WWW 2002, Honolulu, May 9, 2002 Alfred Z. Spector Vice President, Services and Software IBM Research Division aspector@us.ibm.com Thomas J. Watson Research Center PO Box

More information

0.1 Knowledge Organization Systems for Semantic Web

0.1 Knowledge Organization Systems for Semantic Web 0.1 Knowledge Organization Systems for Semantic Web 0.1 Knowledge Organization Systems for Semantic Web 0.1.1 Knowledge Organization Systems Why do we need to organize knowledge? Indexing Retrieval Organization

More information

Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide

Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide 1 www.microsoft.com/sharepoint The information contained in this document represents the current view of Microsoft Corporation on the issues

More information

B2FIND and Metadata Quality

B2FIND and Metadata Quality B2FIND and Metadata Quality 3 rd EUDAT Conference 25 September 2014 Heinrich Widmann and B2FIND team 1 Outline B2FIND the EUDAT Metadata Service Semantic Mapping of Metadata Quality of Metadata Summary

More information

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies Taxonomy Strategies July 17, 2012 Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata A Tale of Two Types of Vocabularies What is semantic metadata? Semantic relationships in the

More information

Semantic Retrieval of the TIB AV-Portal. Dr. Sven Strobel IATUL 2015 July 9, 2015; Hannover

Semantic Retrieval of the TIB AV-Portal. Dr. Sven Strobel IATUL 2015 July 9, 2015; Hannover Semantic Retrieval of the TIB AV-Portal Dr. Sven Strobel IATUL 2015 July 9, 2015; Hannover Semantic Retrieval of the TIB AV-Portal Contents 1. TIB AV-Portal 2. Automatic Video Analysis 3. Named-Entity

More information

The Allen Human Brain Atlas offers three types of searches to allow a user to: (1) obtain gene expression data for specific genes (or probes) of

The Allen Human Brain Atlas offers three types of searches to allow a user to: (1) obtain gene expression data for specific genes (or probes) of Microarray Data MICROARRAY DATA Gene Search Boolean Syntax Differential Search Mouse Differential Search Search Results Gene Classification Correlative Search Download Search Results Data Visualization

More information

Improving information retrieval through metadata tagging

Improving information retrieval through metadata tagging Improving information retrieval through metadata tagging Jonathan Engel, Information Architect Date 25/09/15 Introduction Organising and tagging content using standard structures will always enhance navigation,

More information

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009 Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images

More information

Exploring the Use of Semantic Technologies for Cross-Search of Archaeological Grey Literature and Data

Exploring the Use of Semantic Technologies for Cross-Search of Archaeological Grey Literature and Data Exploring the Use of Semantic Technologies for Cross-Search of Archaeological Grey Literature and Data Presented by Keith May @keith_may Based on the work of Andreas Vlachidis, Ceri Binding, Keith May,

More information

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008 IBE101: Introduction to Information Architecture Hans Fredrik Nordhaug 2008 Objectives Defining IA Practicing IA User Needs and Behaviors The anatomy of IA Organizations Systems Labelling Systems Navigation

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

An Ontology Based Question Answering System on Software Test Document Domain

An Ontology Based Question Answering System on Software Test Document Domain An Ontology Based Question Answering System on Software Test Document Domain Meltem Serhatli, Ferda N. Alpaslan Abstract Processing the data by computers and performing reasoning tasks is an important

More information

Things to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic

Things to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic Things to consider when using Semantics in your Information Management strategy Toby Conrad Smartlogic toby.conrad@smartlogic.com +1 773 251 0824 Some of Smartlogic s 250+ Customers Awards Trend Setting

More information

Content analysis and classification in mathematics

Content analysis and classification in mathematics Content analysis and classification in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) UDC Seminar 2011 CLASSIFICATION & ontology Formal approaches and Access to Knowledge The

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources

More information

Falcon-AO: Aligning Ontologies with Falcon

Falcon-AO: Aligning Ontologies with Falcon Falcon-AO: Aligning Ontologies with Falcon Ningsheng Jian, Wei Hu, Gong Cheng, Yuzhong Qu Department of Computer Science and Engineering Southeast University Nanjing 210096, P. R. China {nsjian, whu, gcheng,

More information

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE YING DING 1 Digital Enterprise Research Institute Leopold-Franzens Universität Innsbruck Austria DIETER FENSEL Digital Enterprise Research Institute National

More information

Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources

Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources Michelle Gregory, Liam McGrath, Eric Bell, Kelly O Hara, and Kelly Domico Pacific Northwest National Laboratory

More information

NMR Guide. 3/14/ Broadbean

NMR Guide. 3/14/ Broadbean NMR Guide //08 0 Broadbean What is the new manage responses tab and where can I access? The New Manage Responses has been built as a dynamic page that allows users to keyword search and filter their adverts'

More information

IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN

IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN Movie Related Information Retrieval Using Ontology Based Semantic Search Tarjni Vyas, Hetali Tank, Kinjal Shah Nirma University, Ahmedabad tarjni.vyas@nirmauni.ac.in, tank92@gmail.com, shahkinjal92@gmail.com

More information

SKOS. COMP62342 Sean Bechhofer

SKOS. COMP62342 Sean Bechhofer SKOS COMP62342 Sean Bechhofer sean.bechhofer@manchester.ac.uk Ontologies Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies

More information

State of the Art and Trends in Search Engine Technology. Gerhard Weikum

State of the Art and Trends in Search Engine Technology. Gerhard Weikum State of the Art and Trends in Search Engine Technology Gerhard Weikum (weikum@mpi-inf.mpg.de) Commercial Search Engines Web search Google, Yahoo, MSN simple queries, chaotic data, many results key is

More information

Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets

Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Rahul Potharaju (Purdue University) Navendu Jain (Microsoft Research) Cristina Nita-Rotaru (Purdue University) April

More information

Oracle Endeca Information Discovery

Oracle Endeca Information Discovery Oracle Endeca Information Discovery Glossary Version 2.4.0 November 2012 Copyright and disclaimer Copyright 2003, 2013, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Eleven+ Views of Semantic Search

Eleven+ Views of Semantic Search Eleven+ Views of Semantic Search Denise A. D. Bedford, Ph.d. Goodyear Professor of Knowledge Management Information Architecture and Knowledge Management Kent State University Presentation Focus Long-Term

More information

MarkLogic. A Modern Data Platform To Support Your Critical Path COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

MarkLogic. A Modern Data Platform To Support Your Critical Path COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. MarkLogic A Modern Data Platform To Support Your Critical Path SLIDE: 2 Inception Pre- Post- Distribution Archive Taxonomies Semantics Technical Descriptive Customers Usage SLIDE: 4 Inception Pre- Post-

More information

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies Taxonomy Strategies October 28, 2012 Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata A Tale of Two Types of Vocabularies What is the semantic web? Making content web-accessible

More information

Ontologies SKOS. COMP62342 Sean Bechhofer

Ontologies SKOS. COMP62342 Sean Bechhofer Ontologies SKOS COMP62342 Sean Bechhofer sean.bechhofer@manchester.ac.uk Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies

More information

User Configurable Semantic Natural Language Processing

User Configurable Semantic Natural Language Processing User Configurable Semantic Natural Language Processing Jason Hedges CEO and Founder Edgetide LLC info@edgetide.com (443) 616-4941 Table of Contents Bridging the Gap between Human and Machine Language...

More information

Domain-specific Concept-based Information Retrieval System

Domain-specific Concept-based Information Retrieval System Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical

More information

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Eloise Currie and Mary Parmelee SAS Institute, Cary NC About SAS: The Power to Know SAS: The Market Leader in

More information

HELP ON THE VIRTUAL LIBRARY

HELP ON THE VIRTUAL LIBRARY HELP ON THE VIRTUAL LIBRARY The Virtual Library search system allows accessing in a quick way to the information the students are interested in and that are available in the Didactic Cyberspace. In its

More information

Text mining tools for semantically enriching the scientific literature

Text mining tools for semantically enriching the scientific literature Text mining tools for semantically enriching the scientific literature Sophia Ananiadou Director National Centre for Text Mining School of Computer Science University of Manchester Need for enriching the

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured

More information

Part I: Data Mining Foundations

Part I: Data Mining Foundations Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?

More information

Natural Language Processing. SoSe Question Answering

Natural Language Processing. SoSe Question Answering Natural Language Processing SoSe 2017 Question Answering Dr. Mariana Neves July 5th, 2017 Motivation Find small segments of text which answer users questions (http://start.csail.mit.edu/) 2 3 Motivation

More information

Taxonomy and Search Patterns for Enhanced Search and Discovery

Taxonomy and Search Patterns for Enhanced Search and Discovery Taxonomy and Search Patterns for Enhanced Search and Discovery Patrick Lambe Taxonomy alone is limited in what it can do. Search alone is also limited. Together, they become much smarter. If taxonomy and

More information

Enterprise Multimedia Integration and Search

Enterprise Multimedia Integration and Search Enterprise Multimedia Integration and Search José-Manuel López-Cobo 1 and Katharina Siorpaes 1,2 1 playence, Austria, 2 STI Innsbruck, University of Innsbruck, Austria {ozelin.lopez, katharina.siorpaes}@playence.com

More information

Clustering Results. Result List Example. Clustering Results. Information Retrieval

Clustering Results. Result List Example. Clustering Results. Information Retrieval Information Retrieval INFO 4300 / CS 4300! Presenting Results Clustering Clustering Results! Result lists often contain documents related to different aspects of the query topic! Clustering is used to

More information

Handling Place References in Text

Handling Place References in Text Handling Place References in Text Introduction Most (geographic) information is available in the form of textual documents Place reference resolution involves two-subtasks: Recognition : Delimiting occurrences

More information

SEO. Drivers You Are Missing in Content Marketing

SEO. Drivers You Are Missing in Content Marketing SEO Drivers You Are Missing in Content Marketing SEO IS ALWAYS CHANGING. HICH MEANS your content strategy what you create and how it is found is ALWAYS CHANGING AS WELL. BUT IS SEO ALWAYS CHANGING? BECAUSE

More information

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How

More information

Terminologies, Knowledge Organization Systems, Ontologies

Terminologies, Knowledge Organization Systems, Ontologies Terminologies, Knowledge Organization Systems, Ontologies Gerhard Budin University of Vienna TSS July 2012, Vienna Motivation and Purpose Knowledge Organization Systems In this unit of TSS 12, we focus

More information

Rapid Information Discovery System (RAID)

Rapid Information Discovery System (RAID) Int'l Conf. Artificial Intelligence ICAI'17 321 Rapid Information Discovery System (RAID) B. Gopal, P. Benjamin, and K. Madanagopal Knowledge Based Systems, Inc. (KBSI), College Station, TX, USA Summary

More information

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report Technical Report A B2B Search Engine Abstract In this report, we describe a business-to-business search engine that allows searching for potential customers with highly-specific queries. Currently over

More information

ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE

ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE June 30, 2012 San Diego Convention Center ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE Stuart Laurie, Senior Consultant #SPSSAN Agenda 1. Challenges 2. What comes out of the box

More information

Mapping between Digital Identity Ontologies through SISM

Mapping between Digital Identity Ontologies through SISM Mapping between Digital Identity Ontologies through SISM Matthew Rowe The OAK Group, Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield S1 4DP, UK m.rowe@dcs.shef.ac.uk

More information

Semantic Enrichment ARMA Chicago Spring Seminar April 18, 2018

Semantic Enrichment ARMA Chicago Spring Seminar April 18, 2018 Semantic Enrichment ARMA Chicago Spring Seminar April 18, 2018 Presentation Overview What is Semantics? Semantic Building Blocks Why Use Semantic Technology Semantic Layers Source of Semantic Information

More information

New Manage Responses. 11/6/ Broadbean

New Manage Responses. 11/6/ Broadbean New Manage Responses /6/08 0 Broadbean About new manage responses (NMR) and access The New Manage Responses function has been designed around our customer suggestions to allow quick, fast, accurate access

More information

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS 82 CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS In recent years, everybody is in thirst of getting information from the internet. Search engines are used to fulfill the need of them. Even though the

More information

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web

More information

Information Retrieval and Knowledge Organisation

Information Retrieval and Knowledge Organisation Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector

More information

Semantic Searching. John Winder CMSC 676 Spring 2015

Semantic Searching. John Winder CMSC 676 Spring 2015 Semantic Searching John Winder CMSC 676 Spring 2015 Semantic Searching searching and retrieving documents by their semantic, conceptual, and contextual meanings Motivations: to do disambiguation to improve

More information

National Documentation Centre Open access in Cultural Heritage digital content

National Documentation Centre Open access in Cultural Heritage digital content National Documentation Centre Open access in Cultural Heritage digital content Haris Georgiadis, Ph.D. Senior Software Engineer EKT hgeorgiadis@ekt.gr The beginning.. 42 institutions documented & digitalized

More information

OVERVIEW. In depth. Smartlogic Semaphore. The what? and how? of our Content Intelligence solution FIND OUT MORE >

OVERVIEW. In depth. Smartlogic Semaphore. The what? and how? of our Content Intelligence solution FIND OUT MORE > In depth Smartlogic Semaphore The what? and how? of our Content Intelligence solution Page 1 of 28 Executive Summary Enterprises no longer face an acute information access challenge. This is mainly because

More information

Search Engine Architecture II

Search Engine Architecture II Search Engine Architecture II Primary Goals of Search Engines Effectiveness (quality): to retrieve the most relevant set of documents for a query Process text and store text statistics to improve relevance

More information

Case-based Recommendation. Peter Brusilovsky with slides of Danielle Lee

Case-based Recommendation. Peter Brusilovsky with slides of Danielle Lee Case-based Recommendation Peter Brusilovsky with slides of Danielle Lee Where we are? Search Navigation Recommendation Content-based Semantics / Metadata Social Modern E-Commerce Site The Power of Metadata

More information

PoolParty. Thesaurus Management Semantic Search Linked Data. ISKO UK, London September 14, Andreas Blumauer

PoolParty. Thesaurus Management Semantic Search Linked Data. ISKO UK, London September 14, Andreas Blumauer PoolParty Thesaurus Management Semantic Search Linked Data ISKO UK, London September 14, 2010 Andreas Blumauer Some thoughts on the Semantic Web In the Semantic Web, it is not the Semantic which is new,

More information

Ex Libris Accessibility Conformance Report

Ex Libris Accessibility Conformance Report Name of Product/Version: Ex Libris Primo / February 2018 release Ex Libris Accessibility Conformance Report Level A and AA VPAT Version 2.0 Product Description: Ex Libris Primo provides a fast, comprehensive,

More information

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered. Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

Vocabulary Alignment for archaeological Knowledge Organization Systems

Vocabulary Alignment for archaeological Knowledge Organization Systems Vocabulary Alignment for archaeological Knowledge Organization Systems 14th Workshop on Networked Knowledge Organization Systems TPDL 2015 Poznan Lena-Luise Stahn September 17, 2015 1 / 20 Summary Introduction

More information

A Quick Start Guide On How To Promote Your Website Using the Total SEO Toolkit

A Quick Start Guide On How To Promote Your Website Using the Total SEO Toolkit A Quick Start Guide On How To Promote Your Website Using the Total SEO Toolkit Welcome to the Total SEO Toolkit, a turn-key SEO Platform with state-of-the-art reporting functionality! We thought it would

More information

New Approach to Graph Databases

New Approach to Graph Databases Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few

More information

Web UI Dos and Don ts

Web UI Dos and Don ts Web UI Dos and Don ts 1. A One Column Layout instead of multi-columns a. A one column layout gives you more control over your narrative. It guides your readers in a more predictable way from top to bottom.

More information

Catching the wave Tools and Technology for Taxonomists Taxonomy Bootcamp London October 16, 2018

Catching the wave Tools and Technology for Taxonomists Taxonomy Bootcamp London October 16, 2018 Catching the wave Dave Clarke CEO Synaptica Catching the wave Tools and Technology for Taxonomists Taxonomy Bootcamp London October 16, 2018 Agenda Three questions AI Linked Data Ontology Semantic Web

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

Semantic web. Tapas Kumar Mishra 11CS60R32

Semantic web. Tapas Kumar Mishra 11CS60R32 Semantic web Tapas Kumar Mishra 11CS60R32 1 Agenda Introduction What is semantic web Issues with traditional web search The Technology Stack Architecture of semantic web Meta Data Main Tasks Knowledge

More information

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers

More information

Vocabulary and Semantics in the Virtual Observatory

Vocabulary and Semantics in the Virtual Observatory Vocabulary and Semantics in the Virtual Observatory Norman Gray VO-TECH / AstroGrid / Uni. Leicester / Uni. Glasgow, UK VOEvent BoF, ADASS, London, 2007 September 24 rdf Resource Description Framework

More information

Exploring and Using the Semantic Web

Exploring and Using the Semantic Web Exploring and Using the Semantic Web Mathieu d Aquin KMi, The Open University m.daquin@open.ac.uk What?? Exploring the Semantic Web Vocabularies Ontologies Linked Data RDF documents Example: Exploring

More information

Converting a thesaurus into an ontology: the use case of URBISOC

Converting a thesaurus into an ontology: the use case of URBISOC Advanced Information Systems Laboratory Cost Action C2 Converting a thesaurus into an ontology: the use case of URBISOC J. Nogueras-Iso, J. Lacasta Alcalá de Henares, 4-5 May 2007 http://iaaa.cps.unizar.es

More information

A Semantic MediaWiki-Empowered Terminology Registry

A Semantic MediaWiki-Empowered Terminology Registry Proc. Int l Conf. on Dublin Core and Metadata Applications 2009 A Semantic MediaWiki-Empowered Terminology Registry Qing Zou School of Information Studies McGill University, Canada qing.zou2@mail.mcgill.ca

More information

The World Bank Enterprise Search Program. Luisita Guanlao The World Bank Group May 10, 2005

The World Bank Enterprise Search Program. Luisita Guanlao The World Bank Group May 10, 2005 The World Bank Enterprise Search Program Luisita Guanlao The World Bank Group May 10, 2005 Agenda Background Enterprise Search Strategy Key Challenges and Lessons Learned History Pre-Internet Search by

More information

Sec. 8.7 RESULTS PRESENTATION

Sec. 8.7 RESULTS PRESENTATION Sec. 8.7 RESULTS PRESENTATION 1 Sec. 8.7 Result Summaries Having ranked the documents matching a query, we wish to present a results list Most commonly, a list of the document titles plus a short summary,

More information

Chapter 6. Queries and Interfaces

Chapter 6. Queries and Interfaces Chapter 6 Queries and Interfaces Keyword Queries Simple, natural language queries were designed to enable everyone to search Current search engines do not perform well (in general) with natural language

More information

Watson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself)

Watson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself) Watson & WMR2017 (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself) R. BASILI A.A. 2016-17 Overview Motivations Watson Jeopardy NLU in Watson

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

MEASURING SEMANTIC SIMILARITY BETWEEN WORDS AND IMPROVING WORD SIMILARITY BY AUGUMENTING PMI

MEASURING SEMANTIC SIMILARITY BETWEEN WORDS AND IMPROVING WORD SIMILARITY BY AUGUMENTING PMI MEASURING SEMANTIC SIMILARITY BETWEEN WORDS AND IMPROVING WORD SIMILARITY BY AUGUMENTING PMI 1 KAMATCHI.M, 2 SUNDARAM.N 1 M.E, CSE, MahaBarathi Engineering College Chinnasalem-606201, 2 Assistant Professor,

More information

Metadata Standards and Applications. 6. Vocabularies: Attributes and Values

Metadata Standards and Applications. 6. Vocabularies: Attributes and Values Metadata Standards and Applications 6. Vocabularies: Attributes and Values Goals of Session Understand how different vocabularies are used in metadata Learn about relationships in vocabularies Understand

More information

Data formats for exchanging classifications UNSD

Data formats for exchanging classifications UNSD ESA/STAT/AC.234/22 11 May 2011 UNITED NATIONS DEPARTMENT OF ECONOMIC AND SOCIAL AFFAIRS STATISTICS DIVISION Meeting of the Expert Group on International Economic and Social Classifications New York, 18-20

More information