Best Practices for World-Class Search
|
|
- Vincent McBride
- 5 years ago
- Views:
Transcription
1 Best Practices for World-Class Search MARY HOLSTEGE Distinguished Engineer, 4 June 2018 MARKLOGIC CORPORATION
2 SLIDE: 2 4 June 2018 MARKLOGIC CORPORATION
3 Search Application: Search for a Purpose To answer questions about the world To discover useful information To find relevant information so it can be acted upon To bring important information to someone's attention To partition data into useful chunks for analysis SLIDE: 3 4 June 2018 MARKLOGIC CORPORATION
4 Search Application: Search for a Purpose To answer questions about the world To discover useful information To find relevant information so it can be acted upon To bring important information to someone's attention To partition data into useful chunks for analysis SLIDE: 4 4 June 2018 MARKLOGIC CORPORATION
5 Oh, the humanities! Oh, the humanities! Great search needs great understanding of humans Linguistics Psychology Anthropology SLIDE: 5 4 June 2018 MARKLOGIC CORPORATION
6 CC at: rs Blocks some melanin. Often gives light colored eyes. GG at: rs Blocks some melanin. Often gives light colored eyes. CC at: rs Low Melanin. Basis for Gray, Blue, Green, or Yellow Eyes if no other pigmentation is present. CC at: rs Blocks some melanin. Often gives light colored eyes. CT at: rs Blue. CT at: rs Blue. CT at: rs Contrasting sphincter around pupil. AA at: rs Med Brown on Sphincter AA at: rs Weak Amber Gradient TT at: rs Penetrance modifier. Blue. GG at: rs Gray ring around outer edge TT at: rs Starburst (Collarette) SLIDE: 6 4 June 2018 MARKLOGIC CORPORATION
7 Search Application: Search for a Purpose To answer questions about the world To discover useful information To find relevant information so it can be acted upon To bring important information to someone's attention To partition data into useful chunks for analysis SLIDE: 7 4 June 2018 MARKLOGIC CORPORATION
8 Improve Search Application: Fitter for Purpose Improve answers to questions Improve discoverability of information Improve usefulness of information Improve relevance of information Improve ability to act on information Improve visibility of important information Improve ability to partition information SLIDE: 8 4 June 2018 MARKLOGIC CORPORATION
9 SLIDE: 9 4 June 2018 MARKLOGIC CORPORATION
10 The Search Tripod Data Query Response SLIDE: 10 4 June 2018 MARKLOGIC CORPORATION
11 Battleplan Understanding Analysis Continual improvement Pertinence Feedback Augmentation Interaction Use SLIDE: 11 4 June 2018 MARKLOGIC CORPORATION
12 Search for a Purpose: Improve answers to questions Understand question, understand answer Make answer pertinent to question Make answer useful and usable SLIDE: 12 4 June 2018 MARKLOGIC CORPORATION
13 Data
14 Data Analysis Augmentation Interaction Feedback Keywords Classification Data modeling Entity recognition Link relationships Quality rankings Metrics Metadata Collections Semantic markup Entity markup Quality Range dimensions Linked data Reference data "SEO" Document proxies Visualizations Clustering Linking Popularity Reviews/ratings Folk taxonomies Annotations Linking SLIDE: 14 4 June 2018 MARKLOGIC CORPORATION
15 SLIDE: 15 4 June 2018 MARKLOGIC CORPORATION
16 Semantic Markup With Linkage to Facts Augment data with RDF triples, ontology Create entity dictionary from SKOS ontology Analyze and augment data with entity markup SLIDE: 16 4 June 2018 MARKLOGIC CORPORATION
17 geo:cayman_islands a skos:concept; skos:inscheme geo:area; skos:preflabel "Cayman Islands"^^xsd:string; geo:agriculturalareanotes "Manual Estimation"^^xsd:string; geo:agriculturalareatotal 2.7; geo:agriculturalareaunit "1000 Ha"^^xsd:string; geo:agriculturalareayear "2009"^^xsd:int; geo:populationtotal 56.0; geo:populationunit "1000"^^xsd:string; geo:populationyear "2010"^^xsd:int;; geo:codedbpediaid "Cayman_Islands"^^xsd:string; geo:countryareatotal 26.4; geo:countryareaunit "1000 Ha"^^xsd:string; SLIDE: 17 4 June 2018 MARKLOGIC CORPORATION
18 <para> <ex:business id=" Partners</ex:business> has a controlling interest in several companies with accounts in the <geo:area id=" Islands</geo:area>, regulators have learned. Other offshore accounts in <geo:area id=" and <geo:area id=" have been tied to the chairman, Mr. Q. A spokesperson for Mr. Q declined to comment. </para> <para> Investigations continue. </para> SLIDE: 18 4 June 2018 MARKLOGIC CORPORATION
19 SLIDE: 19 4 June 2018 MARKLOGIC CORPORATION
20 SLIDE: 20 4 June 2018 MARKLOGIC CORPORATION
21 SLIDE: 21 4 June 2018 MARKLOGIC CORPORATION
22 SLIDE: 22 4 June 2018 MARKLOGIC CORPORATION
23 SLIDE: 23 4 June 2018 MARKLOGIC CORPORATION
24 SLIDE: 24 4 June 2018 MARKLOGIC CORPORATION
25 SLIDE: 25 4 June 2018 MARKLOGIC CORPORATION
26 Query
27 Query Analysis Augmentation Interaction Feedback Canned queries FAQs Query patterns Entity recognition Natural(istic) language Metrics User interests Synonyms Related terms Entity queries Disambiguation Semantic queries Parsed string Facets Sliders Timelines Maps Breadcrumbs Scatter/gather User behaviors User interests Common queries SLIDE: 27 4 June 2018 MARKLOGIC CORPORATION
28 SLIDE: 28 4 June 2018 MARKLOGIC CORPORATION
29 Analyze and Transform Query Create special dictionaries for query analysis Normalize query words, strip stopwords Replace with tagged forms Match tagged query to query patterns Parse tagged query with appropriate bindings SLIDE: 29 4 June 2018 MARKLOGIC CORPORATION
30 // Input query "Does XYZW Partners have business in the Cayman Islands?" // Normalized "XYZW Partners has business Cayman Islands" // Analyzed and tagged query string (business:221932) (control:hold OR get:receive OR suffer:have) (person:clientele OR organization:enterprise OR event:commerce) (location:cayman_islands) SLIDE: 30 4 June 2018 MARKLOGIC CORPORATION
31 // Matched to query pattern "business:* control:hold organization:*" to disambiguate "business: control:hold organization:enterprise location:cayman_islands" // Parsed with appropriate bindings cts:and-query(( cts:field-value-query("id", " cts:or-query(("control","hold","govern","run","have"),"synonym"), cts:or-query(("organization","business","enterprise","insitution"),"synonym"), cts:or-query(( cts:field-value-query("id"," "George Town", "West Bay", "Bodden Town", "East End", "North Side", "West End" )) )) SLIDE: 31 4 June 2018 MARKLOGIC CORPORATION
32 SLIDE: 32 4 June 2018 MARKLOGIC CORPORATION
33 SLIDE: 33 COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
34 SLIDE: 34 4 June 2018 MARKLOGIC CORPORATION
35 SLIDE: 35 4 June 2018 MARKLOGIC CORPORATION
36 SLIDE: 36 4 June 2018 MARKLOGIC CORPORATION
37 SLIDE: 37 4 June 2018 MARKLOGIC CORPORATION
38 SLIDE: 38 4 June 2018 MARKLOGIC CORPORATION
39 Response
40 Response Analysis Augmentation Interaction Feedback Clusters Links Disambiguation Re-ranking Best bets Metrics Document proxies Result clusters Facts Info boxes Related queries Relevance tuning Navigational cues Annotated TOCs Scatter/gather Linked data views Facets Sliders Timelines Maps User behavior SLIDE: 40 4 June 2018 MARKLOGIC CORPORATION
41 SLIDE: 41 4 June 2018 MARKLOGIC CORPORATION
42 Infobox Extract concept IDs from query string Select best entity for results Query for facts related to ID SLIDE: 42 4 June 2018 MARKLOGIC CORPORATION
43 // Input query "Does XYZW Partners have business in the Cayman Islands?" // Extracted entity IDs " " SLIDE: 43 4 June 2018 MARKLOGIC CORPORATION
44 // Facts for chosen ID (predicate + object) "Manual Estimation"^^xsd:string "1000 Ha"^^xsd:string "2009"^^xsd:int "1000"^^xsd:string "2010"^^xsd:int "Cayman_Islands"^^xsd:string "1000 Ha"^^xsd:string SLIDE: 44 4 June 2018 MARKLOGIC CORPORATION
45 SLIDE: 45 4 June 2018 MARKLOGIC CORPORATION
46 SLIDE: 46 4 June 2018 MARKLOGIC CORPORATION
47 SLIDE: 47 4 June 2018 MARKLOGIC CORPORATION
48 SLIDE: 48 4 June 2018 MARKLOGIC CORPORATION
49 SLIDE: 49 4 June 2018 MARKLOGIC CORPORATION
50 SUMMARY Building Great Search Applications Humans Understand user purposes Answers Discovery Analysis Action Humanities Search tripod Strengthen all legs Data Query Response Battleplan Analysis Understanding Augmentation Pertinence Interaction Use Feedback Continual improvement SLIDE: 50 4 June 2018 MARKLOGIC CORPORATION
51 Questions?
52 Appendix
53 SLIDE: 53 4 June 2018 MARKLOGIC CORPORATION
54 SLIDE: 54 4 June 2018 MARKLOGIC CORPORATION
55 SLIDE: 55 4 June 2018 MARKLOGIC CORPORATION
56 SLIDE: 56 4 June 2018 MARKLOGIC CORPORATION
57 SLIDE: 57 4 June 2018 MARKLOGIC CORPORATION
58 SLIDE: 58 4 June 2018 MARKLOGIC CORPORATION
59 SLIDE: 59 4 June 2018 MARKLOGIC CORPORATION
60 SLIDE: 60 COPYRIGHT 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
61 SLIDE: 61 4 June 2018 MARKLOGIC CORPORATION
62 SLIDE: 62 4 June 2018 MARKLOGIC CORPORATION
63 Data
64 Goals Improve searchability - Precision: specific simplified scoping - Recall: related information explicit - Ranking: explicit context, quality Improve usability - Facts from text - Summarization, TOCs, other proxies Improve discoverability - Facets, classifications SLIDE: 64 4 June 2018 MARKLOGIC CORPORATION
65 Analysis Keywords and classification Modeling Entity recognition Link analysis Quality rankings SLIDE: 65 4 June 2018 MARKLOGIC CORPORATION
66 Augmentation Metadata Collections Semantic markup Entity markup Quality Range dimensions Linked data Contextual information Reference data SLIDE: 66 4 June 2018 MARKLOGIC CORPORATION
67 Interaction Document proxies - KWIC snippets - Summaries - TileBar - Color lines Relationships - Clustering - Link visualizations SLIDE: 67 4 June 2018 MARKLOGIC CORPORATION
68 Feedback Popularity, reviews, ratings => quality adjustments, range adjustments Folk taxonomies => classification Annotations => related information Additional linked data => context SLIDE: 68 4 June 2018 MARKLOGIC CORPORATION
69 Selected Techniques Expert indexers (keywords, classifications) - Primed/augmented with classification engines Semantic markup - Authoring toolchain - Harmonization/projection Entity markup - NER integrations - Ontology-driven or query-based entity extraction Linked data, reference data SLIDE: 69 4 June 2018 MARKLOGIC CORPORATION
70 Queries
71 Goals Improve expressibility - Simpler expression => better query Improve search effectiveness - Precision, recall, ranking - Focus in, focus out SLIDE: 71 4 June 2018 MARKLOGIC CORPORATION
72 Analysis Canned queries, query patterns, FAQs Identify known entities Natural(istic) language interpretation SLIDE: 72 4 June 2018 MARKLOGIC CORPORATION
73 Augmentation Boost queries - User interests, related terms Synonym, thesaurus expansion Canned query recognition Entity recognition Query expansion Contextual augmentation Relevance tweaks SLIDE: 73 4 June 2018 MARKLOGIC CORPORATION
74 Interaction Natural(istic) language input Query builders: facets, timelines, maps, links, forms More like this, less like that Result zooming/slicing Breadcrumbs SLIDE: 74 4 June 2018 MARKLOGIC CORPORATION
75 Feedback User behavior => disambiguation User interests => query augmentation Common queries => canned queries and results SLIDE: 75 4 June 2018 MARKLOGIC CORPORATION
76 Selected Techniques Query string parsing/pre-processing - NLP - Regex analysis - NER, ontology-driven entity extraction Augmentation with profiles, thesauri, etc. Reverse query against FAQs Range/geospatial indexes SLIDE: 76 4 June 2018 MARKLOGIC CORPORATION
77 Response
78 Goals Improve usefulness - Immediate answers - Navigation - Context Improve conversation - What happens next? SLIDE: 78 4 June 2018 MARKLOGIC CORPORATION
79 Analysis Clustering Disambiguation Re-ranking Best bets SLIDE: 79 4 June 2018 MARKLOGIC CORPORATION
80 Augmentation Document proxies and context Results clustering Specific facts Info boxes Related queries Relevance tuning Navigational cues (within/across documents) Exploration interfaces SLIDE: 80 4 June 2018 MARKLOGIC CORPORATION
81 Interaction Results cluster navigation Linked data views Facet/timeline/map/sliders Annotated TOCs Counts/coloring Immediate action affordances SLIDE: 81 4 June 2018 MARKLOGIC CORPORATION
82 Feedback User actions => adjust rankings, adjust interests Hide/expose => streamlined personal interface SLIDE: 82 4 June 2018 MARKLOGIC CORPORATION
83 Selected Techniques Snippeting, summarization Clustering Re-ranking, LTR Query entities => result entities => related information Range/geospatial indexes => display widgets Linked data presentations SLIDE: 83 4 June 2018 MARKLOGIC CORPORATION
AutoFocus, an Open Source Facet-Driven Enterprise Search Solution
AutoFocus, an Open Source Facet-Driven Enterprise Search Solution ISKO UK Event, November 5, 2007 RANGANATHAN REVISITED: FACETS FOR THE FUTURE presentation by Jeroen Wester, CTO Aduna key facts Open source
More informationHOW TO BUILD AN AWESOME SEARCH APP
HOW TO BUILD AN AWESOME SEARCH APP Stu McLean, Ph.D., Principal Consultant, Public Sector - Civilian, MarkLogic Ganesh Vaideeswaran, Senior Director, Development, MarkLogic Agenda Components of a search
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationFusing Corporate Thesaurus Management with Linked Data using PoolParty
Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl PoolParty at a glance Developed by punkt. netservices Current release: PoolParty 2.8 Main focus on three application
More informationISSUES IN INFORMATION RETRIEVAL Brian Vickery. Presentation at ISKO meeting on June 26, 2008 At University College, London
ISSUES IN INFORMATION RETRIEVAL Brian Vickery Presentation at ISKO meeting on June 26, 2008 At University College, London NEEDLE IN HAYSTACK MY BACKGROUND Plant chemist, then reports librarian Librarian,
More informationNatural Language Processing with PoolParty
Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense
More information<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17
More informationAnnotation Component in KiWi
Annotation Component in KiWi Marek Schmidt and Pavel Smrž Faculty of Information Technology Brno University of Technology Božetěchova 2, 612 66 Brno, Czech Republic E-mail: {ischmidt,smrz}@fit.vutbr.cz
More informationHistorical Text Mining:
Historical Text Mining Historical Text Mining, and Historical Text Mining: Challenges and Opportunities Dr. Robert Sanderson Dept. of Computer Science University of Liverpool azaroth@liv.ac.uk http://www.csc.liv.ac.uk/~azaroth/
More informationRavel Law Quick Start Guide
Ravel Law Quick Start Guide Table of Contents Start Your Search 2 Search Results: The Visualization Map 4 Case Reading 6 Frequently Asked Questions 11 Appendix: Judge Analytics 13 Start Your Judge Search
More informationOntology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University Outline Communities Ontology value, issues, problems, solutions Ontology languages Terms for ontology Ontologies April
More informationSemantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.
Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More information2015 Search Ranking Factors
2015 Search Ranking Factors Introduction Abstract Technical User Experience Content Social Signals Backlinks Big Picture Takeaway 2 2015 Search Ranking Factors Here, at ZED Digital, our primary concern
More informationOntology-based Architecture Documentation Approach
4 Ontology-based Architecture Documentation Approach In this chapter we investigate how an ontology can be used for retrieving AK from SA documentation (RQ2). We first give background information on the
More informationArchitecting Knowledge Middleware
Architecting Knowledge Middleware WWW 2002, Honolulu, May 9, 2002 Alfred Z. Spector Vice President, Services and Software IBM Research Division aspector@us.ibm.com Thomas J. Watson Research Center PO Box
More information0.1 Knowledge Organization Systems for Semantic Web
0.1 Knowledge Organization Systems for Semantic Web 0.1 Knowledge Organization Systems for Semantic Web 0.1.1 Knowledge Organization Systems Why do we need to organize knowledge? Indexing Retrieval Organization
More informationMicrosoft FAST Search Server 2010 for SharePoint Evaluation Guide
Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide 1 www.microsoft.com/sharepoint The information contained in this document represents the current view of Microsoft Corporation on the issues
More informationB2FIND and Metadata Quality
B2FIND and Metadata Quality 3 rd EUDAT Conference 25 September 2014 Heinrich Widmann and B2FIND team 1 Outline B2FIND the EUDAT Metadata Service Semantic Mapping of Metadata Quality of Metadata Summary
More informationCopyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies
Taxonomy Strategies July 17, 2012 Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata A Tale of Two Types of Vocabularies What is semantic metadata? Semantic relationships in the
More informationSemantic Retrieval of the TIB AV-Portal. Dr. Sven Strobel IATUL 2015 July 9, 2015; Hannover
Semantic Retrieval of the TIB AV-Portal Dr. Sven Strobel IATUL 2015 July 9, 2015; Hannover Semantic Retrieval of the TIB AV-Portal Contents 1. TIB AV-Portal 2. Automatic Video Analysis 3. Named-Entity
More informationThe Allen Human Brain Atlas offers three types of searches to allow a user to: (1) obtain gene expression data for specific genes (or probes) of
Microarray Data MICROARRAY DATA Gene Search Boolean Syntax Differential Search Mouse Differential Search Search Results Gene Classification Correlative Search Download Search Results Data Visualization
More informationImproving information retrieval through metadata tagging
Improving information retrieval through metadata tagging Jonathan Engel, Information Architect Date 25/09/15 Introduction Organising and tagging content using standard structures will always enhance navigation,
More informationMaximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009
Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images
More informationExploring the Use of Semantic Technologies for Cross-Search of Archaeological Grey Literature and Data
Exploring the Use of Semantic Technologies for Cross-Search of Archaeological Grey Literature and Data Presented by Keith May @keith_may Based on the work of Andreas Vlachidis, Ceri Binding, Keith May,
More informationIBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008
IBE101: Introduction to Information Architecture Hans Fredrik Nordhaug 2008 Objectives Defining IA Practicing IA User Needs and Behaviors The anatomy of IA Organizations Systems Labelling Systems Navigation
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationAn Ontology Based Question Answering System on Software Test Document Domain
An Ontology Based Question Answering System on Software Test Document Domain Meltem Serhatli, Ferda N. Alpaslan Abstract Processing the data by computers and performing reasoning tasks is an important
More informationThings to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic
Things to consider when using Semantics in your Information Management strategy Toby Conrad Smartlogic toby.conrad@smartlogic.com +1 773 251 0824 Some of Smartlogic s 250+ Customers Awards Trend Setting
More informationContent analysis and classification in mathematics
Content analysis and classification in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) UDC Seminar 2011 CLASSIFICATION & ontology Formal approaches and Access to Knowledge The
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationA service based on Linked Data to classify Web resources using a Knowledge Organisation System
A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources
More informationFalcon-AO: Aligning Ontologies with Falcon
Falcon-AO: Aligning Ontologies with Falcon Ningsheng Jian, Wei Hu, Gong Cheng, Yuzhong Qu Department of Computer Science and Engineering Southeast University Nanjing 210096, P. R. China {nsjian, whu, gcheng,
More informationSEMANTIC WEB POWERED PORTAL INFRASTRUCTURE
SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE YING DING 1 Digital Enterprise Research Institute Leopold-Franzens Universität Innsbruck Austria DIETER FENSEL Digital Enterprise Research Institute National
More informationDomain Independent Knowledge Base Population From Structured and Unstructured Data Sources
Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources Michelle Gregory, Liam McGrath, Eric Bell, Kelly O Hara, and Kelly Domico Pacific Northwest National Laboratory
More informationNMR Guide. 3/14/ Broadbean
NMR Guide //08 0 Broadbean What is the new manage responses tab and where can I access? The New Manage Responses has been built as a dynamic page that allows users to keyword search and filter their adverts'
More informationIJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN
Movie Related Information Retrieval Using Ontology Based Semantic Search Tarjni Vyas, Hetali Tank, Kinjal Shah Nirma University, Ahmedabad tarjni.vyas@nirmauni.ac.in, tank92@gmail.com, shahkinjal92@gmail.com
More informationSKOS. COMP62342 Sean Bechhofer
SKOS COMP62342 Sean Bechhofer sean.bechhofer@manchester.ac.uk Ontologies Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies
More informationState of the Art and Trends in Search Engine Technology. Gerhard Weikum
State of the Art and Trends in Search Engine Technology Gerhard Weikum (weikum@mpi-inf.mpg.de) Commercial Search Engines Web search Google, Yahoo, MSN simple queries, chaotic data, many results key is
More informationJuggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets
Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Rahul Potharaju (Purdue University) Navendu Jain (Microsoft Research) Cristina Nita-Rotaru (Purdue University) April
More informationOracle Endeca Information Discovery
Oracle Endeca Information Discovery Glossary Version 2.4.0 November 2012 Copyright and disclaimer Copyright 2003, 2013, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationEleven+ Views of Semantic Search
Eleven+ Views of Semantic Search Denise A. D. Bedford, Ph.d. Goodyear Professor of Knowledge Management Information Architecture and Knowledge Management Kent State University Presentation Focus Long-Term
More informationMarkLogic. A Modern Data Platform To Support Your Critical Path COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
MarkLogic A Modern Data Platform To Support Your Critical Path SLIDE: 2 Inception Pre- Post- Distribution Archive Taxonomies Semantics Technical Descriptive Customers Usage SLIDE: 4 Inception Pre- Post-
More informationCopyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies
Taxonomy Strategies October 28, 2012 Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata A Tale of Two Types of Vocabularies What is the semantic web? Making content web-accessible
More informationOntologies SKOS. COMP62342 Sean Bechhofer
Ontologies SKOS COMP62342 Sean Bechhofer sean.bechhofer@manchester.ac.uk Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies
More informationUser Configurable Semantic Natural Language Processing
User Configurable Semantic Natural Language Processing Jason Hedges CEO and Founder Edgetide LLC info@edgetide.com (443) 616-4941 Table of Contents Bridging the Gap between Human and Machine Language...
More informationDomain-specific Concept-based Information Retrieval System
Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical
More informationToward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains
Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Eloise Currie and Mary Parmelee SAS Institute, Cary NC About SAS: The Power to Know SAS: The Market Leader in
More informationHELP ON THE VIRTUAL LIBRARY
HELP ON THE VIRTUAL LIBRARY The Virtual Library search system allows accessing in a quick way to the information the students are interested in and that are available in the Didactic Cyberspace. In its
More informationText mining tools for semantically enriching the scientific literature
Text mining tools for semantically enriching the scientific literature Sophia Ananiadou Director National Centre for Text Mining School of Computer Science University of Manchester Need for enriching the
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationInformation Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science
Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationNatural Language Processing. SoSe Question Answering
Natural Language Processing SoSe 2017 Question Answering Dr. Mariana Neves July 5th, 2017 Motivation Find small segments of text which answer users questions (http://start.csail.mit.edu/) 2 3 Motivation
More informationTaxonomy and Search Patterns for Enhanced Search and Discovery
Taxonomy and Search Patterns for Enhanced Search and Discovery Patrick Lambe Taxonomy alone is limited in what it can do. Search alone is also limited. Together, they become much smarter. If taxonomy and
More informationEnterprise Multimedia Integration and Search
Enterprise Multimedia Integration and Search José-Manuel López-Cobo 1 and Katharina Siorpaes 1,2 1 playence, Austria, 2 STI Innsbruck, University of Innsbruck, Austria {ozelin.lopez, katharina.siorpaes}@playence.com
More informationClustering Results. Result List Example. Clustering Results. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Presenting Results Clustering Clustering Results! Result lists often contain documents related to different aspects of the query topic! Clustering is used to
More informationHandling Place References in Text
Handling Place References in Text Introduction Most (geographic) information is available in the form of textual documents Place reference resolution involves two-subtasks: Recognition : Delimiting occurrences
More informationSEO. Drivers You Are Missing in Content Marketing
SEO Drivers You Are Missing in Content Marketing SEO IS ALWAYS CHANGING. HICH MEANS your content strategy what you create and how it is found is ALWAYS CHANGING AS WELL. BUT IS SEO ALWAYS CHANGING? BECAUSE
More informationIntroduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.
Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How
More informationTerminologies, Knowledge Organization Systems, Ontologies
Terminologies, Knowledge Organization Systems, Ontologies Gerhard Budin University of Vienna TSS July 2012, Vienna Motivation and Purpose Knowledge Organization Systems In this unit of TSS 12, we focus
More informationRapid Information Discovery System (RAID)
Int'l Conf. Artificial Intelligence ICAI'17 321 Rapid Information Discovery System (RAID) B. Gopal, P. Benjamin, and K. Madanagopal Knowledge Based Systems, Inc. (KBSI), College Station, TX, USA Summary
More informationA B2B Search Engine. Abstract. Motivation. Challenges. Technical Report
Technical Report A B2B Search Engine Abstract In this report, we describe a business-to-business search engine that allows searching for potential customers with highly-specific queries. Currently over
More informationACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE
June 30, 2012 San Diego Convention Center ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE Stuart Laurie, Senior Consultant #SPSSAN Agenda 1. Challenges 2. What comes out of the box
More informationMapping between Digital Identity Ontologies through SISM
Mapping between Digital Identity Ontologies through SISM Matthew Rowe The OAK Group, Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield S1 4DP, UK m.rowe@dcs.shef.ac.uk
More informationSemantic Enrichment ARMA Chicago Spring Seminar April 18, 2018
Semantic Enrichment ARMA Chicago Spring Seminar April 18, 2018 Presentation Overview What is Semantics? Semantic Building Blocks Why Use Semantic Technology Semantic Layers Source of Semantic Information
More informationNew Manage Responses. 11/6/ Broadbean
New Manage Responses /6/08 0 Broadbean About new manage responses (NMR) and access The New Manage Responses function has been designed around our customer suggestions to allow quick, fast, accurate access
More informationCHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS
82 CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS In recent years, everybody is in thirst of getting information from the internet. Search engines are used to fulfill the need of them. Even though the
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationInformation Retrieval and Knowledge Organisation
Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector
More informationSemantic Searching. John Winder CMSC 676 Spring 2015
Semantic Searching John Winder CMSC 676 Spring 2015 Semantic Searching searching and retrieving documents by their semantic, conceptual, and contextual meanings Motivations: to do disambiguation to improve
More informationNational Documentation Centre Open access in Cultural Heritage digital content
National Documentation Centre Open access in Cultural Heritage digital content Haris Georgiadis, Ph.D. Senior Software Engineer EKT hgeorgiadis@ekt.gr The beginning.. 42 institutions documented & digitalized
More informationOVERVIEW. In depth. Smartlogic Semaphore. The what? and how? of our Content Intelligence solution FIND OUT MORE >
In depth Smartlogic Semaphore The what? and how? of our Content Intelligence solution Page 1 of 28 Executive Summary Enterprises no longer face an acute information access challenge. This is mainly because
More informationSearch Engine Architecture II
Search Engine Architecture II Primary Goals of Search Engines Effectiveness (quality): to retrieve the most relevant set of documents for a query Process text and store text statistics to improve relevance
More informationCase-based Recommendation. Peter Brusilovsky with slides of Danielle Lee
Case-based Recommendation Peter Brusilovsky with slides of Danielle Lee Where we are? Search Navigation Recommendation Content-based Semantics / Metadata Social Modern E-Commerce Site The Power of Metadata
More informationPoolParty. Thesaurus Management Semantic Search Linked Data. ISKO UK, London September 14, Andreas Blumauer
PoolParty Thesaurus Management Semantic Search Linked Data ISKO UK, London September 14, 2010 Andreas Blumauer Some thoughts on the Semantic Web In the Semantic Web, it is not the Semantic which is new,
More informationEx Libris Accessibility Conformance Report
Name of Product/Version: Ex Libris Primo / February 2018 release Ex Libris Accessibility Conformance Report Level A and AA VPAT Version 2.0 Product Description: Ex Libris Primo provides a fast, comprehensive,
More informationContent Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.
Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything
More informationA Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2
A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,
More informationVocabulary Alignment for archaeological Knowledge Organization Systems
Vocabulary Alignment for archaeological Knowledge Organization Systems 14th Workshop on Networked Knowledge Organization Systems TPDL 2015 Poznan Lena-Luise Stahn September 17, 2015 1 / 20 Summary Introduction
More informationA Quick Start Guide On How To Promote Your Website Using the Total SEO Toolkit
A Quick Start Guide On How To Promote Your Website Using the Total SEO Toolkit Welcome to the Total SEO Toolkit, a turn-key SEO Platform with state-of-the-art reporting functionality! We thought it would
More informationNew Approach to Graph Databases
Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few
More informationWeb UI Dos and Don ts
Web UI Dos and Don ts 1. A One Column Layout instead of multi-columns a. A one column layout gives you more control over your narrative. It guides your readers in a more predictable way from top to bottom.
More informationCatching the wave Tools and Technology for Taxonomists Taxonomy Bootcamp London October 16, 2018
Catching the wave Dave Clarke CEO Synaptica Catching the wave Tools and Technology for Taxonomists Taxonomy Bootcamp London October 16, 2018 Agenda Three questions AI Linked Data Ontology Semantic Web
More informationMIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion
MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad
More informationSemantic web. Tapas Kumar Mishra 11CS60R32
Semantic web Tapas Kumar Mishra 11CS60R32 1 Agenda Introduction What is semantic web Issues with traditional web search The Technology Stack Architecture of semantic web Meta Data Main Tasks Knowledge
More informationTaking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria
Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers
More informationVocabulary and Semantics in the Virtual Observatory
Vocabulary and Semantics in the Virtual Observatory Norman Gray VO-TECH / AstroGrid / Uni. Leicester / Uni. Glasgow, UK VOEvent BoF, ADASS, London, 2007 September 24 rdf Resource Description Framework
More informationExploring and Using the Semantic Web
Exploring and Using the Semantic Web Mathieu d Aquin KMi, The Open University m.daquin@open.ac.uk What?? Exploring the Semantic Web Vocabularies Ontologies Linked Data RDF documents Example: Exploring
More informationConverting a thesaurus into an ontology: the use case of URBISOC
Advanced Information Systems Laboratory Cost Action C2 Converting a thesaurus into an ontology: the use case of URBISOC J. Nogueras-Iso, J. Lacasta Alcalá de Henares, 4-5 May 2007 http://iaaa.cps.unizar.es
More informationA Semantic MediaWiki-Empowered Terminology Registry
Proc. Int l Conf. on Dublin Core and Metadata Applications 2009 A Semantic MediaWiki-Empowered Terminology Registry Qing Zou School of Information Studies McGill University, Canada qing.zou2@mail.mcgill.ca
More informationThe World Bank Enterprise Search Program. Luisita Guanlao The World Bank Group May 10, 2005
The World Bank Enterprise Search Program Luisita Guanlao The World Bank Group May 10, 2005 Agenda Background Enterprise Search Strategy Key Challenges and Lessons Learned History Pre-Internet Search by
More informationSec. 8.7 RESULTS PRESENTATION
Sec. 8.7 RESULTS PRESENTATION 1 Sec. 8.7 Result Summaries Having ranked the documents matching a query, we wish to present a results list Most commonly, a list of the document titles plus a short summary,
More informationChapter 6. Queries and Interfaces
Chapter 6 Queries and Interfaces Keyword Queries Simple, natural language queries were designed to enable everyone to search Current search engines do not perform well (in general) with natural language
More informationWatson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself)
Watson & WMR2017 (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself) R. BASILI A.A. 2016-17 Overview Motivations Watson Jeopardy NLU in Watson
More informationMicrosoft SharePoint Server 2013 Plan, Configure & Manage
Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that
More informationMEASURING SEMANTIC SIMILARITY BETWEEN WORDS AND IMPROVING WORD SIMILARITY BY AUGUMENTING PMI
MEASURING SEMANTIC SIMILARITY BETWEEN WORDS AND IMPROVING WORD SIMILARITY BY AUGUMENTING PMI 1 KAMATCHI.M, 2 SUNDARAM.N 1 M.E, CSE, MahaBarathi Engineering College Chinnasalem-606201, 2 Assistant Professor,
More informationMetadata Standards and Applications. 6. Vocabularies: Attributes and Values
Metadata Standards and Applications 6. Vocabularies: Attributes and Values Goals of Session Understand how different vocabularies are used in metadata Learn about relationships in vocabularies Understand
More informationData formats for exchanging classifications UNSD
ESA/STAT/AC.234/22 11 May 2011 UNITED NATIONS DEPARTMENT OF ECONOMIC AND SOCIAL AFFAIRS STATISTICS DIVISION Meeting of the Expert Group on International Economic and Social Classifications New York, 18-20
More information