The World Bank Enterprise Search Program. Luisita Guanlao The World Bank Group May 10, 2005

Similar documents
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano

BUILDING CYBERSECURITY CAPABILITY, MATURITY, RESILIENCE

Data Governance Overview

Taxonomy Governance Checklist

Chapter 27 Introduction to Information Retrieval and Web Search

DL User Interfaces. Giuseppe Santucci Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza

Data Stewardship Core by Maria C Villar and Dave Wells

Information Retrieval

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Taxonomy for Self-service Delivery

IMS Learning Object Discovery & Exchange

Book Review. Information Architecture for the World Wide Web (Second Edition) by Louis Rosenfeld & Peter Morville

Course Contents: 1 Business Objects Online Training

Striving for efficiency

Effective Information Management and Governance: Building the Business Case for Taxonomy

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

Security Metrics Establishing unambiguous and logically defensible security metrics. Steven Piliero CSO The Center for Internet Security

SharePoint 2010 Enterprise Content Management for IT Pros. Mirjam van Olst Macaw

Microsoft Core Solutions of Microsoft SharePoint Server 2013

Information Management Fundamentals by Dave Wells

4 FEBRUARY, Information architecture in theory

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Development of an Ontology-Based Portal for Digital Archive Services

B2FIND and Metadata Quality

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

When Semantics support Multilingual Access to Cultural Heritage The Europeana Case. Valentine Charles and Juliane Stiller

Six Weeks to Security Operations The AMP Story. Mike Byrne Cyber Security AMP

Information Retrieval

Vendor: The Open Group. Exam Code: OG Exam Name: TOGAF 9 Part 1. Version: Demo

The Emerging Data Lake IT Strategy

50+ INSTALLATIONS WORLDWIDE. 500k WHAT WE DO {

UK Institutional Repository Search Project

What s new in SharePoint Search 2010 for end users. IW109 Mirjam van Olst

Module 8: Search and Indexing

Data Governance Central to Data Management Success

Overview of Web Mining Techniques and its Application towards Web

Advanced Solutions of Microsoft SharePoint Server 2013

MULTIMEDIA DATABASES OVERVIEW

Linking SharePoint Documents with Structured Data. Towards Unified Views of Business-critical Information. Andreas Blumauer Director PoolParty Ltd, UK

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

Terminologies, Knowledge Organization Systems, Ontologies

Executive Committee Meeting

How HP is implementing an Omnichannel support experience

Information Retrieval and Knowledge Organisation

Reducing Consumer Uncertainty

Europeana update: aspects of the data

Sales and Marketing Strategies That Work for Financial Services

Making a Business Case for Electronic Document or Records Management

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Taxonomies and controlled vocabularies best practices for metadata

Advanced Solutions of Microsoft SharePoint 2013

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev

20331B: Core Solutions of Microsoft SharePoint Server 2013

Information Quality & Service Oriented Architecture

Project Management Learning & coaching User Adoption E-learning Governance Support

Informatica Enterprise Information Catalog

ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE

NCOIC Interoperability Framework (NIF ) and NCOIC Patterns Overview

DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK)

Oracle WebCenter Interaction: Roadmap for BEA AquaLogic User Interaction. Ajay Gandhi Sr. Director of Product Management Enterprise 2.

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

Virtustream Managed Services Drive value from technology investments through IT management solutions. Tim Calahan, Manager Managed Services

The European Commission s science and knowledge service

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Data Governance for the Connected Enterprise

Toward Horizon 2020: INSPIRE, PSI and other EU policies on data sharing and standardization

NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information. Part I: Project Overview Gordon Kennedy, Washington State DOT

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.

Future Trends of ILS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

Enterprise Data Catalog for Microsoft Azure Tutorial

Developing your Intranet Content Strategy like a Coder

Vocabulary-Driven Enterprise Architecture Development Guidelines for DoDAF AV-2: Design and Development of the Integrated Dictionary

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

PPM Essentials Accelerator Product Guide - On Premise. Service Pack

Open Source

Making your agency s sites more accessible to web search engine users. Implementing the Sitemap protocol

Executive Committee Meeting

City s user experience journey. Ryan Taylor Head of Digital

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

KDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research

strategy IT Str a 2020 tegy

Verint Knowledge Management Solution Brief Overview of the Unique Capabilities and Benefits of Verint Knowledge Management

Running Effective Projects In Office 365. June 1, 2017

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse

An Industry Definition of Business Architecture

Full file at

Data Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets.

SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web

Data Warehousing Fundamentals by Mark Peco

Harmonizing Multi-Model at the World Bank Group

CA ERwin Data Modeler r9 Rick Alaras N.A. Channel Account Manager

Oracle Database 12c: Performance Management and Tuning

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

Context-aware Services for UMTS-Networks*

Oracle Universal Records Management 11g: Administration

From Conceptual to Physical Adjustments to Enterprise Models for the Real World. Myriad Solutions, Inc. erwin Premier Partner since 2000

CA ERwin Data Profiler

Transcription:

The World Bank Enterprise Search Program Luisita Guanlao The World Bank Group May 10, 2005

Agenda Background Enterprise Search Strategy Key Challenges and Lessons Learned

History Pre-Internet Search by Browse Search Blank Enterprise Search Yahoo, Alta Vista Google Personal Network Finding right information Searchable content collections # of irrelevant search results 1992 2000 2005

Bank Search Structure Now

Feedback from Client Community Search Does Not Work!

Findings 1. Absolute success rate per search: 93%. However, this result is achieved sometimes at a high cost in terms of staff time and productivity; 2. Absolute success rate per search task: 43.18% 3. The source with fewest number of steps: Colleague or Personal Contact greatest number of steps: External Web Search Browse; 4. Intranet search most logical place to look for information in over 65% of cases success rate for the Intranet search: 35% 5. Colleagues or contact people selected as last resort, even though they were always successful when they did this may mean that they have good expectations that using other sources such as the Intranet search or browse;

Findings 6. Types of searches known items (48%), learning and discovery searching (20%), searches by multiple parameters (14%) topical searching (10%); 7. The quick reference and directional kinds of search tasks had success rates higher than the research-oriented searching;

Findings 8. Each source has its own behavior, business rules, functional architectures users need to learn each system; 9. Search experience generally consists of multiple steps and multiple searches within and across sources; 10. Disconnect between what/why we publish to the Intranet or External Web sites and users expectations regarding what we will find; There are several searchable resources and users do not know where to start looking the purpose of and expected content in our individual repositories is not always clear;

Agenda Background Enterprise Search Strategy Key Challenges and Lessons Learned

Staff Expectations On the whole, staff are looking for: An enterprise view that encompasses all WB institutional repositories and external collections; Ability to support known item searching; Ability to find the right answer to their information query; Lowest level of effort to achieve successful search result; Consistent behavior across sources; Ability to extract & customize content based on individual needs.

Enterprise Search Does not preclude Search within existing systems Deals with the findability problem Goal: fewer and more relevant results Initial focus is on surfacing information stored in institutional repositories Email, files in network drives and desktop not in initial scope

Search Improvement Strategy Search Governance Structure and processes Search Framework and Standards Metrics Search as a Service Search Service Provisioning Training Communications Feedback Loop Implementation Search within Application Enterprise Search

Guiding Principles Data driven search with disciplined Data Metadata enrichment Institutional Reference Sources Standardize Search Enterprise Search Existing Systems Continuous Improvement Through Metrics Governance Process

Search Governance Business Sponsorship Alignment with corporate priorities Policies Standards Metrics Funding

Search as a Service Support Training Change Management/Communications

Components of Enterprise Search Search Portal/Interface Search interface, results set display, browsing structures, recommender and similarity linking Search Engine Query filtering, query processing algorithms, indexing Metadata Repository Metadata store, metadata tools & utilities, reporting, metamodel repository Metadata Improvement in Institutional Systems Concept extraction, categorization and summarization Institutional Reference Sources

Enterprise Search Functional Architecture Positioning for Semantic Search Content Aggregator Recommender Engine Content Syndication Personalization Profiles Social Or Task Filtering Threshold Filtering Query Processing Algorithms Query Manipulation Options Classification Schemes Vocabulary Support Results Display & Manipulation Cross Language Searching Search Interface -Simple and Fielded Search Index Utilities Utilities Metadata Extracts Metadata Loads Metadata Maintenance Utilities Security Policy Change Mgmt. Processes Interface Templates Union Index *includes thesaurus support and taxonomies Parametric Indexes MDR Tools & Utilities Enhanced Common Data Stores* MD IRIS MD CMS Search tools Consolidated Metadata Store Metadata Extracts MD LMS MD Image Bank MD Global JOLIS MD IRAMS MD JOLIS Automated Metadata Capture MetaModel Repository Application MetaModel Metadata Repository MetaModel Relational MetaModel Business MetaModel Logical MetaModel Including transformation rules reporting specs loader programs data standards data rationalization

Agenda Background Enterprise Search Strategy Key Challenges and Lessons Learned

Key Challenges Quality of Metadata in Institutional Systems Comprehensiveness of collection Multi-lingual support Cross-lingual Search Relevant results set Contextualized Personalized Recommendations Expanded content types (e.g., video) Googlesque

Going beyond Full text searching Not doable given volume of information at Bank High noise level/irrelevant results set Google Limited to text documents not other formats (e.g., audio, video) Limitations of a search blank Lacks ability to provide personalized or contextualized results

Lessons Learned Search is not a project; it is a program Search projects are never complete Search is not solely a technology problem Search is not a byproduct of application systems (storing vs. access) Contextualization, semantic interoperability begins in legacy systems Establish metrics to benchmark progress in Search investments Continuous improvement through metrics Search competency center Information management Technology management Metrics management Program management Domain experts

Thank You lguanlao@worldbank.org