Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

Similar documents
Business Benefits of Developing Effective Taxonomies. Cathrin Senn and Ian Davis Taxonomy Consultants

SharePoint 2016 Site Collections and Site Owner Administration

SharePoint 2016 Site Collections and Site Owner Administration

"Charting the Course... MOC A: SharePoint 2016 Site Collections and Site Owner Administration. Course Summary

: Course : SharePoint 2016 Site Collection and Site Administration

Course Outline. Microsoft SharePoint Server 2013 for the Site Owner/Power User Course 55035: 2 days Instructor-Led

SharePoint 2016 Site Collections and Site Owner Administration

Microsoft SharePoint Server 2013 for the Site Owner/Power User Course 55035: 2 days; Instructor-Led

55033: SHAREPOINT 2013 SITE COLLECTION AND SITE ADMINISTRATION

SharePoint 2013 Site Collection and Site Administration

AVANTUS TRAINING PTE PTE LTD LTD

MS 50547: Microsoft SharePoint 2010 Site Collection and Site Administration Duration: 5 Days Method: Instructor-Led

Microsoft SharePoint Server 2013 Plan, Configure & Manage

[MS55199]: SharePoint 2016 End User Training. Audience Profile This course is intended for new and existing users of SharePoint.

Microsoft SharePoint Server 2013 for the Site Owner/Power User

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev

55035A: Microsoft SharePoint Server 2013 for the Site Owner/Power User

SharePoint 2016 End User Training

DotNetNuke. Easy to Use Extensible Highly Scalable

Microsoft SharePoint Server 2016 for the Site Owner/Power User

SharePoint 2013 for End Users - Microsoft Official

20331B: Core Solutions of Microsoft SharePoint Server 2013

55035: PowerShell for SharePoint Administrators

Microsoft SharePoint Server 2013 for the Site Owner/Power User

SharePoint 2013 End User

THE GETTY VOCABULARIES TECHNICAL UPDATE

Microsoft Core Solutions of Microsoft SharePoint Server 2013

Course Outline. SharePoint 2013 End User Level I Course 55050: 4 days Instructor Led

55035: Microsoft SharePoint Server 2013 for the Site Owner/Power User

SharePoint 2016 Site Collections and Site Owner Administration

Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010 SharePoint Saturday San Diego February 2011 Chris McNulty

Data Governance for the Connected Enterprise

Microsoft SharePoint 2013 for SharePoint Readers, Authors and Site Managers

TopBraid EVN. A Tour of Recent Enhancements. Copyright 2014 TopQuadrant Inc. Slide 1

Xyleme Studio Data Sheet

PoolParty. Thesaurus Management Semantic Search Linked Data. ISKO UK, London September 14, Andreas Blumauer

Microsoft SharePoint Server 2013 for the Site Owner/Power User

SharePoint Breakfast Session

Applying Auto-Data Classification Techniques for Large Data Sets

"Charting the Course to Your Success!" MOC Microsoft SharePoint 2010 Site Collection and Site Administration Course Summary

October 28, 2017 WELCOME SHAREPOINT SATURDAY OTTAWA. Going Meta How to use metadata in SharePoint

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Bynder Taxonomy Approach and Exercises

Microsoft SharePoint 2010 FOR DUMME5' by Vanessa L. Williams WILEY. Wiley Publishing, Inc.

SharePoint 2013 End User Level I

EMC Documentum Quality and Manufacturing

Introduction to SharePoint 2013 for Collaboration and Document Management

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University

Data formats for exchanging classifications UNSD

For Sales Kathy Hall

Online training catalog

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Content Management for the Defense Intelligence Enterprise

Advanced Solutions of Microsoft SharePoint 2013

Index. Tony Smith 2016 T. Smith, SharePoint 2016 User's Guide, DOI /

Terminologies, Knowledge Organization Systems, Ontologies

VisualSP 2010 Help Items

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

PeopleSoft Applications Portal and WorkCenter Pages

Report from the W3C Semantic Web Best Practices Working Group

Course 55197A: Microsoft SharePoint Server 2016 for the Site Owner/Power User

SharePoint Server 2016 Feature Comparison* Accessibility Standards Support Yes Yes. Asset Library Enhancements/Video Support Yes Yes.

Web 2.0: Crowdsourcing:

Advanced Solutions of Microsoft SharePoint Server 2013

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

3) CHARLIE HULL. Implementing open source search for a major specialist recruiting firm

ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE

INTRODUCTION TO THE STATE OF MICHIGAN S SHAREPOINT ENVIRONMENT

The Emerging Data Lake IT Strategy

0.1 Knowledge Organization Systems for Semantic Web

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with SharePoint

SharePoint 2016 Power User

Automated Classification. Lars Marius Garshol Topic Maps

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

AutoFocus, an Open Source Facet-Driven Enterprise Search Solution

SharePoint Online for Power Users

SharePoint 2013 End User Level II

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

Fusing Corporate Thesaurus Management with Linked Data using PoolParty

Course Outline. Module 1: SharePoint Overview

Lesson 2: Internet Communication

FIBO Metadata in Ontology Mapping

EVALUATION COPY. Unauthorized Reproduction or Distribution Prohibited SHAREPOINT 2013 END USER

Index A Access data formats, 215 exporting data from, to SharePoint, forms and reports changing table used by form, 213 creating, cont

Tags, Categories and Keywords

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Emerging Technologies in Knowledge Management By Ramana Rao, CTO of Inxight Software, Inc.

POSITION DETAILS. Content Analyst/Developer

Springer Science+ Business, LLC

SharePoint Online Power User

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

Functionality Description

Applied Data Governance - Part 3

> Semantic Web Use Cases and Case Studies

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Interaction with SharePoint Team Services, front-end web servers and back-end databases.

European Platform on Rare Diseases Registration

SharePoint Online Power User

Business Glossary Best Practices

Transcription:

Taxonomy Tools: Collaboration, Creation & Integration Dave Clarke Global Taxonomy Director dave.clarke@dowjones.com Dow Jones & Company

Introduction Software Tools for Taxonomy 1. Collaboration 2. Creation 3. Integration

Dow Jones Handle massive volumes of data 24x7 every day: Over 500,000 documents per day 10,000+ Sources 22 languages Expertise to create and maintain a robust taxonomy including: 310,000+ company codes 820+ industries 520+ subjects 340+ regions 3.6 Million Documents/Month 700 Feeds 152 Countries 60 Terra Byte Content Server

Taxonomy Tools

Taxonomy Tools

Collaboration WHO needs to get involved? WHAT do they need to do? HOW do they work together?

WHO Cross-functional Team Categorization Content Management Information Technology Knowledge Management Knowledge Workers Library Metadata Ontology Search Subject Matter Expertise Taxonomy

WHAT Assess Design Build Maintain Business Goals Content IT Metadata Taxonomy Standards & Best Practices Audience Segmentation & Definition Facet Analysis Information Architecture Editorial Guidelines & Workflow Entity Extraction (machine and/or human) Content Tagging Rules (machine and/or human) Taxonomy Construction & Mapping Continuous Work-in-progress Engage endusers (query log analysis, focus groups, folksonomy) Governance Process Users

HOW Web workspace Task-oriented Role-based Workflow Governance alerts

HOW Web workspace Task-oriented Role-based Workflow Governance alerts Location independent access for in-house stakeholders and very often external consultants and SMEs

HOW Web workspace Task-oriented Role-based Workflow Governance alerts Work-oriented views for teams of people performing different tasks the flip side of the collaboration coin is compartmentalization

HOW Web workspace Task-oriented Role-based Workflow Governance alerts Multiple levels of functional permission for fine-tuning what users can do to particular sets of terms

HOW Web workspace Task-oriented Role-based Workflow Governance alerts New Candidates Primary Review Rejected for Rework Secondary QC Deactivated / Deleted Approved & Published Withdrawn & Replaced By

HOW Web workspace Task-oriented Role-based Workflow Governance alerts Design need-to-know reports for each stakeholder group / stage in the workflow

HOW Web workspace Task-oriented Role-based Workflow Governance alerts Schedule the reports to be generated automatically And to email alerts to designated recipients

Creation (models, methods and trends for building taxonomies) Folksonomies Taxonomies Semantic webs

Classic Taxonomy Classification based Web portals Navigation aids File-folder metaphor Ad-hoc groupings 2-dimensional

Faceted Taxonomy Separate taxonomies for individual attributes Content tagged to facets separately not pre- -coordinated Used as orthogonal search filters n-dimensional

Faceted Taxonomy in Action

Faceted Taxonomy in Action

Folksonomy Tag Clouds Web 2.0 Folksonomy Un-controlled Un-structured Social tagging User participation Wikis Collaboration Blogs

Pros & Cons Folksonomy lets users create (and adopt) terminology that is meaningful to themselves but does so at the expense of precision and recall for the general user (meta noise). Controlled vocabularies solve the precision-recall trade off but their insistence on preferred terminology imposes onesize-fits-all order on a heterogeneous user community.

A Middle Path Audience-Centric Taxonomy 1. Segment a user community into Audiences 2. Develop a core-taxonomy but append extensions to it which store the terminology and hierarchy preferences of each audience 3. Leverage folksonomy and social tagging systems to help inform the evolution of the audience-centric taxonomies

Audience-Centric Views The world of your content Audience-centric views provide access and navigation orientated for different user perspectives Conceptual representation of the content as a semantic web

Semantic Webs & Ontologies Concept-oriented rather than terminologyoriented semantic web Formally defined relationships Extensible concept types & extensible relationship types Resource Description Framework (RDF)

Integration Components Talking to each other RDF Files & Web Service Calls

Components Taxonomy Categorization Search Content

Talking to Each Other Taxonomy W3C RDF-based Open Standards (SKOS & OWL) Content Open Standards Categorization n Components 1 Common integration Search Web Services ad hoc transactions and small data sets XML File Libraries published versions and large data sets

EXAMPLE From Idea to Published Output

whiteboard your entities and John Doe relationships Employer Of Employed By Manufacturer Of Manufactured By Widgets Located In Location Of ABC Corporation Client Of Client Of Vendor To PQR Corporation New York Vendor To XYZ Corporation

Step 1 Design the conceptual structure Concept types Data elements Relationship types Semantic rules

Step 2 Input entities and build relationships Key data via GUI Import Excel files Import XML files

Step 3 HTML CSV XML Publish HTML Browser CSV Download XML/RDF Export From whiteboard to published RDF in 30 minutes

Thank You Questions Comments dave.clarke@dowjones.com