Frankfurt 16 June 2010 Local Metadatamanagement in a global environment Daniel van Spanje Global Productmanager Metadata Services OCLC
metadata has become the structure on which we re building information services. In: Metadata everywhere: the catalog is out of the box, p. 5. (Nextspace, no. 15, p. 4-9) 2
The catalogue is changing 3
The catalogue is a mash-up 4
Dewey T2 94 Australia displayed with Google Earth
The paradigm shift in cataloging Main Entry: 2 catalog Variant(s): or catalogue Function: verb Inflected Form(s): cataloged or catalogued; catalog ing or catalogu ing Date: 1598 Transitive verb 1 : to make a catalog of 2 a : to enter in a catalog b : to classify (as books or information) descriptively intransitive verb 1 : to make or work on a catalog 2 : to become listed in a catalog at a specified price <this stamp catalogs at $2> cat a log er or cat a logu er noun (http://www.merriam-webster.com/dictionary/cataloging) 6
The catalog as the direct result of cataloging http://www.rug.nl/cit/organisatie/pictogram/archief/2000-7- 3/leidseboekjes.htm
Cataloging changes into Structured metadata storage of Efficiencies data in workflows management Index engines search algorithms relevance ranking The OPAC changes into a mash-up and Discovery and access discovery Content experience Integration
The user changed 9
From Paris Principles (1961) to International Cataloguing Principles (2009) 10
Key findings: End users bring their expectations from popular Web sites to online catalogs The end user s delivery experience is as important, if not more important than the discovery experience Most important for analog materials: summaries, tables of contents, etc. http://www.oclc.org/reports/onlinecatalogs/default.htm Most important for e- content: linking to the content itself
Librarian/Staff Results: Highlighted Differences End-User Results: Recommended Enhancements 14 Recommended enhancements to WorldCat Total end-user responses 1 4
Meet the user primary personas of worldcat.org
Metadata Sources are also changing Professionally produced Bibliography cataloging ; abstracting and indexing services Authority and classification data Terminologies Publication supply chain data Author/User contributed Institutional repositories Scholarly portals (e.g., arxiv.org) Tags, reviews, lists, etc. Mined WorldCat Identities VIAF Facets for topics, places, events FRBR Work Sets 3 rd party content Databases and article content, Evaluative content Full text
Globalization James Rosenau (NWU) 1924 -.. Web page at GWU People count (2007) Distant proximities (2003) 15
16
17
The OCLC strategy part A: web scale Outward Integration Into the Cloud Integration should be outward rather than inward, with libraries seeking to use their components in new ways --Interviewee for LC report on future of the catalog (Karen Calhoun)
Create system-wide efficiencies in library management WorldCat: an inclusive platform Arabic Bengali Chinese Cyrillic Devanagari Greek Hebrew Japanese Korean Latin Tamil Thai MARC ONIX Dublin Core FRBR Full Text New Contributors Links Unicode Graphics, sound, motion
Create system-wide efficiencies in library management WorldCat Growth since 1998 180 160 Millions of records 183 140 120 139 100 108 80 86 60 40 20 39 41 44 47 50 52 55 61 67 0 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Create system-wide efficiencies in library management Multilingual WorldCat Total Records 1998 37.5m records 2010 117.2 m records Percentage of Non-English Records English German French 23.9 m 2.2 m 2.3 m 64.3 m 17.9 m 16.3 m 36% Spanish 1.6 m 4.5 m Chinese Dutch.7m.2m 4.3 m 2.9 m 1998 Japanese.8 m 2.8 m Russian Italian Latin.8 m.7 m.3 m 2.3 m 2.1 m 1.9 m 53.8% Hebrew.3 m 1.2 m Portuguese.2 m 1.1 m 2009
Create system-wide efficiencies in library management Managing the collective collection 1.9 billion items and growing! Physical holdings in WorldCat 175 million bib records 3.6 million digital items 1.5 billion holdings Licensed digital content in library collections 325 million electronic database records JSTOR: 4.5 million records Gale: 100 million records Local library content being digitized 30 million items (Google, HathiTrust, OAIster)
More metadata: building the knowledge base Core resources are the same EBSCO KB release 87 (june 2010) totals: ProQuest Links Holdings HW Wilson 119 providers 1721 collections 1383 full-text collections Elsevier ScienceDirect SWETS Collections Titles etc 4,317,813 records: 949,348 full-text records 1,102,359 records for free content 2,285,148 ebook records
WorldCat knowledge base: library input Each library can provide us with a feed of the holdings from their existing link resolver / knowledge base system. The data is matched to the WorldCat knowledge base and holdings are set on records that are already present in our knowledge base data. Links Holdings Collections Titles
The OCLC strategy part B: make OCLC Web Services a valued part of library operations WorldCat Search API WorldCat Registry API s xisbn xissn WorldCat Identities Dewey.info WorldCAT Knowledge base API Google Book Syndication SRU-based metadatasynchronization
http://dewey.info - Experimental space for linked DDC data - Available now: Linked data version of the DDC Summaries in ten languages - Features include: - Access to the top three levels of the DDC in eleven languages - Actionable URIs for every class of the DDC Summaries - Classification semantics encoded in SKOS - Representations for machines and for humans - Data is reusable under a Creative Commons BY-NC-ND license
HTML view across all versions in all languages
Putting it together: multilingual and mobile
WorldCat Knowledge Base API model WorldCat knowledge base API: Designed to expose KB metadata Includes read access Includes write access Based on OpenURL 1.0 syntax Documented protocol syntax Same API used for internal and external services Supports service-neutral principles All OCLC Discovery, Delivery, Metadata & Management System Services KBWC API Librarians KBWC Web-scale Management Services Links Holdings WorldCat Local Service WMS ERM WMS License Manager Collection Builder Collections Titles Third Party Service
Create system-wide efficiencies in library management Embedded Google Book Search content
OCLC The world s libraries. Connected. More collaboration More institutions More Web-scale More innovation More synchronization Local Global Group 33
Create system-wide efficiencies in library management Synchronization: real-time update Records added automatically to WorldCat Record id s sent back to original database 4 million holdings contributed to date SRU based (Search and Retrieve via URL / Online process) NCC (Dutch Union Catalogue) this FY prev this FY this mth Total records to Wcat this FY 350.937 338.358 12.579 Total adds to Wcat this FY 268.742 260.274 8.468 Total merges to Wcat this FY 82.195 78.084 4.111
Create system-wide efficiencies in library management Synchronization: real-time 2Way update Operational between NCC (Netherlands) and WorldCAT since March 2010 Supported actions: replace and delete (not: validate, create) Tags can be added, modified or deleted Current options: add only or add only if tag not yet present Tags involved: UDC, DDC, LCC, NLM, LCSH, LC geographical names, URL s to additional content Actions per week between 164.996 in week 11 and 703.105 in week 19 Success rate of the actions between 96.6 % and 97.3 % 35
Create system-wide efficiencies in library management Synchronization: real-time 2Way update SRU statistics GGC (NL) from WorldCAT (SRU2Way): tags added to bib records UDC NLM DDC LCC LCSH LCGN kmc 5000 (045G) kmc 5040 (045C) kmc 5010 (045F) kmc 5030 (045A) kmc 5500 (044A) kmc 5501 (044B) Week 11 (March) 303 10.714 48.841 74.441 172.845 48.637 Week 12 (March) 761 16.305 84.145 120.080 293.403 70.194 Week 13 (March) 222 10.464 57.451 86.632 196.781 37.365 Week 14 (April) 235 21.044 64.274 98.540 247.242 48.799 Week 15 (April) 234 12.656 54.770 96.816 208.134 46.909 Week 16 (April) 198 7.878 32.887 55.732 135.115 25.317 Week 17 (April) 228 6.592 37.019 60.638 139.762 29.063 Week 18 (May) 721 17.591 108.470 170.567 559.669 115.988 Week 19 (May) 772 26.159 135.209 264.316 786.668 165.367 Week 20 (May) 355 10.471 56.563 291.181 291.181 57.362 Week 21 (May) 652 8.687 51.521 82.535 396.628 101.342 Week 22 (June) 507 12.738 53.905 92.738 376.651 73.705 Week 23 (June) 169 2.459 14.525 36.400 76.537 12.018 Total 5.357 163.758 799.580 1.530.616 3.880.616 832.066 Average per week 412 12.597 61.506 117.740 298.509 64.005 36
Synchronisation Gateway Z39.50 Client SQL Query Library System OAI ILS Local PMH API Scripts Widget Tailor SRU update Send S R U Detect / Elicit
Conclusion: metadata management workflows are changing Before the Web (B.W.) For finding and managing library collections (mostly print) After the Web (A.W.) For finding and managing many types of materials, for many user communities Catalogue records (well-understood rules and encoding conventions) Create a Many types of records, many sources Create a Shared cooperative cataloguing systems Loosely database coupled metadata management, reuse and exchange services among multiple repositories catalogue Usually handcrafted, one record at a time Multiple batch creation and metadata in text extract, conversion, mapping, ingest and transfer services Record creation and editing generally a solitary activity undertaken by librarytrained professionals with actionable Metadata creation data and manipulation a fundamentally collaborative activity involving various specialists 38
Create system-wide efficiencies in library management Tomorrow s cataloging platform and next generation cataloging: An open cataloging and metadata platform that supports the organization and management of library collections (physical, licensed electronic and digital collections) Manage the collection Functionality to efficiently create, manage and share metadata both at the Manage single information object collection level and at the level of a single information object Shared metadata creation and management not only for individual titles and works, but also for people, places, concepts and other types of information objects Easy, fast ways to ingest an OCLC cataloging subscriber s data and keep member holdings synchronized with WorldCAT A vibrant, worldwide Expert Community working with OCLC to extend and improve WorldCAT quality Integration with WorldCAT Local and WMS, and support use by external Webscale and outward integration39 applications for collections and metadata management (Nextspace, no. 15, p.3) Share metadata and authority control Create efficient workflows Support manual metadata management
Frankfurt 16 June 2010 Thank you!