The Smithsonian/NASA Astrophysics Data System

Similar documents
The Role of Repositories and Journals in the Astronomy Research Lifecycle

Semantics at ADS Labs. Rahul Dave, Alberto Accomazzi Sponsored by Microsoft Research and ADS

Astronomy Dataverse: enabling astronomer data publishing.

The Virtual Observatory and the IVOA

Demystifying Scopus APIs

Invenio: A Modern Digital Library for Grey Literature

American Institute of Physics

Demos: DMP Assistant and Dataverse

Brown University Libraries Technology Plan

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Designing an institutional research data management infrastructure for the life sciences

How Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline Welcome to how Primo works.

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Implementation of Open-World, Integrative, Transparent, Collaborative Research Data Platforms: the University of Things (UoT)

INSTITUTIONAL REPOSITORY SERVICES

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

Scitation A User Guide

The IPAC Research Archives. Steve Groom IPAC / Caltech

Editorial Workflow Tasks. Michaela Barton Account Coordinator

Invenio: a modern digital library system. <

Next-Generation Scholarly Discovery. Director Portfolio Strategy Microsoft Research

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Introduction to Bibliometrics and Tools for Organizing References. Uta Grothkopf ESO Library

Extending the Facets concept by applying NLP tools to catalog records of scientific literature

The Portal Aspect of the LSST Science Platform. Gregory Dubois-Felsmann Caltech/IPAC. LSST2017 August 16, 2017

Discovery to Delivery with WorldCat Local

Queen s University Library. Research Data Management (RDM) Workflow

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

CrossRef tools for small publishers

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

Introducing the Springer Nature Data Support Services

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017

SHARING YOUR RESEARCH DATA VIA

Data publication and discovery with Globus

SciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus

IVOA and European VO Efforts Status and Plans

Semantic Scholar. ICSTI Towards a More Efficient Review of Research Literature 11 September

Next Generation Library Catalogs: opportunities. September 26, 2008

Electronic Resources Simplexity

Science Panel Discussion presentation: "A Data Sharing Story"

TDNet Discover User Manual

Research Elsevier

NTUST Library Service & E-Resource

Educating a New Breed of Data Scientists for Scientific Data Management

ProQuest Dissertations and Theses Overview. Austin McLean and Marlene Coles CGS Summer Workshop, July 2017

Data Curation Handbook Steps

Performing searches on Érudit

Web of Science 8.0. Alain Frey, Customer Education and Support

(

Enhanced retrieval using semantic technologies:

Your Open Science and Research Publishing Platform. 1st SciShops Summer School

Federal STI Managers Group Presentation to Board on Research Data and Information Ellen Herbst Director, NTIS CENDI Chair

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

PRELIDA. D2.3 Deployment of the online infrastructure

Mendeley Institutional Edition Group Owner Guide

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Issues and Opportunities Associated with Federated Searching

Data Curation Profile Plant Genomics

Digital The Harold B. Lee Library

OCLC Global Council November 9, WorldCat Local. Maria Savova. Collection Development and Special Projects Librarian McGill University Library

User Guide: Navigating PNAS Online

CDL s Web Archiving System

21. Search Models and UIs for IR

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

DOIs for Research Data

September Development of favorite collections & visualizing user search queries in CERN Document Server (CDS)

Scuola di dottorato in Scienze molecolari Information literacy in chemistry 2015 SCOPUS

Your Research Social Media: Leverage the Mendeley platform for your needs

Searchable. Readable. Relatable. E-Journal Platform for Japanese Academic Societies

Smart Federated Search for Egyptian Knowledge Bank

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

CatalogWS Library Catalog as Versatile Discovery Platform. Tito Sierra and Joseph Ryan Digital Library Initiatives NCSU Libraries

Protecting Future Access Now Models for Preserving Locally Created Content

EBSCO Host - H W Wilson ( Click on continue button. Select OminiFile Full-text Mega.

Blackwell Synergy New Design Preview

Serving communication needs of physical scientists

Glossary: AAS American Astronomical Society AURA Association of Universities for Research in Astronomy

A document-inspired way for tracking changes of RDF data The case of the OpenCitations Corpus

Implementing a Discovery Tool

ACM Digital Library. LIBRARY SERVICES

User guide. Created by Ilse A. Rasmussen & Allan Leck Jensen. 27 August You ll find Organic Eprints here:

The worlds largest collection of optics & photonics research. SPIEDigitalLibrary.org

DATA SHARING FOR BETTER SCIENCE

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

Reproducibility and FAIR Data in the Earth and Space Sciences

Full-text scientific databases, electronic resources and improving the quality of scientific research

The Canadian CyberSKA Project

What do we need to build a successful knowledge base?

Personal User Guide : adding content to Pure 1

PrototypeofaDiscoveryToolforQuerying Heterogeneous Services

SCOPUS. Scuola di Dottorato di Ricerca in Bioscienze e Biotecnologie. Polo bibliotecario di Scienze, Farmacologia e Scienze Farmaceutiche

USER S GUIDE FOR THE ECONOMICS ELECTRONIC LIBRARY

LUND UNIVERSITY Open Access Journals dissemination and integration in modern library services

DCC Disciplinary Metadata

ScienceDirect. Goes beyond search to research. Genevieve Musasa - Customer Consultant Africa

<Metadata>! What is Crossref metadata Who uses Crossref metadata services How Crossref provides this content

Interoperability Framework Recommendations

Colorado Alliance of Research Libraries 3801 E. Florida, Ste. 515 Denver, CO (303) FAX: (303) Copyright Colorado Alliance

Transcription:

The Smithsonian/NASA Astrophysics Data System Status Report Alberto Accomazzi Michael Kurtz Harvard-Smithsonian Center for Astrophysics http://ads.harvard.edu

The ADS Project Established in 1989 (before the web!) as a portal for accessing astronomical data and bibliographic metadata Was restructured in 1994 to become an A&I service for astronomers and astrophysicists, with fulltext archive Has 100% penetration in astronomical community, with take-up in other areas of space sciences, engineering and physics

History of the ADS project (1/2) 1992: literature database implemented 1993: implemented federated search to object database 1994: web-based version of ADS launched 1995: fulltext of ApJ Letters digitized and online 1996: citation data incorporated in ADS; links between bib records and datasets created; first mirror site online 1997: indexed astronomy preprints from arxiv 1998: online readership via ADS > worldwide print readership 1999: incorporated 1.2M citations via text mining A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

History of the ADS project (2/2) 2000: incorporated usage data, citation ranking 2001: 10th mirror site comes online 2003: myads notification service launched 2006: introduced basic search, openurl links 2007: implemented user login system 2008: introduced topic search 2002: 300K historical scans from microfilms online 2004: introduced fulltext search, private libraries 2005: introduced daily database updates, RSS feeds A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

ADS Collaborators GScholar NED Vizier GSky IVOA WWT MAST Chandra SIMBAD Libraries Services Archives HEASARC ADS Libraries Repositories Publishers Springer ComPADRE JSTAGE arxiv SPIRES Elsevier IOP Wiley T&F CrossRef A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

Who uses ADS Astronomers & Astrophysicists Solar physicists, planetary scientists Space scientists, engineers Geophysicists Physicists Amateur astronomers

ADS Data Holdings Over 8M bibliographic metadata records: Astronomy: 1.7M Physics: 5.4M Arxiv e-prints: 590K Citations: 40M (over 3.4M papers with citations) Curated links: 23M (fulltext, data products, citations) Fulltext archive of literature: 4M scanned pages, 544K articles 650K pages historical material

Over past 12 months ingested 800K metadata records bibliographic metadata coverage up by 10% citations up by 20% to 40M ingested 336K pages of fulltext (180K pages historical)

ADS Usage ( account Over 40K registered users (with login Over 8,000 myads users signed up for notifications Over 30K regular users (> 10 queries/month) 1.2M users from search engines every month 8M database queries every month 16M bibliographic records retrieved every month 2.5M scanned article pages downloaded every month month 1.2M fulltext downloads via ADS every

Operations Collaborations and agreements with over 100 partners: 21 major publishers contributing ~600 journals CrossRef metadata services other like-minded projects 72 minor publishers contributing 100+ journals astrophysics data archives, observatories, missions

Metadata Ingest Coverage: Astronomy, Planetary Sciences Physics, Geophysics, general science (Nature, Science, PNAS, etc.) Journals, conference proceedings, reports, observing proposals, bulletins, newsletters, PhD Thesis Database updates: daily from e-prints, weekly for Astronomy database, 2x month for Physics database Metadata feeds, completeness, quality all over the map; lots of effort goes in curation, merging of metadata, tracking provenance: high-throughput, high/low quality (publishers) low-throughput, hand-generated (observatories, other projects, small publications), conference proceedings, staff and contractors A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

Future Plans Restructure system architecture Improve database completeness and metadata curation Improve user interface Extend search capabilities Enhance personalization, recommendations Incorporate semantic web technologies

System Architecture New architecture for metadata curation and search based on CERN Invenio/INSPIRE Currently extending Invenio search interface and capability to match ADS s requirements Implementing pipeline for metadata ingest in new system from ADS classic Goal: alpha system online by Jan 2011

System Customizations ADS-style author indexing and searching ADS-style word parsing and indexing Implementation of synonym search from knowledge base (authors, text words) Simplified use of word weights for scoring Federated search of object databases Selection/filtering by bibliographic groups

ADS Labs A playground for testing new ideas, technologies A bridge present with the future, without having to worry about legacy too much Aim: receive feedback from users, test performance and impact of new tools, services It s OK if we break things in the labs

Labs Ideas Metadata enhancement (normalized keywords, affiliations, ORCID experiments) Infrastructure (caching, federated searching, RDF output) New services (recommendation, fulltext search) User interface (search, facets, widgets, topic browser)

New User Interface Looks TBD Remove clutter, hide complexity, use javascript/ajax where appropriate Use facets throughout Provide recommendations, context when appropriate

Fulltext Search Enable searching and viewing of papers in ADS fulltext archive Allow searching of current content, with links to publisher s fulltext offerings Support text mining efforts, metadata enrichment Currently investigating available technologies (SOLR, Invenio)

Links to Data ADS provides links to data catalogs, astronomical objects, data archives Metadata about links provided by publishers, collaborators, librarians Involved in VAO Data Curation & Preservation effort to expand data linking via semantics Goal: enable data discovery, use literature as filter on data (and vice-versa)

Collaborators Notifications: myads My citations Recent papers Hottest papers

Review articles A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

objects in papers A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

WWT links back to ADS A. Accomazzi - The Smithsonian/NASA Astrophysics Data System IWGDD 11/19/2009

Recommendations

Conclusions ADS project has 100% penetration in community, used by other disciplines (physics) and general public Current project focus is IT restructuring, technology realignment, UI redesign (2 yr timeline) Many new features and technologies being tested and considered, ready to collaborate with partners (e.g. ORCID) Future plans include support for text mining, linked data applications, integration of observational metadata, increased personalization & recommendations

Demo Faceted Topic Search Keyword Topic Cluster Article Recommeder (alpha)