Engaging and Connecting Faculty:

Similar documents
DataStaR: Science Metadata Schemas Meet the Semantic Web

Digital Objects, Data Models, and Surrogates. Carl Lagoze Computing and Information Science Cornell University

DataStaR: An Institutional Approach to Research Data Curation

Data publication and discovery with Globus

OAI-ORE. A non-technical introduction to: (

The library s role in promoting the sharing of scientific research data

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Brown University Libraries Technology Plan

IAALD/2013 World Congress. VIVO Workshop. Brian J. Lowe Jon Corson-Rikert

Reproducibility and FAIR Data in the Earth and Space Sciences

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

SHARING YOUR RESEARCH DATA VIA

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

Title: Interactive data entry and validation tool: A collaboration between librarians and researchers

The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr

A Brief Introduction to the Data Curation Profiles

UC Irvine LAUC-I and Library Staff Research

Digital repositories as research infrastructure: a UK perspective

Developing Seamless Discovery of Scholarly and Trade Journal Resources Via OAI and RSS Chumbe, Santiago Segundo; MacLeod, Roddy

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Data Curation Profile Human Genomics

Reflections on Three Decades in Internet Time

James Hardiman Library. Digital Scholarship Enablement Strategy

CREATING DIGITAL REPOSITORIES PRESENTED BY CHAMA MPUNDU MFULA CHIEF LIBRARIAN NATIONAL ASSEMBLY OF ZAMBIA

DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences

Indiana University Research Technology and the Research Data Alliance

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

The State of Arctic Data the IPY experience

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

The Semantic Planetary Data System

Development of an Ontology-Based Portal for Digital Archive Services

Interoperability for Digital Libraries

An overview of the OAIS and Representation Information

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

VI-SEEM Data Repository. Presented by: Panayiotis Charalambous

NDSA Web Archiving Survey

Florida Coastal Everglades LTER Program

Scientific Data Curation and the Grid

SciENCV - Putting the Pieces Together VIVO

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Data Curation Handbook Steps

Invenio: A Modern Digital Library for Grey Literature

Developing and Deploying an Interactive Community Dashboard: An Empirical Window into Homelessness

Arctic Data Center: Call for Synthesis Working Group Proposals Due May 23, 2018

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

Importance of cultural heritage:

Reducing Consumer Uncertainty

Enhancing discovery with entity reconciliation: Use cases from the Linked Data for Libraries (LD4L) project

COMP6217 Social Networking Technologies Web evolution and the Social Semantic Web. Dr Thanassis Tiropanis

Comparing Open Source Digital Library Software

REQUEST FOR PROPOSALS: ARTIST TRUST WEBSITE REDESIGN

An e-infrastructure for Language Documentation on the Web

Contribution of OCLC, LC and IFLA

Comparing Curricula for Digital Library. Digital Curation Education

New Approach to Graph Databases

Data Management Plan Generic Template Zach S. Henderson Library

Data Management Checklist

Preserving Digital Content at Scale

Using ESML in a Semantic Web Approach for Improved Earth Science Data Usability

Developing the Discovery Layer in the University Research e- Infrastructure

The Data Census: Assessing Data Services at MSU

Building Institutional Repositories: Emerging Challenges

A distributed network of digital heritage information

Using the Semantic Web in Ubiquitous and Mobile Computing

Digital Curators: Who, What, & How

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Building for the Future

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

If you build it, will they come? Issues in Institutional Repository Implementation, Promotion and Maintenance

Response to RFI: Public Access to Digital Data Resulting From Federally Funded Scientific Research Office of Science and Technology Policy

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Chinese-European Workshop on Digital Preservation. Beijing (China), July 14 16, 2004

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Introduction to Data Management for Ocean Science Research

Basic Requirements for Research Infrastructures in Europe

National Materials Data Initiatives

An Institutional Approach to Developing Research Data Management Infrastructure

Evolving the digital library for digital scholarship enablement

How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center

Exploring the Concept of Temporal Interoperability as a Framework for Digital Preservation*

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Best Practice Guidelines for the Development and Evaluation of Digital Humanities Projects

Semantic Web: vision and reality

GETTING STARTED WITH DIGITAL COMMONWEALTH

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

Protecting Future Access Now Models for Preserving Locally Created Content

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

Building Collaborative Tools on NSDL 2.0. Dean Krafft, Cornell University

Developing a Research Data Policy

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

Outline. The Collaborative Research Platform for Data Curation and Repositories: CKAN For ANGIS Data Portal. Open Access & Open Data.

Data is the new Oil (Ann Winblad)

CU Boulder Research Cyberinfrastructure plan 1

Transcription:

Engaging and Connecting Faculty: Research Discovery, Access, Re-use, and Archiving Janet McCue and Jon Corson-Rikert Albert R. Mann Library Cornell University CNI Spring 2007 Task Force Meeting April 16, 2007

An essential process is the joining together of subcultures when a wider common language is needed. Often two groups independently develop very similar concepts, and describing the relationship between them brings great benefits. Like a Finnish-English dictionary, or a weights and measure conversion table, the relations allow communication and collaboration even when the commonality of concept has not (yet) led to a commonality of terms. Tim Berners-Lee, James Hendler, Ora Lassila, The Semantic Web, Scientific American, May 2001

Focus NSF Small Grant for Exploratory Research (SGER) to explore collaboration between scientists and research library staff Semantic web applications that create a virtual research community

Research questions When is laboratory-library collaboration feasible and desirable? How should the responsibilities of the library and the laboratory be balanced? What significant challenges and costs are associated with various activities? What is a conceptual model for collaboration that would allow us to focus on promoting the preservation and discovery of resources valuable for interdisciplinary research?

Library goals Direct better information More timely, accurate and complete documentation of data Formats that can more easily be preserved More standard metadata to promote discovery Indirect better community Promote awareness and exchange of data prior to and after publication Promote collaboration within & across disciplines Promote new methods of publication that include data Keep libraries front and center in the academy

Cornell Language Acquisition Lab (Barbara Lust) Large digitization requirement from analog audio tapes Collecting and standardizing metadata

Upper Susquehanna Applied Ecology Program Multiple investigators Born-digital data

NSF s data policies will be redesigned as necessary to mitigate existing sociological and cultural barriers to data sharing and access, and to bring them into accord across programs and ensure coherence. This will lead to the development of a suite of harmonized policy statements supporting data open access and usability. NSF s actions will promote a change in culture such that the collection and deposition of all appropriate digital data and associated metadata become a matter of routine for investigators in all fields. This change will be encouraged through an NSFwide requirement for data management plans in all proposals. These plans will be considered in the merit review process and will be actively monitored post-award. Cyberinfrastructure Vision for 21 st Century Discovery (March, 2007)

The approach Involve the library early on Provide tangible, low-barrier, near-term assistance Know the culture (librarians with disciplinary expertise) Offer a blend of services through one point of contact Offer patterns, tools, and training rather than data processing Demonstrate benefits for collaboration and exchange as well as long-term stewardship

Think from the faculty perspective Facilitate a primary goal: publication Communicates the science Validates the research project Enhances personal and lab reputations (especially for young investigators) Lays the groundwork for future funding New: help faculty comply with funding agency requirements for data management plans

Provide synergistic services Faculty know how to collect and analyze data Help with collaboration, data sharing, special analysis or display (e.g., GIS), and metadata creation Publicize the availability of additional services such as high-performance computing, large dataset storage Faculty are familiar with the publication process May fall short on data formats and metadata to enable data to be re-used and ultimately preserved May appreciate help preparing data for a repository

Goals Provide wikis Internal project use minutes, posters, mini-grants Informal data review, comment, and sharing Handoffs between data producers and modelers Results Ideally one willing, tech-savvy person on the project to managing and re-organize content Asked for more storage for data & documents Word gets out -- three other projects now have wikis

Provide portals Public access to Background information Research plans Participants Activities Public datasets (available now or anticipated)

Provide staging repositories Expand from the wiki to support larger data sets and/or data backup Provide project-level access and exchange prior to publication Coordinate with statistical consulting, computer and information science, and highperformance computing for value-added services Provide a platform for appraisal & selection prior to submission to an institutional or disciplinary repository Begin to tap the services of the Grid

Provide metadata tools & training Local expertise geared toward domain standards Open Language Archives Community (OLAC) Ecological Metadata Language (EML) Community has developed tools for metadata creation (Morpho) and management (Metacat) Library hosts, customizes, and teaches these tools Workshop attended by three of four USAEP PIs Researcher enters metadata only once, in the format closest to their domain Conversion process to create Dublin Core for DSpace

Metadata Challenges Varied formats require specialized editing and storage software XML DTDs and schemas enforce syntactic correctness but most content is in freeformat data literals Meaning may be implicitly rather than explicitly encoded, visible only via local transformation tools Any cross-references between elements can't be relied on to be consistently interpreted

Promising approaches Ontologies Better support for automated processing Capture object relations at creation time through explicit properties, not just free-text values Global resource identifiers for interoperability Store independent statements Collectively encode meaning as well as structure Transform metadata into necessary output formats on demand

LiLaC Conceptual Framework Level 1 Search Engines, Harvesters, and Inter-Repository Exchange search engines, OAI harvesters, Semantic Web initiatives, library research portals, etc. Level 2 Domain and Institutional Repositories GOLD data GOLD COPE OLAC metadata Institutional Institutional Repository Repository DSpace, DSpace, etc. etc. (data (data and and metadata) metadata) KNB data and metadata NBII Level 3 Staging Repository (library service) VCLA staging level metadata USAEP staging level metadata Level 4 Individual Laboratories / Researchers MIT CLAL project participants Library-Laboratory Collaboration for Research Data NSF 0437603, SGER: Planning Information Infrastructure Through a New Library-Research Partnership

VIVO integrated discovery Provides consistent and highlyvisible information on researchers, facilities, grants, publications, and data Makes content accessible independently of Cornell's administrative structure Provides a Google-like search while adding rich contextual navigation Includes information on research in progress

VIVO as harvester and distributor Faculty updates Central course listings LDAP directory Grants, publications News Service RSS feed Events calendar Departmental seminars

Property editing

VIVO architecture Manage content in small units Integrate content from multiple sources Import from databases of record (OHR, OSP, events, news) Allow direct entry and update through faculty reporting Leverage semantic structure for display, filtering and reporting, including areas of new development Standardizing on OWL data model OWLIM for dynamic updates of inferred classes SPARQL for more complex queries than SQL

Institutional relationships Database exposed via web services on multiple websites Integrated content offered back to distributed units VIVO data model (Vitro) selected for new Cornell-wide faculty reporting database collaborative development project starting up for the faculty selfreporting interface and workflow Colleges keep editorial control while gaining wider exposure for their faculty and research

Next steps Collaborate across institutions Explore interoperability and semantic integration Leverage the value of the grid through improved semantics as well as computation, workflow, and networking

Agents of Integration

Questions?

Citations Berners-Lee, Tim, James Hendler and Ora Lassila (2001), The Semantic Web, Scientific American, May 2001. Cyberinfrastructure Council, National Science Foundation (2007), Cyberinfrastructure Vision for 21 st Century Discovery. http://www.nsf.gov/pubs/2007/nsf0728/index.jsp Green, Ann G. and Myron Gutmann (2007), Building Partnerships Among Social Science Researchers, Institution-based Repositories and Domain Specific Data Archives. OCLC Systems and Services: International Digital Library Perspectives. 23: 35-53. http://hdl.handle.net/2027.42/41214 Steinhart, Gail and Brian J. Lowe (2007), Data Curation and Distribution in Support of Cornell University s Upper Susquehanna Agricultural Ecology Program, presented at DigCCurr2007, April 19, 2007. Warner, Simeon, Jeroen Bekaert, Carl Lagoze, Xiaoming Liu, Sandy Payette, and Herbert Van de Sompel (2006), Pathways: Augmenting interoperability across scholarly repositories. arxiv:cs/0610031v1 [cs.dl]