re3data.org Registry of Research Data Repositories Peter Schirmbacher Humboldt-Universität zu Berlin CNI San Antonio, April 05. 2013
Outline Background Research Data Repositories Mission Status quo Partners
Background 1. Research data are valuable and ubiquitous New technologies facilitate data-intensive science 2. Broad discussion about the permanent access to research data Increasing requirements from funders to make data openly available 3. Growing demand for trustable and sustainable research data repositories
Reference: Michael Nentwich, 2003, Cyberscience Research in the Age of the Internet, S. 24
Enquete-Kommission of the German Parliament: Internet and digital Society Enquete-Kommission Internet und digitale Gesellschaft, 2012.
The Royal Society The Royal Society, 2012. Scientists should communicate the data they collect and the models they create, to allow free and open access, and in ways that are intelligible, assessable and usable for other specialists in the same or linked fields wherever they are in the world. Where data justify it, scientists should make them available in an appropriate data repository. Where possible, communication with a wider public
EC: Recommendations for the European Commission. (2012). Commission Recommendation on access to and preservation of scientific information. C(2012) 4890 final. Retrieved from http://ec.europa.eu/research/science-society/document_library/pdf_06/recommendation-accessand-preservation-scientific-information_en.pdf
Research-Data-Repositories EC: ICT infrastructures for e-science The landscape of data repositories across Europe is fairly heterogeneous, but there is a solid basis to develop a coherent strategy to overcome the fragmentation and enable research communities to better manage, use, European Commission. (2009). ICT infrastructures for e-science. Communication from the Commission to the European Parliament, share and preserve data. the Council, the European Economic and Social Committee and the Committee of the Regions. COM(2009) 108 final. Retrieved from http://eur-lex.europa.eu/lexuriserv/lexuriserv.do?uri=com:2009:0108:fin:en:pdf
Vision 2030 Researchers and practitioners from any discipline are able to find, access and process the data they need. They can be confident in their ability to use and understand data, and they can evaluate the degree to which that data can be trusted. Producers of data benefit from opening it to broad access, and prefer to deposit their data with confidence in reliable repositories. A framework of repositories is guided by international standards, to High Level Expert Group on Scientific Data, 2010. ensure they are trustworthy.
Key Drivers for the Vision 2030 Incentives for scholars Training the researcher and users Building the infrastructure The three main challenges in developing an ecosystem of data repositories are (1) gaps in the present data infrastructure and (2) connectivity issues (between the workflow of researchers and the institutional data infrastructure and between institutional and national data infrastructures) and (3) long-term financial basis. Funding the infrastructure Van der Graaf, M., & Waaijers, L. 2011.
The Research Data Repositories Landscape funders journals research data repositories scientists universities and research labs RRZE Icon Set (CC: BY- SA)
The Research Data Repositories Landscape funders journals Where can I store my data? research data repositories scientists universities and research labs RRZE Icon Set (CC: BY- SA)
The Research Data Repositories Landscape funders Where can I store my data? research data repositories journals Should we offer repositories for all disciplines? scientists universities and research labs RRZE Icon Set (CC: BY- SA)
The Research Data Repositories Landscape Underlying data must be accessible! funders Where can I store my data? research data repositories journals Should we offer repositories for all disciplines? scientists universities and research labs RRZE Icon Set (CC: BY- SA)
The Research Data Repositories Landscape How is it to organize? Underlying data must be accessible! funders Where can I store my data? research data repositories journals Should we offer repositories for all disciplines? scientists universities and research labs RRZE Icon Set (CC: BY- SA)
The Research Data Repositories Landscape How is it to organize? Underlying data must be accessible! funders Where can I store my data? research data repositories journals Should we offer repositories for all disciplines? Where can I find data? scientists universities and research labs RRZE Icon Set (CC: BY- SA)
The Research Data Repositories Landscape Investigators are expected to share their data! How is it to organize? Underlying data must be accessible! funders Where can I store my data? research data repositories journals Should we offer repositories for all disciplines? Where can I find data? scientists universities and research labs RRZE Icon Set (CC: BY- SA)
Research Data Repositories GEO, http://www.ncbi.nlm.nih.gov/ geo/ PANGAEA, http://www.pangaea.de
Research Data Repositories SDDB, http:// www.scientificdrilling.org BDPP, http://www.digitalpantheon.ch
Research Data Repositories PURR, http:// research.hub.purdue.edu Open Data LMU, http://data.ub.uni-muenchen.de
Mission re3data.org will be a global registry of research data repositories will cover research data repositories from all academic disciplines will help researchers, funding bodies, publishers and scholarly institutions to find research data repositories aims to promote a culture of sharing, increased access and better visibility of research data
General Workflow
re3data.org: Aspects
re3data.org: Vocabulary Vierkant, P., Spier, S., Rücknagel, J., Gundlach, J., Fichtmüller, D., Pampel, H., Kindling, M., et al. (2012). Vocabulary for the Registration and Description of Research Data Repositories. Version 2.0. DOI:10.2312/re3.002
Indexing Indexing and reviewing 31 metadata elements 25 child elements 22 controlled vocabularies Icon set to easily grasp basic information Useful for institutions and researchers
plate
Results
ensive Dataset
ensive Dataset
Partners and Sustainability Berlin School of Library and Information Science Peter Schirmbacher (Principal Investigator), Maxi Kindling, Paul Vierkant, Jessika Rücknagel, Shaked Spier GFZ German Research Centre for Geosciences Roland Bertelmann ((Principal Investigator), Jens Klump, Heinz Pampel Karlsruhe Institute of Technology (KIT), KIT Library Frank Scholze ((Principal Investigator) Hans-Jürgen Goebelbecker, Jens Gundlach, All partners are actively involved in the German Initiative for Network Information (DINI). re3data.org is funded by the German Research Foundation DFG in the period 2012 to 2014
Thanks for your With the exception of all photos and graphics, this slides are licensed under the Creative Commons Attribution 3.0 Germany (CC BY 3.0) Licence.
Quotation of References Baker, M. (2012). Databases fight funding cuts. Nature, 489(7414), 19 19. doi:10.1038/489019a ELIXIR. (n.d.). The ELIXIR Strategy for Data Resources. Draft Report from Workpackage 2. The ELIXIR Preparatory Phase. Retrieved from http://www.elixir-europe.org/bcms/elixir/documents/reports/ ELIXIR Strategy for Data Resources report.pdf European Commission. (2009). ICT infrastructures for e-science. Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee of the Regions. COM(2009) 108 final. Retrieved from http://eur-lex.europa.eu/ LexUriServ/LexUriServ.do?uri=COM:2009:0108:FIN:EN:PDF European Commission. (n.d.). Main references to open access in the European Commission s proposals for Horizon 2020. Retrieved from http://ec.europa.eu/research/science-society/ document_library/pdf_06/open-access-in-horizon-2020_en.pdf High Level Expert Group on Scientific Data. (2010). Riding the wave. How Europe can gain from the rising tide of scientific data. Retrieved from http://cordis.europa.eu/fp7/ict/e-infrastructure/docs/ hlg-sdi-report.pdf Pampel, H., Goebelbecker, H.-J., & Vierkant, P. (2012). re3data.org: Aufbau eines Verzeichnisses von Forschungsdaten-Repositorien. Eine Werkstattbericht. (Forthcoming). WissKom2012. Jülich: Forschungszentrum Jülich. Van der Graaf, M., & Waaijers, L. (2011). A Surfboard for Riding the Wave. Towards a four country action programme on research data. Retrieved from http://www.knowledge-exchange.info/admin/ Public/DWSDownload.aspx?File=/Files/Filer/downloads/Primary+Research+Data/Surfboard+for +Riding+the+Wave/KE_Surfboard_Riding_the_Wave_Screen.pdf Vierkant, P., Spier, S., Rücknagel, J., Gundlach, J., Fichtmüller, D., Pampel, H., Kindling, M., et al. (2012). Vocabulary for the Registration and Description of Research Data Repositories. Version 2.0. DOI:10.2312/re3.002