irods - An Overview Jason Executive Director, irods Consortium CS Department of Computer Science, AGH Kraków, Poland

Similar documents
Future plans for irods. John Constable Informatics Support Group

irods for Data Management and Archiving UGM 2018 Masilamani Subramanyam

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

Cyber Defense Maturity Scorecard DEFINING CYBERSECURITY MATURITY ACROSS KEY DOMAINS

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint 2013

irods 4.0 and Beyond Presented at the irods & DDN User Group Meeting 2014

irods workflows for the data management in the EUDAT pan-european infrastructure

EUDAT - Open Data Services for Research

July SNIA Technology Affiliate Membership Overview

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Advanced Solutions of Microsoft SharePoint Server 2013

I D C T E C H N O L O G Y S P O T L I G H T. V i r t u a l and Cloud D a t a Center Management

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

irods and Objectstorage UGM 2016, Chapel Hill / Othmar Weber, Bayer Business Services / v0.2

Cloud Computing the VMware Perspective. Bogomil Balkansky Product Marketing

Fusion Registry 9 SDMX Data and Metadata Management System

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

TECHNICAL OVERVIEW irods Technical Overview 2016 edition RCI_iROD_Report_final2.indd 1-2 5/26/16 9:21 AM

Automating Elasticity. March 2018

Policy Based Distributed Data Management Systems

Inventory (input to ECOMP and ONAP Roadmaps)

Virtustream Managed Services Drive value from technology investments through IT management solutions. Tim Calahan, Manager Managed Services

Microsoft Core Solutions of Microsoft SharePoint Server 2013

Electronic Records Archives: Philadelphia Federal Executive Board

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

Simplifying Collaboration in the Cloud

ORCID: A simple basis for digital data governance

Data Center Management and Automation Strategic Briefing

The International Journal of Digital Curation Issue 1, Volume

20331B: Core Solutions of Microsoft SharePoint Server 2013

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano

CIAM: Need for Identity Governance & Assurance. Yash Prakash VP of Products

By Julián Fernández-Campón Solutions Maximizing storage Storage Anywhere

SOLUTION ARCHITECTURE AND TECHNICAL OVERVIEW. Decentralized platform for coordination and administration of healthcare and benefits

JBoss DNA. Randall Hauch Principal Software Engineer JBoss Data Services

Accelerate Your Enterprise Private Cloud Initiative

DOCAVE ONLINE. Your Cloud. Our SaaS. A Powerful Combination. Online Services. Technical Overview ADMINISTRATION BACKUP & RESTORE

Planning and Administering SharePoint 2016

EUDAT- Towards a Global Collaborative Data Infrastructure

NorStore. a national infrastructure for scientific data. Andreas O Jaunsen UNINETT Sigma as

Unity and Interoperability Among Decentralized Systems. Chris Gebhardt. The InfoCentral Project

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

CERT Symposium: Cyber Security Incident Management for Health Information Exchanges

Global Reference Architecture: Overview of National Standards. Michael Jacobson, SEARCH Diane Graski, NCSC Oct. 3, 2013 Arizona ewarrants

2 The IBM Data Governance Unified Process

Technical Overview. Access control lists define the users, groups, and roles that can access content as well as the operations that can be performed.

Building Open Source IoT Ecosystems. November 2017

The iplant Data Commons

Welcome to Islandora Camp UK. London, May 7-9, 2014

Data Curation Handbook Steps

STATE BROADBAND ACTION PLAN MAY 2015 Nevada Economic Development Conference PREPARED BY CONNECT NEVADA AND THE NEVADA BROADBAND TASK FORCE

COURSE OUTLINE MOC : PLANNING AND ADMINISTERING SHAREPOINT 2016

1Z0-560 Oracle Unified Business Process Management Suite 11g Essentials

Best Practices for Cloud Security at Scale. Phil Rodrigues Security Solutions Architect Amazon Web Services, ANZ

Sentinet for Windows Azure VERSION 2.2

IBM Advantage: IBM Watson Compare and Comply Element Classification

Evaluating Encryption Products

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018

Enterprise Data Architect

NIST Public Working Group on Federated Cloud (PWGFC) IEEE P2302 Intercloud Kickoff

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment

ISAO SO Product Outline

Developing a social science data platform. Ron Dekker Director CESSDA

Grid Architectural Models

ODPi and Data Governance Free Your MetaData! October 10, 2018

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

NIEM. National. Information. Exchange Model. NIEM and Information Exchanges. <Insert Picture Here> Deploy. Requirements. Model Data.

UTAP UNIFIED TEST AUTOMATION PLATFORM

High Availability Distributed (Micro-)services. Clemens Vasters Microsoft

ehealth in Southwestern Ontario

Active Directory Services with Windows Server

Standards Readiness Criteria. Tier 2

AUTOMATING IBM SPECTRUM SCALE CLUSTER BUILDS IN AWS PROOF OF CONCEPT

Oregon State Police. Information Technology. Honor Loyalty. Pride Dedication

FeduShare Update. AuthNZ the SAML way for VOs

About the DISA Cloud Playbook

Course : Planning and Administering SharePoint 2016

Data Center 3.0: Transforming the Data Center via the Network

einfrastructures Concertation Event

P a g e 1. Teknologisk Institut. Online kursus k SysAdmin & DevOps Collection

21ST century enterprise. HCL Technologies Presents. Roadmap for Data Center Transformation

ONUG SDN Federation/Operability

DreamFactory Security Guide

Privilege Security & Next-Generation Technology. Morey J. Haber Chief Technology Officer

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

strategy IT Str a 2020 tegy

MCSA Windows Server 2012

Striving for efficiency

Secure, scalable storage made simple. OEM Storage Portfolio

GÉANT Community Programme

TRANSFORMING TO IT-AS-A- SERVICE

CLOUD GOVERNANCE SPECIALIST Certification

Merging Enterprise Applications with Docker* Container Technology

National Research Data Cloud

Introduction to Microsoft Flow

July 13, Via to RE: International Internet Policy Priorities [Docket No ]

Transcription:

irods - An Overview Jason Coposky @jason_coposky Executive Director, irods Consortium CS3 2018 Department of Computer Science, AGH Kraków, Poland 1

What is irods irods is Distributed Open source Metadata Driven Data Centric A flexible framework for the abstraction of infrastructure 2

irods as the Integration Layer 3

Data Virtualization Combine various distributed storage technologies into a Unified Namespace Existing file systems Cloud storage On premises object storage Archival storage systems irods provides a logical view into the complex physical representation of your data, distributed geographically, and at scale. 4

Data Virtualization Logical Path Physical Paths(s) 5

Data Virtualization $ ils -L /tempzone/home/rods/thefile.txt rods 0 demoresc 29606 2016-10-05.09:05 & thefile.txt generic /var/lib/irods/irods/vault/home/rods/thefile.txt rods 1 repl;u2 29606 2016-10-05.09:06 & thefile.txt generic /tmp/u2vault/home/rods/thefile.txt rods 2 repl;u1 29606 2016-10-05.09:06 & thefile.txt generic /tmp/u1vault/home/rods/thefile.txt Logical Path Physical Paths /tempzone/home/rods/thefile.txt /var/lib/irods/irods/vault/home/rods/thefile.txt /tmp/u2vault/home/rods/thefile.txt /tmp/u1vault/home/rods/thefile.txt 6

Data Discovery Attach metadata to any first class entity within the irods Zone Data Objects Collections Users Storage Resources The Namespace irods provides automated and user-provided metadata which makes your data and infrastructure more discoverable, operational and valuable. 7

Metadata Everywhere 8

Workflow Automation Integrated scripting language which is triggered by any operation within the framework Authentication Storage Access Database Interaction Network Activity Extensible RPC API The irods rule engine provides the ability to capture real world policy as computer actionable rules which may allow, deny, or add context to operations within the system. 9

Dynamic Policy Enforcement The irods rule may: restrict access log for audit and reporting provide additional context send a notification 10

Dynamic Policy Enforcement A single API call expands to many plugin operations all of which may invoke policy enforcement Plugin Interfaces: Authentication Database Storage Network Rule Engine Microservice RPC API 11

Provenance and Reporting 12

Secure Collaboration irods allows for collaboration across administrative boundaries after deployment No need for common infrastructure No need for shared funding Affords temporary collaborations irods provides the ability to federate namespaces across organizations without pre-coordinated funding or effort. 13

irods Service Interface 14

Federation - Shared Data and Services 15

Institutional repositories As data matures and reaches a broader community, data management policy must also evolve to meet these additional requirements. 16

irods Use Cases 17

On Premises to Any Cloud Infrastructure 18

Data to Compute Use Case 19

Compute to Data Use Case 20

The Wellcome Trust Sanger Institute 21

Sanger - Replication Data preferentially placed on resource servers in the green data center (fallback to red) Data replicated to the other room. Checksums applied Green and red centers both used for read access. 22

Sanger - Metadata Example metadata attributes Users query and access data from local compute clusters Users access irods locally via the command line interface attribute: library attribute: total_reads attribute: type attribute: lane attribute: is_paired_read attribute: study_accession_number attribute: library_id attribute: sample_accession_number attribute: sample_public_name attribute: manual_qc attribute: tag attribute: sample_common_name attribute: md5 attribute: tag_index attribute: study_title attribute: study_id attribute: reference attribute: sample attribute: target attribute: sample_id attribute: id_run attribute: study attribute: alignment 23

Sanger - Federation 24

University College London UK sponsored research requirements: last date of access request plus 10 years irods tiers data across storage technologies Enables federated access from other centers 25

irods Software Roadmap 26

The Roadmap irods 4.3 Packaged irods Capabilities Multipart Transfer Cacheless Object Storage Query Arrow Metadata Templates Filesystem Integration 27

The Roadmap - irods 4.3 Hardening Release Logging irods Monitor Delegate Checksum to Storage Plugins 28

Packaged irods Capabilities 29

Multipart Transfer Provide reliable transfer with restart - object parts tracked in the catalog Later versions will provide fast, first class access to object storage 30

irods 4.2 and Beyond - The Scatter 31

Next Generation Query Interface 32

irods 4.3 and Beyond - The Gather 33

Shared Data - Shared Infrastructure 34

Metadata Templates 35

irods Consortium Business Model 36

The irods Consortium Our Mission Write Good Software Grow the Community Show Value to our Membership 37

Why Open Source Transparency Quality Persistence Vendor Neutrality Customization Community Try before you buy 38

Our Membership 39

Our Business Model Consortium Membership Participate in roadmap development Participate in consortium governance Direct support from the team Tier 3 support agreements Discount for support agreements 40

Our Business Model Service & Support Contracts Billed hourly Implement Proofs of Concept Custom rule and plugin development Expand to new use cases Discounted rate for consortium members 41

Membership Committees Technology Working Group Monthly web conferences Build irods Roadmap Propose new technology direction Propose inclusion of new software Propose new working groups 42

Membership Committees Planning Committee Monthly web conferences Discuss consortium policy and business practices Propose conferences and workshops Vote on inclusion of new software Vote on roadmap 43

Membership Committees Executive Board Meets twice yearly Votes on consortium budget and bylaw changes Determines the thematic priorities of the consortium Additional working groups are formed as required 44

Our Consortium Participation 45