Galaxy a community driven platform for accessible, transparent, and reproducible data science

Size: px
Start display at page:

Download "Galaxy a community driven platform for accessible, transparent, and reproducible data science"

Transcription

1 Galaxy a community driven platform for accessible, transparent, and reproducible data science / #usegalaxy

2 A continuing crisis in genomics research: reproducibility

3 What is reproducibility? (for computational analyses) Reproducibility is not provenance, reusability/ generalizability, or correctness Reproducibility means that an analysis is described/captured in sufficient detail that it can be precisely reproduced (given the data) Yet most published analyses are not reproducible (see e.g. Ioannadis et al /18 microarray experiments reproducible; Nekrutenko and Taylor 2012, 7/50 resequencing experiments reproducible) Missing software, versions, parameters, data

4 Galaxy: accessible analysis system

5 A free (for everyone) web service integrating a wealth of tools, compute resources, terabytes of reference data and permanent storage Open source software that makes integrating your own tools and data and customizing for your own site simple An open extensible platform for sharing tools, datatypes, workflows,...

6 Galaxy s ideological goals: How best can data intensive methods be accessible to scientists? How best to facilitate transparent communication of computational analyses? How best to ensure that analyses are reproducible?

7 Galaxy s practical goals: How to arm researchers with access to powerful compute and latest tools How to build a community of tool developers How to run Galaxy on any HPC

8 Describe analysis tool behavior abstractly Workflow system for complex analysis, constructed explicitly or automatically Analysis environment automatically and transparently tracks details Pervasive sharing, and publication of documents with integrated analysis

9 Visualization and visual analytics

10 To grow Galaxy we needed to actively engage the community Users: support the tools and workflows they want, engage in outreach, training, infrastructure to support communication (e.g. biostar) Developers: make it easy to integrate new tools, to run and test locally, be as flexible as possible, avoid anything specific to a given scientific domain Great value in being able to run anywhere, not building a gateway but a framework to build gateways

11 The Galaxy ecosystem

12 More than 70 known public Galaxy servers 15+ general servers Domain specific servers including: Ballaxy for structure based computational biology, Cistrome for regulatory sequence analysis, Genomic Hyperbrowser: statistical integration of genomic data, GigaGalaxy: integrating workflows published in GigaScience, Pathogen Portal:comparative analysis of host response to pathogens,... Dozens of large scale private Galaxy instances

13

14

15

16

17 Ways to use Galaxy The public web service at Install locally with many compute environments Deploy on a cloud using Cloudman Atmosphere

18 Galaxy can scale: for example Galaxy main Dedicated resources Shared resources SDSC, San Diego Galaxy Cluster 256 cores 2 TB memory Rodeo 128 cores 1 TB memory Corral/Stockyard 20 PB disk Stampede 462,462 cores 205 TB memory PSC, Pittsburgh Blacklight 4,096 cores 32 TB memory Trestles 10,368 cores 20.7 TB memory TACC Austin Nate Coraor

19 Bringing it all together: automate all the things! Unified ansible playbook for Galaxy main, cloud, and local deployments

20 Collaboration among Galaxy instances

21 Galaxies on private clouds Galaxies on public clouds Galaxy Tool Shed Private Tool Sheds private Galaxy installations Greg von Kuster, Dave Bouvier

22 Repositories are owned by the contributor, can contain tools, workflows, etc. Backed by version control, a complete version history is retained for everything that passes through the toolshed Galaxy instance admins can install tools directly from the toolshed using only a web UI Support for recipes for installing the underlying software that tools depend on (also versioned)

23

24 Command line tools to support tool developers Command-line tools to aid development. Test tools quickly without worrying about configuration files. Check tools for common bugs and best practices. Optimized publishing to the ToolShed. Testbed for new dependency management - Homebrew and Homebrew-science

25 git[hub] centric development workflow

26 Tool citations, credit and incentivization Embed DOIs in Tool Configuration, Galaxy resolves and provides a list of citations, with links, which can exported for reference managers

27 Summary Galaxy is an (obsessively) open framework for making data analysis accessible and reproducible Nearly everything in Galaxy is pluggable, allowing it to customized for myriad purposes By supporting and leveraging developers the Galaxy community can collectively keep up with rapid changes in available tools

28 The Core Galaxy Team Engineering Enis Afgan Dannon Baker Dan Blankenberg Dave Bouvier Nate Coraor Martin Čech John Chilton Carl Eberhard Sam Guerler Nitesh Turaga Support and outreach Custodians Dave Clements Jennifer Jackson James Taylor Anton Nekrutenko Jeremy Goecks Supported by the NHGRI (HG005542, HG004909, HG005133, HG006620), NSF (DBI ), Penn State University, Johns Hopkins University, and the Pennsylvania Department of Public Health

29 Extended team and other contributors Björn Grüning Uni Freiburg Peter Cock TJHI Kyle Ellrott UCSC Eric Rasche CPT Nicola Soranzo TGAC Brad Chapman HSPH Nuwan Goonasekera VeRSI Yousef Kowsar VLSCI And many others who have contributed to the main Galaxy code, tools to the ToolShed, participated in discussions, attended the Galaxy conferences,

Galaxy. Data intensive biology for everyone. / #usegalaxy

Galaxy. Data intensive biology for everyone. / #usegalaxy Galaxy Data intensive biology for everyone. www.galaxyproject.org @jxtx / #usegalaxy Engineering Dannon Baker Dan Blankenberg Dave Bouvier Nate Coraor Carl Eberhard Jeremy Goecks Sam Guerler Greg von Kuster

More information

Galaxy. Data intensive biology for everyone. / #usegalaxy

Galaxy. Data intensive biology for everyone. / #usegalaxy Galaxy Data intensive biology for everyone. www.galaxyproject.org @jxtx / #usegalaxy High-Throughput v I SEQUENCING! High-throughput sequencing is transformative Resequencing De novo genome sequencing

More information

Web-Based Visualization and Visual Analysis for High-Throughput Genomics. Jeremy Goecks! Computational Biology Institute

Web-Based Visualization and Visual Analysis for High-Throughput Genomics. Jeremy Goecks! Computational Biology Institute Web-Based Visualization and Visual Analysis for High-Throughput Genomics with Galaxy! Jeremy Goecks! Computational Biology Institute Topics Galaxy Visualization framework Large-scale visualization Integrated

More information

Galaxy. Daniel Blankenberg The Galaxy Team

Galaxy. Daniel Blankenberg The Galaxy Team Galaxy Daniel Blankenberg The Galaxy Team http://galaxyproject.org Overview What is Galaxy? What you can do in Galaxy analysis interface, tools and datasources data libraries workflows visualization sharing

More information

Galaxy Project Update

Galaxy Project Update Galaxy Project Update 2013 GMOD Meeting Cambridge, UK Dave Clements Emory University Agenda Project Introduction Project Update What is Galaxy? An open, web-based platform for accessible, reproducible,

More information

Accessible, Transparent and Reproducible Analysis with Galaxy

Accessible, Transparent and Reproducible Analysis with Galaxy Accessible, Transparent and Reproducible Analysis with Galaxy Application of Next Generation Sequencing Technologies for Whole Transcriptome and Genome Analysis ABRF 2013 Saturday, March 2, 2013 Palm Springs,

More information

Galaxy Community Update

Galaxy Community Update Galaxy Community Update PAG XXVI January 17, 2018 San Diego, California, United States Dave Clements Johns Hopkins University Galaxy Team / Galaxy Community #usegalaxy @galaxyproject bit.ly/gxy-pag2018-upd

More information

Introduction to Galaxy

Introduction to Galaxy Introduction to Galaxy Saint Louis University St. Louis, Missouri April 30, 2013 Dave Clements, Emory University http://galaxyproject.org/ Agenda 9:00 Welcome 9:20 Basic Analysis with Galaxy 10:30 Basic

More information

Using Galaxy: RNA-seq

Using Galaxy: RNA-seq Using Galaxy: RNA-seq Stanford University September 23, 2014 Jennifer Hillman-Jackson Galaxy Team Penn State University http://galaxyproject.org/ The Agenda Introduction RNA-seq Example - Data Prep: QC

More information

Reproducible & Transparent Computational Science with Galaxy. Jeremy Goecks The Galaxy Team

Reproducible & Transparent Computational Science with Galaxy. Jeremy Goecks The Galaxy Team Reproducible & Transparent Computational Science with Galaxy Jeremy Goecks The Galaxy Team 1 Doing Good Science Previous talks: performing an analysis setting up and scaling Galaxy adding tools libraries

More information

Get your own Galaxy within minutes

Get your own Galaxy within minutes Get your own Galaxy within minutes Enis Afgan, Nitesh Turaga, Nuwan Goonasekera GCC 2016 Bloomington, IN Access slides from bit.ly/gcc2016_usecloud Today s agenda Introduction Hands on, part 1 Launch your

More information

GCC 2017 Community Update. Dan Blankenberg and Jeremy Goecks

GCC 2017 Community Update. Dan Blankenberg and Jeremy Goecks GCC 2017 Community Update Dan Blankenberg and Jeremy Goecks The Past Year has been Amazing Fantastic advances in many areas of Galaxy: so many new features, from user interface to workflow engine to cloud

More information

Adding Transparency and Automation into the Galaxy Tool Installation Process

Adding Transparency and Automation into the Galaxy Tool Installation Process Adding Transparency and Automation into the Galaxy Tool Installation Process Enis Afgan Johns Hopkins University Galaxy Team Galaxy Admins Web Meetup August 20, 2015. Outline Installing tools in bulk (i.e.,

More information

The Data exacell DXC. J. Ray Scott DXC PI May 17, 2016

The Data exacell DXC. J. Ray Scott DXC PI May 17, 2016 The Data exacell DXC J. Ray Scott DXC PI May 17, 2016 DXC Leadership Mike Levine Co-Scientific Director Co-PI Nick Nystrom Senior Director of Research Co-PI Ralph Roskies Co-Scientific Director Co-PI Robin

More information

DNA Sequence Bioinformatics Analysis with the Galaxy Platform

DNA Sequence Bioinformatics Analysis with the Galaxy Platform DNA Sequence Bioinformatics Analysis with the Galaxy Platform University of São Paulo, Brazil 28 July - 1 August 2014! Dave Clements Johns Hopkins University Robson Francisco de Souza University of São

More information

The GISandbox: A Science Gateway For Geospatial Computing. Davide Del Vento, Eric Shook, Andrea Zonca

The GISandbox: A Science Gateway For Geospatial Computing. Davide Del Vento, Eric Shook, Andrea Zonca The GISandbox: A Science Gateway For Geospatial Computing Davide Del Vento, Eric Shook, Andrea Zonca 1 Paleoscape Model and Human Origins Simulate Climate and Vegetation during the Last Glacial Maximum

More information

The Future of Galaxy. Nate Coraor galaxyproject.org

The Future of Galaxy. Nate Coraor galaxyproject.org The Future of Galaxy Nate Coraor galaxyproject.org Galaxy is... A framework for scientists Enables usage of complicated command line tools Deals with file formats as transparently as possible Provides

More information

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research Dr Paul Calleja Director of Research Computing University of Cambridge Global leader in science & technology

More information

Using Galaxy to provide a NGS Analysis Platform

Using Galaxy to provide a NGS Analysis Platform 11/15/11 Using Galaxy to provide a NGS Analysis Platform Friedrich Miescher Institute - part of the Novartis Research Foundation - affiliated institute of Basel University - member of Swiss Institute of

More information

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel National Center for Supercomputing Applications University of Illinois

More information

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term

More information

The Data Exacell (DXC): Data Infrastructure Building Blocks for Integrating Analytics with Data Management

The Data Exacell (DXC): Data Infrastructure Building Blocks for Integrating Analytics with Data Management The Data Exacell (DXC): Data Infrastructure Building Blocks for Integrating Analytics with Data Management Nick Nystrom, Michael J. Levine, Ralph Roskies, and J Ray Scott Pittsburgh Supercomputing Center

More information

Science-as-a-Service

Science-as-a-Service Science-as-a-Service The iplant Foundation Rion Dooley Edwin Skidmore Dan Stanzione Steve Terry Matthew Vaughn Outline Why, why, why! When duct tape isn t enough Building an API for the web Core services

More information

Building the Genomics Virtual Lab

Building the Genomics Virtual Lab Building the Genomics Virtual Lab Ron Horst, Uni QLD Analysis and visualisation platform Community Resources Australian Research Cloud Agenda Objectives Scalable, on demand Latest tools, reproducible Build

More information

CloudMan cloud clusters for everyone

CloudMan cloud clusters for everyone CloudMan cloud clusters for everyone Enis Afgan usecloudman.org This is accessibility! But only sometimes So, there are alternatives BUT WHAT IF YOU WANT YOUR OWN, QUICKLY The big picture A. Users in different

More information

Galaxy Pasteur. Patchwork of experiences and improvements. Olivia Doppelt-Azeroual, Sophie Créno et Fabien Mareuil CIB, Institut Pasteur, Paris

Galaxy Pasteur. Patchwork of experiences and improvements. Olivia Doppelt-Azeroual, Sophie Créno et Fabien Mareuil CIB, Institut Pasteur, Paris Galaxy Pasteur Patchwork of experiences and improvements Olivia Doppelt-Azeroual, Sophie Créno et Fabien Mareuil CIB, Institut Pasteur, Paris Summary Part 0 : Galaxy Pasteur Part 1 : Adaptations to the

More information

Cyberinfrastructure!

Cyberinfrastructure! Cyberinfrastructure! David Minor! UC San Diego Libraries! San Diego Supercomputer Center! January 4, 2012! Cyberinfrastructure:! History! Definitions! Examples! History! mid-1990s:! High performance computing

More information

Nuts and Bolts: Lessons Learned in Creating a User-Friendly FOSS Cluster Configuration Tool

Nuts and Bolts: Lessons Learned in Creating a User-Friendly FOSS Cluster Configuration Tool Nuts and Bolts: Lessons Learned in Creating a User-Friendly FOSS Cluster Configuration Tool Presenters: Barbara Hallock, Indiana University, bahalloc@iu.edu Resa Reynolds, Cornell University, rda1@cornell.edu

More information

Secure, scalable storage made simple. OEM Storage Portfolio

Secure, scalable storage made simple. OEM Storage Portfolio Secure, scalable storage made simple. OEM Storage Portfolio P Data is the currency of the digital economy. It s the new oil and the lifeblood of your organization. But, how to manage it all? How can you

More information

NUIT Tech Talk Topics in Research Computing: XSEDE and Northwestern University Campus Champions

NUIT Tech Talk Topics in Research Computing: XSEDE and Northwestern University Campus Champions NUIT Tech Talk Topics in Research Computing: XSEDE and Northwestern University Campus Champions Pradeep Sivakumar pradeep-sivakumar@northwestern.edu Contents What is XSEDE? Introduction Who uses XSEDE?

More information

ArcGIS Enterprise: Architecture & Deployment. Anthony Myers

ArcGIS Enterprise: Architecture & Deployment. Anthony Myers ArcGIS Enterprise: Architecture & Deployment Anthony Myers 1 2 3 4 5 Web GIS Overview of ArcGIS Enterprise Federation & Hosted Server Deployment Patterns Implementation 1 Web GIS ArcGIS Enabling GIS for

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Big Data 2015: Sponsor and Participants Research Event ""

Big Data 2015: Sponsor and Participants Research Event Big Data 2015: Sponsor and Participants Research Event "" Center for Large-scale Data Systems Research, CLDS! San Diego Supercomputer Center! UC San Diego! Agenda" Welcome and introductions! SDSC: Who

More information

Datasheet. Only Workspaces delivers the features users want and the control that IT needs.

Datasheet. Only Workspaces delivers the features users want and the control that IT needs. Datasheet Secure SECURE Enterprise ENTERPRISE File FILE Sync, SYNC, Sharing SHARING and AND Content CONTENT Collaboration COLLABORATION BlackBerry Workspaces makes enterprises more mobile and collaborative,

More information

HPC Capabilities at Research Intensive Universities

HPC Capabilities at Research Intensive Universities HPC Capabilities at Research Intensive Universities Purushotham (Puri) V. Bangalore Department of Computer and Information Sciences and UAB IT Research Computing UAB HPC Resources 24 nodes (192 cores)

More information

The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists

The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists Internet2 Network Infrastructure for the Life Sciences Focused Technical Workshop. Berkeley, CA July 17-18, 2013

More information

RENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team

RENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team RENKU - Reproduce, Reuse, Recycle Research Rok Roškar and the SDSC Renku team Renku-Reana workshop @ CERN 26.06.2018 Goals of Renku 1. Provide the means to create reproducible data science 2. Facilitate

More information

Arkadin helps you achieve more at work: The voice expert for Microsoft Skype for Business and Office 365 For Large Enterprises

Arkadin helps you achieve more at work: The voice expert for Microsoft Skype for Business and Office 365 For Large Enterprises Arkadin helps you achieve more at work: The voice expert for Microsoft Skype for Business and Office 365 For Large Enterprises Arkadin is the world s expert at bringing voice to Office 365. We understand

More information

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Unleash Your Data Center s Hidden Power September 16, 2014 Molly Rector CMO, EVP Product Management & WW Marketing

More information

Walkthrough OCCAM. Be on the lookout for this fellow: The callouts are ACTIONs for you to do!

Walkthrough OCCAM. Be on the lookout for this fellow: The callouts are ACTIONs for you to do! Walkthrough OCCAM Be on the lookout for this fellow: The callouts are ACTIONs for you to do! When you see the check mark, compare your work to the marked element Objectives In this presentation you ll

More information

Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure

Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure An IDC InfoBrief, Sponsored by IBM April 2018 Executive Summary Today s healthcare organizations

More information

Future of Enzo. Michael L. Norman James Bordner LCA/SDSC/UCSD

Future of Enzo. Michael L. Norman James Bordner LCA/SDSC/UCSD Future of Enzo Michael L. Norman James Bordner LCA/SDSC/UCSD SDSC Resources Data to Discovery Host SDNAP San Diego network access point for multiple 10 Gbs WANs ESNet, NSF TeraGrid, CENIC, Internet2, StarTap

More information

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill Introduction to FREE National Resources for Scientific Computing Dana Brunson Oklahoma State University High Performance Computing Center Jeff Pummill University of Arkansas High Peformance Computing Center

More information

HPC Cloud at SURFsara

HPC Cloud at SURFsara HPC Cloud at SURFsara Offering cloud as a service SURF Research Boot Camp 21st April 2016 Ander Astudillo Markus van Dijk What is cloud computing?

More information

ElastiCluster Automated provisioning of computational clusters in the cloud

ElastiCluster Automated provisioning of computational clusters in the cloud ElastiCluster Automated provisioning of computational clusters in the cloud Riccardo Murri (with contributions from Antonio Messina, Nicolas Bär, Sergio Maffioletti, and Sigve

More information

Simon Mercer Director, Health & Wellbeing Microsoft Corporation

Simon Mercer Director, Health & Wellbeing Microsoft Corporation Simon Mercer Director, Health & Wellbeing Microsoft Corporation An open-source library of reusable bioinformatics algorithms and functions built on the.net platform Proteomics Customer Challenges Dependency

More information

JANUARY Migrating standalone ArcGIS Server to ArcGIS Enterprise

JANUARY Migrating standalone ArcGIS Server to ArcGIS Enterprise JANUARY 2018 Migrating standalone ArcGIS Server to ArcGIS Enterprise Copyright 2018 Esri All rights reserved. Printed in the United States of America. The information contained in this document is the

More information

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations October 29, 2014 Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations XSEDE for HPC Users What is XSEDE? XSEDE mo/va/on and goals XSEDE Resources XSEDE for HPC Users: Before

More information

Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr)

Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr) Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr) Jeremy Oakey Senior Director, Technical Marketing and Integrations Agenda Introduction Architecture

More information

Enterprise Data Architect

Enterprise Data Architect Enterprise Data Architect Position Summary Farmer Mac maintains a considerable repository of financial data that spans over two decades. Farmer Mac is looking for a hands-on technologist and data architect

More information

Reflections on Three Decades in Internet Time

Reflections on Three Decades in Internet Time This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States of America License. Reflections on Three Decades in Internet Time Christine Borgman, Paul

More information

WVU RESEARCH COMPUTING INTRODUCTION. Introduction to WVU s Research Computing Services

WVU RESEARCH COMPUTING INTRODUCTION. Introduction to WVU s Research Computing Services WVU RESEARCH COMPUTING INTRODUCTION Introduction to WVU s Research Computing Services WHO ARE WE? Division of Information Technology Services Funded through WVU Research Corporation Provide centralized

More information

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team Isilon: Raising The Bar On Performance & Archive Use Cases John Har Solutions Product Manager Unstructured Data Storage Team What we ll cover in this session Isilon Overview Streaming workflows High ops/s

More information

irods at TACC: Secure Infrastructure for Open Science Chris Jordan

irods at TACC: Secure Infrastructure for Open Science Chris Jordan irods at TACC: Secure Infrastructure for Open Science Chris Jordan What is TACC? Texas Advanced Computing Center Cyberinfrastructure Resources for Open Science University of Texas System 9 Academic, 6

More information

Microsoft Office 365 for Business. Your office-on-the-go. Get more work done virtually anytime, anywhere, on any device.

Microsoft Office 365 for Business. Your office-on-the-go. Get more work done virtually anytime, anywhere, on any device. Microsoft Office 365 for Business Your office-on-the-go. Get more work done virtually anytime, anywhere, on any device. Unified Communications and Collaboration (UC&C) tools are becoming popular with modern

More information

EGI: Linking digital resources across Eastern Europe for European science and innovation

EGI: Linking digital resources across Eastern Europe for European science and innovation EGI- InSPIRE EGI: Linking digital resources across Eastern Europe for European science and innovation Steven Newhouse EGI.eu Director 12/19/12 EPE 2012 1 EGI European Over 35 countries Grid Secure sharing

More information

Galaxy workshop at the Winter School Igor Makunin

Galaxy workshop at the Winter School Igor Makunin Galaxy workshop at the Winter School 2016 Igor Makunin i.makunin@uq.edu.au Winter school, UQ, July 6, 2016 Plan Overview of the Genomics Virtual Lab Introduce Galaxy, a web based platform for analysis

More information

A curated Domain centric shared Docker registry linked to the Galaxy toolshed

A curated Domain centric shared Docker registry linked to the Galaxy toolshed A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2,

More information

A Robust, Flexible Platform for Expanding Your Storage without Limits

A Robust, Flexible Platform for Expanding Your Storage without Limits White Paper SUSE Enterprise A Robust, Flexible Platform for Expanding Your without Limits White Paper A Robust, Flexible Platform for Expanding Your without Limits Unlimited Scalability That s Cost-Effective

More information

Amadeus Technology Journey

Amadeus Technology Journey 265ced1609a17cf1a5979880a2ad364653895ae8 Amadeus Technology Journey A user driven Open Source roadmap experience Christophe Defayet Security & Communication Systems Amadeus in a few words Amadeus is a

More information

modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data

modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data Quang M Trinh Ontario Institute for Cancer Research qtrinh@oicr.on.ca Outline Model Organism ENCyclopedia Of DNA Elements

More information

7 th International Digital Curation Conference December 2011

7 th International Digital Curation Conference December 2011 Golden Trail 1 Golden-Trail: Retrieving the Data History that Matters from a Comprehensive Provenance Repository Practice Paper Paolo Missier, Newcastle University, UK Bertram Ludäscher, Saumen Dey, Michael

More information

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017 Services to Make Sense of Data Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017 How many journals make data sharing a requirement of publication? https://jordproject.wordpress.com/2013/07/05/going-back-to-basics-reusing-data/

More information

Scientific Workflow Tools. Daniel Crawl and Ilkay Altintas San Diego Supercomputer Center UC San Diego

Scientific Workflow Tools. Daniel Crawl and Ilkay Altintas San Diego Supercomputer Center UC San Diego Scientific Workflow Tools Daniel Crawl and Ilkay Altintas San Diego Supercomputer Center UC San Diego 1 escience Today Increasing number of Cyberinfrastructure (CI) technologies Data Repositories: Network

More information

UCT Application Development Lifecycle. UCT Business Applications

UCT Application Development Lifecycle. UCT Business Applications UCT Business Applications Page i Table of Contents Planning Phase... 1 Analysis Phase... 2 Design Phase... 3 Implementation Phase... 4 Software Development... 4 Product Testing... 5 Product Implementation...

More information

Informatica Data Quality Product Family

Informatica Data Quality Product Family Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity

More information

Reproducibility and FAIR Data in the Earth and Space Sciences

Reproducibility and FAIR Data in the Earth and Space Sciences Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society

More information

Cisco Unified Data Center Strategy

Cisco Unified Data Center Strategy Cisco Unified Data Center Strategy How can IT enable new business? Holger Müller Technical Solutions Architect, Cisco September 2014 My business is rapidly changing and I need the IT and new technologies

More information

From command-line bioinformatics to biogui

From command-line bioinformatics to biogui From command-line bioinformatics to biogui Markus Joppich 1 and Ralf Zimmer 1 1 Institute for Informatics, LFE Bioinformatik, LMU München, München, Germany Corresponding author: Markus Joppich 1 Email

More information

Introduction to Grid Computing

Introduction to Grid Computing Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able

More information

THE STATE OF CONTAINERS

THE STATE OF CONTAINERS THE STATE OF CONTAINERS Engines & Runtimes in RHEL & OpenShift Scott McCarty Principal Technology Product Manager - Containers 10/15/2018 What if... I told you there is container innovation happening in

More information

The Value of Data Governance for the Data-Driven Enterprise

The Value of Data Governance for the Data-Driven Enterprise Solution Brief: erwin Data governance (DG) The Value of Data Governance for the Data-Driven Enterprise Prepare for Data Governance 2.0 by bringing business teams into the effort to drive data opportunities

More information

Docker Universal Control Plane Deploy and Manage On-Premises, Your Dockerized Distributed Applications

Docker Universal Control Plane Deploy and Manage On-Premises, Your Dockerized Distributed Applications Technical Brief Docker Universal Control Plane Deploy and Manage On-Premises, Your Dockerized Distributed Applications As application teams deploy their Dockerized applications into production environments,

More information

Data Intensive Scalable Computing

Data Intensive Scalable Computing Data Intensive Scalable Computing Randal E. Bryant Carnegie Mellon University http://www.cs.cmu.edu/~bryant Examples of Big Data Sources Wal-Mart 267 million items/day, sold at 6,000 stores HP built them

More information

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University Update on Dataverse Image credit: David Bygott (CC-BY-NC-SA) 2014 Dryad-Dataverse Community Meeting Mercè Crosas, Elizabeth Quigley & Eleni Castro Data Science > IQSS > Harvard University Introduction

More information

Introduction and Datacenter Topology For Your System

Introduction and Datacenter Topology For Your System Introduction and Datacenter Topology For Your System This chapter provides an introduction, a datacenter overview, and VMware vcenter requirements for your system. Introducing Cisco WebEx Meetings Server,

More information

IBM Bluemix compute capabilities IBM Corporation

IBM Bluemix compute capabilities IBM Corporation IBM Bluemix compute capabilities After you complete this section, you should understand: IBM Bluemix infrastructure compute options Bare metal servers Virtual servers IBM Bluemix Container Service IBM

More information

2013 AWS Worldwide Public Sector Summit Washington, D.C.

2013 AWS Worldwide Public Sector Summit Washington, D.C. 2013 AWS Worldwide Public Sector Summit Washington, D.C. EMR for Fun and for Profit Ben Butler Sr. Manager, Big Data butlerb@amazon.com @bensbutler Overview 1. What is big data? 2. What is AWS Elastic

More information

Journey Towards Science DMZ. Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia

Journey Towards Science DMZ. Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia Malaysia's Computational Journey Towards Science DMZ Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia suhaimi@upm.my In the Beginning... Research on parallel/distributed

More information

OUR VISION To be a global leader of computing research in identified areas that will bring positive impact to the lives of citizens and society.

OUR VISION To be a global leader of computing research in identified areas that will bring positive impact to the lives of citizens and society. Join the Innovation Qatar Computing Research Institute (QCRI) is a national research institute established in 2010 by Qatar Foundation for Education, Science and Community Development. As a primary constituent

More information

Georgia State University Cyberinfrastructure Plan

Georgia State University Cyberinfrastructure Plan Georgia State University Cyberinfrastructure Plan Summary Building relationships with a wide ecosystem of partners, technology, and researchers are important for GSU to expand its innovative improvements

More information

Achieving Digital Transformation: FOUR MUST-HAVES FOR A MODERN VIRTUALIZATION PLATFORM WHITE PAPER

Achieving Digital Transformation: FOUR MUST-HAVES FOR A MODERN VIRTUALIZATION PLATFORM WHITE PAPER Achieving Digital Transformation: FOUR MUST-HAVES FOR A MODERN VIRTUALIZATION PLATFORM WHITE PAPER Table of Contents The Digital Transformation 3 Four Must-Haves for a Modern Virtualization Platform 3

More information

One Body, Many Heads for Repository-Powered Library Applications

One Body, Many Heads for Repository-Powered Library Applications One Body, Many Heads for Repository-Powered Library Applications Tom Cramer! Chief Technology Strategist! Stanford University Libraries!! CNI * 13 December 2011! Repositories make strange bedfellows University

More information

Vblock Infrastructure Packages: Accelerating Deployment of the Private Cloud

Vblock Infrastructure Packages: Accelerating Deployment of the Private Cloud Vblock Infrastructure Packages: Accelerating Deployment of the Private Cloud Roberto Missana - Channel Product Sales Specialist Data Center, Cisco 1 IT is undergoing a transformation Enterprise IT solutions

More information

Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr)

Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr) Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr) Jeremy Oakey - Sr. Director, Technical Marketing & Integrations BRKCLD-2008 Agenda Introduction Architecture

More information

Video Conferencing & Skype for Business: Your Need-to-Know Guide

Video Conferencing & Skype for Business: Your Need-to-Know Guide Video Conferencing & Skype for Business: Your Need-to-Know Guide Effective, engaging collaboration that leverages video conferencing should incorporate features like content sharing, clear participant

More information

ICME: Status & Perspectives

ICME: Status & Perspectives ICME: Status & Perspectives from Materials Science and Engineering Surya R. Kalidindi Georgia Institute of Technology New Strategic Initiatives: ICME, MGI Reduce expensive late stage iterations Materials

More information

Building Bridges: A System for New HPC Communities

Building Bridges: A System for New HPC Communities Building Bridges: A System for New HPC Communities HPC User Forum 59 LRZ, Garching October 16, 2015 Presenter: Jim Kasdorf Director, Special Projects Pittsburgh Supercomputing Center kasdorf@psc.edu 2015

More information

Software Defined Storage

Software Defined Storage Software Defined Storage IBM Spectrum Portfolio Ian Hancock ian.hancock@uk.ibm.com Business challenges are IT challenges Create new business models (CEO) Transform financial & management processes (CFO)

More information

globus online The Galaxy Project and Globus Online

globus online The Galaxy Project and Globus Online globus online The Galaxy Project and Globus Online Ravi K Madduri Argonne National Lab University of Chicago Outline What is Globus Online? Globus Online and Sequencing Centers What is Galaxy? Integra;ng

More information

Cisco Smart+Connected Communities

Cisco Smart+Connected Communities Brochure Cisco Smart+Connected Communities Helping Cities on Their Digital Journey Cities worldwide are becoming digital or are evaluating strategies for doing so in order to make use of the unprecedented

More information

VMware on IBM Cloud:

VMware on IBM Cloud: VMware on IBM Cloud: How VMware customers can deploy new or existing applications with SoftLayer resources. Introduction This paper focuses on how existing VMware customers can gain a strategic advantage

More information

Cloud Foundry and OpenStack

Cloud Foundry and OpenStack Free Signup: www.cloudfoundry.com, code: openstack2013 Cloud Foundry and OpenStack Ferran Rodenas, Dekel Tankel Cloud Foundry, Pivotal frodenas@vmware.com, twitter: @ferdy dekel@vmware.com, twitter: @dekt

More information

How to Leverage Containers to Bolster Security and Performance While Moving to Google Cloud

How to Leverage Containers to Bolster Security and Performance While Moving to Google Cloud PRESENTED BY How to Leverage Containers to Bolster Security and Performance While Moving to Google Cloud BIG-IP enables the enterprise to efficiently address security and performance when migrating to

More information

BUYING SERVER HARDWARE FOR A SCALABLE VIRTUAL INFRASTRUCTURE

BUYING SERVER HARDWARE FOR A SCALABLE VIRTUAL INFRASTRUCTURE E-Guide BUYING SERVER HARDWARE FOR A SCALABLE VIRTUAL INFRASTRUCTURE SearchServer Virtualization P art 1 of this series explores how trends in buying server hardware have been influenced by the scale-up

More information

EGI federated e-infrastructure, a building block for the Open Science Commons

EGI federated e-infrastructure, a building block for the Open Science Commons EGI federated e-infrastructure, a building block for the Open Science Commons Yannick LEGRÉ Director, EGI.eu www.egi.eu EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union

More information

Security as Code: The Time is Now. Dave Shackleford Founder, Voodoo Security Sr. Instructor, SANS

Security as Code: The Time is Now. Dave Shackleford Founder, Voodoo Security Sr. Instructor, SANS Security as Code: The Time is Now Dave Shackleford Founder, Voodoo Security Sr. Instructor, SANS Introduction Business is moving faster to the cloud, and DevOps is accelerating scale and pushing automation

More information

Cisco ACI Simulator VM Installation Guide

Cisco ACI Simulator VM Installation Guide Cisco ACI Simulator VM Installation Guide New and Changed Information 2 About the Application Policy Infrastructure Controller 2 About the ACI Simulator Virtual Machine 2 Simulator VM Topology and Connections

More information

Virtual Appliances and Education in FutureGrid. Dr. Renato Figueiredo ACIS Lab - University of Florida

Virtual Appliances and Education in FutureGrid. Dr. Renato Figueiredo ACIS Lab - University of Florida Virtual Appliances and Education in FutureGrid Dr. Renato Figueiredo ACIS Lab - University of Florida Background l Traditional ways of delivering hands-on training and education in parallel/distributed

More information

Overview of HPC at LONI

Overview of HPC at LONI Overview of HPC at LONI Le Yan HPC Consultant User Services @ LONI What Is HPC High performance computing is to use supercomputers to solve problems computationally The most powerful supercomputer today

More information