globus online The Galaxy Project and Globus Online

Size: px
Start display at page:

Download "globus online The Galaxy Project and Globus Online"

Transcription

1 globus online The Galaxy Project and Globus Online Ravi K Madduri Argonne National Lab University of Chicago

2 Outline What is Globus Online? Globus Online and Sequencing Centers What is Galaxy? Integra;ng Galaxy and Globus Online Why? How? Demo Future Work Q & A 2

3 3

4 Benefits of Globus Online Reliable file transfer. Easy fire and forget file transfers Automa;c fault recovery High performance Across mul;ple security domains No IT required. No client sopware installa;on New features automa;cally available Consolidated support and troubleshoo;ng Works with exis;ng GridFTP servers Globus Connect solves last mile problem I moved 400 GB of files and didn t even have to think about it. Lawrence Berkeley Na.onal Lab It s just not a big deal to move big data anymore. Ini.a.ve for Biomedical Informa.cs Fantas.c! I have started using globus connect to transfer data, and it only took me 5 minutes to set up. Thank you! NERSC user 4

5 DNA Sequencing Costs are Declining 13 years $300 million ~1 week $1,000 5

6 Sequencing Data is Increasing Source: Pennisi, E., Science :6018 pp

7 PredicGon of Number of Sequenced Genomes 1,000 Genomes Learning the Ropes 25 Million Genomes A Brave New World 250,000 Genomes Clinical Early Adop;on 5 Million Genomes Consumer Reality Source: Resnick, Richard, Implica;ons of exponen;al growth of global whole genome sequencing capacity. GenomeQuest. July 9, Retrieved April 7,

8 Current CompuGng Infrastructure Won t Support Increasing Data Will Computers Crash Genomics? Pennisi, E., Science :6018 pp. 666 The various members of the genome informa;cs ecosystem are now facing a potengal tsunami of genome data that will swamp our storage systems and crush our compute clusters. Stein, L.D., Genome Biology :5 pp. 207 CREDIT: ALVARO ARTEAGA/ALVAREJO.COM. 8

9 Sequencing Center Workflow Delivery of biological sample to sequencing center by scien;st Data deliver to scien;st. Data produc;on by sequencing center 9

10 ABI SOLiD Sequencing Instrument 1.2 TB of data every 7 to 14 days 2 to 1600 files 1 to 16 customers File Transfer = Product Delivery 10

11 Current File Transfer Methods 11 11

12 Current File Transfer Methods providers and their customers open resort to the sneaker net : overnight shipment of data-laden hard drives. Pennisi, E., Science :6018 pp

13 Globus Online Improves Current Methods providers and their customers open resort to the sneaker net : overnight shipment of data-laden hard drives. Pennisi, E., Science :6018 pp

14 Why Globus Online? Fire- and- forget usage Re- trying failed transfers Logs to iden;fy reasons behind failed transfers Simplicity Simple logon and authen;ca;on Web interface for execu;on and monitoring Globus Connect for sequencing facility endpoint Reliability Checksum op;on (~$10K/genome) Performance Secure enablement Now, with Globus Online, the process is trivial and our scien.sts can move data to the right loca.on with just a few clicks. File size is no longer a barrier to produc.vity. Ini.a.ve for Biomedical Informa.cs 14

15 Globus Online at UC Sequencing Facility Mac using Globus Connect Mount drive Delivery of data to customer ibi File Server Sequencing Instrument 15 ibi General Purpose Compute Cluster Sequencing Specific Compute Cluster

16 Extend to Other Sequencing Centers Over 300 public sequencing centers 16

17 Extend Beyond Sequencing Centers Future Use Case = Biologists Moreover, as so- called third genera;on machines which promise even cheaper, faster produc;on of DNA sequences (Science, 5 March 2010, p. 1190) become available, more, and smaller, labs will start genome projects of their own. Pennisi, E., Science :6018 pp

18 What about Analysis of this Data? 18

19 Enter Galaxy A free (for everyone) web service integra;ng a wealth of tools, compute resources, terabytes of reference data and permanent storage Open source so^ware that makes it easy to integrate your own tools and data and customize your own site Flexible architecture - > Customizable 19

20 Galaxy AdopGon ~50 deployments of Galaxy Galaxy for MicroArray analysis, Machine Learning, Drug Discovery etc ~130,000 jobs a month and growing on the public instance of Galaxy 1 TB/week in user uploads 60TB from China 150+ aqendees in the Galaxy users conference From 6 con;nents 20

21 Globus Online and Galaxy Transferring large quan;;es of data in and out of Galaxy reliably Downloading results to user s laptop Running Galaxy in Produc;on for mul;ple users Running analysis on a cluster in the user space Federated Iden;ty management Flexible group management system (VO) Mechanisms to share data and workflows Parallel execu;on Condor and SwiP 21

22 What works Iden;ty Management Used Globus Provision to bootstrap the provisioned ec2 cluster with GO creden;als so one can put data from GO- enabled endpoints into galaxy and script transferring data from galaxy to any GO- enabled endpoint Data Management Added a GO Transfer task to Galaxy Ability to script the transfer tasks along with the workflow Execu;on Management We know more about this than anything Condor runner for galaxy: The executable along with command line op;ons get passed to a condor runner (runner is galaxy term for execu;on mechanism) Group Management Metadata Management 22

23 Automated deployment of Galaxy clusters on EC2 Uses Globus Provision An NFS server providing global directories for home directories, sopware, and scratch space. An NIS server providing an authen;ca;on domain within the cluster. The list of users must be specified when the cluster is created A GridFTP server, which is automa;cally registered as a GO endpoint. A Condor pool. The number of worker nodes is fixed at the ;me the cluster is created. A Galaxy server with the modifica;ons outlined above. Each user in the cluster has a user cer;ficate signed by the Globus Provision CA (this CA is trusted by GO). If a GO username matches a cluster username, and that user s Globus Provision cer;ficate is uploaded to GO as a trusted cer;ficate, then the user will be able to transfer files to/from the cluster s endpoint (and will also be able to use Galaxy s GO transfer tasks; this further requires that the user s public name in Galaxy matches his GO and cluster username). 23

24 GO Galaxy DescripGon The deployment of clusters on EC2 is completely automated, and we can deploy fully- func;onal Galaxy clusters on EC2 within minutes We created galaxy runners for Condor and SwiP So the applica;ons are run on a worker node instead of the galaxy node Reusable CHEF recipes to provision produc;on galaxy instances on demand on EC2(- ish) clusters 24

25 IntegraGon of Globus Online with Galaxy 25

26 Demo of Current CapabiliGes 26

27 Globus Online: Part of the CompuGng Infrastructure to Support Genomics CREDIT: ALVARO ARTEAGA/ALVAREJO.COM 27 27

28 Globus Online: Part of the CompuGng Infrastructure to Support Genomics CREDIT: ALVARO ARTEAGA/ALVAREJO.COM 28

29 GO Galaxy Future CapabiliGes Integrate flexible Globus Online iden;ty management, group management with Galaxy Ability to create a VO with the above capabili;es (User/Groups management, File system, Transfer, HTC, Parallel execu;on, EC2 Cluster, data sharing) with click of a buqon Integrate with Globus Online on- demand storage solu;on Ability to easily share data, workflows within a Virtual Organiza;on 29

30 References More details about Globus Online can be found here: hqp:// Details on Galaxy: hqp://usegalaxy.org Details on work we did integra;ng Globus Online and Galaxy: bit.ly/qdohhw 30

31 globus online Thanks! Questions? (or on twitter)

globus online Software-as-a-Service for Research Data Management

globus online Software-as-a-Service for Research Data Management globus online Software-as-a-Service for Research Data Management Steve Tuecke Deputy Director, Computation Institute University of Chicago & Argonne National Laboratory Big Science built on Globus Toolkit

More information

Welcome to the CyVerse Data Store. Manage and share your data across all CyVerse pla8orms

Welcome to the CyVerse Data Store. Manage and share your data across all CyVerse pla8orms Welcome to the CyVerse Data Store Manage and share your data across all CyVerse pla8orms ü Follow Along ü Workshop Packet ü mcbios.readthedocs.org Logis;cs Working with Big Data What? Challenges: the scope

More information

HIDDEN SLIDE Summary These slides are meant to be used as is to give an upper level view of perfsonar for an audience that is not familiar with the

HIDDEN SLIDE Summary These slides are meant to be used as is to give an upper level view of perfsonar for an audience that is not familiar with the HIDDEN SLIDE Summary These slides are meant to be used as is to give an upper level view of perfsonar for an audience that is not familiar with the concept. You *ARE* allowed to delete things you don t

More information

October 5 th 2010 NANOG 50 Jason Zurawski Internet2

October 5 th 2010 NANOG 50 Jason Zurawski Internet2 October 5 th 2010 NANOG 50 Jason Zurawski Internet2 Overview Introduc=on and Mo=va=on Design & Func=onality Current Deployment Footprint Conclusion 2 10/5/10, 2010 Internet2 Why Worry About Network Performance?

More information

ThinManager and FactoryTalk View SE. John Ter8n; ESE, Inc.

ThinManager and FactoryTalk View SE. John Ter8n; ESE, Inc. ThinManager and FactoryTalk View SE John Ter8n; ESE, Inc. Who Am I John Ter8n Director of Manufacturing Informa8on Systems Who We Are Founded in 1981 Headquartered in Marshfield, Wisconsin 100% Employee-

More information

COMPUTE CANADA GLOBUS PORTAL

COMPUTE CANADA GLOBUS PORTAL COMPUTE CANADA GLOBUS PORTAL Fast, user-friendly data transfer and sharing Jason Hlady University of Saskatchewan WestGrid / Compute Canada February 4, 2015 Why Globus? I need to easily, quickly, and reliably

More information

Chris Awre & Diane Leeson LLI Seminar 20 th November 2012

Chris Awre & Diane Leeson LLI Seminar 20 th November 2012 Chris Awre & Diane Leeson LLI Seminar 20 th November 2012 Blacklight Background What is Blacklight and what can it do? Func5onality Technology Blacklight and Hydra Blacklight and the catalogue Community

More information

REsources linkage for E-scIence - RENKEI -

REsources linkage for E-scIence - RENKEI - REsources linkage for E-scIence - - hlp://www.e- sciren.org/ REsources linkage for E- science () is a research and development project for new middleware technologies to enable e- science communi?es. ""

More information

Visualization for Scientists. We discuss how Deluge and Complexity call for new ideas in data exploration. Learn more, find tools at layerscape.

Visualization for Scientists. We discuss how Deluge and Complexity call for new ideas in data exploration. Learn more, find tools at layerscape. Visualization for Scientists We discuss how Deluge and Complexity call for new ideas in data exploration. Learn more, find tools at layerscape.org Transfer and synchronize files Easy fire-and-forget transfers

More information

globus online Globus Nexus Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory

globus online Globus Nexus Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory globus online Globus Nexus Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory Computation Institute (CI) Apply to challenging problems Accelerate by building the research

More information

Climate Data Management using Globus

Climate Data Management using Globus Climate Data Management using Globus Computation Institute Rachana Ananthakrishnan (ranantha@uchicago.edu) Data Management Challenges Transfers often take longer than expected based on available network

More information

escience in the Cloud: A MODIS Satellite Data Reprojection and Reduction Pipeline in the Windows

escience in the Cloud: A MODIS Satellite Data Reprojection and Reduction Pipeline in the Windows escience in the Cloud: A MODIS Satellite Data Reprojection and Reduction Pipeline in the Windows Jie Li1, Deb Agarwal2, Azure Marty Platform Humphrey1, Keith Jackson2, Catharine van Ingen3, Youngryel Ryu4

More information

S2E is proud to partner with Intercede helping organizations to create and use trusted digital identities.

S2E is proud to partner with Intercede helping organizations to create and use trusted digital identities. S2E is proud to partner with Intercede helping organizations to create and use trusted digital identities. 3 Who are Intercede? So%ware company specializing in iden5ty and creden5al management Focus on

More information

Distributed Monte Carlo Production for

Distributed Monte Carlo Production for Distributed Monte Carlo Production for Joel Snow Langston University DOE Review March 2011 Outline Introduction FNAL SAM SAMGrid Interoperability with OSG and LCG Production System Production Results LUHEP

More information

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations October 29, 2014 Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations XSEDE for HPC Users What is XSEDE? XSEDE mo/va/on and goals XSEDE Resources XSEDE for HPC Users: Before

More information

Symantec Data Loss Preven2on 12.5 Demo Presenta2on

Symantec Data Loss Preven2on 12.5 Demo Presenta2on Symantec Data Loss Preven2on 12.5 Demo Presenta2on 1 Our Understanding PROJECT DRIVERS & DATA TO PROTECT Regulatory compliance PCI, GLBA Data inventory and cleansing SSNs, CCNs [Replace these bullet points

More information

The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists

The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists Internet2 Network Infrastructure for the Life Sciences Focused Technical Workshop. Berkeley, CA July 17-18, 2013

More information

6,000 Cameras in Time Square 210 million Cameras worldwide

6,000 Cameras in Time Square 210 million Cameras worldwide SMILE!! You are on camera 75 $mes per day Average American ci$zen can be caught on camera 1:29 Camera to person ra$o World Wide 6,000 Cameras in Time Square 210 million Cameras worldwide What is the LTO

More information

Senate Subcommi-ee on Flooding & Evacua5ons. Presenta5on by Commission on State Emergency Communica5ons August 24, 2010 Houston, Texas

Senate Subcommi-ee on Flooding & Evacua5ons. Presenta5on by Commission on State Emergency Communica5ons August 24, 2010 Houston, Texas Senate Subcommi-ee on Flooding & Evacua5ons Presenta5on by Commission on State Emergency Communica5ons August 24, 2010 Houston, Texas Overview of 9-1- 1 in Texas "9-1- 1 service" means a telecommunica5ons

More information

The Purge Threat: Scien*sts thoughts on peta- scale usability

The Purge Threat: Scien*sts thoughts on peta- scale usability The Purge Threat: Scien*sts thoughts on peta- scale usability Alexandra Holloway Storage Systems Research Center + Assis*ve Technology Lab University of California, Santa Cruz Introduc*on

More information

Design patterns for data-driven research acceleration

Design patterns for data-driven research acceleration Design patterns for data-driven research acceleration Rachana Ananthakrishnan, Kyle Chard, and Ian Foster The University of Chicago and Argonne National Laboratory Contact: rachana@globus.org Introduction

More information

The Future of Galaxy. Nate Coraor galaxyproject.org

The Future of Galaxy. Nate Coraor galaxyproject.org The Future of Galaxy Nate Coraor galaxyproject.org Galaxy is... A framework for scientists Enables usage of complicated command line tools Deals with file formats as transparently as possible Provides

More information

Oracle VM Workshop Applica>on Driven Virtualiza>on

Oracle VM Workshop Applica>on Driven Virtualiza>on Oracle VM Workshop Applica>on Driven Virtualiza>on Simon COTER Principal Product Manager Oracle VM & VirtualBox simon.coter@oracle.com hnps://blogs.oracle.com/scoter November 25th, 2015 Copyright 2014

More information

Federated Services for Scientists Thursday, December 9, p.m. EST

Federated Services for Scientists Thursday, December 9, p.m. EST IAM Online Federated Services for Scientists Thursday, December 9, 2010 1 p.m. EST Rachana Ananthakrishnan Argonne National Laboratory & University of Chicago Jim Basney National Center for Supercomputing

More information

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago Assembly Annota3on Modeling Design Big Data, Big Compute, Big Interac3on Machines for Future Biology Rick Stevens stevens@anl.gov Argonne Na3onal Laboratory The University of Chicago There are no solved

More information

The new perfsonar: a global tool for global network monitoring

The new perfsonar: a global tool for global network monitoring The new perfsonar: a global tool for global network monitoring Domenico Vicinanza (on behalf of the perfsonar Project) http://www.perfsonar.net GÉANT Product Management Team GÉANT Global Connec/vity -

More information

Kaseya Fundamentals Workshop DAY TWO. Developed by Kaseya University. Powered by IT Scholars

Kaseya Fundamentals Workshop DAY TWO. Developed by Kaseya University. Powered by IT Scholars Kaseya Fundamentals Workshop DAY TWO Developed by Kaseya University Powered by IT Scholars Kaseya Version 6.5 Last updated March, 2014 Day One Review IT- Scholars Virtual LABS System Management Organiza@on

More information

Modifying an Exis.ng Commercial Product for Cryptographic Module Evalua.on

Modifying an Exis.ng Commercial Product for Cryptographic Module Evalua.on Modifying an Exis.ng Commercial Product for Cryptographic Module Evalua.on ICMC16 O?awa, Canada 18-20 May 2016 Presented by Alan Gornall Introduc.on I provide cer.fica.on support to my clients: compliance

More information

CONTAINERIZING JOBS ON THE ACCRE CLUSTER WITH SINGULARITY

CONTAINERIZING JOBS ON THE ACCRE CLUSTER WITH SINGULARITY CONTAINERIZING JOBS ON THE ACCRE CLUSTER WITH SINGULARITY VIRTUAL MACHINE (VM) Uses so&ware to emulate an en/re computer, including both hardware and so&ware. Host Computer Virtual Machine Host Resources:

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1 IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns 6/25/14 Archive Analy3cs Solu3ons 1 Credits Archive Analy3cs Solu3ons is presen3ng an archive system that embodies best prac3ce for long- term, high integrity

More information

Project Blackbird. U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis. Sam Hoover

Project Blackbird. Ulizing Condor and HTC to address archiving online courses at Clemson on a weekly basis. Sam Hoover Project Blackbird U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis Sam Hoover shoover@clemson.edu 1 Project Blackbird Blackboard at Clemson End of Semester archives

More information

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of

More information

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System Ilkay Al(ntas and Daniel Crawl San Diego Supercomputer Center UC San Diego Jianwu Wang UMBC WorDS.sdsc.edu Computa3onal

More information

Outline. ASP 2012 Grid School

Outline. ASP 2012 Grid School Distributed Storage Rob Quick Indiana University Slides courtesy of Derek Weitzel University of Nebraska Lincoln Outline Storage Patterns in Grid Applications Storage

More information

Using the Open Science Data Cloud for Data Science Research. Robert Grossman University of Chicago Open Cloud Consor=um June 17, 2013

Using the Open Science Data Cloud for Data Science Research. Robert Grossman University of Chicago Open Cloud Consor=um June 17, 2013 Using the Open Science Data Cloud for Data Science Research Robert Grossman University of Chicago Open Cloud Consor=um June 17, 2013 Discoveries Team: you and your colleagues correla=on + algorithms +

More information

Engaging Employees and Customers with Video. The Benefits of Corporate Webcas3ng

Engaging Employees and Customers with Video. The Benefits of Corporate Webcas3ng Engaging Employees and Customers with Video The Benefits of Corporate Webcas3ng Agenda Introduc9on UnityLivestream Teradek Wowza Workflow Produc9on Streaming Delivery Case Studies Demo - Live Solu9on -

More information

Kaseya Fundamentals Workshop DAY FOUR. Developed by Kaseya University. Powered by IT Scholars

Kaseya Fundamentals Workshop DAY FOUR. Developed by Kaseya University. Powered by IT Scholars Kaseya Fundamentals Workshop DAY FOUR Developed by Kaseya University Powered by IT Scholars Kaseya Version 6.5 Last updated March, 2014 Day Three Review State Based Monitoring Event Based Monitoring Monitoring

More information

User Community Driven Development in Trust and Identity Services

User Community Driven Development in Trust and Identity Services User Community Driven Development in Trust and Identity Services Ann Harding, SWITCH Internet2 Global Summit 27 April 2015 Washington DCs Agenda Trust and Iden.ty Landscape GÉANT Research Community Engagement

More information

Securing Hadoop. Keys Botzum, MapR Technologies Jan MapR Technologies - Confiden6al

Securing Hadoop. Keys Botzum, MapR Technologies Jan MapR Technologies - Confiden6al Securing Hadoop Keys Botzum, MapR Technologies kbotzum@maprtech.com Jan 2014 MapR Technologies - Confiden6al 1 Why Secure Hadoop Historically security wasn t a high priority Reflec6on of the type of data

More information

Advanced Photon Source Data Management. S. Veseli, N. Schwarz, C. Schmitz (SDM/XSD) R. Sersted, D. Wallis (IT/AES)

Advanced Photon Source Data Management. S. Veseli, N. Schwarz, C. Schmitz (SDM/XSD) R. Sersted, D. Wallis (IT/AES) Advanced Photon Source Data Management S. Veseli, N. Schwarz, C. Schmitz (SDM/XSD) R. Sersted, D. Wallis (IT/AES) APS Data Management - Globus World 2018 Growing Beamline Data Needs X-ray detector capabilities

More information

Clouds: An Opportunity for Scientific Applications?

Clouds: An Opportunity for Scientific Applications? Clouds: An Opportunity for Scientific Applications? Ewa Deelman USC Information Sciences Institute Acknowledgements Yang-Suk Ki (former PostDoc, USC) Gurmeet Singh (former Ph.D. student, USC) Gideon Juve

More information

biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data

biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data Ilkay Al/ntas 1, Jianwu Wang 2, Daniel Crawl 1, Shweta Purawat 1 1 San Diego

More information

FutureGrid 101. Part 2: Ge*ng Started Craig Stewart

FutureGrid 101. Part 2: Ge*ng Started Craig Stewart FutureGrid 101 Part 2: Ge*ng Started Craig Stewart Futuregrid.org We re s9ll in early adopter mode But we are very much interested in applica9ons experiments, computa9onal science experiments, and computer

More information

Tools for Handling Big Data and Compu5ng Demands. Humani5es and Social Science Scholars

Tools for Handling Big Data and Compu5ng Demands. Humani5es and Social Science Scholars Tools for Handling Big Data and Compu5ng Demands Humani5es and Social Science Scholars Outline Overview of Compute Canada and WestGrid Focus on Humani5es and Social Sciences The Resource Alloca5on Compe55on

More information

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science T. Maeno, K. De, A. Klimentov, P. Nilsson, D. Oleynik, S. Panitkin, A. Petrosyan, J. Schovancova, A. Vaniachine,

More information

MulG-Vendor Key Management with KMIP

MulG-Vendor Key Management with KMIP MulG-Vendor Key Management with KMIP Tim Hudson CTO Cryptso2 tjh@cryptso2.com GS13A 19-May-2016 1:35pm Key Management 1000011010100100101100101010000010101000101001101001111010001100 Key Management Standards

More information

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing New Paradigms: Clouds, Virtualization and Co. EGEE08, Istanbul, September 25, 2008 An Introduction to Virtualization and Cloud Technologies to Support Grid Computing Distributed Systems Architecture Research

More information

Building social services: Social TV case study

Building social services: Social TV case study Building social services: Social TV case study CFP All-Members Meeting May 14, 2009 Venice, Italy Natalie Klym nklym@cfp.mit.edu The Social TV study combines two research streams The future of television

More information

CANARIE: Providing Essential Digital Infrastructure for Canada

CANARIE: Providing Essential Digital Infrastructure for Canada CANARIE: Providing Essential Digital Infrastructure for Canada Mark Wolff; CTO April 16, 2014 A Transformation of the Science Paradigm thousands of years ago last few hundred years last few decades today

More information

Introduction to Grid Computing

Introduction to Grid Computing Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able

More information

COSC 310: So*ware Engineering. Dr. Bowen Hui University of Bri>sh Columbia Okanagan

COSC 310: So*ware Engineering. Dr. Bowen Hui University of Bri>sh Columbia Okanagan COSC 310: So*ware Engineering Dr. Bowen Hui University of Bri>sh Columbia Okanagan 1 Admin A2 is up Don t forget to keep doing peer evalua>ons Deadline can be extended but shortens A3 >meframe Labs This

More information

A High-Performance Storage and Ultra- High-Speed File Transfer Solution for Collaborative Life Sciences Research

A High-Performance Storage and Ultra- High-Speed File Transfer Solution for Collaborative Life Sciences Research A High-Performance Storage and Ultra- High-Speed File Transfer Solution for Collaborative Life Sciences Research Storage Platforms with Aspera Overview A growing number of organizations with data-intensive

More information

EGI: Linking digital resources across Eastern Europe for European science and innovation

EGI: Linking digital resources across Eastern Europe for European science and innovation EGI- InSPIRE EGI: Linking digital resources across Eastern Europe for European science and innovation Steven Newhouse EGI.eu Director 12/19/12 EPE 2012 1 EGI European Over 35 countries Grid Secure sharing

More information

Leveraging Globus Identity for the Grid. Suchandra Thapa GlobusWorld, April 22, 2016 Chicago

Leveraging Globus Identity for the Grid. Suchandra Thapa GlobusWorld, April 22, 2016 Chicago Leveraging Globus Identity for the Grid Suchandra Thapa GlobusWorld, April 22, 2016 Chicago Open Science Grid Helps researchers speed up their research using high throughput computing methods Helps campus

More information

Galaxy. Data intensive biology for everyone. / #usegalaxy

Galaxy. Data intensive biology for everyone. / #usegalaxy Galaxy Data intensive biology for everyone. www.galaxyproject.org @jxtx / #usegalaxy Engineering Dannon Baker Dan Blankenberg Dave Bouvier Nate Coraor Carl Eberhard Jeremy Goecks Sam Guerler Greg von Kuster

More information

TPP On The Cloud. Joe Slagel

TPP On The Cloud. Joe Slagel TPP On The Cloud Joe Slagel Lecture topics Introduc5on to Cloud Compu5ng and Amazon Web Services Overview of TPP Cloud components Setup trial AWS and use of the new TPP Web Launcher for Amazon (TWA) Future

More information

RESEARCH DATA DEPOT AT PURDUE UNIVERSITY

RESEARCH DATA DEPOT AT PURDUE UNIVERSITY Preston Smith Director of Research Services RESEARCH DATA DEPOT AT PURDUE UNIVERSITY May 18, 2016 HTCONDOR WEEK 2016 Ran into Miron at a workshop recently.. Talked about data and the challenges of providing

More information

The Avalon Media System

The Avalon Media System The Avalon Media System A Next- Genera8on Solu8on for Media Management and Access Jon Dunn Mark Notess IU Digital Library Brown Bag Series 27 March 2013 About Us Jon Dunn Interim Assistant Dean for Library

More information

SQS, SWF, and SNS 7/24/17. References. Amazon Simple Queue Service(SQS)

SQS, SWF, and SNS 7/24/17. References. Amazon Simple Queue Service(SQS) SQS, SWF, and SNS Chapter 8 References All informa6on in this presenta6on was obtained from the following sources with all credit due to the listed authors: J. Baron, H. Baz, T. Bixler, B. Gaut, K. E.

More information

Red Hat Container Strategy Ahmed El-Rayess

Red Hat Container Strategy Ahmed El-Rayess Red Hat Container Strategy Ahmed El-Rayess I.T. Organiza,ons Under Pressure CONCRETE SHOES OF LEGACY AND RIGID PROCESSES CURRENT STATE Manual processes Inconsistent environments Dependency hell Legacy

More information

Open Data Kit. A set of open source tools to help organiza3ons collect, aggregate and visualize their rich data.

Open Data Kit. A set of open source tools to help organiza3ons collect, aggregate and visualize their rich data. Open Data Kit h8p://code.google.com/p/open- data- kit A set of open source tools to help organiza3ons collect, aggregate and visualize their rich data. Organiza.ons in developing regions inefficiently

More information

ESnet s (100G) SDN Testbed

ESnet s (100G) SDN Testbed ESnet s (100G) SDN Testbed Inder Monga and ESnet SDN team Interna:onal SDN Testbed, March 2015 Outline Testbeds in ESnet Mo:va:on: Building a Scalable SDN WAN testbed Hardware and Deployment Status First

More information

NFS 3/25/14. Overview. Intui>on. Disconnec>on. Challenges

NFS 3/25/14. Overview. Intui>on. Disconnec>on. Challenges NFS Overview Sharing files is useful Network file systems give users seamless integra>on of a shared file system with the local file system Many op>ons: NFS, SMB/CIFS, AFS, etc. Security an important considera>on

More information

Unified Management for Virtual Storage

Unified Management for Virtual Storage Unified Management for Virtual Storage Storage Virtualization Automated Information Supply Chains Contribute to the Information Explosion Zettabytes Information doubling every 18-24 months Storage growing

More information

Globus Online: File Transfer Made Easy!

Globus Online: File Transfer Made Easy! Globus Online: File Transfer Made Easy! Matteo Lanati matteo.lanati@lrz.de Initiative for Globus in Europe Leibniz Supercomputing Centre Outline Introduction and acknowledgments Motivation Demo session

More information

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University Dataverse 4.0 & Beyond ì Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University 2 Data Science Team Data Cura/on & Stewardship Informa/on Scien/sts Researchers Sta/s/cal Innova/on

More information

Next Generation Science and Infrastructure Support

Next Generation Science and Infrastructure Support Next Generation Science and Infrastructure Support James Lowey Director Network & Computing Systems TGEN The Translational Genomics Research Institute (TGen) Non-profit Biomedical research institute Founded

More information

Building a Materials Data Facility (MDF)

Building a Materials Data Facility (MDF) Building a Materials Data Facility (MDF) Ben Blaiszik (blaiszik@uchicago.edu) Ian Foster (foster@uchicago.edu) Steve Tuecke, Kyle Chard, Rachana Ananthakrishnan, Jim Pruyne (UC) Kandace Turner-Jones, John

More information

The Fedlet: Real World Examples

The Fedlet: Real World Examples The Fedlet: Real World Examples Sun Iden(ty Management User Group 12 March 2009 Agenda BIT Systems Overview Federal Agency Architecture Iden>ty Federa>on Fedlet Introduc>on Enhancing Fedlet Capabili>es

More information

NFS. CSE/ISE 311: Systems Administra5on

NFS. CSE/ISE 311: Systems Administra5on NFS CSE/ISE 311: Systems Administra5on Sharing files is useful Overview Network file systems give users seamless integra8on of a shared file system with the local file system Many op8ons: NFS, SMB/CIFS,

More information

RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0

RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0 software development simplified RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0 Eric Westfall - Indiana University JASIG 2011 For those who don t know Kuali Rice consists of mul8ple sub-

More information

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations Argonne National Laboratory Argonne National Laboratory is located on 1,500

More information

The Role of Privilege in Recent Breaches

The Role of Privilege in Recent Breaches Security Solutions Inc. The Role of Privilege in Recent Breaches Anthony Meyer Regional SE, Canada CyberArk Luc Gagne North America Sales Director IAM Concepts Medical Center About Primary Hospital & Level

More information

Managed So*ware Installa1on with Munki

Managed So*ware Installa1on with Munki Managed So*ware Installa1on with Munki Jon Rhoades St Vincent s Ins1tute & University of Melbourne jrhoades@svi.edu.au Managed Installa1on Why? What are we using now? Needs Installs Updates Apple Updates

More information

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef VMs at a Tier-1 site EGEE 09, 21-09-2009 Sander Klous, Nikhef Contents Introduction Who are we? Motivation Why are we interested in VMs? What are we going to do with VMs? Status How do we approach this

More information

PBXact UC. March

PBXact UC. March PBXact UC March 24 2016 PBXACT UC Launch - Agenda Introduc2on PBXACT Key Features Logis2cs, Pricing, Timing Support Reseller Program Summary 2016 Sangoma Technologies 2 INTRODUCTION 2016 Sangoma Technologies

More information

7 Ways to Increase Your Produc2vity with Revolu2on R Enterprise 3.0. David Smith, REvolu2on Compu2ng

7 Ways to Increase Your Produc2vity with Revolu2on R Enterprise 3.0. David Smith, REvolu2on Compu2ng 7 Ways to Increase Your Produc2vity with Revolu2on R Enterprise 3.0 David Smith, REvolu2on Compu2ng REvolu2on Compu2ng: The R Company REvolu2on R Free, high- performance binary distribu2on of R REvolu2on

More information

How the Cloud is Changing Federated Iden4ty Requirements. Patrick Harding CTO, Ping March 1, 2010

How the Cloud is Changing Federated Iden4ty Requirements. Patrick Harding CTO, Ping March 1, 2010 How the Cloud is Changing Federated Iden4ty Requirements Patrick Harding CTO, Ping Iden3ty @pingcto March 1, 2010 http://www.flickr.com/photos/quinnanya/2690873096/ The Return of Timesharing http://www.flickr.com/photos/quinnanya/2690873096/

More information

Basic UNIX. Jon K. Lærdahl, Structural Bioinforma cs

Basic UNIX. Jon K. Lærdahl, Structural Bioinforma cs Basic UNIX Today s Programme Biological databases Brief introducon What is UNIX? Why should you learn UNIX? Bioinformacs Core Facility Seng up your laptops What about those of you that know Unix and Python

More information

DNSSEC Activities In North America: Comcast

DNSSEC Activities In North America: Comcast DNSSEC Activities In North America: Comcast ICANN 45 October 17, 2012 NATIONAL ENGINEERING & TECHNICAL OPERATIONS DNSSEC Deployment Status We began working on this in 2008 (see 4meline) We completed our

More information

Validating Microsoft Exchange 2010 on Cisco and NetApp FlexPod with the F5 BIG-IP System

Validating Microsoft Exchange 2010 on Cisco and NetApp FlexPod with the F5 BIG-IP System Validating Microsoft Exchange 2010 on Cisco and NetApp FlexPod with the F5 BIG-IP System As enterprises around the globe move to increasingly virtualized environments, they can use a Cisco and NetApp FlexPod

More information

IPv6 in the DREN III acquisi4on. Ron Broersma DREN Chief Engineer

IPv6 in the DREN III acquisi4on. Ron Broersma DREN Chief Engineer IPv6 in the DREN III acquisi4on Ron Broersma DREN Chief Engineer Background DREN = Defense Research and Engineering Network The DoD wide area network (WAN) suppor4ng the R&D, T&E, HPC, M&S, and other communi4es

More information

<Insert Picture Here> Linux: The Journey, Milestones, and What s Ahead Edward Screven, Chief Corporate Architect, Oracle

<Insert Picture Here> Linux: The Journey, Milestones, and What s Ahead Edward Screven, Chief Corporate Architect, Oracle Linux: The Journey, Milestones, and What s Ahead Edward Screven, Chief Corporate Architect, Oracle SAFE HARBOR STATEMENT The following is intended to outline our general product direction.

More information

Singularity in CMS. Over a million containers served

Singularity in CMS. Over a million containers served Singularity in CMS Over a million containers served Introduction The topic of containers is broad - and this is a 15 minute talk! I m filtering out a lot of relevant details, particularly why we are using

More information

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017 UNIFY DATA AT MEMORY SPEED Haoyuan (HY) Li, CEO @ Alluxio Inc. VAULT Conference 2017 March 2017 HISTORY Started at UC Berkeley AMPLab In Summer 2012 Originally named as Tachyon Rebranded to Alluxio in

More information

Edinburgh (ECDF) Update

Edinburgh (ECDF) Update Edinburgh (ECDF) Update Wahid Bhimji On behalf of the ECDF Team HepSysMan,10 th June 2010 Edinburgh Setup Hardware upgrades Progress in last year Current Issues June-10 Hepsysman Wahid Bhimji - ECDF 1

More information

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Henry Le - Utegra8on Russell Hull - SAP

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Henry Le - Utegra8on Russell Hull - SAP Analyze Big Data Faster and Store it Cheaper Dominick Huang CenterPoint Energy Henry Le - Utegra8on Russell Hull - SAP ABOUT CENTERPOINT ENERGY, INC. Ø Ø Ø Ø Ø Ø Publicly traded on New York Stock Exchange

More information

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber NERSC Site Update National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory Richard Gerber NERSC Senior Science Advisor High Performance Computing Department Head Cori

More information

$ whoami. Carrie Ganote. id Group: NCGAS National Center for Genome Analysis Support

$ whoami. Carrie Ganote. id Group: NCGAS National Center for Genome Analysis Support $ whoami Carrie Ganote $ id Group: NCGAS National Center for Genome Analysis Support Galaxy Deployment on Heterogeneous Hardware Who we are and our compu1ng environment How we set up Galaxy on mul1ple

More information

The OpenNebula Virtual Infrastructure Engine

The OpenNebula Virtual Infrastructure Engine EGEE 4th User Forum/OGF25 & OGF-Europe's 2nd International Event Catania, Italy Thursday 5 th, March 2009 The OpenNebula Virtual Infrastructure Engine Constantino Vázquez Blanco Distributed Systems Architecture

More information

January 2011 Joint ISACA/IIA Mee5ng

January 2011 Joint ISACA/IIA Mee5ng January 2011 Joint ISACA/IIA Mee5ng Panel Discussion - Cloud Compu5ng January 13, 2011 Agenda Learning Objec5ves Introduc5ons Defini5ons Discussion Resource Links Note: Electronic copies of this presenta2on

More information

GPFS for Life Sciences at NERSC

GPFS for Life Sciences at NERSC GPFS for Life Sciences at NERSC A NERSC & JGI collaborative effort Jason Hick, Rei Lee, Ravi Cheema, and Kjiersten Fagnan GPFS User Group meeting May 20, 2015-1 - Overview of Bioinformatics - 2 - A High-level

More information

CloudMan cloud clusters for everyone

CloudMan cloud clusters for everyone CloudMan cloud clusters for everyone Enis Afgan usecloudman.org This is accessibility! But only sometimes So, there are alternatives BUT WHAT IF YOU WANT YOUR OWN, QUICKLY The big picture A. Users in different

More information

EMC ISILON HARDWARE PLATFORM

EMC ISILON HARDWARE PLATFORM EMC ISILON HARDWARE PLATFORM Three flexible product lines that can be combined in a single file system tailored to specific business needs. S-SERIES Purpose-built for highly transactional & IOPSintensive

More information

High Performance and Cloud Computing (HPCC) for Bioinformatics

High Performance and Cloud Computing (HPCC) for Bioinformatics High Performance and Cloud Computing (HPCC) for Bioinformatics King Jordan Georgia Tech January 13, 2016 Adopted From BIOS-ICGEB HPCC for Bioinformatics 1 Outline High performance computing (HPC) Cloud

More information

CISL Operations and Yellowstone Update

CISL Operations and Yellowstone Update CISL Operations and Yellowstone Update CISL HPC Advisory Panel Meeting 17 October 2013 David Hart dhart@ucar.edu Operations and Services Division Computational and Information Systems Laboratory Much happening

More information

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013 Data Replication: Automated move and copy of data PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013 Claudio Cacciari c.cacciari@cineca.it Outline The issue

More information

A scalability comparison study of data management approaches for smart metering systems

A scalability comparison study of data management approaches for smart metering systems A scalability comparison study of data management approaches for smart metering systems Houssem Chihoub, Chris.ne Collet Grenoble INP houssem.chihoub@imag.fr Journées Plateformes Clermont Ferrand 6-7 octobre

More information

Clinical Research Professionals Educa3on Session Barb Greguson Alliance Sta3s3cal and Data Center

Clinical Research Professionals Educa3on Session Barb Greguson Alliance Sta3s3cal and Data Center Clinical Research Professionals Educa3on Session Barb Greguson Alliance Sta3s3cal and Data Center greguson.barbara@mayo.edu Alliance for Clinical Trials in Oncology Fall 2015 Group Mee3ng Presenta3on Objec3ves

More information