Galaxy a community driven platform for accessible, transparent, and reproducible data science
|
|
- Conrad Hoover
- 5 years ago
- Views:
Transcription
1 Galaxy a community driven platform for accessible, transparent, and reproducible data science / #usegalaxy
2 A continuing crisis in genomics research: reproducibility
3 What is reproducibility? (for computational analyses) Reproducibility is not provenance, reusability/ generalizability, or correctness Reproducibility means that an analysis is described/captured in sufficient detail that it can be precisely reproduced (given the data) Yet most published analyses are not reproducible (see e.g. Ioannadis et al /18 microarray experiments reproducible; Nekrutenko and Taylor 2012, 7/50 resequencing experiments reproducible) Missing software, versions, parameters, data
4 Galaxy: accessible analysis system
5 A free (for everyone) web service integrating a wealth of tools, compute resources, terabytes of reference data and permanent storage Open source software that makes integrating your own tools and data and customizing for your own site simple An open extensible platform for sharing tools, datatypes, workflows,...
6 Galaxy s ideological goals: How best can data intensive methods be accessible to scientists? How best to facilitate transparent communication of computational analyses? How best to ensure that analyses are reproducible?
7 Galaxy s practical goals: How to arm researchers with access to powerful compute and latest tools How to build a community of tool developers How to run Galaxy on any HPC
8 Describe analysis tool behavior abstractly Workflow system for complex analysis, constructed explicitly or automatically Analysis environment automatically and transparently tracks details Pervasive sharing, and publication of documents with integrated analysis
9 Visualization and visual analytics
10 To grow Galaxy we needed to actively engage the community Users: support the tools and workflows they want, engage in outreach, training, infrastructure to support communication (e.g. biostar) Developers: make it easy to integrate new tools, to run and test locally, be as flexible as possible, avoid anything specific to a given scientific domain Great value in being able to run anywhere, not building a gateway but a framework to build gateways
11 The Galaxy ecosystem
12 More than 70 known public Galaxy servers 15+ general servers Domain specific servers including: Ballaxy for structure based computational biology, Cistrome for regulatory sequence analysis, Genomic Hyperbrowser: statistical integration of genomic data, GigaGalaxy: integrating workflows published in GigaScience, Pathogen Portal:comparative analysis of host response to pathogens,... Dozens of large scale private Galaxy instances
13
14
15
16
17 Ways to use Galaxy The public web service at Install locally with many compute environments Deploy on a cloud using Cloudman Atmosphere
18 Galaxy can scale: for example Galaxy main Dedicated resources Shared resources SDSC, San Diego Galaxy Cluster 256 cores 2 TB memory Rodeo 128 cores 1 TB memory Corral/Stockyard 20 PB disk Stampede 462,462 cores 205 TB memory PSC, Pittsburgh Blacklight 4,096 cores 32 TB memory Trestles 10,368 cores 20.7 TB memory TACC Austin Nate Coraor
19 Bringing it all together: automate all the things! Unified ansible playbook for Galaxy main, cloud, and local deployments
20 Collaboration among Galaxy instances
21 Galaxies on private clouds Galaxies on public clouds Galaxy Tool Shed Private Tool Sheds private Galaxy installations Greg von Kuster, Dave Bouvier
22 Repositories are owned by the contributor, can contain tools, workflows, etc. Backed by version control, a complete version history is retained for everything that passes through the toolshed Galaxy instance admins can install tools directly from the toolshed using only a web UI Support for recipes for installing the underlying software that tools depend on (also versioned)
23
24 Command line tools to support tool developers Command-line tools to aid development. Test tools quickly without worrying about configuration files. Check tools for common bugs and best practices. Optimized publishing to the ToolShed. Testbed for new dependency management - Homebrew and Homebrew-science
25 git[hub] centric development workflow
26 Tool citations, credit and incentivization Embed DOIs in Tool Configuration, Galaxy resolves and provides a list of citations, with links, which can exported for reference managers
27 Summary Galaxy is an (obsessively) open framework for making data analysis accessible and reproducible Nearly everything in Galaxy is pluggable, allowing it to customized for myriad purposes By supporting and leveraging developers the Galaxy community can collectively keep up with rapid changes in available tools
28 The Core Galaxy Team Engineering Enis Afgan Dannon Baker Dan Blankenberg Dave Bouvier Nate Coraor Martin Čech John Chilton Carl Eberhard Sam Guerler Nitesh Turaga Support and outreach Custodians Dave Clements Jennifer Jackson James Taylor Anton Nekrutenko Jeremy Goecks Supported by the NHGRI (HG005542, HG004909, HG005133, HG006620), NSF (DBI ), Penn State University, Johns Hopkins University, and the Pennsylvania Department of Public Health
29 Extended team and other contributors Björn Grüning Uni Freiburg Peter Cock TJHI Kyle Ellrott UCSC Eric Rasche CPT Nicola Soranzo TGAC Brad Chapman HSPH Nuwan Goonasekera VeRSI Yousef Kowsar VLSCI And many others who have contributed to the main Galaxy code, tools to the ToolShed, participated in discussions, attended the Galaxy conferences,
Galaxy. Data intensive biology for everyone. / #usegalaxy
Galaxy Data intensive biology for everyone. www.galaxyproject.org @jxtx / #usegalaxy Engineering Dannon Baker Dan Blankenberg Dave Bouvier Nate Coraor Carl Eberhard Jeremy Goecks Sam Guerler Greg von Kuster
More informationGalaxy. Data intensive biology for everyone. / #usegalaxy
Galaxy Data intensive biology for everyone. www.galaxyproject.org @jxtx / #usegalaxy High-Throughput v I SEQUENCING! High-throughput sequencing is transformative Resequencing De novo genome sequencing
More informationWeb-Based Visualization and Visual Analysis for High-Throughput Genomics. Jeremy Goecks! Computational Biology Institute
Web-Based Visualization and Visual Analysis for High-Throughput Genomics with Galaxy! Jeremy Goecks! Computational Biology Institute Topics Galaxy Visualization framework Large-scale visualization Integrated
More informationGalaxy. Daniel Blankenberg The Galaxy Team
Galaxy Daniel Blankenberg The Galaxy Team http://galaxyproject.org Overview What is Galaxy? What you can do in Galaxy analysis interface, tools and datasources data libraries workflows visualization sharing
More informationGalaxy Project Update
Galaxy Project Update 2013 GMOD Meeting Cambridge, UK Dave Clements Emory University Agenda Project Introduction Project Update What is Galaxy? An open, web-based platform for accessible, reproducible,
More informationAccessible, Transparent and Reproducible Analysis with Galaxy
Accessible, Transparent and Reproducible Analysis with Galaxy Application of Next Generation Sequencing Technologies for Whole Transcriptome and Genome Analysis ABRF 2013 Saturday, March 2, 2013 Palm Springs,
More informationGalaxy Community Update
Galaxy Community Update PAG XXVI January 17, 2018 San Diego, California, United States Dave Clements Johns Hopkins University Galaxy Team / Galaxy Community #usegalaxy @galaxyproject bit.ly/gxy-pag2018-upd
More informationIntroduction to Galaxy
Introduction to Galaxy Saint Louis University St. Louis, Missouri April 30, 2013 Dave Clements, Emory University http://galaxyproject.org/ Agenda 9:00 Welcome 9:20 Basic Analysis with Galaxy 10:30 Basic
More informationUsing Galaxy: RNA-seq
Using Galaxy: RNA-seq Stanford University September 23, 2014 Jennifer Hillman-Jackson Galaxy Team Penn State University http://galaxyproject.org/ The Agenda Introduction RNA-seq Example - Data Prep: QC
More informationReproducible & Transparent Computational Science with Galaxy. Jeremy Goecks The Galaxy Team
Reproducible & Transparent Computational Science with Galaxy Jeremy Goecks The Galaxy Team 1 Doing Good Science Previous talks: performing an analysis setting up and scaling Galaxy adding tools libraries
More informationGet your own Galaxy within minutes
Get your own Galaxy within minutes Enis Afgan, Nitesh Turaga, Nuwan Goonasekera GCC 2016 Bloomington, IN Access slides from bit.ly/gcc2016_usecloud Today s agenda Introduction Hands on, part 1 Launch your
More informationGCC 2017 Community Update. Dan Blankenberg and Jeremy Goecks
GCC 2017 Community Update Dan Blankenberg and Jeremy Goecks The Past Year has been Amazing Fantastic advances in many areas of Galaxy: so many new features, from user interface to workflow engine to cloud
More informationAdding Transparency and Automation into the Galaxy Tool Installation Process
Adding Transparency and Automation into the Galaxy Tool Installation Process Enis Afgan Johns Hopkins University Galaxy Team Galaxy Admins Web Meetup August 20, 2015. Outline Installing tools in bulk (i.e.,
More informationThe Data exacell DXC. J. Ray Scott DXC PI May 17, 2016
The Data exacell DXC J. Ray Scott DXC PI May 17, 2016 DXC Leadership Mike Levine Co-Scientific Director Co-PI Nick Nystrom Senior Director of Research Co-PI Ralph Roskies Co-Scientific Director Co-PI Robin
More informationDNA Sequence Bioinformatics Analysis with the Galaxy Platform
DNA Sequence Bioinformatics Analysis with the Galaxy Platform University of São Paulo, Brazil 28 July - 1 August 2014! Dave Clements Johns Hopkins University Robson Francisco de Souza University of São
More informationThe GISandbox: A Science Gateway For Geospatial Computing. Davide Del Vento, Eric Shook, Andrea Zonca
The GISandbox: A Science Gateway For Geospatial Computing Davide Del Vento, Eric Shook, Andrea Zonca 1 Paleoscape Model and Human Origins Simulate Climate and Vegetation during the Last Glacial Maximum
More informationThe Future of Galaxy. Nate Coraor galaxyproject.org
The Future of Galaxy Nate Coraor galaxyproject.org Galaxy is... A framework for scientists Enables usage of complicated command line tools Deals with file formats as transparently as possible Provides
More informationThe Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research
The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research Dr Paul Calleja Director of Research Computing University of Cambridge Global leader in science & technology
More informationUsing Galaxy to provide a NGS Analysis Platform
11/15/11 Using Galaxy to provide a NGS Analysis Platform Friedrich Miescher Institute - part of the Novartis Research Foundation - affiliated institute of Basel University - member of Swiss Institute of
More informationTHE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel
THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel National Center for Supercomputing Applications University of Illinois
More informationACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development
ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term
More informationThe Data Exacell (DXC): Data Infrastructure Building Blocks for Integrating Analytics with Data Management
The Data Exacell (DXC): Data Infrastructure Building Blocks for Integrating Analytics with Data Management Nick Nystrom, Michael J. Levine, Ralph Roskies, and J Ray Scott Pittsburgh Supercomputing Center
More informationScience-as-a-Service
Science-as-a-Service The iplant Foundation Rion Dooley Edwin Skidmore Dan Stanzione Steve Terry Matthew Vaughn Outline Why, why, why! When duct tape isn t enough Building an API for the web Core services
More informationBuilding the Genomics Virtual Lab
Building the Genomics Virtual Lab Ron Horst, Uni QLD Analysis and visualisation platform Community Resources Australian Research Cloud Agenda Objectives Scalable, on demand Latest tools, reproducible Build
More informationCloudMan cloud clusters for everyone
CloudMan cloud clusters for everyone Enis Afgan usecloudman.org This is accessibility! But only sometimes So, there are alternatives BUT WHAT IF YOU WANT YOUR OWN, QUICKLY The big picture A. Users in different
More informationGalaxy Pasteur. Patchwork of experiences and improvements. Olivia Doppelt-Azeroual, Sophie Créno et Fabien Mareuil CIB, Institut Pasteur, Paris
Galaxy Pasteur Patchwork of experiences and improvements Olivia Doppelt-Azeroual, Sophie Créno et Fabien Mareuil CIB, Institut Pasteur, Paris Summary Part 0 : Galaxy Pasteur Part 1 : Adaptations to the
More informationCyberinfrastructure!
Cyberinfrastructure! David Minor! UC San Diego Libraries! San Diego Supercomputer Center! January 4, 2012! Cyberinfrastructure:! History! Definitions! Examples! History! mid-1990s:! High performance computing
More informationNuts and Bolts: Lessons Learned in Creating a User-Friendly FOSS Cluster Configuration Tool
Nuts and Bolts: Lessons Learned in Creating a User-Friendly FOSS Cluster Configuration Tool Presenters: Barbara Hallock, Indiana University, bahalloc@iu.edu Resa Reynolds, Cornell University, rda1@cornell.edu
More informationSecure, scalable storage made simple. OEM Storage Portfolio
Secure, scalable storage made simple. OEM Storage Portfolio P Data is the currency of the digital economy. It s the new oil and the lifeblood of your organization. But, how to manage it all? How can you
More informationNUIT Tech Talk Topics in Research Computing: XSEDE and Northwestern University Campus Champions
NUIT Tech Talk Topics in Research Computing: XSEDE and Northwestern University Campus Champions Pradeep Sivakumar pradeep-sivakumar@northwestern.edu Contents What is XSEDE? Introduction Who uses XSEDE?
More informationArcGIS Enterprise: Architecture & Deployment. Anthony Myers
ArcGIS Enterprise: Architecture & Deployment Anthony Myers 1 2 3 4 5 Web GIS Overview of ArcGIS Enterprise Federation & Hosted Server Deployment Patterns Implementation 1 Web GIS ArcGIS Enabling GIS for
More informationThe Materials Data Facility
The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials
More informationBig Data 2015: Sponsor and Participants Research Event ""
Big Data 2015: Sponsor and Participants Research Event "" Center for Large-scale Data Systems Research, CLDS! San Diego Supercomputer Center! UC San Diego! Agenda" Welcome and introductions! SDSC: Who
More informationDatasheet. Only Workspaces delivers the features users want and the control that IT needs.
Datasheet Secure SECURE Enterprise ENTERPRISE File FILE Sync, SYNC, Sharing SHARING and AND Content CONTENT Collaboration COLLABORATION BlackBerry Workspaces makes enterprises more mobile and collaborative,
More informationHPC Capabilities at Research Intensive Universities
HPC Capabilities at Research Intensive Universities Purushotham (Puri) V. Bangalore Department of Computer and Information Sciences and UAB IT Research Computing UAB HPC Resources 24 nodes (192 cores)
More informationThe National Center for Genome Analysis Support as a Model Virtual Resource for Biologists
The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists Internet2 Network Infrastructure for the Life Sciences Focused Technical Workshop. Berkeley, CA July 17-18, 2013
More informationRENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team
RENKU - Reproduce, Reuse, Recycle Research Rok Roškar and the SDSC Renku team Renku-Reana workshop @ CERN 26.06.2018 Goals of Renku 1. Provide the means to create reproducible data science 2. Facilitate
More informationArkadin helps you achieve more at work: The voice expert for Microsoft Skype for Business and Office 365 For Large Enterprises
Arkadin helps you achieve more at work: The voice expert for Microsoft Skype for Business and Office 365 For Large Enterprises Arkadin is the world s expert at bringing voice to Office 365. We understand
More informationLeveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands
Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Unleash Your Data Center s Hidden Power September 16, 2014 Molly Rector CMO, EVP Product Management & WW Marketing
More informationWalkthrough OCCAM. Be on the lookout for this fellow: The callouts are ACTIONs for you to do!
Walkthrough OCCAM Be on the lookout for this fellow: The callouts are ACTIONs for you to do! When you see the check mark, compare your work to the marked element Objectives In this presentation you ll
More informationModernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure
Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure An IDC InfoBrief, Sponsored by IBM April 2018 Executive Summary Today s healthcare organizations
More informationFuture of Enzo. Michael L. Norman James Bordner LCA/SDSC/UCSD
Future of Enzo Michael L. Norman James Bordner LCA/SDSC/UCSD SDSC Resources Data to Discovery Host SDNAP San Diego network access point for multiple 10 Gbs WANs ESNet, NSF TeraGrid, CENIC, Internet2, StarTap
More informationIntroduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill
Introduction to FREE National Resources for Scientific Computing Dana Brunson Oklahoma State University High Performance Computing Center Jeff Pummill University of Arkansas High Peformance Computing Center
More informationHPC Cloud at SURFsara
HPC Cloud at SURFsara Offering cloud as a service SURF Research Boot Camp 21st April 2016 Ander Astudillo Markus van Dijk What is cloud computing?
More informationElastiCluster Automated provisioning of computational clusters in the cloud
ElastiCluster Automated provisioning of computational clusters in the cloud Riccardo Murri (with contributions from Antonio Messina, Nicolas Bär, Sergio Maffioletti, and Sigve
More informationSimon Mercer Director, Health & Wellbeing Microsoft Corporation
Simon Mercer Director, Health & Wellbeing Microsoft Corporation An open-source library of reusable bioinformatics algorithms and functions built on the.net platform Proteomics Customer Challenges Dependency
More informationJANUARY Migrating standalone ArcGIS Server to ArcGIS Enterprise
JANUARY 2018 Migrating standalone ArcGIS Server to ArcGIS Enterprise Copyright 2018 Esri All rights reserved. Printed in the United States of America. The information contained in this document is the
More informationOverview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations
October 29, 2014 Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations XSEDE for HPC Users What is XSEDE? XSEDE mo/va/on and goals XSEDE Resources XSEDE for HPC Users: Before
More informationMulti-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr)
Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr) Jeremy Oakey Senior Director, Technical Marketing and Integrations Agenda Introduction Architecture
More informationEnterprise Data Architect
Enterprise Data Architect Position Summary Farmer Mac maintains a considerable repository of financial data that spans over two decades. Farmer Mac is looking for a hands-on technologist and data architect
More informationReflections on Three Decades in Internet Time
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States of America License. Reflections on Three Decades in Internet Time Christine Borgman, Paul
More informationWVU RESEARCH COMPUTING INTRODUCTION. Introduction to WVU s Research Computing Services
WVU RESEARCH COMPUTING INTRODUCTION Introduction to WVU s Research Computing Services WHO ARE WE? Division of Information Technology Services Funded through WVU Research Corporation Provide centralized
More informationIsilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team
Isilon: Raising The Bar On Performance & Archive Use Cases John Har Solutions Product Manager Unstructured Data Storage Team What we ll cover in this session Isilon Overview Streaming workflows High ops/s
More informationirods at TACC: Secure Infrastructure for Open Science Chris Jordan
irods at TACC: Secure Infrastructure for Open Science Chris Jordan What is TACC? Texas Advanced Computing Center Cyberinfrastructure Resources for Open Science University of Texas System 9 Academic, 6
More informationMicrosoft Office 365 for Business. Your office-on-the-go. Get more work done virtually anytime, anywhere, on any device.
Microsoft Office 365 for Business Your office-on-the-go. Get more work done virtually anytime, anywhere, on any device. Unified Communications and Collaboration (UC&C) tools are becoming popular with modern
More informationEGI: Linking digital resources across Eastern Europe for European science and innovation
EGI- InSPIRE EGI: Linking digital resources across Eastern Europe for European science and innovation Steven Newhouse EGI.eu Director 12/19/12 EPE 2012 1 EGI European Over 35 countries Grid Secure sharing
More informationGalaxy workshop at the Winter School Igor Makunin
Galaxy workshop at the Winter School 2016 Igor Makunin i.makunin@uq.edu.au Winter school, UQ, July 6, 2016 Plan Overview of the Genomics Virtual Lab Introduce Galaxy, a web based platform for analysis
More informationA curated Domain centric shared Docker registry linked to the Galaxy toolshed
A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2,
More informationA Robust, Flexible Platform for Expanding Your Storage without Limits
White Paper SUSE Enterprise A Robust, Flexible Platform for Expanding Your without Limits White Paper A Robust, Flexible Platform for Expanding Your without Limits Unlimited Scalability That s Cost-Effective
More informationAmadeus Technology Journey
265ced1609a17cf1a5979880a2ad364653895ae8 Amadeus Technology Journey A user driven Open Source roadmap experience Christophe Defayet Security & Communication Systems Amadeus in a few words Amadeus is a
More informationmodencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data
modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data Quang M Trinh Ontario Institute for Cancer Research qtrinh@oicr.on.ca Outline Model Organism ENCyclopedia Of DNA Elements
More information7 th International Digital Curation Conference December 2011
Golden Trail 1 Golden-Trail: Retrieving the Data History that Matters from a Comprehensive Provenance Repository Practice Paper Paolo Missier, Newcastle University, UK Bertram Ludäscher, Saumen Dey, Michael
More informationServices to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017
Services to Make Sense of Data Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017 How many journals make data sharing a requirement of publication? https://jordproject.wordpress.com/2013/07/05/going-back-to-basics-reusing-data/
More informationScientific Workflow Tools. Daniel Crawl and Ilkay Altintas San Diego Supercomputer Center UC San Diego
Scientific Workflow Tools Daniel Crawl and Ilkay Altintas San Diego Supercomputer Center UC San Diego 1 escience Today Increasing number of Cyberinfrastructure (CI) technologies Data Repositories: Network
More informationUCT Application Development Lifecycle. UCT Business Applications
UCT Business Applications Page i Table of Contents Planning Phase... 1 Analysis Phase... 2 Design Phase... 3 Implementation Phase... 4 Software Development... 4 Product Testing... 5 Product Implementation...
More informationInformatica Data Quality Product Family
Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity
More informationReproducibility and FAIR Data in the Earth and Space Sciences
Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society
More informationCisco Unified Data Center Strategy
Cisco Unified Data Center Strategy How can IT enable new business? Holger Müller Technical Solutions Architect, Cisco September 2014 My business is rapidly changing and I need the IT and new technologies
More informationFrom command-line bioinformatics to biogui
From command-line bioinformatics to biogui Markus Joppich 1 and Ralf Zimmer 1 1 Institute for Informatics, LFE Bioinformatik, LMU München, München, Germany Corresponding author: Markus Joppich 1 Email
More informationIntroduction to Grid Computing
Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able
More informationTHE STATE OF CONTAINERS
THE STATE OF CONTAINERS Engines & Runtimes in RHEL & OpenShift Scott McCarty Principal Technology Product Manager - Containers 10/15/2018 What if... I told you there is container innovation happening in
More informationThe Value of Data Governance for the Data-Driven Enterprise
Solution Brief: erwin Data governance (DG) The Value of Data Governance for the Data-Driven Enterprise Prepare for Data Governance 2.0 by bringing business teams into the effort to drive data opportunities
More informationDocker Universal Control Plane Deploy and Manage On-Premises, Your Dockerized Distributed Applications
Technical Brief Docker Universal Control Plane Deploy and Manage On-Premises, Your Dockerized Distributed Applications As application teams deploy their Dockerized applications into production environments,
More informationData Intensive Scalable Computing
Data Intensive Scalable Computing Randal E. Bryant Carnegie Mellon University http://www.cs.cmu.edu/~bryant Examples of Big Data Sources Wal-Mart 267 million items/day, sold at 6,000 stores HP built them
More informationUpdate on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University
Update on Dataverse Image credit: David Bygott (CC-BY-NC-SA) 2014 Dryad-Dataverse Community Meeting Mercè Crosas, Elizabeth Quigley & Eleni Castro Data Science > IQSS > Harvard University Introduction
More informationIntroduction and Datacenter Topology For Your System
Introduction and Datacenter Topology For Your System This chapter provides an introduction, a datacenter overview, and VMware vcenter requirements for your system. Introducing Cisco WebEx Meetings Server,
More informationIBM Bluemix compute capabilities IBM Corporation
IBM Bluemix compute capabilities After you complete this section, you should understand: IBM Bluemix infrastructure compute options Bare metal servers Virtual servers IBM Bluemix Container Service IBM
More information2013 AWS Worldwide Public Sector Summit Washington, D.C.
2013 AWS Worldwide Public Sector Summit Washington, D.C. EMR for Fun and for Profit Ben Butler Sr. Manager, Big Data butlerb@amazon.com @bensbutler Overview 1. What is big data? 2. What is AWS Elastic
More informationJourney Towards Science DMZ. Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia
Malaysia's Computational Journey Towards Science DMZ Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia suhaimi@upm.my In the Beginning... Research on parallel/distributed
More informationOUR VISION To be a global leader of computing research in identified areas that will bring positive impact to the lives of citizens and society.
Join the Innovation Qatar Computing Research Institute (QCRI) is a national research institute established in 2010 by Qatar Foundation for Education, Science and Community Development. As a primary constituent
More informationGeorgia State University Cyberinfrastructure Plan
Georgia State University Cyberinfrastructure Plan Summary Building relationships with a wide ecosystem of partners, technology, and researchers are important for GSU to expand its innovative improvements
More informationAchieving Digital Transformation: FOUR MUST-HAVES FOR A MODERN VIRTUALIZATION PLATFORM WHITE PAPER
Achieving Digital Transformation: FOUR MUST-HAVES FOR A MODERN VIRTUALIZATION PLATFORM WHITE PAPER Table of Contents The Digital Transformation 3 Four Must-Haves for a Modern Virtualization Platform 3
More informationOne Body, Many Heads for Repository-Powered Library Applications
One Body, Many Heads for Repository-Powered Library Applications Tom Cramer! Chief Technology Strategist! Stanford University Libraries!! CNI * 13 December 2011! Repositories make strange bedfellows University
More informationVblock Infrastructure Packages: Accelerating Deployment of the Private Cloud
Vblock Infrastructure Packages: Accelerating Deployment of the Private Cloud Roberto Missana - Channel Product Sales Specialist Data Center, Cisco 1 IT is undergoing a transformation Enterprise IT solutions
More informationMulti-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr)
Multi-Cloud and Application Centric Modeling, Deployment and Management with Cisco CloudCenter (CliQr) Jeremy Oakey - Sr. Director, Technical Marketing & Integrations BRKCLD-2008 Agenda Introduction Architecture
More informationVideo Conferencing & Skype for Business: Your Need-to-Know Guide
Video Conferencing & Skype for Business: Your Need-to-Know Guide Effective, engaging collaboration that leverages video conferencing should incorporate features like content sharing, clear participant
More informationICME: Status & Perspectives
ICME: Status & Perspectives from Materials Science and Engineering Surya R. Kalidindi Georgia Institute of Technology New Strategic Initiatives: ICME, MGI Reduce expensive late stage iterations Materials
More informationBuilding Bridges: A System for New HPC Communities
Building Bridges: A System for New HPC Communities HPC User Forum 59 LRZ, Garching October 16, 2015 Presenter: Jim Kasdorf Director, Special Projects Pittsburgh Supercomputing Center kasdorf@psc.edu 2015
More informationSoftware Defined Storage
Software Defined Storage IBM Spectrum Portfolio Ian Hancock ian.hancock@uk.ibm.com Business challenges are IT challenges Create new business models (CEO) Transform financial & management processes (CFO)
More informationglobus online The Galaxy Project and Globus Online
globus online The Galaxy Project and Globus Online Ravi K Madduri Argonne National Lab University of Chicago Outline What is Globus Online? Globus Online and Sequencing Centers What is Galaxy? Integra;ng
More informationCisco Smart+Connected Communities
Brochure Cisco Smart+Connected Communities Helping Cities on Their Digital Journey Cities worldwide are becoming digital or are evaluating strategies for doing so in order to make use of the unprecedented
More informationVMware on IBM Cloud:
VMware on IBM Cloud: How VMware customers can deploy new or existing applications with SoftLayer resources. Introduction This paper focuses on how existing VMware customers can gain a strategic advantage
More informationCloud Foundry and OpenStack
Free Signup: www.cloudfoundry.com, code: openstack2013 Cloud Foundry and OpenStack Ferran Rodenas, Dekel Tankel Cloud Foundry, Pivotal frodenas@vmware.com, twitter: @ferdy dekel@vmware.com, twitter: @dekt
More informationHow to Leverage Containers to Bolster Security and Performance While Moving to Google Cloud
PRESENTED BY How to Leverage Containers to Bolster Security and Performance While Moving to Google Cloud BIG-IP enables the enterprise to efficiently address security and performance when migrating to
More informationBUYING SERVER HARDWARE FOR A SCALABLE VIRTUAL INFRASTRUCTURE
E-Guide BUYING SERVER HARDWARE FOR A SCALABLE VIRTUAL INFRASTRUCTURE SearchServer Virtualization P art 1 of this series explores how trends in buying server hardware have been influenced by the scale-up
More informationEGI federated e-infrastructure, a building block for the Open Science Commons
EGI federated e-infrastructure, a building block for the Open Science Commons Yannick LEGRÉ Director, EGI.eu www.egi.eu EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union
More informationSecurity as Code: The Time is Now. Dave Shackleford Founder, Voodoo Security Sr. Instructor, SANS
Security as Code: The Time is Now Dave Shackleford Founder, Voodoo Security Sr. Instructor, SANS Introduction Business is moving faster to the cloud, and DevOps is accelerating scale and pushing automation
More informationCisco ACI Simulator VM Installation Guide
Cisco ACI Simulator VM Installation Guide New and Changed Information 2 About the Application Policy Infrastructure Controller 2 About the ACI Simulator Virtual Machine 2 Simulator VM Topology and Connections
More informationVirtual Appliances and Education in FutureGrid. Dr. Renato Figueiredo ACIS Lab - University of Florida
Virtual Appliances and Education in FutureGrid Dr. Renato Figueiredo ACIS Lab - University of Florida Background l Traditional ways of delivering hands-on training and education in parallel/distributed
More informationOverview of HPC at LONI
Overview of HPC at LONI Le Yan HPC Consultant User Services @ LONI What Is HPC High performance computing is to use supercomputers to solve problems computationally The most powerful supercomputer today
More information