Utilizing Container Technology to Streamline Data Science
|
|
- Gabriel Weaver
- 5 years ago
- Views:
Transcription
1 Utilizing Container Technology to Streamline Data Science National Laboratory Information Technology May 22, 2018 Dr. Amanda J. Minnich This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA Lawrence Livermore National Security, LLC
2 Who am I? Staff Research Scientist at LLNL Co Lead of ATOM s Data Driven Modeling Team Data Scientist for ~6 years MS and PhD in Computer Science from the University of New Mexico BA in Integrative Biology from UC Berkeley Docker and Kubernetes User/ Administrator for 10 months 2
3 The Rest of the Team Jonathan Allen Computer Scientist leading the machine learning effort on the ATOM project. Works on all aspects of Docker/ Kubernetes/ Data services Ted Stirm DevOps lead on Docker/ Kubernetes deployment Jeff Mast Developer, provides help with deployments and day today DevOps support Mike Goldman Head of LLNL Data Science Institute and advocate for containerized setup Jim Morton sys admin who set up server They are happy to talk about their specific areas of expertise to anyone interested in deploying a similar system! 3
4 Roadmap Motivation Requirements and Tools Containers (Docker) Kubernetes JupyterLab Data services Conclusion 4
5 MOTIVATION Why are we trying to find a new way to enable data science research at LLNL? 5
6 ATOM Consortium Public private consortium whose goal is to innovate the drug discovery pipeline Founding Members: LLNL GlaxoSmithKline Frederick National Lab (National Cancer Institute) UCSF Sit at a central office in San Francisco Bringing together people with very different backgrounds Unique group that imposes unique needs 6
7 Current drug discovery: long, costly, & high failure Is there a better way to get medicines to patients? Molecules Target Human relevant in vitro, in vivo experiments Demonstration of clinical effect and value Preclinical issues: Average time: 5.5 years 33% of total cost of medicine development Millions of molecules tested, 1000s made, and most fail Clinical success rates still only 12%, indicating poor translation in patients Source: 7
8 Patients are waiting A convergence of catalysts points to now as the time to change drug discovery The Opportunity: Ending Cancer as We KnowIt Whilebold,thegoaloftheCancer Moonshot is within our grasp. Today, society has the benefit of decades of scientific understanding and vast amounts of rich data just waiting to be transformed intosolutions. Science Translational Medicine 12 Aug 2015: Vol. 7, Issue 300, pp. 300ps17 DOI: /scitranslmed.aaa moonshot Mol. Inf. 2016,35, 3 14 The exponential increase of computation power enabled with exascalewill fuel a vast range of breakthroughs and accelerate discoveries in national security, medicine, earth sciences and many other fields. 8
9 ATOM Consortium Accelerating Therapeutics for Opportunities in Medicine Mission To accelerate the development of more effective therapies for patients High performance computing Vision How? Diverse biological data Emerging biotech capabilities A new starting point: Transform drug discovery from a slow, sequential, and high failure process into a rapid, integrated, and patient centric model 3 year goal Integrated precompetitive platform 1 year From target to patient ready in <1 year 9
10 The ATOM platform vision Moonshot goals In silico and complex in vitro models reduce reliance on nonhuman models Empirical testing is only a tool to validate and optimize computational predictions Effects can be predicted at the protein, cell, tumor, tissue, organ and organism level Regulatory files can be supported by in vitro data and in silico predictions 10
11 Accelerated drug discovery concept Vision of ATOM workflow in practice ATOM Workflow design synthesize Released to public after a 1 year member benefit Empiric al Patient specific data and samples input to workflow to develop new therapeutics simulate In Silico active learning assay PK Safety Efficacy Open source models and generated data Therapeutic Candidates Members use workflow for internal drug discovery Models of drug behavior in humans Commercialization by members for patient benefit 11
12 REQUIREMENTS AND TOOLS What are ATOM s needs, and what tools satisfy these needs? 12
13 Reproducibility is key ATOM hopes to convince the FDA to accept in silicomodels as a partial replacement for in vitro(bioassay) and in vivo (animal testing) assays for potential cancer drugs Any erroneous results could have serious effects on people s health This means we need to be able to trace the exact processing and modeling paths and reproduce any and all results Must save train/test dataframes, models, parameter metadata One thing that breaks reproducibility: software updates 13
14 However, data science tools change rapidly 14
15 These releases can contain critical new features 15
16 ATOM s end users Data Science Team: 2 Computer Scientists, 2 Statisticians, 1 Bioinformatician ~10 Computational Chemists/Biologists ~10 Pharmacology Researchers Those with Linux/ HPC experience don t necessarily have data domain knowledge Those with data domain knowledge are not familiar with terminals, slurm, ssh, etc. Team members from GSK rotate out after a year So having an easy to access development server is key 16
17 ATOM s data 300 GB of a variety of bioassay and animal toxicology data on ~2 million compounds from GSK Proprietary or sensitive data must only be stored on approved servers Absolutely no laptop storage Hoping to have other pharma partners join Data may remain sequestered from competing pharma members Data is messy and complex, so we want to access it using data services Also have to store and tag output data from models and simulations (~PBs) 17
18 Prior system at LLNL Current system: Livermore Computing (LC) Uses slurmto manage job deployment onto computing nodes Only access via ssh(no GUI access for non tech folks) No data services provided No sudo privileges Potentially long wait time for installation of new libraries Update of libraries with no preservation of old environments 18
19 Summary of Requirements Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) It was clear the existing setup would not support these requirements, so we had to look elsewhere for solutions. 19
20 Requirements and Tools Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) Tools Containers Containers Kubernetes Kubernetes JupyterLab Containers + RESTful API 20
21 Requirements and Tools Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) Tools Containers Containers Kubernetes Kubernetes JupyterLab Containers + RESTful API 21
22 Features of containers Containers allow you to: Package up code and dependencies together Create new environments with different software versions without needing server admin privileges Version a file system via tags, ensuring traceability of work and reproducibility of results Much more lightweight than VMs: can put lots of them on a single host operating system Underlying philosophy: if it works on my machine it will work on yours. 22
23 Containers vs. Virtual Machines VM Stack Container Stack Traits VM Container Virtualization Hardware Level Operating System Level Base Image Size GBs MBs Boot Times Seconds* <100ms* Resource Allocation Static Dynamic 23
24 What is docker? docker is a an open source project based on Linux containers. Basically acts as helpful tool for packing, shipping, and running applications withinthese containers. Adheres to the Open Container Initiative (OCI), which means it is supported by standard container orchestration tools like Kubernetes Caches image layers, saving huge amounts of time for developers Source: for data science 4901f35d7cf9 24
25 Docker terminology Dockerfile Text document with all the commands you would normally use to create an image Image a template/blueprint for creating containers Container Runtime instance of an image Tags Applied to Docker images in a repository. This is how to distinguish between images in a repository. (version numbers) Repository Collection of docker images with same name but different tags Registry Online system that hosts repositories 25
26 Docker registry We use Portusto host and access our registry Open source authorization service and user interface that hosts our private docker registry and handles user authentication and group management 26
27 Containers are ideal for two of ATOM s requirements 1. Environment consistency: Image tags allow tracking of different software version environments For example: atom app:deepchem1.2, atom app:deepchem1.3, atom app:deepchem Easy software updates: To install a new version of a library, simply update dockerfile, rebuild, and push image Allows installation of new software without having root on the remote server 27
28 Requirements and Tools Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) Tools Containers Containers Kubernetes Kubernetes JupyterLab Containers + RESTful API 28
29 Orchestrating containers Once you have your containers, you need to handle: User access Directory access Resource access Docker compose works for local development, but to coordinate many users and many resources, need a more heavy duty orchestrator We chose to use Kubernetes for this task 29
30 What is Kubernetes? Kubernetes is an open source container orchestrator, used to manage containerized workloads and services Combines simplicity of Platform as a Service (PaaS) with flexibility of Infrastructure as a Service (IaaS) Developed by Google, who has been running containers in production for over 10 years 30
31 We use Kubernetes to: Orchestrate the allocation of compute resources: CPU, GPU, and memory Manage access to container based data services Handle Role Based Access Control (RBAC) to various mounted volumes 31
32 Can mount external directories into container 32
33 kubectl Source: commands cheat sheet/ 33
34 Requirements and Tools Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) Tools Containers Containers Kubernetes Kubernetes JupyterLab Containers + RESTful API 34
35 Intuitive GUI Many of ATOM s team members do not have a background in computing However, they are the domain experts Need to be able to view raw data files, generate visualizations, and compute basic statistics Need access to a variety of Python/R libraries Would like to be able to share code/pages with each other Want it to be accessible through a web page, rather than an ssh session 35
36 JupyterLab JupyterLabrunning out of a container: Provides an interactive GUI via a URL Has access to the notebook environment containing the desired packages Has access to a terminal window to push/pull from Git Has text editor window for editing code or viewing files 36
37 Anaconda simplifies installation of data science libraries Anaconda is a freemium open source distribution of the Python and R programming languages for large scale data processing, predictive analytics, and scientific computing, that aims to simplify package management and deployment. We use Anaconda inside docker containers to package up desired libraries Helps to handle any dependency issues for installation of Python and R libraries 37
38 Bokeh allows for interactive plots using Python 38
39 Requirements and Tools Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) Tools Containers Containers Kubernetes Kubernetes JupyterLab Containers + RESTful API 39
40 Data Services Data services are required to organize both raw data and metadata annotations Also need to save model ready train/test datasets, serialized models, and simulation output Containers enable services to be deployed quickly and easily by the data scientists themselves Have data services that use SQL and nosqldatabases as back ends Accessible via a client that uses a RESTful interface Allow the end user to write database agnostic code 40
41 ATOM s data services In house datastore service Allows for association of complex metadata with any type of file mongodb Used as backend for datastore Initially helpful to have more unstructured database option MySQL Much of the GSK data is in CSV files, which fits well with relational framework Probably will expand to other relational dbs 41
42 Overall structure of data services Backend Services deployed via containers NoSQL SQL Object Store Application Services MLMR, CCR, CCDE, MLMT... Web Metadata Application Services Services Server APIs (secure REST interface) Machine Learning Apps Tensorflow,... Application Client APIs (Python, R, etc.) Large Scale Data Science Apps (HPC) Interactive Data Science Apps (Jupyter/ Browser) 42
43 Datastore Maintain controlled data access through user groups Supports basic versioning of data files Allows for direct reference to files to support data stored on physically separated drives Keyword search for files based on metadata tags Accessible both via online GUI for domain experts and a RESTful API for programmatic access 43
44 Enables metadata annotation and search User creates metadata JSON file describing the contents of the file Supports search and retrieval of data files based on metadata file, but not on contents of the file Provides an alternative to maintaining a giant README and pointers to directories, subfolders, and files Key benefit is storing and tracking programmatically created datasets that tend to lead to large heterogeneous file sets 44
45 Datastore uses buckets and collections Buckets Set access control and can be public or private Project or team based via authorization keys Can prevent downloads for certain buckets Can be thought of as high level directories Collections Group together and organize sets of data objects (or files) Useful when operating on a set of data files 45
46 Requirements and Tools Requirements Environment consistency Easy software updates Orchestration of resources (including GPUs) Role based access to mounted volumes Intuitive GUI Data services (SQL, nosql, metadata annotation) Tools Containers Containers Kubernetes Kubernetes JupyterLab Containers + RESTful API 46
47 CONCLUSION Lessons learned, successes, and future work 47
48 Requirements for success DevOps support Team members can train up, but there will be problems that require deep knowledge and more advanced skills Sys admin support Getting this setup to play nice with a server comes with its own set of challenges Team members willing to spend time acting as tech support Need people to help troubleshoot issues or no one will be willing to adopt this workflow 48
49 What our setup looks like: my perspective Get request via Jira, , Slack, Jabber, or in person Container exists? DEBUG Create configuration files for pod Create Dockerfile DEBUG Create and start pod on remote server as root Build image on build server or laptop TROUBLESHOOT Keep an eye on status and follow up with user Push to registry 49
50 Helpful resources Jira Gives users a central place to report issues and request help Separate development and application pages Confluence Documentation saves you from sending out the same instructions over and over Shell scripts Everyone knows this, but putting time into writing automation scripts is worth the effort 50
51 Challenges Had to learn Docker and Kubernetes We were facing a learning curve ourselves while trying to assist others Did not have dedicated DevOps/sys admin help for the first 6 months Supporting users with different levels of IT skills Other projects share (and occasionally break) the server Have to handle LLNL security concerns (certs are a pain) 51
52 Successes Have the entire ATOM Data Driven Modeling Team working out of containers Can get new team members up and running in a day Have generated ~1k deep learning and random forest models Have ~20 notebooks being shared with data intake, processing, curation, and visualization capabilities Domain experts have begun interacting with data through these notebooks Computational biologist is building a Data Explorer webpage with a mongodb backend Have ~10 domain experts putting data in datastore with curated metadata tags 52
53 Future plans Install kubespawnerfor JupyterHubso team members can launch containers without needing to run kubectl Model zoo: formalized pipeline for reading in model ready datasets and outputting serialized models with hyperparameters used and performance metrics calculated Active learning loop that incorporates the datastore Eventually would like to have data science cluster that is separate from the HPC systems, but can sit inside same network 53
54 Thanks for your time! Questions?
Continuous Integration and Delivery with Spinnaker
White Paper Continuous Integration and Delivery with Spinnaker The field of software functional testing is undergoing a major transformation. What used to be an onerous manual process took a big step forward
More informationImportant DevOps Technologies (3+2+3days) for Deployment
Important DevOps Technologies (3+2+3days) for Deployment DevOps is the blending of tasks performed by a company's application development and systems operations teams. The term DevOps is being used in
More informationContainerization Dockers / Mesospere. Arno Keller HPE
Containerization Dockers / Mesospere Arno Keller HPE What is the Container technology Hypervisor vs. Containers (Huis vs artement) A container doesn't "boot" an OS instead it loads the application and
More informationPython based Data Science on Cray Platforms Rob Vesse, Alex Heye, Mike Ringenburg - Cray Inc C O M P U T E S T O R E A N A L Y Z E
Python based Data Science on Cray Platforms Rob Vesse, Alex Heye, Mike Ringenburg - Cray Inc Overview Supported Technologies Cray PE Python Support Shifter Urika-XC Anaconda Python Spark Intel BigDL machine
More informationAdvanced Continuous Delivery Strategies for Containerized Applications Using DC/OS
Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS ContainerCon @ Open Source Summit North America 2017 Elizabeth K. Joseph @pleia2 1 Elizabeth K. Joseph, Developer Advocate
More informationWHITE PAPER. RedHat OpenShift Container Platform. Benefits: Abstract. 1.1 Introduction
WHITE PAPER RedHat OpenShift Container Platform Abstract Benefits: Applications are designed around smaller independent components called microservices. Elastic resources: Scale up or down quickly and
More informationA DEVOPS STATE OF MIND. Chris Van Tuin Chief Technologist, West
A DEVOPS STATE OF MIND Chris Van Tuin Chief Technologist, West cvantuin@redhat.com THE NEED FOR SPEED THE ACCELERATION OF APPLICATION DELIVERY FOR THE BUSINESS In short, software is eating the world. -
More informationGoing cloud-native with Kubernetes and Pivotal
Going cloud-native with Kubernetes and Pivotal A guide to Pivotal Container Service (PKS) by role Fast, low-risk enterprise-grade Kubernetes has arrived With Pivotal Container Service (PKS), organizations
More informationYOUR APPLICATION S JOURNEY TO THE CLOUD. What s the best way to get cloud native capabilities for your existing applications?
YOUR APPLICATION S JOURNEY TO THE CLOUD What s the best way to get cloud native capabilities for your existing applications? Introduction Moving applications to cloud is a priority for many IT organizations.
More informationA DEVOPS STATE OF MIND. Chris Van Tuin Chief Technologist, West
A DEVOPS STATE OF MIND Chris Van Tuin Chief Technologist, West cvantuin@redhat.com In short, software is eating the world. - Marc Andreessen, Wall Street Journal, August 2011 UBER, LYFT FALLOUT: TAXI
More informationDocker and Oracle Everything You Wanted To Know
Docker and Oracle Everything You Wanted To Know June, 2017 Umesh Tanna Principal Technology Sales Consultant Oracle Sales Consulting Centers(SCC) Bangalore Safe Harbor Statement The following is intended
More informationContinuous Integration and Deployment (CI/CD)
WHITEPAPER OCT 2015 Table of contents Chapter 1. Introduction... 3 Chapter 2. Continuous Integration... 4 Chapter 3. Continuous Deployment... 6 2 Chapter 1: Introduction Apcera Support Team October 2015
More informationTitle DC Automation: It s a MARVEL!
Title DC Automation: It s a MARVEL! Name Nikos D. Anagnostatos Position Network Consultant, Network Solutions Division Classification ISO 27001: Public Data Center Evolution 2 Space Hellas - All Rights
More informationRed Hat Atomic Details Dockah, Dockah, Dockah! Containerization as a shift of paradigm for the GNU/Linux OS
Red Hat Atomic Details Dockah, Dockah, Dockah! Containerization as a shift of paradigm for the GNU/Linux OS Daniel Riek Sr. Director Systems Design & Engineering In the beginning there was Stow... and
More informationAzure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region
Azure DevOps Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region What is DevOps? People. Process. Products. Build & Test Deploy DevOps is the union of people, process, and products to
More informationAzure Development Course
Azure Development Course About This Course This section provides a brief description of the course, audience, suggested prerequisites, and course objectives. COURSE DESCRIPTION This course is intended
More informationCisco CloudCenter Use Case Summary
Cisco CloudCenter Use Case Summary Overview IT organizations often use multiple clouds to match the best application and infrastructure services with their business needs. It makes sense to have the freedom
More informationFuncX: A Function Serving Platform for HPC. Ryan Chard 28 Jan 2019
FuncX: A Function Serving Platform for HPC Ryan Chard 28 Jan 2019 Outline - Motivation FuncX: FaaS for HPC Implementation status Preliminary applications - Machine learning inference Automating analysis
More information[Docker] Containerization
[Docker] Containerization ABCD-LMA Working Group Will Kinard October 12, 2017 WILL Kinard Infrastructure Architect Software Developer Startup Venture IC Husband Father Clemson University That s me. 2 The
More informationBright Cluster Manager: Using the NVIDIA NGC Deep Learning Containers
Bright Cluster Manager: Using the NVIDIA NGC Deep Learning Containers Technical White Paper Table of Contents Pre-requisites...1 Setup...2 Run PyTorch in Kubernetes...3 Run PyTorch in Singularity...4 Run
More informationFIVE REASONS YOU SHOULD RUN CONTAINERS ON BARE METAL, NOT VMS
WHITE PAPER FIVE REASONS YOU SHOULD RUN CONTAINERS ON BARE METAL, NOT VMS Over the past 15 years, server virtualization has become the preferred method of application deployment in the enterprise datacenter.
More informationLSST software stack and deployment on other architectures. William O Mullane for Andy Connolly with material from Owen Boberg
LSST software stack and deployment on other architectures William O Mullane for Andy Connolly with material from Owen Boberg Containers and Docker Packaged piece of software with complete file system it
More informationD360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research
D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research Dr. Fabian Bös, Senior Application Scientist Certara Spain SL Martin-Kollar-Str. 17, 81829 Munich
More informationThe Materials Data Facility
The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials
More informationEASILY DEPLOY AND SCALE KUBERNETES WITH RANCHER
EASILY DEPLOY AND SCALE KUBERNETES WITH RANCHER 2 WHY KUBERNETES? Kubernetes is an open-source container orchestrator for deploying and managing containerized applications. Building on 15 years of experience
More informationLab 8: Building Microservices on Oracle Autonomous Transaction Processing Service
Lab 8: Building Microservices on Oracle Autonomous Transaction Processing Service Containers allow us to package apps along with all their dependencies and provide a lightweight run time environment that
More informationDevOps Workflow. From 0 to kube in 60 min. Christian Kniep, v Technical Account Manager, Docker Inc.
DevOps Workflow From 0 to kube in 60 min http://qnib.org/devops-workflow Christian Kniep, v2018-02-20 Technical Account Manager, Docker Inc. Motivation Iteration barriers Works on my Laptop! Why is DevOps
More informationGetting Started With Containers
DEVNET 2042 Getting Started With Containers Matt Johnson Developer Evangelist @mattdashj Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session
More informationDevOps Course Content
DevOps Course Content 1. Introduction: Understanding Development Development SDLC using WaterFall & Agile Understanding Operations DevOps to the rescue What is DevOps DevOps SDLC Continuous Delivery model
More informationUsing DC/OS for Continuous Delivery
Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years
More informationDevOps Technologies. for Deployment
DevOps Technologies for Deployment DevOps is the blending of tasks performed by a company's application development and systems operations teams. The term DevOps is being used in several ways. In its most
More informationRunning MarkLogic in Containers (Both Docker and Kubernetes)
Running MarkLogic in Containers (Both Docker and Kubernetes) Emma Liu Product Manager, MarkLogic Vitaly Korolev Staff QA Engineer, MarkLogic @vitaly_korolev 4 June 2018 MARKLOGIC CORPORATION Source: http://turnoff.us/image/en/tech-adoption.png
More informationRed Hat OpenShift Roadmap Q4 CY16 and H1 CY17 Releases. Lutz Lange Solution
Red Hat OpenShift Roadmap Q4 CY16 and H1 CY17 Releases Lutz Lange Solution Architect @AtomicContainer OpenShift Roadmap OpenShift Container Platform 3.2 Kubernetes 1.2 & Docker 1.9
More informationSunil Shah SECURE, FLEXIBLE CONTINUOUS DELIVERY PIPELINES WITH GITLAB AND DC/OS Mesosphere, Inc. All Rights Reserved.
Sunil Shah SECURE, FLEXIBLE CONTINUOUS DELIVERY PIPELINES WITH GITLAB AND DC/OS 1 Introduction MOBILE, SOCIAL & CLOUD ARE RAISING CUSTOMER EXPECTATIONS We need a way to deliver software so fast that our
More informationThe Path to GPU as a Service in Kubernetes Renaud Gaubert Lead Kubernetes Engineer
The Path to GPU as a Service in Kubernetes Renaud Gaubert , Lead Kubernetes Engineer May 03, 2018 RUNNING A GPU APPLICATION Customers using DL DL Application RHEL 7.3 CUDA 8.0 Driver 375
More information2018 Cisco and/or its affiliates. All rights reserved. Cisco Public
Cisco ACI App Center Fabrice Servais, Software Engineer, Data Center Networking, Cisco Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session
More informationThink Small to Scale Big
Think Small to Scale Big Intro to Containers for the Datacenter Admin Pete Zerger Principal Program Manager, MVP pete.zerger@cireson.com Cireson Lee Berg Blog, e-mail address, title Company Pete Zerger
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
CNA1612BU Deploying real-world workloads on Kubernetes and Pivotal Cloud Foundry VMworld 2017 Fred Melo, Director of Technology, Pivotal Merlin Glynn, Sr. Technical Product Manager, VMware Content: Not
More informationOnto Petaflops with Kubernetes
Onto Petaflops with Kubernetes Vishnu Kannan Google Inc. vishh@google.com Key Takeaways Kubernetes can manage hardware accelerators at Scale Kubernetes provides a playground for ML ML journey with Kubernetes
More informationChoosing the Right Container Infrastructure for Your Organization
WHITE PAPER Choosing the Right Container Infrastructure for Your Organization Container adoption is accelerating rapidly. Gartner predicts that by 2018 more than 50% of new workloads will be deployed into
More informationRENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team
RENKU - Reproduce, Reuse, Recycle Research Rok Roškar and the SDSC Renku team Renku-Reana workshop @ CERN 26.06.2018 Goals of Renku 1. Provide the means to create reproducible data science 2. Facilitate
More information/ Cloud Computing. Recitation 5 February 14th, 2017
15-319 / 15-619 Cloud Computing Recitation 5 February 14th, 2017 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.1, OLI Unit 2 modules 5 and 6 This week
More informationBuild Scientific Computing Infrastructure with Rebar3 and Docker. Eric Sage
Build Scientific Computing Infrastructure with Rebar3 and Docker Eric Sage A scientific telecommunications network Hello, I d like an automated gene ontology please! Agenda - An example biological service
More informationDeep Learning mit PowerAI - Ein Überblick
Stephen Lutz Deep Learning mit PowerAI - Open Group Master Certified IT Specialist Technical Sales IBM Cognitive Infrastructure IBM Germany Ein Überblick Stephen.Lutz@de.ibm.com What s that? and what s
More information20532D: Developing Microsoft Azure Solutions
20532D: Developing Microsoft Azure Solutions Course Details Course Code: Duration: Notes: 20532D 5 days Elements of this syllabus are subject to change. About this course This course is intended for students
More informationovirt and Docker Integration
ovirt and Docker Integration October 2014 Federico Simoncelli Principal Software Engineer Red Hat 1 Agenda Deploying an Application (Old-Fashion and Docker) Ecosystem: Kubernetes and Project Atomic Current
More informationFROM VSTS TO AZURE DEVOPS
#DOH18 FROM VSTS TO AZURE DEVOPS People. Process. Products. Gaetano Paternò @tanopaterno info@gaetanopaterno.it 2 VSTS #DOH18 3 Azure DevOps Azure Boards (ex Work) Deliver value to your users faster using
More informationNVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI
NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI Overview Unparalleled Value Product Portfolio Software Platform From Desk to Data Center to Cloud Summary AI researchers depend on computing performance to gain
More informationWelcome to Docker Birthday # Docker Birthday events (list available at Docker.Party) RSVPs 600 mentors Big thanks to our global partners:
Docker Birthday #3 Welcome to Docker Birthday #3 2 120 Docker Birthday events (list available at Docker.Party) 7000+ RSVPs 600 mentors Big thanks to our global partners: Travel Planet 24 e-food.gr The
More informationS INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS
S8497 - INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS Chris Lamb CUDA and NGC Engineering, NVIDIA John Barco NGC Product Management, NVIDIA NVIDIA GPU Cloud (NGC) overview AGENDA Using NGC
More informationCONTAINERS AND MICROSERVICES WITH CONTRAIL
CONTAINERS AND MICROSERVICES WITH CONTRAIL Scott Sneddon Sree Sarva DP Ayyadevara Sr. Director Sr. Director Director Cloud and SDN Contrail Solutions Product Line Management This statement of direction
More informationTECHNICAL BRIEF. Scheduling and Orchestration of Heterogeneous Docker-Based IT Landscapes. January 2017 Version 2.0 For Public Use
TECHNICAL BRIEF Scheduling and Orchestration of Heterogeneous Docker-Based IT Landscapes January 2017 Version 2.0 For Public Use Table of Contents 1 Summary... 2 2 Introduction... 2 3 Stonebranch DevOps
More informationBringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security
Bringing OpenStack to the Enterprise An enterprise-class solution ensures you get the required performance, reliability, and security INTRODUCTION Organizations today frequently need to quickly get systems
More informationAmir Zipory Senior Solutions Architect, Redhat Israel, Greece & Cyprus
Amir Zipory Senior Solutions Architect, Redhat Israel, Greece & Cyprus amirz@redhat.com TODAY'S IT CHALLENGES IT is under tremendous pressure from the organization to enable growth Need to accelerate,
More informationKardia / Centrallix VM Appliance Quick Reference
Kardia / Centrallix VM Appliance Quick Reference Version 1.0 Beta 2 15-Mar-2011 (c) 2011 LightSys Technology Services, Inc. http://www.lightsys.org/ Redeeming Technology... For God's Kingdom. Overview...
More informationUsing and Developing with Azure. Joshua Drew
Using and Developing with Azure Joshua Drew Visual Studio Microsoft Azure X-Plat ASP.NET Visual Studio - Every App Our vision Every App Every Developer .NET and mobile development Desktop apps - WPF Universal
More informationRED HAT OPENSHIFT A FOUNDATION FOR SUCCESSFUL DIGITAL TRANSFORMATION
RED HAT OPENSHIFT A FOUNDATION FOR SUCCESSFUL DIGITAL TRANSFORMATION Stephanos D Bacon Product Portfolio Strategy, Application Platforms Stockholm, 13 September 2017 1 THE PATH TO DIGITAL LEADERSHIP IT
More informationContainer 2.0. Container: check! But what about persistent data, big data or fast data?!
@unterstein @joerg_schad @dcos @jaxdevops Container 2.0 Container: check! But what about persistent data, big data or fast data?! 1 Jörg Schad Distributed Systems Engineer @joerg_schad Johannes Unterstein
More informationGo Faster: Containers, Platforms and the Path to Better Software Development (Including Live Demo)
RED HAT DAYS VANCOUVER Go Faster: Containers, Platforms and the Path to Better Software Development (Including Live Demo) Paul Armstrong Principal Solutions Architect Gerald Nunn Senior Middleware Solutions
More informationContainer-Native Storage
Container-Native Storage Solving the Persistent Storage Challenge with GlusterFS Michael Adam Manager, Software Engineering José A. Rivera Senior Software Engineer 2017.09.11 WARNING The following presentation
More informationTurning Data Science into a reality with TIBCO Spotfire
Turning Data Science into a reality with TIBCO Spotfire Eduardo Gonzalez-Couto, Ph.D. Product Manager, PerkinElmer Informatics Basel, 3 rd November 2016 Safe Harbor Statement This document shows current
More informationTEN LAYERS OF CONTAINER SECURITY
TEN LAYERS OF CONTAINER SECURITY Tim Hunt Kirsten Newcomer May 2017 ABOUT YOU Are you using containers? What s your role? Security professionals Developers / Architects Infrastructure / Ops Who considers
More informationFusion Registry 9 SDMX Data and Metadata Management System
Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance
More informationTest Automation with Jenkins & TidalScale WaveRunner
Test Automation with Jenkins & TidalScale WaveRunner Software-Defined Servers Test Automation with Jenkins & TidalScale WaveRunner iv Contents 1 Test Automation with Jenkins & TidalScale WaveRunner Overview
More informationIntroduction to BioHPC
Introduction to BioHPC New User Training [web] [email] portal.biohpc.swmed.edu biohpc-help@utsouthwestern.edu 1 Updated for 2018-03-07 Overview Today we re going to cover: What is BioHPC? How do I access
More informationContainer in Production : Openshift 구축사례로 이해하는 PaaS. Jongjin Lim Specialist Solution Architect, AppDev
Container in Production : Openshift 구축사례로 이해하는 PaaS Jongjin Lim Specialist Solution Architect, AppDev jonlim@redhat.com Agenda Why Containers? Solution : Red Hat Openshift Container Platform Enterprise
More informationSTATE OF MODERN APPLICATIONS IN THE CLOUD
STATE OF MODERN APPLICATIONS IN THE CLOUD 2017 Introduction The Rise of Modern Applications What is the Modern Application? Today s leading enterprises are striving to deliver high performance, highly
More informationLINUX CONTAINERS. Where Enterprise Meets Embedded Operating Environments WHEN IT MATTERS, IT RUNS ON WIND RIVER
Where Enterprise Meets Embedded Operating Environments WHEN IT MATTERS, IT RUNS ON WIND RIVER EXECUTIVE SUMMARY Flexible and connected platforms are core components in leading computing fields, including
More informationWalkthrough OCCAM. Be on the lookout for this fellow: The callouts are ACTIONs for you to do!
Walkthrough OCCAM Be on the lookout for this fellow: The callouts are ACTIONs for you to do! When you see the check mark, compare your work to the marked element Objectives In this presentation you ll
More informationContainerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the Cloud SUBBU RAMA FUTURE OF PACKAGING APPLICATIONS Turns Discrete Computing Resources into a Virtual Supercomputer GPU Mem Mem GPU GPU Mem
More informationTechno Expert Solutions
Course Content of Microsoft Windows Azzure Developer: Course Outline Module 1: Overview of the Microsoft Azure Platform Microsoft Azure provides a collection of services that you can use as building blocks
More informationOpenshift: Key to modern DevOps
Azure days 28/02/2018 Openshift: Key to modern DevOps Jiří Kolář Solution Architect CZ/SK/CEE jkolar@redhat.com PROBLEM: DEVELOPERS I.T. OPERATIONS THE SOLUTION DEVELOPERS I.T. OPERATIONS GENERAL DISTRIBUTION
More informationData Centers and Cloud Computing
Data Centers and Cloud Computing CS677 Guest Lecture Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet
More informationKuberiter White Paper. Kubernetes. Cloud Provider Comparison Chart. Lawrence Manickam Kuberiter Inc
Kuberiter White Paper Kubernetes Cloud Provider Comparison Chart Lawrence Manickam Kuberiter Inc Oct 2018 Executive Summary Kubernetes (K8S) has become the de facto standard for Cloud Application Deployments.
More informationAccelerate at DevOps Speed With Openshift v3. Alessandro Vozza & Samuel Terburg Red Hat
Accelerate at DevOps Speed With Openshift v3 Alessandro Vozza & Samuel Terburg Red Hat IT (R)Evolution Red Hat Brings It All Together What is Kubernetes Open source container cluster manager Inspired by
More informationData Centers and Cloud Computing. Slides courtesy of Tim Wood
Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet
More informationHybrid IT Managed Services
Hybrid IT Managed Services Working together on your digital journey Welcome to the new world of Hybrid IT Managed Services. Digital technology is rapidly changing and improving our world. As your organisation
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationHow to Keep UP Through Digital Transformation with Next-Generation App Development
How to Keep UP Through Digital Transformation with Next-Generation App Development Peter Sjoberg Jon Olby A Look Back, A Look Forward Dedicated, data structure dependent, inefficient, virtualized Infrastructure
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
STO1794BE Evolution of vsan Vijay Ramachandran Sr.Director, Product Management Christos Karamanolis VMware Fellow & CTO of Storage & Availability #VMworld #STO1794BE Disclaimer This presentation may contain
More informationVerteego VDS Documentation
Verteego VDS Documentation Release 1.0 Verteego May 31, 2017 Installation 1 Getting started 3 2 Ansible 5 2.1 1. Install Ansible............................................. 5 2.2 2. Clone installation
More informationDevOps Tooling from AWS
DevOps Tooling from AWS What is DevOps? Improved Collaboration - the dropping of silos between teams allows greater collaboration and understanding of how the application is built and deployed. This allows
More informationBuilding innovative drug discovery alliances. Migrating to ChemAxon
Building innovative drug discovery alliances Migrating to ChemAxon Evotec AG, Migrating to ChemAxon, May 2011 Agenda Evotec Why migrate? Searching for Library Enumeration Replacement Migrating a small
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationCloud Computing with APL. Morten Kromberg, CXO, Dyalog
Cloud Computing with APL Morten Kromberg, CXO, Dyalog Cloud Computing Docker with APL Morten Kromberg, CXO, Dyalog 2 Cloud Computing: Definitions 3 Cloud Computing: Definitions Cloud Computing = "Using
More informationMicroservices. Chaos Kontrolle mit Kubernetes. Robert Kubis - Developer Advocate,
Microservices Chaos Kontrolle mit Kubernetes Robert Kubis - Developer Advocate, Google @hostirosti About me Robert Kubis Developer Advocate Google Cloud Platform London, UK hostirosti github.com/hostirosti
More informationTHE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel
THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel National Center for Supercomputing Applications University of Illinois
More informationIndustry-leading Application PaaS Platform
Industry-leading Application PaaS Platform Solutions Transactional Apps Digital Marketing LoB App Modernization Services Web Apps Web App for Containers API Apps Mobile Apps IDE Enterprise Integration
More informationData Centers and Cloud Computing. Data Centers
Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet
More informationSão Paulo. August,
São Paulo August, 28 2018 Going Cloud Native with Cloud Foundry Luis Macedo Sr Platform Engineer, Pivotal @luis0macedo What is Cloud Native Platform? - A platform that delivers predictable deployment
More informationInstallation and setup guide of 1.1 demonstrator
Installation and setup guide of 1.1 demonstrator version 2.0, last modified: 2015-09-23 This document explains how to set up the INAETICS demonstrator. For this, we use a Vagrant-based setup that boots
More informationContainers, Serverless and Functions in a nutshell. Eugene Fedorenko
Containers, Serverless and Functions in a nutshell Eugene Fedorenko About me Eugene Fedorenko Senior Architect Flexagon adfpractice-fedor.blogspot.com @fisbudo Agenda Containers Microservices Docker Kubernetes
More informationSBB. Java User Group 27.9 & Tobias Denzler, Philipp Oser
OpenShift @ SBB Java User Group 27.9 & 25.10.17 Tobias Denzler, Philipp Oser Who we are Tobias Denzler Software Engineer at SBB IT Java & OpenShift enthusiast @tobiasdenzler Philipp Oser Architect at ELCA
More informationAllowing Users to Run Services at the OLCF with Kubernetes
Allowing Users to Run Services at the OLCF with Kubernetes Jason Kincl Senior HPC Systems Engineer Ryan Adamson Senior HPC Security Engineer This work was supported by the Oak Ridge Leadership Computing
More informationIndex. Bessel function, 51 Big data, 1. Cloud-based version-control system, 226 Containerization, 30 application, 32 virtualize processes, 30 31
Index A Amazon Web Services (AWS), 2 account creation, 2 EC2 instance creation, 9 Docker, 13 IP address, 12 key pair, 12 launch button, 11 security group, 11 stable Ubuntu server, 9 t2.micro type, 9 10
More informationInvestigating Containers for Future Services and User Application Support
Investigating Containers for Future Services and User Application Support JLAB CNI NLIT 2018 () Overview JLAB scope What is a container? Why are we interested? Platform-as-a-Service (PaaS) for orchestration
More informationebook ADVANCED LOAD BALANCING IN THE CLOUD 5 WAYS TO SIMPLIFY THE CHAOS
ebook ADVANCED LOAD BALANCING IN THE CLOUD 5 WAYS TO SIMPLIFY THE CHAOS Introduction Load balancing isn t just about managing traffic anymore. As your infrastructure expands to include applications in
More informationHOW TO STAND OUT IN DEVOPS
HOW TO STAND OUT IN DEVOPS Exam Guide for Linux Professional Institute DevOps Tools Engineer Certification www.lpi.org Contents 1. Why you need DevOps Tools certification 3 2. The type of experience you
More informationCon$nuous Deployment with Docker Andrew Aslinger. Oct
Con$nuous Deployment with Docker Andrew Aslinger Oct 9. 2014 Who is Andrew #1 So#ware / Systems Architect for OpenWhere Passion for UX, Big Data, and Cloud/DevOps Previously Designed and Implemented automated
More informationTaming the Multi-Cloud With Simplicity and Openness. Minh Dang Cisco Systems Vietnam 2018 January
Taming the Multi-Cloud With Simplicity and Openness Minh Dang Cisco Systems Vietnam 2018 January Multi-Cloud Challenge Growth in Applications and Infrastructure Many Operating Env COST Data Center Many
More information