Virtual Appliances and Education in FutureGrid. Dr. Renato Figueiredo ACIS Lab - University of Florida

Similar documents
HPC learning using Cloud infrastructure

Autonomic Condor Clouds. David Wolinsky ACIS P2P Group University of Florida

Large Scale Sky Computing Applications with Nimbus

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France

Clouds: An Opportunity for Scientific Applications?

Building Effective CyberGIS: FutureGrid. Marlon Pierce, Geoffrey Fox Indiana University

Azure MapReduce. Thilina Gunarathne Salsa group, Indiana University

CINET Collaboration Report

Survey on Cloud Infrastructure Service: OpenStack Compute

COP Cloud Computing. Presented by: Sanketh Beerabbi University of Central Florida

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

Andrew Pullin, Senior Software Designer, School of Computer Science / x4338 / HP5165 Last Updated: September 26, 2016

Introduction to Cloudbreak

Leveraging the InCommon Federation to access the NSF TeraGrid

Introduction. Software Trends. Topics for Discussion. Grid Technology. GridForce:

Ioan Raicu. Everyone else. More information at: Background? What do you want to get out of this course?

José Fortes. Advanced Computing and Information Systems laboratory. and NSF Center for Autonomic Computing. HPC 2010 Cetraro

Jetstream: A science & engineering cloud

DBaaS (Oracle and Open Source)

Andrew Pullin, Senior Software Designer, School of Computer Science / x4338 / HP5165 Last Updated: October 05, 2015

OPENSTACK PRIVATE CLOUD WITH GITHUB

Alabama Supercomputer Center Alabama Research and Education Network

Developing Microsoft Azure Solutions

Vendor: HP. Exam Code: HP0-D31. Exam Name: Designing HP Data Center and Cloud Solutions. Version: Demo

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef

Universities Access IBM/Google Cloud Compute Cluster for NSF-Funded Research

Cloud Computing with Nimbus

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications

Internet2 and IPv6: A Status Update

Chapter in Contemporary HPC Architectures

NET+ INFRASTRUCTURE AND PLATFORM SERVICES PORTFOLIO STATUS & UPDATE

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

StratusLab Cloud Distribution Installation. Charles Loomis (CNRS/LAL) 3 July 2014

FuncX: A Function Serving Platform for HPC. Ryan Chard 28 Jan 2019

A Cloud Compu*ng Approach to On- Demand and Scalable CyberGIS Analy*cs

Presentation Title. Grid Computing Project Officer / Research Assistance. InfoComm Development Center (idec) & Department of Communication

OpenNebula on VMware: Cloud Reference Architecture

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

CloudMan cloud clusters for everyone

The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D

Cloud Computing Introduction & Offerings from IBM

Cisco Enterprise Cloud Suite Overview Cisco and/or its affiliates. All rights reserved.

Lightweight Streaming-based Runtime for Cloud Computing. Shrideep Pallickara. Community Grids Lab, Indiana University

CIS : Computational Reproducibility

An Introduction To: Help Me Grow-LA. August 11, 2016

[MS10992]: Integrating On-Premises Core Infrastructure with Microsoft Azure

More AWS, Serverless Computing and Cloud Research

FutureGrid 101. Part 2: Ge*ng Started Craig Stewart

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

NOW! Manage ALL workloads virtual, physical and cloud from a single console!

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

AT&T Flow Designer. Current Environment

6. Correspondence relationship between Educational goals and Course goals

Service Graph Design with Cisco Application Centric Infrastructure

Experiences with OracleVM 3.3

Where are you with your Cloud or Clouds? Simon Kaye Dr Cloud

CloudLab. Updated: 5/24/16

Improving Blade Economics with Virtualization

Think Small to Scale Big

Federating FutureGrid and GENI

Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure

.NET Library for Seamless Remote Execution of Supercomputing Software

The convergence of HPC and BigData

South African Science Gateways

What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet.

An Introduction to GIS for developers

SLATE. Services Layer at the Edge. First Meeting of the National Research Platform Montana State University August 7-8, 2017

XSEDE Campus Bridging Tools Jim Ferguson National Institute for Computational Sciences

(Aws solution architect -Associate, ITIL, CEH, MCSE, Linux, CCNA, Open source expert) LinkedIn Profile:

FOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA *

Red Hat Enterprise Virtualization (RHEV) Backups by SEP

Y790 Report for 2009 Fall and 2010 Spring Semesters

Chameleon Cloud Documentation. Release 2.0a

7 Things ISVs Must Know About Virtualization

Top 40 Cloud Computing Interview Questions

Infrastructure Clouds for Science and Education: Platform Tools

Azure Development Course

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Federated Services for Scientists Thursday, December 9, p.m. EST

Table of Contents 1.1. Overview. Containers, Docker, Registries vsphere Integrated Containers Engine

INDIGO PAAS TUTORIAL. ! Marica Antonacci RIA INFN-Bari

A Container On a Virtual Machine On an HPC? Presentation to HPC Advisory Council. Perth, July 31-Aug 01, 2017

DevOps CICD for VNF a NetOps Approach

Resiliency Replication Appliance Installation Guide Version 7.2

Implementation of the Pacific Research Platform over Pacific Wave

Higher Education in Texas: Serving Texas Through Transformational Education, Research, Discovery & Impact

Jetstream Overview A national research and education cloud

BUILDING the VIRtUAL enterprise

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

ElasterStack 3.2 User Administration Guide - Advanced Zone

Automated Deployment of Private Cloud (EasyCloud)

Large Scale Computing Infrastructures

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Science Computing Clouds.

Introducing IBM WebSphere CloudBurst Appliance and IBM WebSphere Application Server Hypervisor Edition

A User-level Secure Grid File System

Xen and CloudStack. Ewan Mellor. Director, Engineering, Open-source Cloud Platforms Citrix Systems

Build Converged Infrastructures With True Systems Management

ACET s e-research Activities

I D C T E C H N O L O G Y S P O T L I G H T. V i r t u a l and Cloud D a t a Center Management

Transcription:

Virtual Appliances and Education in FutureGrid Dr. Renato Figueiredo ACIS Lab - University of Florida

Background l Traditional ways of delivering hands-on training and education in parallel/distributed computing have non-trivial dependences on the environment Difficult to replicate same environment on different resources (e.g. HPC clusters, desktops) Difficult to cope with changes in the environment (e.g. software upgrades) l Virtualization technologies Remove key software dependences Allow packaging and replication of hands-on, executable educational environments Can be deployed, managed with cloud technologies

Overview l FutureGrid enables new approaches to education and training and opportunities to engage in outreach Cloud, virtualization and dynamic provisioning environment can adapt to the user, rather than expect user to adapt to the environment l FutureGrid education leverages the unique capabilities of the infrastructure and its software to: Reduce barriers to entry and engage new users Use of encapsulated environments ( appliances ) as a primary delivery mechanism of education/ training modules promoting reuse, replication, and sharing

Guiding principles l Fidelity: activities should use full-fledged, executable software: education/training modules Learn using the proper tools l Reproducibility: Creators of content should be able to install, configure, and test their modules once, and be assured of the same functional behavior regardless of where the module is deployed Incentive to invest effort in developing, testing and documenting new modules

Guiding principles l Deployability: Students and users should be able to deploy modules in a simple manner, and in a variety of resources Reduce barriers to entry; avoid dependences upon a particular infrastructure l Community-oriented: Modules should be simple to share, discover, reuse, and expand Create conditions for viral growth

Role of clouds and portal l Executable modules virtual appliances Deployable on FutureGrid resources Deployable on other cloud platforms, as well as virtualized desktops l Community sharing Web 2.0 portal, appliance image repositories An aggregation hub for executable modules and documentation

What is an appliance? l Hardware/software appliances TV receiver + computer + hard disk + Linux + user interface Computer + network interfaces + FreeBSD + user interface

Virtual appliance example LAMP image l Linux + Apache + MySQL + PHP A web server Another Web server copy instantiate Virtualization Layer Repeat

Educational appliances l A flexible, extensible platform for hands-on, lab-oriented education on FutureGrid l Support clustering of resources Virtual machines + virtual networking to create sandboxed modules Virtual Grid appliances: self-contained, pre-packaged execution environments Group VPNs: simple management of virtual clusters by students and educators

Virtual Networking l A single appliance encapsulates software and configuration l Cluster/Grid/Cloud computing Middleware expects a collection of machines, typically on a LAN (Local Area Network) Appliances need to communicate and coordinate with each other Each worker needs an IP address, uses TCP/ IP sockets

Virtual cluster appliances l Virtual appliance + virtual network Hadoop + Virtual Network copy instantiate A Hadoop worker Virtual network Virtual machine Repeat Another Hadoop worker

Grid appliance in a nutshell l Plug-and-play clusters with a preconfigured software environment Linux + (Hadoop, Condor, MPI, ) Scripts for zero-configuration l Hands-on examples, bootstrap infrastructure, and zero-configuration software you re off to a quick start

Virtual Network - GroupVPN l Setting up and managing typical VPNs can be daunting VPN server(s), key distribution, NAT traversal l GroupVPN makes it simple for users to create and manage virtual cluster VPNs l Key insights: Web 2.0 interface: create/manage user groups All the complexity of setting up and managing VPN links is automated

GroupVPN Web interface l You can request to join or create your own VPN group Determines who is allowed to connect to virtual network l You can request to join or create your own appliance group Determines priorities of users on resources owned by their groups

Deploying virtual clusters l Same image, different VPNs Hadoop + Virtual Network GroupVPN Credentials (from Web site) copy instantiate A Hadoop worker Virtual IP - DHCP 10.10.1.1 Group VPN Virtual machine Repeat Another Hadoop worker Virtual IP - DHCP 10.10.1.2

Cloud deployment approach l Generate virtual floppies Through GroupVPN and GroupAppliance Web interface l Deploy appliances image(s) FutureGrid (Nimbus/Eucalyptus), EC2 GUI or command line tools Use APIs to copy virtual floppy to image l Submit jobs; terminate VMs when done

Setting up classes l Classes are setup and managed using the FutureGrid portal l Project proposal: can be a class, workshop, short course, tutorial Needs to be approved by FutureGrid project to become active l Users can be added to a project Users create accounts using the portal Project leaders can authorize them to gain access to resources Students can then interactively use FG resources (e.g. to start VMs)

Use of FutureGrid in classes l Cloud computing/distributed systems classes U.of Florida, U. Central Florida, U. of Puerto Rico, Univ. of Piemonte Orientale (Italy), Univ. of Mostar (Croatia) l Distributed scientific computing Louisiana State University l Tutorials, workshops: Big Data for Science summer school A cloudy view on computing SC 11 tutorial Clouds for science Science Cloud Summer School

Cloud computing classes l Massimo Canonico, U. Piemonte Orientale l Difficulties to overcome: Hardware issues: find enough free physical machines able to host virtual machines Software issues: time to install/configure as many as possible different cloud platforms University was not able to provide me the necessary hardware and software support l Students started to play with FutureGrid After attending few lessons, they were able to start/stop virtual instance with several Cloud Computing platforms

Cloud computing classes l Students used Eucalyptus, OpenStack and Nimbus Half were not computer scientists. l As FutureGrid freely shares their physical machines and their cloud platforms, decided to freely share all materials of my class. Hands-out, configuration files and link to useful documentation are available https://portal.futuregrid.org/contrib/cloudcomputing-class

Cloud computing classes l Graduate-level Cloud computing for Data-Intensive Sciences (Judy Qiu, Fall 2010) Virtualization technologies and tools Infrastructure as a service Parallel programming (MPI, Hadoop) FutureGrid provided a set of software options that made it possible for students to work on different projects along the system stack.

Term Projects Higher Level Languages Cloud Platform Dryad/DryadLINQ #1 Matrix Multiplication (Swapnil,Amit,Pradnay) #2 PhyloD (Ratul,Adrija,Chengming) Iterative MapReduce #3 LDA (Changsi, Yang) #4 MemCache (Saliya, Yiming,Jerome) #5 Avro (Yuduo, Yuan, patanachai) #6 PageRank (Shuo-Huan,Parag) Cloud Infrastructu re Hypervisor/ Virtualizatio n Cloud Infrastructure #7 Nimbus, Eucalyptus (Stephen, Sonali, Shakeela) Cloud Storage #8 Cloud Storage Survey (Xiaoming, Nixiaogang) Virtualization #9 Hypervisor Performance Analysis Project (James, Andrew) (Slide courtesy of Judy Qiu)

Big Data for Science 300+ Students (200 on sites from 10 insbtutes; 100 online) July 26-30, 2010 NCSA Summer School Workshop IU MapReduce and UF Virtual Appliance technologies are supported by FutureGrid. hdp://salsahpc.indiana.edu/tutorial Washington University IBM Almaden Research Center Iowa State University of Minnesota Univ.Illinois at Chicago Michigan State Notre Dame Penn State Johns Hopkins University of California at Los Angeles Indiana University San Diego Supercomputer Center University of Texas at El Paso University of Arkansas University of Florida (Slide courtesy of Judy Qiu)

Demonstration l Deploying virtual appliance node on FutureGrid (Nimbus @ Alamo) l Connecting to virtual machine l Virtual networking l Running sample job

Demonstration l Pre-instantiated VM to save us time: cloud-client.sh --conf alamo.conf --run --name grid-appliance-2.05.03.gz --hours 24 l Connect to VM ssh root@vmip l Check virtual network interface ifconfig l Ping other VMs in the virtual cluster l Submit Condor job

Uploading and sharing images l APIs available to upload images, customize, save, and share images l Community education pages are available FutureGrid Web portal allows users to publish their own content Tutorials, presentations on Web portal; VMs on image repositories

Where to go from here? l Tutorials on FutureGrid and Grid appliance Web sites for various middleware stacks Condor, MPI, Hadoop l A community resource for educational virtual appliances Success hinges on users effectively getting involved If you are happy with the system, let others know! Contribute with your own content virtual appliance images, tutorials, etc

Questions? l More information: http://www.futuregrid.org http://grid-appliance.org l This document was developed with support from the National Science Foundation (NSF) under Grant No. 0910812 to Indiana University for "FutureGrid: An Experimental, High-Performance Grid Test-bed." Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF