Patagonia Cluster Project Research Cluster

Similar documents
Repositories for Partition Cloning

Partition Cast Modelling and Optimizing the Distribution of Large Data Sets in PC Clusters

Partition Repositories for Partition Cloning OS Independent Software Maintenance in Large Clusters of PCs

Optimizing the distribution of large data sets in theory and practice

Sysadmin Training. Virtual Unix Lab

Oracle VM. Getting Started Guide for Release 3.2

Getting Started Guide. Installation and Setup Instructions. For version Copyright 2009 Code 42 Software, Inc. All rights reserved

Symantec NetBackup PureDisk Compatibility Matrix Created August 26, 2010

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Advanced Operating Systems

The power of centralized computing at your fingertips

Installing VMware vsphere 5.1 Components

Data Block. Data Block. Copy A B C D P HDD 0 HDD 1 HDD 2 HDD 3 HDD 4 HDD 0 HDD 1

Network Management Utility

ISTORAGE SERVER PRODUCT DATASHEET

Oracle Database 11g Direct NFS Client Oracle Open World - November 2007

Downloaded from various sources on the NET

Sun Certified System Administrator for the Solaris 10 OS Bootcamp

Motivation to Teach Network Hardware

High performance Linux clusters - design/configuration/maintenance

High Availability Overview Paper

HP Supporting the HP ProLiant Storage Server Product Family.

Introduction. Kevin Miles. Paul Henderson. Rick Stillings. Essex Scales. Director of Research Support. Systems Engineer.

Symantec Design of DP Solutions for UNIX using NBU 5.0. Download Full Version :

Improving the Network Interfaces for Gigabit Ethernet in Clusters of PCs by Protocol Speculation

Pinnacle3 Professional

VERITAS SANPoint Storage Appliance Overview of an Open Platform for the Implementation of Intelligent Storage Server

Linux Administration

The World s Fastest Backup Systems

Linux Compute Cluster in the German Automotive Industry

HPC File Systems and Storage. Irena Johnson University of Notre Dame Center for Research Computing

Technical Brief: Specifying a PC for Mascot

Pinnacle 3 SmartEnterprise

Three Generations of Linux Dr. Alexander Dunaevskiy

WLPC 2017 Maker Session

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture

Certified Ubuntu Professional VS-1140

E3-E4 CM TECHNICAL IT INFRASTRUCTURE

NOW and the Killer Network David E. Culler

IOPStor: Storage Made Easy. Key Business Features. Key Business Solutions. IOPStor IOP5BI50T Network Attached Storage (NAS) Page 1 of 5

ArcExplorer -- Java Edition 9.0 System Requirements

On the Migration of the Scientific Code Dyana from SMPs to Clusters of PCs and on to the Grid

Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments

Parallel & Cluster Computing. cs 6260 professor: elise de doncker by: lina hussein

VMware Workstation 5 Lab. New Features and Capabilities: Multiple Snapshots, Teams, Clones, Video Capture and More

DocuShare Installation Guide

CyberStore DSS. Multi Award Winning. Broadberry. CyberStore DSS. Open-E DSS v7 based Storage Appliances. Powering these organisations

What's New in SUSE LINUX Enterprise Server 9

The DETER Testbed: Overview 25 August 2004

Configuring a Single Oracle ZFS Storage Appliance into an InfiniBand Fabric with Multiple Oracle Exadata Machines

Ensure that the server where you install the Primary Server software meets the following requirements: Item Requirements Additional Details

Acronis Backup & Recovery 11.5

USB2.0 IDE & LANDISK External Enclosure

Veeam Backup & Replication

Sun and Oracle. Kevin Ashby. Oracle Technical Account Manager. Mob:

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

Performance Characterization of the Dell Flexible Computing On-Demand Desktop Streaming Solution

DeploymentManager - product introduction - Aug 30,2016 NEC Corporation

HP LeftHand SAN Solutions

Introduction to Hadoop. Owen O Malley Yahoo!, Grid Team

Development Environment Embedded Linux Primer Ch 1&2

Baremetal with Apache CloudStack

SOLO NETWORK (11) (21) (31) (41) (48) (51) (61)

Contents Overview of the Gateway Performance and Sizing Guide... 5 Primavera Gateway System Architecture... 7 Performance Considerations...

Netegrity SiteMinder 4.51 AuthMark Performance Details

1Z Oracle. Oracle Solaris 10 System Administrator Certified Professional, Part II

Parallel File Systems Compared

SMP and ccnuma Multiprocessor Systems. Sharing of Resources in Parallel and Distributed Computing Systems

Adapted from: TRENDS AND ATTRIBUTES OF HORIZONTAL AND VERTICAL COMPUTING ARCHITECTURES

DRaaS AllConnected Veeam Solutions Overview

NETWORK AUDIT FOR CHOPPY AND SONS, LLC

HP Advanced Server V5.1B-3 for UNIX. Overview

Installing Linux on JMU Computer-Science Department Removable Hard Drives for CS-450 and CS-550

PCIe 10G 5-Speed. Multi-Gigabit Network Card

Configuring and Managing Virtual Storage

Virtual Machines Disco and Xen (Lecture 10, cs262a) Ion Stoica & Ali Ghodsi UC Berkeley February 26, 2018

This presentation of uclinux-on-microblaze given

EI 338: Computer Systems Engineering (Operating Systems & Computer Architecture)

Experiences with OracleVM 3.3

IT ESSENTIALS V. 4.1 Module 5 Fundamental Operating Systems

Introduction SMB FTP DHCP. More than a NAS

CIS 1 Lecture Notes. Chapter 1 What is a computer?

Speculative Defragmentation Leading Gigabit Ethernet to True Zero-Copy Communication

StorageCraft OneXafe and Veeam 9.5

vsphere Installation and Setup Update 2 Modified on 10 JULY 2018 VMware vsphere 6.5 VMware ESXi 6.5 vcenter Server 6.5

Sun Fire V880 System Architecture. Sun Microsystems Product & Technology Group SE

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Why Study Multimedia? Operating Systems. Multimedia Resource Requirements. Continuous Media. Influences on Quality. An End-To-End Problem

6 x 3.5-inch SATA 6Gb/s, SATA 3Gb/s hard drive. 1. The standard system is shipped without HDD. 2. For the HDD compatibility list, please visit :

Lab 0 Getting Started. Sistemas Operativos: Práctica 0 1

CT LANforge WiFIRE Chromebook a/b/g/n WiFi Traffic Generator with 128 Virtual STA Interfaces

Design and Implementation of a Monitoring and Scheduling System for Multiple Linux PC Clusters*

SPECIFICATION FOR NETWORK ATTACHED STORAGE (NAS) TO BE FILLED BY BIDDER. NAS Controller Should be rack mounted with a form factor of not more than 2U

CS Operating Systems

Apple 9L OS X Support Essentials

WLM1200-RMTS User s Guide

SuccessMaker. System Requirements SuccessMaker v1.0

DeploymentManager. NEC Corporation 2nd IT Software Division/ IT Network Global Solutions Division. September 28, 2012

Geneva 10.0 System Requirements

STOREONCE OVERVIEW. Neil Fleming Mid-Market Storage Development Manager. Copyright 2010 Hewlett-Packard Development Company, L.P.

Transcription:

Patagonia Cluster Project Research Cluster Clusters of PCs Multi-Boot and Multi-Purpose? Christian Kurmann, Felix Rauch, Michela Taufer, Prof. Thomas M. Stricker Laboratory for Computer Systems ETHZ - Swiss Institute of Technology CH-8092 Zurich Eidgenössische Technische Hochschule Zürich Ecole polytechnique fédérale de Zurich Politecnico federale di Zurigo Swiss Federal Institute of Technology Zurich Distributed Supercomputer, Henri E. Bal, U.Amsterdam 2 Characteristics of a Research Cluster Education Cluster fast processors (1-4 per node) large memory (RAM and disk) scalable high performance network (Switches and Gigabit/s) e.g. Myrinet SAN, Giganet SAN, Swiss Tx-net, SCI usage pattern: development during the day experiments throughout the night Patagonia Cluster, CS Department, ETH Zürich 3 4 Characteristics of an Education Cluster Multimedia Collaborative Work large disks for multiple extensive software installations operating systems with standard and specialized configurations lots of room: LAN needed system security: software installation student data hardware (theft) usage pattern: during the day only 5 Multimedia PCs at the Lab for Computer systems 6

Characteristics MultimediaWorkstations large disks for large amount of software installations operating systems in maintained default configuration and user specific configurations lots of space: LAN needed Gigabit Ethernet or Switcherland (Multimedia) system security installation and user data usage pattern: predominantly during the day Multi-user/Multi-purpose-Cluster Observation: All three types of clusters have mostly the same requirements but totally different usage patterns. Thesis: - A single multi-boot cluster matches all needs! - OS and file system independent installation tool needed ÿ Widely the same requirements as in education! 7 8 Overview The Right Operating System Multiboot / Operating Systems Patagonia Project at DINFK/ETHZ Patagonia Security / Maintenance Installations with Cloning / Data Streaming OS-Image Repositories Evaluation Cloning / Repositories Alternatives to Multiboot Conclusions 9 Linux VS. ETH Oberon 10 System Software Common Problem Education: Windows NT German with file system security Windows NT English with file system security Oberon Solaris 2.7, same setup as Sun SPARC Cluster Research: Linux Windows NT English without file system security Oberon, development system 11 Maintenance of software installations is hard: Different operating systems or applications in Cluster Temporary installations: tests, experiments, courses Software rejuvenation to combat software rotting proc. Manual Install: days, Network Installs: hrs, Cloning: min 12

The Patagonia Project Multi-boot, multi-purpose Cluster for the Department of Computer Science at ETH Zurich 60 PCs (600-1000 MHz Dell, SCSI) 3 Computer Labs Windows NT, Linux, Oberon Patagonia Installation Replication through cloning Initial installation configuration of a master disk block wise copy of the master disk Cloning of single disk images / partitions boot a service OS block wise copy of the images over the network Used for Education and Research 13 14 Patagonia Installation II Patagonia Auto Configuration Partitioning: Boot-Partition Partitions for Windows NT 4.0 Education Partitions for Windows NT 4.0 Research Partitions for Linux Partitions for Oberon Spare partition (Solaris, Oracle...) Small Service Linux Partition 0.020 GB 2 x 2.0 GB 2.0 GB 1.5 GB 2 x 0.1 GB 1.0 GB 0.25 GB Configuration of machine specific parameters: IP number, Hostname, HostID, SystemID, SecurityID manual automatically through DHCP (with server) automatically through Ethernet MAC address (with static table) approx. 9 GB 15 DHCP = Dynamic Host Configuration Protocol MAC = Media Access Control ID - Ethernet board built-in 16 Security in Patagonia Boot-Manager: System Commander Goal: Security without interference for the users ÿ reached through three levels booting with System Commander separate and hide partitions lock partitions with Device Lock and resetting device IDs to C:\ (Windows NT) mount tables (UNIX) write-protected partitions plus RAM disk (Oberon) usage of access rights and clean authentication by a central server 17 18

Accounts for UNIX and Windows Efficient Sun-Server for: home directories over SMB (with Samba) resp. NFS (UNIX) from Sun-Server authentication through a specific Windows NT-Server (NT- Clients) resp. NIS Server (UNIX-Clients) Account generation on UNIX, automatic generation for NT via scripts Password synchronisation with commercial tool Passync Maintenance with CAT Cluster Administration Tool (CAT): Cluster information system (running OS, users, load..) Remote boot to selected OS Installation daemon (Dolly for reliable data streaming) SMB = Server Message Block Protocol NFS = Network File System NIS = Network Information Service (former yellow pages) 19 20 Analytic Model for Data Distribution Cloning through partition-cast can be done in many different ways: Factors: data compression disk bandwidth / network bandwidth utilization of I/O bus, CPUs, memory system network topology An analytic model [Rauch, EuroPar2000] helps to optimise for the best alternative. Active Nodes / Topologies Simple receiver for star topologies Simple Receiver Advanced cloning node for multi drop chains Multi-drop Chain Node cloning streams for general spanning tree Universal Active Node 21 22 Tools for Partition-Cast dd/nfs, built-in OS function and network file system based on UDP/IP - simple - permits star topology only Dolly, small application for streaming with cloning based on TCP/IP - reliable data streaming Dolly for reliable data casting on all spanning trees star (n-ary) 2-ary, 3-ary chain (unary) Experimental Setup Topologies: Star 3-ary spanning tree Multi-drop chain Fast Ethernet / Gigabit Ethernet Compressed / Uncompressed Images All experiments: Distribute 2 GByte to 1..15 clients 23 24

Scalability Data Streaming Results Optimal configuration derived from our model. In most cases, the multi-drop chain delivers perfect scalability (thanks to the Ethernet switch) and is better than any other spanning tree configuration. Complex multi-cast protocols cannot be implemented for the required throughput - but they are not required anyway. Resulting simple Streaming Tool Dolly transfers a 2 GB Windows NT installation image to 60 workstations in less than 5 minutes at a sustained transfer rate of 9 MB/s per node 25 26 Evolution of Software Installations Characteristics of Installations Software Installations are incremental and constantly changing Temporal diffs: New installations, Patches, Upgrades ÿ need ability to go back Spacial diffs: Localization, Personalization ÿ OS dependant, file system dependant, removal not clean Cloning needs no OS and file system knowledge but leads to intense image archives Solution: Store only changes in OS independent partition repository 27 28 Incremental Images in Repository Incremental Repository Results Problem of software maintenance in large clusters of PCs was analyzed. The proposed system is based on an storing and distributing entire partition images. Investigated block-wise differential partition repositories work well and optimally compress the archive. Solution is independent on file system and work with all operating systems, also future versions of current OSes. 29 30

Alternatives to Multiboot Micro Kernel with OS adaption layers Binary Translators, Emulators, VMs,JIT Softwindows for Macintosh, Java Virtual Machine Micro Kernel with OS adaption layers latest attempt by Steve Jobs: Mac OS X Low level virtual machines Disco Project, Stanford University, Prof. Mendel Rosenblum and Eduard Bugnion, SGI Cluster VMWare : appliance of technology on PCs, Linux and Windows NT 31 32 VMWare :Low Level Virtual Machines Conclusions 33 Successful installation and beginning of operation of Patagonia, a universal multi-purpose Cluster for Research, Education and Collaborative Work. Facilitation of maintenance and Installation through: small service operating system (Linux) fast, large disks (SCSI) switched network Multi-boot installations lead to great flexibility Cloning of entire software installations scales well Differential Repository optimally compresses sequences of OS installations 34 Published Papers F. Rauch, Ch. Kurmann, T. Stricker: Partition Repositories for Partition Cloning - OS Independent Software Maintenance in LargeClustersofPCs. Proceedings of the IEEE International Conference on Cluster Computing 2000, Chemnitz, Germany. F. Rauch, Ch. Kurmann, T. Stricker: Partition Cast - Modelling and Optimising the Distribution of Large Data Sets in PC Clusters. Distinguished paper published at European Conference on Parallel Computing, Euro-Par 2000, Munich, Germany. F. Rauch, Ch. Kurmann, B. M. Müller-Lagunez, T. Stricker Patagonia - A Dual Use Cluster of PCs for Computation and Education. Proc. of the second workshop on Cluster-Computing, March 1999, Karlsruhe, Germany. 35 Q&A CoPs (Clusters of PCs) Project-Homepage http://www.cs.inf.ethz.ch/cops Described tools including source code are available for download under the GNU general public license from the above URL. 36