Adaptive Cluster Computing using JavaSpaces
|
|
- Gertrude Cook
- 5 years ago
- Views:
Transcription
1 Adaptive Cluster Computing using JavaSpaces Jyoti Batheja and Manish Parashar The Applied Software Systems Lab. ECE Department, Rutgers University
2 Outline Background Introduction Related Work Summary of presented Framework Framework Framework Architecture Framework Operation Experimental Evaluation Conclusions & Future Work
3 Background
4 Introduction Motivation Traditional approach adopted for HPC Potential for leveraging existing networked resources for HPC Objective is to design and implement a framework that: Aggregates networked computing resources Non-intrusively exploits idle resources for Parallel/Distributed Computing Challenges that need to addressed by the Framework 1. Adaptability to Cluster Dynamics 2. System Configuration and Management Overhead 3. Heterogeneity 4. Security and Privacy concerns 5. Non intrusiveness
5 Related Work Adaptive Parallelism Job Level Parallelism Condor Piranha Cluster Based ATLAS ObjectSpace Web Based Charlotte Javelin ParaWeb
6 Features Summary of the Presented Work Implementation using Java as the core programming language to leverage portability across heterogeneous platforms Offloading user responsibilities to the Framework for Resource Availability event triggers Minimized configuration overhead by employing remote node configuration techniques Targeted Applications High Computational Complexity Massively Partitioned Problems Small Input and Output Sizes
7 Framework Architecture
8 Architectural Overview Master Module Worker Module Network Management Module Master Master Process Process Java Spaces Jini Java Space Infrastructure Space Space Remote Node Configuration Engine Worker Worker Process Process Rule Based Protocol Java Sockets Inference Engine Rule Based Protocol Java Native Interface Monitoring Agent Common Services PLATFORM INFRASTRUCTURE (OPERATING SYSTEM)
9 Architectural Components: Jini Infrastructure Master Module Master Process JavaSpaces Jini Java Space Infrastructure Space Worker Module Remote Node Config. Engine Worker Process Rule Base Protocol Java Sockets Network Management Module Inference Engine Rule Base Protocol Java Native Interface Monitoring Agent Jini Based Infrastructure Common Services PLATFORM INFRASTRUCTURE (OPERATING SYSTEM) 3. Client discovers lookup server through multicast discovery 4. Client performs lookup directly to know lookup server and receives server location Jini Client Jini Lookup Server 1. Service discovers lookup server through multicast discovery 2. Service registers with lookup Server using join Jini Server 5. Client directly accesses the server and executes code on the server Jini Client Jini Client
10 Architectural Components: Master-Worker Paradigm Master Module Master Process JavaSpaces Jini Java Space Infrastructure Space Worker Module Remote Node Config. Engine Worker Process Rule Base Protocol Java Sockets Network Management Module Inference Engine Rule Base Protocol Java Native Interface Monitoring Agent Master-Worker Interaction Diagram Common Services PLATFORM INFRASTRUCTURE (OPERATING SYSTEM) Worker 1 Writing Input Parameters Collects tasks Executes Returns results into space Worker 2 Master Creates tasks Loads into Space Awaits collection of results Collect Result Parameters Space Space Pool Pool of of Tasks Tasks Collects tasks Executes Returns results into space Worker n Legend: Writing Task Writing Results Collects tasks Executes Returns results into space
11 Architectural Components: Rule-Base Protocol UML Interaction Diagram for Rule-Base Protocol Master Module Master Process JavaSpaces Jini Java Space Infrastructure Common Services Space Worker Module Remote Node Config. Engine Worker Process Rule Base Protocol Java Sockets PLATFORM INFRASTRUCTURE (OPERATING SYSTEM) Network Management Module Inference Engine Rule Base Protocol Java Native Interface Monitoring Agent Inference Engine SNMP Server SNMP Client Worker Application 1. Server Listens for Client Connections Client connects and sends its I.P. Address 3. to Server Server assigns a Client I.D. 4. Server invokes SNMP Service for the Client Invokes Inference Engine 6. Decides signal for the Client 7. Sends signal to Client through the Server 8. Client sends signal to Application 10. Goto Step Computations are performed
12 Architectural Components: Worker Execution States Master Module Master Process JavaSpaces Jini Java Space Infrastructure Space Worker Module Remote Node Config. Engine Worker Process Rule Base Protocol Java Sockets Network Management Module Inference Engine Rule Base Protocol Java Native Interface Monitoring Agent Worker State Transition Diagram Common Services PLATFORM INFRASTRUCTURE (OPERATING SYSTEM) Stop Running Running Start Stop Stop Pause Resume Pause Pause
13 Experimental Evaluation
14 Experimental Evaluation Application Metrics Parallel Monte Carlo Simulation for Stock Option Pricing The problem domain is divided into 50 sub tasks, each comprising of 100 simulations Parallel Ray Tracing A 600X600-image plane is divided into rectangular slices of 25X600 thus creating 24 independent tasks Option Pricing Scheme Ray Tracing Scheme Scalability Medium High CPU Memory Adaptable depending High Requirements on number of simulations Dependency No No Jini Server Master Workstation P III, 256 MB RAM, 800 MHz Worker 1 P III, 64 MB RAM, 300 MHz Experimental Setup Worker 2 P III, 64 MB RAM, 300 MHz LAN SNMP Server Worker 3 P III, 64 MB RAM, 300 MHz P III, 64 MB RAM, 300 MHz Worker 12 P III, 64 MB RAM, 300 MHz
15 Experiment 1: Scalability Analysis (Option Pricing Scheme) Figure 1: Scalability Analysis for Option Pricing Scheme Time (ms) Number of Workers # of machines Max worker time Parallel Time Task Planning Task aggregation
16 Experiment 2: Adaptation Protocol Analysis (Option Pricing Scheme) Load Simulator 1: Stop Signal Load Simulator 2: Pause Signal Figure 1(a): Graph of CPU Usage on Worker Nodes for Option Pricing Scheme Figure 1(b): Graph of Worker Reaction Times for Option Pricing Scheme CPU Usage (%) Simulation Run (STOP) Class Loading Simulation Run (PAUSE) Signal Start Pause Start Stop Start Start Worker Signal 1061 Client Signal Stop Restart Pause Restart Time (ms) 0 Client Signal Worker Signal
17 Experiment 3: Dynamic Behavior Patterns under varying load conditions (Option Pricing Scheme) Figure 1(a): Graph of dynamics under load showing time measurements for Option Pricing Scheme Measurement Category Total Parallel time Task Planning and Aggregation Maximum Master Overhead Maximum Worker Time Time (ms) Without Loading Loading 3 Workers Loading 6 Workers Figure 1(b): Graph of dynamics under load showing task distribution for Option Pricing Scheme Number of Tasks Executed Worker 1 Worker 2 Worker 3 Worker 4 Worker 5 Worker 6 Worker 7 Worker 8 Worker 9 Worker 10 Worker 11 Worker 12 Worker Nodes Without Loading Loading 3 Workers Loading 6 Workers
18 Conclusions and Future Work
19 Conclusions Summary Good Scalability for coarse grained applications Idle workstations can be effectively used to compute large parallel jobs Framework provides support for global code deployment and configuration management Monitoring and reacting to System State enables us to minimize intrusiveness to machines within the cluster Future Directions Optimizations at the Application level Use of Transaction Management services to address Partial Failures Incorporating a distributed JavaSpace model to avoid a single point of resource contention or failure
20 Additional Slides
21 Summary of Related Cluster Based Parallel Computing Architectures Architecture Approach Key Features Strengths Weaknesses Condor Piranha Cluster based, Job level parallelism, Asynchronous offloading of work to idle resources on a request basis Cluster based, Adaptive parallelism, Centralized work distribution using the master worker paradigm Queuing mechanism, Check pointing and process migration, Remote procedure call and matchmaking Implementation of Linda Model using tuple spaces, Recycles idle resources based on user defined criteria. Survives partial failure, preserved local execution environment Survives partial failure, efficient cycle stealing Incompleteness of checkpointing files. Difficulty in linking checkpointing code with user code for third party software. Requires modifications to the Linda systems. Supports only Linda Programs
22 Summary of Related Cluster Based Parallel Computing Architectures (cont d) Architecture Approach Key Features Strengths Weaknesses ATLAS ObjectSpace /Anaconda Cluster based, Adaptive Parallelism, Hierarchical work stealing Cluster based, Job level parallelism at brokers Implementation based on Cilk. Emphasis on thread scheduling algorithm based on work stealing techniques. Runtime library for work stealing, thread management, marshalling of objects. Eager scheduling, automated/manual job replication, Job level parallelism at broker level. Fault Tolerance, Localized computations: more bandwidth, reduced latencies, and natural mapping to administrative domains. Improved scalability Periodic polling the health of participation hosts. Global file sharing leads to conflicts in access control. Heterogeneity concerns: conversion mechanisms for native libraries, platform specific Java interpreter. Fault tolerance is not supported among brokers.
23 Architectur e Summary of Related Web Based Parallel Computing Architectures Approach Key Features Strengths Weaknesses Charlotte Web based, Adaptive parallelism, Centralized work distribution using downloadable applets Abstraction of Distributed Shared Memory, data type mapping and use of atomic update protocol. Directory look up service for idle resource identification, embedded lightweight class server on local hosts Direct interapplet communication, Fault tolerance, Eager scheduling and Two Phase idempotent execution Centralized broker can be a potential bottleneck Javelin Web based, Adaptive parallelism, Centralized work distribution using downloadable applets Heterogeneous, secure execution environment, portability to include Linda and SPMD programming models Offline worker computation, Applet balancing scheme via broker registration Inter applet communication routed via initiating broker: potential bottleneck
24 Summary of Related Web Based Parallel Computing Architectures (cont d) Architecture Approach Key Features Strengths Weaknesses ParaWeb Web based, Adaptive parallelism, Centralized work distribution using scheduling servers Implementation of Java Parallel Runtime System to provide global shared memory and transparent thread management and Java Parallel Class Library to facilitate upload and download of execution code across the web Clients can act as workers JPRS requires modifying the Java Interpreter Open Issues Heterogeneity concerns Manual Management
25 Architectural Components: JavaSpace Service Features Coordination tool for gluing processes together into a distributed application Supports heterogeneous computing Supports Dynamic Client Pool Provides a storage repository for Java Objects Provides a simplistic API for access of the Java objects in the space write read take JavaSpaces TM Interaction: Source Java Web Site
26 Experiment 1: Scalability Analysis (Ray Tracing Scheme) Figure 2: Scalability Analysis for Ray Tracing Scheme Time (ms) Number of Workers # of machines Max Worker Time Parallel Time Task Planning Task Aggregation
27 Experiment 1: Scalability Analysis (Pre-fetching Scheme) Figure 3: Scalability Analysis for Pre-fetching Scheme based on Page Rank Time (ms) Number of Workers # of machines Max Worker Time Parallel Time Task Planning Task Aggregation
28 Experiment 2: Adaptation Protocol Analysis (Ray Tracing Scheme) Figure 2(a): Graph of CPU Usage on Worker Nodes for Ray Tracing Scheme Figure 2(b): Graph of Worker Reaction Times for Ray Tracing Scheme CPU Usage (%) Simulation Run (PAUSE) Simulation Run (STOP) Signal Restart Pause Restart Stop 1072 Start Start Stop Restart Pause Restart Worker Signal Client Signal Time (ms) Client Signal Worker Signal
29 Experiment 2: Adaptation Protocol Analysis (Pre-fetching scheme) Figure 3(a): Graph of CPU Usage on Worker Nodes for Pre-fetching Scheme Figure 3(b): Graph of Worker Reaction Times for Pre-fetching Scheme CPU Usage (%) Simulation Run (STOP) Class Loading Simulation Run (PAUSE) Signal Restart Pause Restart Stop Start Start Stop Restart Pause Restart Worker Signal Client Signal Time (ms) Client Signal Worker Signal
30 Experiment 3: Dynamic Behavior Patterns under varying load conditions (Ray Tracing Scheme) Figure 2(a): Graph of dynamics under load showing time measurements for Ray Tracing Scheme Measurement Category Total Parallel Time Task Planning and Aggregation Maximum Master Overhead Max Worker Time Time (ms) Without Loading Loading 1Worker Loading 2 Workers Figure 2(b): Graph of dynamics under load showing task distribution for Ray Tracing Scheme Number of Tasks Executed Worker 1 Worker 2 Worker 3 Worker 4 Worker Nodes Without Loading Loading 1Worker Loading 2 Workers
31 Experiment 3: Dynamic Behavior Patterns under varying load conditions (Pre-fetching Scheme) Figure 3(a): Graph of dynamics under load showing time measurements for Pre-fetching Scheme Measurement Category Parallel Time Task Planning and Aggregation Maximum Master Overhead Max Worker Time Time (ms) Without Loading Loading 1 Worker Loading 2 Workers Figure 3(b): Graph of dynamics under load showing task distribution for Pre-fetching Scheme Number of Tasks Executed Worker 1 Worker 2 Worker3 Worker 4 Worker Nodes Without Loading Loading 1 Worker Loading 2 Workers
A Framework for Adaptive Cluster Computing using JavaSpaces 1
A Framework for Adaptive Cluster Computing using JavaSpaces 1 Jyoti Batheja and Manish Parashar The Applied Software Systems Laboratory Department of Electrical and Computer Engineering, Rutgers, The State
More informationA Framework for Opportunistic Cluster Computing using JavaSpaces 1
A Framework for Opportunistic Cluster Computing using JavaSpaces 1 Jyoti Batheja and Manish Parashar Electrical and Computer Engineering, Rutgers University 94 Brett Road, Piscataway, NJ 08854 {jbatheja,
More informationPARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM
PARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM Szabolcs Pota 1, Gergely Sipos 2, Zoltan Juhasz 1,3 and Peter Kacsuk 2 1 Department of Information Systems, University of Veszprem, Hungary 2 Laboratory
More informationChapter 3. Design of Grid Scheduler. 3.1 Introduction
Chapter 3 Design of Grid Scheduler The scheduler component of the grid is responsible to prepare the job ques for grid resources. The research in design of grid schedulers has given various topologies
More informationECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective
ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective Part II: Data Center Software Architecture: Topic 3: Programming Models Piccolo: Building Fast, Distributed Programs
More informationOutline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems
Distributed Systems Outline Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems What Is A Distributed System? A collection of independent computers that appears
More informationone.world Towards a System Architecture for Pervasive Computing
Towards a System Architecture for Pervasive Computing Robert Grimm, Janet Davis, Ben Hendrickson, Eric Lemar, Tom Anderson, Brian Bershad, Gaetano Borriello, David Wetherall University of Washington Vision
More informationSEDA: An Architecture for Well-Conditioned, Scalable Internet Services
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services Matt Welsh, David Culler, and Eric Brewer Computer Science Division University of California, Berkeley Operating Systems Principles
More informationONOS OVERVIEW. Architecture, Abstractions & Application
ONOS OVERVIEW Architecture, Abstractions & Application WHAT IS ONOS? Open Networking Operating System (ONOS) is an open source SDN network operating system (controller). Mission: to enable Service Providers
More informationMONitoring Agents using a Large Integrated Services Architecture. Iosif Legrand California Institute of Technology
MONitoring Agents using a Large Integrated s Architecture California Institute of Technology Distributed Dynamic s Architecture Hierarchical structure of loosely coupled services which are independent
More informationCondor and BOINC. Distributed and Volunteer Computing. Presented by Adam Bazinet
Condor and BOINC Distributed and Volunteer Computing Presented by Adam Bazinet Condor Developed at the University of Wisconsin-Madison Condor is aimed at High Throughput Computing (HTC) on collections
More informationDiffusing Your Mobile Apps: Extending In-Network Function Virtualisation to Mobile Function Offloading
Diffusing Your Mobile Apps: Extending In-Network Function Virtualisation to Mobile Function Offloading Mario Almeida, Liang Wang*, Jeremy Blackburn, Konstantina Papagiannaki, Jon Crowcroft* Telefonica
More informationLinda, JavaSpaces & Jini
ECE 451/566 - Introduction to Parallel and Distributed Computing Linda, JavaSpaces & Jini Manish Parashar parashar@ece.rutgers.edu Department of Electrical & Computer Engineering Rutgers University Linda
More informationCAS 703 Software Design
Dr. Ridha Khedri Department of Computing and Software, McMaster University Canada L8S 4L7, Hamilton, Ontario Acknowledgments: Material based on Software by Tao et al. (Chapters 9 and 10) (SOA) 1 Interaction
More informationAssignment 5. Georgia Koloniari
Assignment 5 Georgia Koloniari 2. "Peer-to-Peer Computing" 1. What is the definition of a p2p system given by the authors in sec 1? Compare it with at least one of the definitions surveyed in the last
More informationIOS: A Middleware for Decentralized Distributed Computing
IOS: A Middleware for Decentralized Distributed Computing Boleslaw Szymanski Kaoutar El Maghraoui, Carlos Varela Department of Computer Science Rensselaer Polytechnic Institute http://www.cs.rpi.edu/wwc
More informationScreen Saver Science: Realizing Distributed Parallel Computing with Jini and JavaSpaces
Screen Saver Science: Realizing Distributed Parallel Computing with Jini and JavaSpaces William L. George and Jacob Scott National Institute of Standards and Technology Information Technology Laboratory
More informationDatabase Server. 2. Allow client request to the database server (using SQL requests) over the network.
Database Server Introduction: Client/Server Systems is networked computing model Processes distributed between clients and servers. Client Workstation (usually a PC) that requests and uses a service Server
More informationApache Flink. Alessandro Margara
Apache Flink Alessandro Margara alessandro.margara@polimi.it http://home.deib.polimi.it/margara Recap: scenario Big Data Volume and velocity Process large volumes of data possibly produced at high rate
More informationThe Google File System (GFS)
1 The Google File System (GFS) CS60002: Distributed Systems Antonio Bruto da Costa Ph.D. Student, Formal Methods Lab, Dept. of Computer Sc. & Engg., Indian Institute of Technology Kharagpur 2 Design constraints
More informationDeep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services
Deep Dive Amazon Kinesis Ian Meyers, Principal Solution Architect - Amazon Web Services Analytics Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
More informationA Federated Grid Environment with Replication Services
A Federated Grid Environment with Replication Services Vivek Khurana, Max Berger & Michael Sobolewski SORCER Research Group, Texas Tech University Grids can be classified as computational grids, access
More informationTypes of Virtualization. Types of virtualization
Types of Virtualization Emulation VM emulates/simulates complete hardware Unmodified guest OS for a different PC can be run Bochs, VirtualPC for Mac, QEMU Full/native Virtualization VM simulates enough
More informationDistributed Scheduling for the Sombrero Single Address Space Distributed Operating System
Distributed Scheduling for the Sombrero Single Address Space Distributed Operating System Donald S. Miller Department of Computer Science and Engineering Arizona State University Tempe, AZ, USA Alan C.
More informationData Model Considerations for Radar Systems
WHITEPAPER Data Model Considerations for Radar Systems Executive Summary The market demands that today s radar systems be designed to keep up with a rapidly changing threat environment, adapt to new technologies,
More informationCycle Sharing Systems
Cycle Sharing Systems Jagadeesh Dyaberi Dependable Computing Systems Lab Purdue University 10/31/2005 1 Introduction Design of Program Security Communication Architecture Implementation Conclusion Outline
More informationMOHA: Many-Task Computing Framework on Hadoop
Apache: Big Data North America 2017 @ Miami MOHA: Many-Task Computing Framework on Hadoop Soonwook Hwang Korea Institute of Science and Technology Information May 18, 2017 Table of Contents Introduction
More informationDISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 1. Introduction
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN Chapter 1 Introduction Modified by: Dr. Ramzi Saifan Definition of a Distributed System (1) A distributed
More informationChapter Outline. Chapter 2 Distributed Information Systems Architecture. Distributed transactions (quick refresh) Layers of an information system
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 2 Distributed Information Systems Architecture Chapter Outline
More informationMulti-Channel Clustered Web Application Servers
Multi-Channel Clustered Web Application Servers Masters Thesis Proposal Progress American University in Cairo Proposed by Karim Sobh (kmsobh@aucegypt.edu) Supervised by Dr. Ahmed Sameh (sameh@aucegypt.edu)
More informationUsing peer to peer. Marco Danelutto Dept. Computer Science University of Pisa
Using peer to peer Marco Danelutto Dept. Computer Science University of Pisa Master Degree (Laurea Magistrale) in Computer Science and Networking Academic Year 2009-2010 Rationale Two common paradigms
More informationPAC485 Managing Datacenter Resources Using the VirtualCenter Distributed Resource Scheduler
PAC485 Managing Datacenter Resources Using the VirtualCenter Distributed Resource Scheduler Carl Waldspurger Principal Engineer, R&D This presentation may contain VMware confidential information. Copyright
More informationDistributed Systems COMP 212. Lecture 18 Othon Michail
Distributed Systems COMP 212 Lecture 18 Othon Michail Virtualisation & Cloud Computing 2/27 Protection rings It s all about protection rings in modern processors Hardware mechanism to protect data and
More informationFault Tolerance in K3. Ben Glickman, Amit Mehta, Josh Wheeler
Fault Tolerance in K3 Ben Glickman, Amit Mehta, Josh Wheeler Outline Background Motivation Detecting Membership Changes with Spread Modes of Fault Tolerance in K3 Demonstration Outline Background Motivation
More informationTITLE: PRE-REQUISITE THEORY. 1. Introduction to Hadoop. 2. Cluster. Implement sort algorithm and run it using HADOOP
TITLE: Implement sort algorithm and run it using HADOOP PRE-REQUISITE Preliminary knowledge of clusters and overview of Hadoop and its basic functionality. THEORY 1. Introduction to Hadoop The Apache Hadoop
More information05 Indirect Communication
05 Indirect Communication Group Communication Publish-Subscribe Coulouris 6 Message Queus Point-to-point communication Participants need to exist at the same time Establish communication Participants need
More informationECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective
ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective Part II: Data Center Software Architecture: Topic 1: Distributed File Systems GFS (The Google File System) 1 Filesystems
More informationA FRAMEWORK FOR THE DYNAMIC RECONFIGURATION OF SCIENTIFIC APPLICATIONS IN GRID ENVIRONMENTS
A FRAMEWORK FOR THE DYNAMIC RECONFIGURATION OF SCIENTIFIC APPLICATIONS IN GRID ENVIRONMENTS By Kaoutar El Maghraoui A Thesis Submitted to the Graduate Faculty of Rensselaer Polytechnic Institute in Partial
More informationMCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter One Introducing Windows Server 2008
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # 70-643) Chapter One Introducing Windows Server 2008 Objectives Distinguish among the different Windows Server
More informationCloud Computing. Summary
Cloud Computing Lectures 2 and 3 Definition of Cloud Computing, Grid Architectures 2012-2013 Summary Definition of Cloud Computing (more complete). Grid Computing: Conceptual Architecture. Condor. 1 Cloud
More informationAnnouncements. me your survey: See the Announcements page. Today. Reading. Take a break around 10:15am. Ack: Some figures are from Coulouris
Announcements Email me your survey: See the Announcements page Today Conceptual overview of distributed systems System models Reading Today: Chapter 2 of Coulouris Next topic: client-side processing (HTML,
More informationFunctional Requirements for Grid Oriented Optical Networks
Functional Requirements for Grid Oriented Optical s Luca Valcarenghi Internal Workshop 4 on Photonic s and Technologies Scuola Superiore Sant Anna Pisa June 3-4, 2003 1 Motivations Grid networking connection
More informationJavaSpaces technology for distributed communication and collaboration. Chih-Yao Hsieh
JavaSpaces technology for distributed communication and collaboration Chih-Yao Hsieh Computer Science and Engineering University of Texas at Arlington chsieh@cse.uta.edu Abstract This paper will give an
More informationOracle 10g and IPv6 IPv6 Summit 11 December 2003
Oracle 10g and IPv6 IPv6 Summit 11 December 2003 Marshal Presser Principal Enterprise Architect Oracle Corporation Agenda Oracle Distributed Computing Role of Networking IPv6 Support Plans Early IPv6 Implementations
More information6.1 Multiprocessor Computing Environment
6 Parallel Computing 6.1 Multiprocessor Computing Environment The high-performance computing environment used in this book for optimization of very large building structures is the Origin 2000 multiprocessor,
More informationDeployment Planning Guide
Deployment Planning Guide Community 1.5.1 release The purpose of this document is to educate the user about the different strategies that can be adopted to optimize the usage of Jumbune on Hadoop and also
More informationDr Markus Hagenbuchner CSCI319. Distributed Systems Chapter 3 - Processes
Dr Markus Hagenbuchner markus@uow.edu.au CSCI319 Distributed Systems Chapter 3 - Processes CSCI319 Chapter 3 Page: 1 Processes Lecture notes based on the textbook by Tannenbaum Study objectives: 1. Understand
More informationDistributed File Systems II
Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation
More informationChapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies
More informationToday: More Case Studies DCOM
Today: More Case Studies DCOM Jini Lecture 24, page 1 DCOM Distributed Component Object Model Microsoft s object model (middleware) Lecture 24, page 2 DCOM: History Successor to COM Developed to support
More informationThe GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection
The GridWay approach for job Submission and Management on Grids Eduardo Huedo Rubén S. Montero Ignacio M. Llorente Laboratorio de Computación Avanzada Centro de Astrobiología (INTA - CSIC) Associated to
More informationLarge-Scale GPU programming
Large-Scale GPU programming Tim Kaldewey Research Staff Member Database Technologies IBM Almaden Research Center tkaldew@us.ibm.com Assistant Adjunct Professor Computer and Information Science Dept. University
More informationExecutive Summary. iii
Executive Summary Operational and tactical military environments are composed of mobile nodes and dynamic situations resulting in a high degree of uncertainty. Communication links are often based on adhoc
More informationDynamic Fine Grain Scheduling of Pipeline Parallelism. Presented by: Ram Manohar Oruganti and Michael TeWinkle
Dynamic Fine Grain Scheduling of Pipeline Parallelism Presented by: Ram Manohar Oruganti and Michael TeWinkle Overview Introduction Motivation Scheduling Approaches GRAMPS scheduling method Evaluation
More informationApache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source
Apache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source DMITRIY SETRAKYAN Founder, PPMC https://ignite.apache.org @apacheignite @dsetrakyan Agenda About In- Memory Computing Apache Ignite
More informationChapter 5. The MapReduce Programming Model and Implementation
Chapter 5. The MapReduce Programming Model and Implementation - Traditional computing: data-to-computing (send data to computing) * Data stored in separate repository * Data brought into system for computing
More informationIntroduction to Distributed Systems. INF5040/9040 Autumn 2018 Lecturer: Eli Gjørven (ifi/uio)
Introduction to Distributed Systems INF5040/9040 Autumn 2018 Lecturer: Eli Gjørven (ifi/uio) August 28, 2018 Outline Definition of a distributed system Goals of a distributed system Implications of distributed
More informationChapter Outline. Chapter 2 Distributed Information Systems Architecture. Layers of an information system. Design strategies.
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 2 Distributed Information Systems Architecture Chapter Outline
More informationOverview SENTINET 3.1
Overview SENTINET 3.1 Overview 1 Contents Introduction... 2 Customer Benefits... 3 Development and Test... 3 Production and Operations... 4 Architecture... 5 Technology Stack... 7 Features Summary... 7
More informationTowards a codelet-based runtime for exascale computing. Chris Lauderdale ET International, Inc.
Towards a codelet-based runtime for exascale computing Chris Lauderdale ET International, Inc. What will be covered Slide 2 of 24 Problems & motivation Codelet runtime overview Codelets & complexes Dealing
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google* 정학수, 최주영 1 Outline Introduction Design Overview System Interactions Master Operation Fault Tolerance and Diagnosis Conclusions
More informationIntroduction. Distributed Systems IT332
Introduction Distributed Systems IT332 2 Outline Definition of A Distributed System Goals of Distributed Systems Types of Distributed Systems 3 Definition of A Distributed System A distributed systems
More informationDistributed Operating Systems
2 Distributed Operating Systems System Models, Processor Allocation, Distributed Scheduling, and Fault Tolerance Steve Goddard goddard@cse.unl.edu http://www.cse.unl.edu/~goddard/courses/csce855 System
More informationDynamic inter-core scheduling in Barrelfish
Dynamic inter-core scheduling in Barrelfish. avoiding contention with malleable domains Georgios Varisteas, Mats Brorsson, Karl-Filip Faxén November 25, 2011 Outline Introduction Scheduling & Programming
More informationInternet2 Meeting September 2005
End User Agents: extending the "intelligence" to the edge in Distributed Systems Internet2 Meeting California Institute of Technology 1 OUTLINE (Monitoring Agents using a Large, Integrated s Architecture)
More informationOpportunistic Application Flows in Sensor-based Pervasive Environments
Opportunistic Application Flows in Sensor-based Pervasive Environments Nanyan Jiang, Cristina Schmidt, Vincent Matossian, and Manish Parashar ICPS 2004 1 Outline Introduction to pervasive sensor-based
More informationModule 1 - Distributed System Architectures & Models
Module 1 - Distributed System Architectures & Models System Architecture Defines the structure of the system components identified functions of each component defined interrelationships and interactions
More informationServices. Service descriptions. Cisco HCS services
Service descriptions, page 1 Infrastructure Platform Automation Description, page 5 Infrastructure Manager Sync Introduction, page 5 Service descriptions After the installation of the Cisco HCM-F platform,
More informationChapter 18 Distributed Systems and Web Services
Chapter 18 Distributed Systems and Web Services Outline 18.1 Introduction 18.2 Distributed File Systems 18.2.1 Distributed File System Concepts 18.2.2 Network File System (NFS) 18.2.3 Andrew File System
More informationTOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT
TOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT Eric Kelmelis 28 March 2018 OVERVIEW BACKGROUND Evolution of processing hardware CROSS-PLATFORM KERNEL DEVELOPMENT Write once, target multiple hardware
More informationUnderstanding Feature and Network Services in Cisco Unified Serviceability
CHAPTER 10 Understanding Feature and Network Services in Cisco Unified Serviceability May 19, 2009 Cisco Unified Serviceability service management includes working with feature and network services and
More informationDistributed System. Gang Wu. Spring,2018
Distributed System Gang Wu Spring,2018 Lecture7:DFS What is DFS? A method of storing and accessing files base in a client/server architecture. A distributed file system is a client/server-based application
More informationCRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart
CRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Hao Wang, Jian Huang, Dhabaleswar K. Panda Department of Computer
More informationKernel Level Speculative DSM
Motivation Main interest is performance, fault-tolerance, and correctness of distributed systems Present our ideas in the context of a DSM system We are developing tools that Improve performance Address
More informationHigh Throughput WAN Data Transfer with Hadoop-based Storage
High Throughput WAN Data Transfer with Hadoop-based Storage A Amin 2, B Bockelman 4, J Letts 1, T Levshina 3, T Martin 1, H Pi 1, I Sfiligoi 1, M Thomas 2, F Wuerthwein 1 1 University of California, San
More informationMotivation. Threads. Multithreaded Server Architecture. Thread of execution. Chapter 4
Motivation Threads Chapter 4 Most modern applications are multithreaded Threads run within application Multiple tasks with the application can be implemented by separate Update display Fetch data Spell
More informationHPX. High Performance ParalleX CCT Tech Talk Series. Hartmut Kaiser
HPX High Performance CCT Tech Talk Hartmut Kaiser (hkaiser@cct.lsu.edu) 2 What s HPX? Exemplar runtime system implementation Targeting conventional architectures (Linux based SMPs and clusters) Currently,
More informationDISTRIBUTED SYSTEMS [COMP9243] Lecture 3b: Distributed Shared Memory DISTRIBUTED SHARED MEMORY (DSM) DSM consists of two components:
SHARED ADDRESS SPACE DSM consists of two components: DISTRIBUTED SYSTEMS [COMP9243] ➀ Shared address space ➁ Replication and consistency of memory objects Lecture 3b: Distributed Shared Memory Shared address
More informationFRANCESCO CESARINI. presents ERLANG/OTP. Francesco Cesarini Erlang
FRANCESCO CESARINI presents Francesco Cesarini Erlang Solutions ERLANG/OTP @FrancescoC francesco@erlang-solutions.com www.erlang-solutions.com WHAT IS SCALABILITY? WHAT IS (MASSIVE) CONCURRENCY? WHAT
More informationDesign Patterns for the Cloud. MCSN - N. Tonellotto - Distributed Enabling Platforms 68
Design Patterns for the Cloud 68 based on Amazon Web Services Architecting for the Cloud: Best Practices Jinesh Varia http://media.amazonwebservices.com/aws_cloud_best_practices.pdf 69 Amazon Web Services
More informationHybrid Implementation of 3D Kirchhoff Migration
Hybrid Implementation of 3D Kirchhoff Migration Max Grossman, Mauricio Araya-Polo, Gladys Gonzalez GTC, San Jose March 19, 2013 Agenda 1. Motivation 2. The Problem at Hand 3. Solution Strategy 4. GPU Implementation
More informationMiddleware. Adapted from Alonso, Casati, Kuno, Machiraju Web Services Springer 2004
Middleware Adapted from Alonso, Casati, Kuno, Machiraju Web Services Springer 2004 Outline Web Services Goals Where do they come from? Understanding middleware Middleware as infrastructure Communication
More informationChapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems!
Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and
More informationPREGEL: A SYSTEM FOR LARGE- SCALE GRAPH PROCESSING
PREGEL: A SYSTEM FOR LARGE- SCALE GRAPH PROCESSING G. Malewicz, M. Austern, A. Bik, J. Dehnert, I. Horn, N. Leiser, G. Czajkowski Google, Inc. SIGMOD 2010 Presented by Ke Hong (some figures borrowed from
More informationBlackBerry Enterprise Server for IBM Lotus Domino Version: 5.0. Administration Guide
BlackBerry Enterprise Server for IBM Lotus Domino Version: 5.0 Administration Guide SWDT487521-636611-0528041049-001 Contents 1 Overview: BlackBerry Enterprise Server... 21 Getting started in your BlackBerry
More informationIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduce Antonino Virgillito THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Large-scale Computation Traditional solutions for computing large
More informationLecture 11 Hadoop & Spark
Lecture 11 Hadoop & Spark Dr. Wilson Rivera ICOM 6025: High Performance Computing Electrical and Computer Engineering Department University of Puerto Rico Outline Distributed File Systems Hadoop Ecosystem
More informationCluster-Based Scalable Network Services
Cluster-Based Scalable Network Services Suhas Uppalapati INFT 803 Oct 05 1999 (Source : Fox, Gribble, Chawathe, and Brewer, SOSP, 1997) Requirements for SNS Incremental scalability and overflow growth
More informationVoldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation
Voldemort Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Leader Election 1/29 Outline 1 2 3 Smruti R. Sarangi Leader Election 2/29 Data
More informationUnderstanding and Evaluating Kubernetes. Haseeb Tariq Anubhavnidhi Archie Abhashkumar
Understanding and Evaluating Kubernetes Haseeb Tariq Anubhavnidhi Archie Abhashkumar Agenda Overview of project Kubernetes background and overview Experiments Summary and Conclusion 1. Overview of Project
More informationDESIGN AND OVERHEAD ANALYSIS OF WORKFLOWS IN GRID
I J D M S C L Volume 6, o. 1, January-June 2015 DESIG AD OVERHEAD AALYSIS OF WORKFLOWS I GRID S. JAMUA 1, K. REKHA 2, AD R. KAHAVEL 3 ABSRAC Grid workflow execution is approached as a pure best effort
More informationAdvanced Distributed Systems
Course Plan and Department of Computer Science Indian Institute of Technology New Delhi, India Outline Plan 1 Plan 2 3 Message-Oriented Lectures - I Plan Lecture Topic 1 and Structure 2 Client Server,
More informationChapter 20: Database System Architectures
Chapter 20: Database System Architectures Chapter 20: Database System Architectures Centralized and Client-Server Systems Server System Architectures Parallel Systems Distributed Systems Network Types
More informationChapter 1: Distributed Information Systems
Chapter 1: Distributed Information Systems Contents - Chapter 1 Design of an information system Layers and tiers Bottom up design Top down design Architecture of an information system One tier Two tier
More informationDistributed Object Space Cluster Architecture for Search Engines
Distributed Object Space Cluster Architecture for Search Engines Ben Choi & Rohit Dhawan Computer Science, College of Science and Engineering Louisiana Tech University, LA 71272, USA pro@benchoi.org Abstract
More informationChapter 2 Distributed Information Systems Architecture
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 2 Distributed Information Systems Architecture Chapter Outline
More informationEMC RecoverPoint. EMC RecoverPoint Support
Support, page 1 Adding an Account, page 2 RecoverPoint Appliance Clusters, page 3 Replication Through Consistency Groups, page 4 Group Sets, page 22 System Tasks, page 24 Support protects storage array
More informationAnna Morajko.
Performance analysis and tuning of parallel/distributed applications Anna Morajko Anna.Morajko@uab.es 26 05 2008 Introduction Main research projects Develop techniques and tools for application performance
More informationLecture 1: January 22
CMPSCI 677 Distributed and Operating Systems Spring 2018 Lecture 1: January 22 Lecturer: Prashant Shenoy Scribe: Bin Wang 1.1 Introduction to the course The lecture started by outlining the administrative
More informationICONA Inter Cluster ONOS Network Application. CREATE-NET, CNIT/University of Rome Tor Vergata, Consortium GARR
ICONA Inter Cluster ONOS Network Application CREATE-NET, CNIT/University of Rome Tor Vergata, Consortium GARR The evolution of the Network Control Plane Standard networks: Control and Forwarding Planes
More information