Fault tolerance based on the Publishsubscribe Paradigm for the BonjourGrid Middleware
|
|
- Christian Rogers
- 5 years ago
- Views:
Transcription
1
2 University of Paris XIII INSTITUT GALILEE Laboratoire d Informatique de Paris Nord (LIPN) Université of Tunis École Supérieure des Sciences et Tehniques de Tunis Unité de Recherche UTIC Fault tolerance based on the Publishsubscribe Paradigm for the BonjourGrid Middleware Heithem ABBES, Christophe CERIN, Mohamed JEMNI and Walid SAAD Grid October 2010
3 Outline Introduction Objectives Design of BonjourGrid Integration of Boinc and Condor Fault tolerance approach Experimentation and validation Conclusion and future works 2
4 Introduction (1/3) P2P systems have allowed large improvements in the field of file sharing over Internet. 3
5 Introduction (1/3) P2P systems have allowed large improvements in the field of file sharing over Internet. Gnutella, Kazaa and Freenet 3
6 Introduction (1/3) P2P systems have allowed large improvements in the field of file sharing over Internet. Gnutella, Kazaa and Freenet Decentralized architecture No coordination between machines 3
7 Introduction (2/3) Grid computing : obtaining an infrastructure offering computing power for users applications. Coordination between machines during application execution. Centralized or hierarchical architectures (Globus, Glite, Condor). 4
8 Introduction (2/3) Grid computing : obtaining an infrastructure offering computing power for users applications. Coordination between machines during application execution. Centralized or hierarchical architectures (Globus, Glite, Condor). No scalability Complicated procedure of installation Complicated configuration phase for an ordinary user 4
9 Introduction (2/3) Grid computing : obtaining an infrastructure offering computing power for users applications. Coordination between machines during application execution. Centralized or hierarchical architectures (Globus, Glite, Condor). No scalability Complicated procedure of installation Complicated configuration phase for an ordinary user 4
10 Introduction (3/3) Desktop Grid led the community to build computing systems based on voluntary machines. Current systems use Master/Worker model 5
11 Introduction (3/3) Desktop Grid led the community to build computing systems based on voluntary machines. Current systems use Master/Worker model United Devices, BOINC, PLANETLAB, XtremWeb 5
12 Introduction (3/3) Desktop Grid led the community to build computing systems based on voluntary machines. Current systems use Master/Worker model United Devices, BOINC, PLANETLAB, XtremWeb Application domains Global climate prediction (BOINC) Search for extraterrestrial intelligence Cosmic rays study (XtremWeb). 5
13 Introduction (3/3) Desktop Grid led the community to build computing systems based on voluntary machines. Current systems use Master/Worker model United Devices, BOINC, PLANETLAB, XtremWeb Application domains Global climate prediction (BOINC) Search for extraterrestrial intelligence Cosmic rays study (XtremWeb). Demonstrate the potential of Desktop Grid 5
14 Introduction (3/3) Desktop Grid led the community to build computing systems based on voluntary machines. Current systems use Master/Worker model United Devices, BOINC, PLANETLAB, XtremWeb Application domains Global climate prediction (BOINC) Search for extraterrestrial intelligence Cosmic rays study (XtremWeb). Demonstrate the potential of Desktop Grid Suffer from being hardly scalable due to centralized control Rely on permanent administrative staff who guarantees the master operation 5
15 Objectives of BonjourGrid Design a multi-platform coordinators and fault tolerant system using existing desktop grid middleware Reduce the centralization factor: no static coordinator Benefit from the existing decentralized service discovery tools (Publish / Subscribe) Create coordinators on demand, automatically and without administrator intervention. Each coordinator selects machines to participate in the execution of a given application. 6
16 Design of BonjourGrid Coordinateur 1 Computing Element (CE) = 1 coordinator + N workers 7
17 Design of BonjourGrid Coordinateur 1 Computing Element (CE) = 1 coordinator + N workers 7
18 Design of BonjourGrid Coordinateur 1 Computing Element (CE) = 1 coordinator + N workers 7
19 Design of BonjourGrid Coordinateur 1 Computing Element (CE) = 1 coordinator + N workers 1 instance: 1 CE managed by a middleware 7
20 Design of BonjourGrid Coordinateur 1 Computing Element (CE) = 1 coordinator + N workers 1 instance: 1 CE managed by a middleware Controls and orchestrate multiple instances 7
21 Design of BonjourGrid Coordinateur 1 Computing Element (CE) = 1 coordinator + N workers 1 instance: 1 CE managed by a middleware Controls and orchestrate multiple instances Introduction of the concept of meta-grids 7
22 Design of BonjourGrid 8
23 Design of BonjourGrid A 8
24 Design of BonjourGrid A B 8
25 Design of BonjourGrid A C B 8
26 Design of BonjourGrid D A C B 8
27 Design of BonjourGrid D A C B 8
28 Design of BonjourGrid D A C B 8
29 Design of BonjourGrid D A C B 8
30 Design of BonjourGrid D A C B 8
31 Design of BonjourGrid D A C B 8
32 Design of BonjourGrid D A C B 8
33 Design of BonjourGrid D A C B 8
34 Design of BonjourGrid D A C B 8
35 Design of BonjourGrid D A C B 8
36 Design of BonjourGrid D A C B 8
37 Design of BonjourGrid D A C B A computing element for each user 8
38 Design of BonjourGrid D A C B A computing element for each user No static coordinator 8
39 Design of BonjourGrid D A C B A computing element for each user No static coordinator Each user can specify a middleware for his computing element 8
40 Components of BonjourGrid BonjourGrid is based on : A resource discovery protocol Fully decentralized A computing element Executes and handles the various tasks of an application (Condor, Boinc, XtremWeb) A global coordination protocol Manages and controls all resources, services and computing elements Does not depend on any specific machine or centralized element 9
41 Discovery protocol Based on Bonjour protocol Multicast IP network An implementation by Apple of ZeroConf protocol. Structured around three functionalities : Dynamic allocation of IP addresses without DHCP Resolution of names and IP addresses without DNS Services discovery without directory server Motivations Industrial protocol approved by Apple Different versions for the 3 OS (Windows, Linux, MaxOS) Linux and MacOS distributions integrate Bonjour Evolution of networks (10 Gb/s 10 * x Gb/s) => low risk of network congestion for multicast protocols 10
42 Computing element (CE) Each coordinator creates dynamically its CE CE = Coordinator + set of workers CE functionalities Allocates workers Submits and run tasks on workers Schedules and get results Computing systems XtremWeb, Condor or Boinc 11
43 Computing element (CE) Each coordinator creates dynamically its CE CE = Coordinator + set of workers CE functionalities Allocates workers Submits and run tasks on workers Schedules and get results Computing systems XtremWeb, Condor or Boinc 1 specific CE for each user 11
44 Coordination protocol Each machine can have one of the three states (Idle, Worker or Coordinator). A machine announces its state by publishing the specific service to this state : IdleService for idle state WorkerService for worker state CoordinatorService for coordinator state When machine state changes: it publishes the appropriate service to advertise this new state, after having deactivated the old one. Every machine can discover machines that are in a given state: A machine launches a discovery on a particular service instead of permanently receiving all new events. Restrict communication between machines. 12
45 Layered architecture 13
46 Layered architecture Publish/Subscribe 13
47 Layered architecture Connection to BonjourGrid Publish/Subscribe 13
48 Layered architecture Resources discovery Connection to BonjourGrid Publish/Subscribe 13
49 Layered architecture Resources discovery Resources characteristics Connection to BonjourGrid Publish/Subscribe 13
50 Layered architecture Establishment of CE network Resources discovery Resources characteristics Connection to BonjourGrid Publish/Subscribe 13
51 Layered architecture Establishment of CE network Resources discovery Connection to BonjourGrid Publish/Subscribe XtremWeb Resources characteristics 13
52 Layered architecture Establishment of CE network Resources discovery Connection to BonjourGrid Publish/Subscribe Condor XtremWeb Resources characteristics 13
53 Layered architecture Establishment of CE network Resources discovery Connection to BonjourGrid Publish/Subscribe Boinc Condor XtremWeb Resources characteristics 13
54 Layered architecture Deployment of a computing system Establishment of CE network Resources discovery Connection to BonjourGrid Publish/Subscribe Boinc Condor XtremWeb Resources characteristics 13
55 Integration of Boinc in BonjourGrid!"#$%&'()"%*'#"+!"#$"#%&'()*++,!"#$%&'(%)&*""+!"#$%&!"#$%&'()#*+,- Account,mail,Certificate!"#$%&'()''*&+, URL,ProjectName,NbreWorker URL,ProjectName ServiceType,HostName..!"#$"%&#'"($) IP,Hostname,CPU,Memory!"#$%&'(")!"#$%#&'%()%*+,!"#$"%&'$()$* IP,Hostname,CPU,Memory!"#$%&'(")!"#$"%&'&()*
56 Integration of Condor in BonjourGrid!"#"$%&'(#)*"++,!"#$%#&'()*+,,-!"#$"%&!"#"$%&'()#*$+ Host/IP Access levels security, Mail and Networks parameters!""#$%&&'()* IP,PoolName,ManagerName, CollectorName,DomainName,NbreWorker IP,PoolName,ManagerName, CollectorName,DomainName!"#$"%&#'"($) IP,Hostname,CPU,Memory!"#$%&'(") ServiceType,HostName..!"#$%#&'%()%*+,!"#$"%&'$()$* IP,Hostname,CPU,Memory!"#$%&'(")!"#$"%&'&()*
57 Fault tolerance approach Each computing system is responsible for : controlling and monitoring application tasks execution the fault-tolerance of its workers within a computing element The failure of workers is not the responsibility of BonjourGrid The failure of coordinators is in the charge of BonjourGrid 16
58 Fault tolerance approach Solution Create dynamically backup coordinators for each application, Provide k backup (k is a configuration setting that must be fixed before construction of the CE) for each application, using a passive replication
59 Fault tolerance approach Idle Worker Coordinator 18
60 Fault tolerance approach Idle Worker Coordinator Construction of a computing element and 2 backups of the coordinator 18
61 Fault tolerance approach Idle Worker Coordinator Construction of a computing element and 2 backups of the coordinator 18
62 Fault tolerance approach Idle Worker Coordinator Construction of a computing element and 2 backups of the coordinator 18
63 Fault tolerance approach Idle Worker Coordinator Construction of a computing element and 2 backups of the coordinator 18
64 Fault tolerance approach Idle Worker Coordinator Construction of a computing element and 2 backups of the coordinator The workers go back to idle state when the coordinator is disabled 18
65 Fault tolerance approach Idle Worker Coordinator Construction of a computing element and 2 backups of the coordinator The workers go back to idle state when the coordinator is disabled Problem: The coordinator has not yet completed the application 18
66 Fault tolerance approach Solution : Status field to distinguish between : Stop due to failure Status = 0 (application is in execution) Stop following the end of application Status = 1 (application finished) 19
67 Fault tolerance approach Solution : Status field to distinguish between : Stop due to failure Status = 0 (application is in execution) Stop following the end of application Status = 1 (application finished) Status Idle Worker Coordinator 19
68 Fault tolerance approach Solution : Status field to distinguish between : Stop due to failure Status = 0 (application is in execution) Stop following the end of application Status = 1 (application finished) Status Idle Worker Coordinator 19
69 Fault tolerance approach Solution : Status field to distinguish between : Stop due to failure Status = 0 (application is in execution) Stop following the end of application Status = 1 (application finished) Status Idle Worker Coordinator 19
70 Fault tolerance approach Solution : Status field to distinguish between : Stop due to failure Status = 0 (application is in execution) Stop following the end of application Status = 1 (application finished) Status Idle Worker Coordinator 19
71 Experimentations System evaluation based on a set of specific applications? a specific arrival pattern (Poisson s Law)? Workload model very close to the reality Feitelson and Lublin Inputs of the workload model Number of nodes (system size) Arrival time of applications Maximum number of parallel tasks Tasks execution times Application ID Arrived Time (s) Execution Time (s) Nbre of parallel tasks
72 Experimentations Emulation of a set of users and a set of applications 1 CE is dynamically created for each application Emulator Parameters : list of machines list of applications workload model Submit an application following the arrival pattern of applications in the workload Look for free machine on which a coordinator will start to initiate the application tasks execution The CE is released when application tasks finish 21
73 Experimentations Calculate: (end time of an application - submission time) Analyze the delay caused by the decentralization Analyze the behavior of BonjourGrid with : Boinc Condor Setup BonjourGrid : N machines (dynamic infrastructure) Boinc or Condor : 1 coordinator + N-1 workers (static infrastructure) 22
74 Experimentations - Boinc Setup : 130 applications (2 to 128 // tasks) 200 machines on Grid5000 (Orsay s node) Execution times vary from 1 to 500 seconds Results : With BonjourGrid, 60% of applications give a delay varying from 24 to 1277 s BonjourGrid gives execution times < Boinc when the tasks number is important Time (s) in logscale(2) Nbre Of // Tasks #Applications Turnaround time of BOINC Turnaround time of BonjourGrid Nbre of Tasks per App. 23
75 Experimentations - Condor Setup : 130 applications (2 to 128 // tasks) 200 machines on Grid5000 (Orsay s node) Execution times vary from 1 to 500 seconds Results : With BonjourGrid, 35% of applications generate a delay around 30 s BonjourGrid generates more important delays for applications which are preceded by applications with a large number (>100) of tasks Time (s) in logscale(2) Nbre Of // Tasks #Applications Turnaround time of Condor Turnaround time of BonjourGrid Nbre of Tasks per App. 24
76 Experimentations - Fault tolerance Using virtual machines to save the state of coordinators XEN virtualization system 10 applications, with parallel tasks ranging from 2 to 128 tasks 50 machines on the node of Nancy (Grid5000) Faults scenarios by injecting faults during the execution of applications Recovery Time Time of recovery of the coordinator Time to re-establish the connection workers 25
77 Fault tolerance framework!"#$%&'()*#!%+,-.$&-&/-(0#!""#$%&'("#) *"%&+,!"&$"#)!"#$"%&'()*+&'!"#$%&'()*+,'-.)&&%&/'012' Establishment of connection(4)!"#$%&&$ Worker Migrate Snapshot (2) Save Snapshot (1) Restore Snapshot (3)!"#$%&'()*# +)%,-.(/0 12&%/(2#!""#$%&'("#) *"%&+,!"&$"#)!"#$"%&'()*+&'!"#$%&'()*# +)%,-.(/0 1&-/(2&#!"#$"%&'()*+&'!"#$%&'( )"&*"+,-,%$(.,#/01",2(!"#$%&'(")*'!"#$%&'()*+,' -.)&&%&/'012'!"#$%&'(")*'!"#$%&'()*+,' -.,$/,*0'123'!"#$%&&$ Main coordinator!"#$%&&$ Backup coordinator
78 Experimentations - Fault tolerance BOINC Average delay of 197 sec Almost stable delay which does not depend on number of tasks Boinc allows the continuation of work after coordinator failure Time (s) in logscale(2) #Applications Turnaround time of BOINC Turnaround time of BOINC-FT Nbre of Tasks per App. Level of Fault Injection Nbre Of // Tasks 27
79 Experimentations - Fault tolerance Condor Average delay of 238 sec Condor recovers tasks that have not completed their executions Time (s) in logscale(2) #Applications Turnaround time of CONDOR Turnaround time of CONDOR-FT Nbre of Tasks per App. Level of Fault Injection Nbre Of // Tasks 28
80 Conclusion BonjourGrid: A novel approach for making a collaborative and decentralized Desktop Grid systems. Publish/Subscribe protocol Orchestrate the participants A computing system (Boinc, Condor or XtremWeb) for the execution level of an application. BonjourGrid makes a distributed control over resources and does not depend on a central element. BonjourGrid implements a Fault-tolerant mechanism for coordinators BonjourGrid favors collaborative execution and Meta- Grids orchestration 29
81 Future works Minimize the amount of information transferred between coordinators Include reservation rules based on history traces of the previous executions Integrate economic models Add a new layer for security issue 30
82 Our background is coordination, our future will be coordination of clouds!
83
84 Thanks. Any Questions?
Analysis of Peer-to-Peer Protocols Performance for Establishing a Decentralized Desktop Grid Middleware
Analysis of Peer-to-Peer Protocols Performance for Establishing a Decentralized Desktop Grid Middleware Heithem Abbes 1,2, Jean-Christophe Dubacq 2 1 Unité de Recherche UTIC ESSTT, Université de Tunis
More informationFault-Tolerance for PastryGrid Middleware
Fault-Tolerance for PastryGrid Middleware Christophe Cérin 1, Heithem Abbes 1,2, Mohamed Jemni 2, Yazid Missaoui 2 1 LIPN, Université de Paris XIII, CNRS UMR 7030, France 2 UTIC, ESSTT, Université de Tunis,
More informationFrom Grids to Clouds to PaaS: new research challenges
From Grids to Clouds to PaaS: new research challenges AOC Team Christophe Cérin 1 1 Université de Paris XIII, CNRS UMR 7030, France Inria - universités tunisiennes workshop 1 Christophe Cérin Sous thème
More informationAdvanced School in High Performance and GRID Computing November Introduction to Grid computing.
1967-14 Advanced School in High Performance and GRID Computing 3-14 November 2008 Introduction to Grid computing. TAFFONI Giuliano Osservatorio Astronomico di Trieste/INAF Via G.B. Tiepolo 11 34131 Trieste
More informationChapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies
More informationP2P technologies, PlanetLab, and their relevance to Grid work. Matei Ripeanu The University of Chicago
P2P technologies, PlanetLab, and their relevance to Grid work Matei Ripeanu The University of Chicago Why don t we build a huge supercomputer? Top500 supercomputer list over -0.70 time: -0.72 Zipf distribution:
More informationAssignment 5. Georgia Koloniari
Assignment 5 Georgia Koloniari 2. "Peer-to-Peer Computing" 1. What is the definition of a p2p system given by the authors in sec 1? Compare it with at least one of the definitions surveyed in the last
More informationSystematic Cooperation in P2P Grids
29th October 2008 Cyril Briquet Doctoral Dissertation in Computing Science Department of EE & CS (Montefiore Institute) University of Liège, Belgium Application class: Bags of Tasks Bag of Task = set of
More informationAudience(s) : IT Professionals Level : 300 Technology : Microsoft System Center Configuration Manager Delivery Method : Instructor-led (Classroom)
[MS10748]: Planning and Deploying System Center 2012 Configuration Length : 3 Days Audience(s) : IT Professionals Level : 300 Technology : System Center Configuration Delivery Method : Instructor-led (Classroom)
More informationTelecommunication Services Engineering Lab. Roch H. Glitho
1 Support Infrastructure Support infrastructure for application layer Why? Re-usability across application layer protocols Modularity (i.e. separation between application layer protocol specification /
More informationOutline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems
Distributed Systems Outline Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems What Is A Distributed System? A collection of independent computers that appears
More informationAdaptive Cluster Computing using JavaSpaces
Adaptive Cluster Computing using JavaSpaces Jyoti Batheja and Manish Parashar The Applied Software Systems Lab. ECE Department, Rutgers University Outline Background Introduction Related Work Summary of
More informationGrid Computing Middleware. Definitions & functions Middleware components Globus glite
Seminar Review 1 Topics Grid Computing Middleware Grid Resource Management Grid Computing Security Applications of SOA and Web Services Semantic Grid Grid & E-Science Grid Economics Cloud Computing 2 Grid
More informationOutline. INF3190:Distributed Systems - Examples. Last week: Definitions Transparencies Challenges&pitfalls Architecturalstyles
INF3190:Distributed Systems - Examples Thomas Plagemann & Roman Vitenberg Outline Last week: Definitions Transparencies Challenges&pitfalls Architecturalstyles Today: Examples Googel File System (Thomas)
More information30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy
Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of
More information[MS10992]: Integrating On-Premises Core Infrastructure with Microsoft Azure
[MS10992]: Integrating On-Premises Core Infrastructure with Microsoft Azure Length : 3 Days Audience(s) : IT Professionals Level : 300 Technology : Azure Delivery Method : Instructor-led (Classroom) Course
More informationto know how and when to apply which Microsoft technology. In many cases, you can combine multiple
Overview of Microsoft Virtualization Microsoft offers a number of virtualization technologies that administrators and infrastructure architects can use to create and administer a virtual environment. To
More informationIntroduction to Grid Computing
Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able
More informationWork Queue + Python. A Framework For Scalable Scientific Ensemble Applications
Work Queue + Python A Framework For Scalable Scientific Ensemble Applications Peter Bui, Dinesh Rajan, Badi Abdul-Wahid, Jesus Izaguirre, Douglas Thain University of Notre Dame Distributed Computing Examples
More informationDistributed Knowledge Organization and Peer-to-Peer Networks
Knowledge Organization and Peer-to-Peer Networks Klaus Wehrle Group Chair of Computer Science IV RWTH Aachen University http://ds.cs.rwth-aachen.de 1 Organization of Information Essential challenge in?
More informationARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer
ARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer computing platform Internal report Marko Niinimaki, Mohamed BenBelgacem, Nabil Abdennadher HEPIA, January 2010 1. Background and motivation
More informationUsing peer to peer. Marco Danelutto Dept. Computer Science University of Pisa
Using peer to peer Marco Danelutto Dept. Computer Science University of Pisa Master Degree (Laurea Magistrale) in Computer Science and Networking Academic Year 2009-2010 Rationale Two common paradigms
More informationGrid Scheduling Architectures with Globus
Grid Scheduling Architectures with Workshop on Scheduling WS 07 Cetraro, Italy July 28, 2007 Ignacio Martin Llorente Distributed Systems Architecture Group Universidad Complutense de Madrid 1/38 Contents
More informationvsan Disaster Recovery November 19, 2017
November 19, 2017 1 Table of Contents 1. Disaster Recovery 1.1.Overview 1.2.vSAN Stretched Clusters and Site Recovery Manager 1.3.vSAN Performance 1.4.Summary 2 1. Disaster Recovery According to the United
More informationA RESOURCE MANAGEMENT FRAMEWORK FOR INTERACTIVE GRIDS
A RESOURCE MANAGEMENT FRAMEWORK FOR INTERACTIVE GRIDS Raj Kumar, Vanish Talwar, Sujoy Basu Hewlett-Packard Labs 1501 Page Mill Road, MS 1181 Palo Alto, CA 94304 USA { raj.kumar,vanish.talwar,sujoy.basu}@hp.com
More informationDeploying virtualisation in a production grid
Deploying virtualisation in a production grid Stephen Childs Trinity College Dublin & Grid-Ireland TERENA NRENs and Grids workshop 2 nd September 2008 www.eu-egee.org EGEE and glite are registered trademarks
More information6. Peer-to-peer (P2P) networks I.
6. Peer-to-peer (P2P) networks I. PA159: Net-Centric Computing I. Eva Hladká Faculty of Informatics Masaryk University Autumn 2010 Eva Hladká (FI MU) 6. P2P networks I. Autumn 2010 1 / 46 Lecture Overview
More informationIntroduction to MapReduce
Introduction to MapReduce April 19, 2012 Jinoh Kim, Ph.D. Computer Science Department Lock Haven University of Pennsylvania Research Areas Datacenter Energy Management Exa-scale Computing Network Performance
More informationPlanning and Deploying System Center 2012 Configuration Manager
Planning and Deploying System Center 2012 Configuration Manager 10748C; 3 days, Instructor Led Course Description Get detailed instruction and hands-on practice planning and deploying Microsoft System
More informationOverview of Microsoft Virtualization
Overview of Microsoft Virtualization Microsoft offers a number of virtualization technologies that administrators and infrastructure architects can use to create and administer a virtual environment. To
More information10748: Planning and Deploying System Center 2012 Configuration Manager
Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com
More informationCondor and BOINC. Distributed and Volunteer Computing. Presented by Adam Bazinet
Condor and BOINC Distributed and Volunteer Computing Presented by Adam Bazinet Condor Developed at the University of Wisconsin-Madison Condor is aimed at High Throughput Computing (HTC) on collections
More informationITU Arab Forum on Future Networks: "Broadband Networks in the Era of App Economy", Tunis - Tunisia, Feb. 2017
Farouk Kamoun Professor Emeritus, ENSI President SESAME University Farouk.kamoun@sesame.com.tn ITU Arab Forum on Future Networks: "Broadband Networks in the Era of App Economy", Tunis - Tunisia, 21-22
More informationCSE 5306 Distributed Systems. Course Introduction
CSE 5306 Distributed Systems Course Introduction 1 Instructor and TA Dr. Donggang Liu @ CSE Web: http://ranger.uta.edu/~dliu Email: dliu@uta.edu Phone: 817-2720741 Office: ERB 555 Office hours: Tus/Ths
More informationVMWARE PROTECTION WITH DELL EMC NETWORKER 9
VMWARE PROTECTION WITH DELL EMC NETWORKER 9 Solution Overview ABSTRACT This white paper describes the integration of Dell EMC NetWorker with VMware vcenter. It also includes details on the NetWorker architecture
More information20533B: Implementing Microsoft Azure Infrastructure Solutions
20533B: Implementing Microsoft Azure Infrastructure Solutions Course Details Course Code: Duration: Notes: 20533B 5 days This course syllabus should be used to determine whether the course is appropriate
More informationData Protection Modernization: Meeting the Challenges of a Changing IT Landscape
Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Tom Clark IBM Distinguished Engineer, Chief Architect Software 1 Data growth is continuing to explode Sensors & Devices
More informationGeorges Da Costa Introduction on Peer to Peer systems
Introduction on Peer to Peer systems Georges Da Costa dacosta@irit.fr Goal of this Lecture What can P2P do, not only as a buzzword What it can t do Shows some examples & algorithms A Survey and Comparison
More informationVeeam Availability Suite 9.5 Direct Restore to Microsoft Azure integration
Veeam Availability Suite 9.5 Direct Restore to Microsoft Azure integration Presenter: Mike Resseler Technical Evangelist Veeam Proven Partnerships A modern, high-performance solution for next generation
More informationIn this unit we are going to look at cloud computing. Cloud computing, also known as 'on-demand computing', is a kind of Internet-based computing,
In this unit we are going to look at cloud computing. Cloud computing, also known as 'on-demand computing', is a kind of Internet-based computing, where shared resources, data and information are provided
More informationDistributed Systems COMP 212. Lecture 18 Othon Michail
Distributed Systems COMP 212 Lecture 18 Othon Michail Virtualisation & Cloud Computing 2/27 Protection rings It s all about protection rings in modern processors Hardware mechanism to protect data and
More informationHow it can help your organisation
How it can help your organisation History Types of Virtualisation & Hypervisors Virtualisation Features Why Virtualisation? Virtualising Oracle Performance Licensing Support Cloud 1998 VMware founded by
More informationCisco Wide Area Bonjour Solution Overview
, page 1 Topology Overview, page 2 About the Cisco Application Policy Infrastructure Controller Enterprise Module (APIC-EM), page 5 The Cisco Wide Area Bonjour solution is based on a distributed and hierarchical
More informationWHITE PAPER SEPTEMBER VMWARE vsphere AND vsphere WITH OPERATIONS MANAGEMENT. Licensing, Pricing and Packaging
WHITE PAPER SEPTEMBER 2017 VMWARE vsphere AND vsphere WITH OPERATIONS MANAGEMENT Licensing, Pricing and Packaging Table of Contents Executive Summary 3 VMware vsphere with Operations Management Overview
More informationDistributed Systems Question Bank UNIT 1 Chapter 1 1. Define distributed systems. What are the significant issues of the distributed systems?
UNIT 1 Chapter 1 1. Define distributed systems. What are the significant issues of the distributed systems? 2. What are different application domains of distributed systems? Explain. 3. Discuss the different
More informationInteractive Analysis of Large Distributed Systems with Scalable Topology-based Visualization
Interactive Analysis of Large Distributed Systems with Scalable Topology-based Visualization Lucas M. Schnorr, Arnaud Legrand, and Jean-Marc Vincent e-mail : Firstname.Lastname@imag.fr Laboratoire d Informatique
More informationADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT
ADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT PhD Summary DOCTORATE OF PHILOSOPHY IN COMPUTER SCIENCE & ENGINEERING By Sandip Kumar Goyal (09-PhD-052) Under the Supervision
More informationBest Practices for Virtualizing Active Directory
Best Practices for Virtualizing Active Directory Breakout Session AP01 Chris Skinner Senior Technical Instructor,VMware, Inc. February 25, 2009 Disclaimer This session may contain product features that
More informationThe GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection
The GridWay approach for job Submission and Management on Grids Eduardo Huedo Rubén S. Montero Ignacio M. Llorente Laboratorio de Computación Avanzada Centro de Astrobiología (INTA - CSIC) Associated to
More informationHéméra Inria Large Scale Initiative
Héméra Inria Large Scale Initiative https://www.grid5000.fr/hemera Christian Perez Avalon and many co-authors Motivations Scientific issues Large scale, volatile, complex systems Performance, fault tolerance,
More informationVirtualization for Desktop Grid Clients
Virtualization for Desktop Grid Clients Marosi Attila Csaba atisu@sztaki.hu BOINC Workshop 09, Barcelona, Spain, 23/10/2009 Using Virtual Machines in Desktop Grid Clients for Application Sandboxing! Joint
More informationBack To The Future - VMware Product Directions. Andre Kemp Sr. Product Marketing Manager Asia - Pacific
Back To The Future - VMware Product Directions Andre Kemp Sr. Product Marketing Manager Asia - Pacific Disclaimer This session contains product features that are currently under development. This session/overview
More informationCXS Citrix XenServer 6.0 Administration
Course Overview View Course Dates & Register Today Students will learn to effectively install, configure, administer, and troubleshoot XenServer 6.0. Students will also learn how to configure a Provisioning
More informationGrid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia
Grid services Dusan Vudragovic dusan@phy.bg.ac.yu Scientific Computing Laboratory Institute of Physics Belgrade, Serbia Sep. 19, 2008 www.eu-egee.org Set of basic Grid services Job submission/management
More informationFault tolerance in Grid and Grid 5000
Fault tolerance in Grid and Grid 5000 Franck Cappello INRIA Director of Grid 5000 fci@lri.fr Fault tolerance in Grid Grid 5000 Applications requiring Fault tolerance in Grid Domains (grid applications
More informationThe Intersection of Cloud & Solid State Storage
The Intersection of Cloud & Solid State Storage Val Bercovici Cloud Czar, NetApp Office of the CTO SNIA Cloud Storage Initiative SNIA Solid State Storage Initiative Cloud Backdrop Worldwide IT spending
More informationSZDG, ecom4com technology, EDGeS-EDGI in large P. Kacsuk MTA SZTAKI
SZDG, ecom4com technology, EDGeS-EDGI in large P. Kacsuk MTA SZTAKI The EDGI/EDGeS projects receive(d) Community research funding 1 Outline of the talk SZTAKI Desktop Grid (SZDG) SZDG technology: ecom4com
More informationPretaGov Australia SaaS Hosting with Fully Managed Services, Support and Maintenance
PretaGov Australia SaaS Hosting with Fully Managed Services, Support and Maintenance Introduction to PretaGov PretaGov operates a Government Community Plone CMS (content management system) SaaS Cloud exclusively
More informationNext-Generation Cloud Platform
Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology
More informationTowards a Security Model to Bridge Internet Desktop Grids and Service Grids
Towards a Security Model to Bridge Internet Desktop Grids and Service Grids Gabriel Caillat(1), Oleg Lodygensky(1), Etienne Urbah(1), Gilles Fedak(2), and Haiwu He(2) (1) Laboratoire de lʼaccelerateur
More informationDistributed Systems 26. Mobile Ad Hoc Mesh Networks
Distributed Systems 26. Mobile Ad Hoc Mesh Networks Paul Krzyzanowski pxk@cs.rutgers.edu 12/16/2011 1 Mesh Networks Mobile Ad-hoc networks, sensor networks, Decentralized networking No need for routers
More informationChapter 1: Introduction 1/29
Chapter 1: Introduction 1/29 What is a Distributed System? A distributed system is a collection of independent computers that appears to its users as a single coherent system. 2/29 Characteristics of a
More informationIntroducing VMware Validated Designs for Software-Defined Data Center
Introducing VMware Validated Designs for Software-Defined Data Center VMware Validated Design 4.0 VMware Validated Design for Software-Defined Data Center 4.0 You can find the most up-to-date technical
More informationDistributed Systems Principles and Paradigms. Chapter 01: Introduction
Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 01: Introduction Version: October 25, 2009 2 / 26 Contents Chapter
More informationAdvanced Scheduling Strategies and Grid Programming Environments
Advanced Scheduling Strategies and Grid Programming Environments Bruno Schulze 1, Geoffrey C Fox 2 1. National Laboratory for Scientific Computing (LNCC), Av. Getulio Vargas 333, Petropolis RJ, Brazil
More informationDecentralized Grid Management Model Based on Broker Overlay
Decentralized Grid Management Model Based on Broker Overlay Abdulrahman Azab 1, Hein Meling 1 1 Dept. of Computer Science and Electrical Engineering, Faculty of Science and Technology, University of Stavanger,
More informationDynamic Virtual Clusters in a Grid Site Manager
Dynamic Virtual Clusters in a Grid Site Manager Jeff Chase, David Irwin, Laura Grit, Justin Moore, Sara Sprenkle Department of Computer Science Duke University Dynamic Virtual Clusters Grid Services Grid
More informationIBM. Planning and Installation. IBM Workload Scheduler. Version 9 Release 4
IBM Workload Scheduler IBM Planning and Installation Version 9 Release 4 IBM Workload Scheduler IBM Planning and Installation Version 9 Release 4 Note Before using this information and the product it
More informationIntroducing VMware Validated Designs for Software-Defined Data Center
Introducing VMware Validated Designs for Software-Defined Data Center VMware Validated Design for Software-Defined Data Center 4.0 This document supports the version of each product listed and supports
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system
More informationMarch 10, Distributed Hash-based Lookup. for Peer-to-Peer Systems. Sandeep Shelke Shrirang Shirodkar MTech I CSE
for for March 10, 2006 Agenda for Peer-to-Peer Sytems Initial approaches to Their Limitations CAN - Applications of CAN Design Details Benefits for Distributed and a decentralized architecture No centralized
More informationTypes of Virtualization. Types of virtualization
Types of Virtualization Emulation VM emulates/simulates complete hardware Unmodified guest OS for a different PC can be run Bochs, VirtualPC for Mac, QEMU Full/native Virtualization VM simulates enough
More informationDesigning Windows Server 2008 Network and Applications Infrastructure
Designing Windows Server 2008 Network and Applications Infrastructure Course No. 6435B - 5 Days Instructor-led, Hands-on Introduction This five-day course will provide students with an understanding of
More informationJourney to the Cloud Next Generation Infrastructure for the future workforce.
Journey to the Cloud Next Generation Infrastructure for the future workforce. Steven J. Davis Managing Director Infrastructure Consulting ASEAN Accenture What is the Internet & Cloud The Internet traditionally
More informationMultiprocessor Scheduling. Multiprocessor Scheduling
Multiprocessor Scheduling Will consider only shared memory multiprocessor or multi-core CPU Salient features: One or more caches: cache affinity is important Semaphores/locks typically implemented as spin-locks:
More informationFlexRay International Workshop. Protocol Overview
FlexRay International Workshop 4 th March 2003 Detroit Protocol Overview Dr. Christopher Temple - Motorola FlexRay principles Provide a communication infrastructure for future generation highspeed control
More informationTips for creating a VDI backup plan
E-Guide Tips for creating a VDI backup plan Most virtual desktop infrastructure (VDI) products claim to simplify the backup process, but IT administrators are finding that backing up and protecting data
More informationGrid Computing Systems: A Survey and Taxonomy
Grid Computing Systems: A Survey and Taxonomy Material for this lecture from: A Survey and Taxonomy of Resource Management Systems for Grid Computing Systems, K. Krauter, R. Buyya, M. Maheswaran, CS Technical
More informationA High Availability Solution for GRID Services
A High Availability Solution for GRID Services Álvaro López García 1 Mirko Mariotti 2 Davide Salomoni 3 Leonello Servoli 12 1 INFN Sezione di Perugia 2 Physics Department University of Perugia 3 INFN CNAF
More informationConsolidate and Prepare for Cloud Efficiencies Oracle Database 12c Oracle Multitenant Option
Consolidate and Prepare for Cloud Efficiencies Oracle Database 12c Oracle Multitenant Option Eric Rudie Master Principal Sales Consultant Oracle Public Sector 27 September 2016 Safe Harbor Statement The
More informationHPC learning using Cloud infrastructure
HPC learning using Cloud infrastructure Florin MANAILA IT Architect florin.manaila@ro.ibm.com Cluj-Napoca 16 March, 2010 Agenda 1. Leveraging Cloud model 2. HPC on Cloud 3. Recent projects - FutureGRID
More informationg-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.
g-eclipse A Framework for Accessing Grid Infrastructures Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.cy) EGEE Training the Trainers May 6 th, 2009 Outline Grid Reality The Problem g-eclipse
More informationConsolidated Disaster Recovery. Paul Kangro Applied Technology Strategiest
Consolidated Disaster Recovery Paul Kangro Applied Technology Strategiest 2008.10.22 Presentation Overview About PlateSpin Technology Overview State of Disaster Recovery Introducing PlateSpin Forge Product
More informationGrid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms
Grid Computing 1 Resource sharing Elements of Grid Computing - Computers, data, storage, sensors, networks, - Sharing always conditional: issues of trust, policy, negotiation, payment, Coordinated problem
More informationDeploying System Center 2012 Configuration Manager Course 10748A; 3 Days
Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Deploying System Center 2012 Configuration Manager Course 10748A; 3 Days
More informationIOS: A Middleware for Decentralized Distributed Computing
IOS: A Middleware for Decentralized Distributed Computing Boleslaw Szymanski Kaoutar El Maghraoui, Carlos Varela Department of Computer Science Rensselaer Polytechnic Institute http://www.cs.rpi.edu/wwc
More informationDistributed Systems Principles and Paradigms
Distributed Systems Principles and Paradigms Chapter 01 (version September 5, 2007) Maarten van Steen Vrije Universiteit Amsterdam, Faculty of Science Dept. Mathematics and Computer Science Room R4.20.
More informationWhere are you with your Cloud or Clouds? Simon Kaye Dr Cloud
Where are you with your Cloud or Clouds? Simon Kaye Dr Cloud 15 th September, 2011 2 3 Cloud Computing definitions are varying, but a common set of attributes can be identified 4 Organizations need to
More informationVMware vsphere 6.5: Install, Configure, Manage (5 Days)
www.peaklearningllc.com VMware vsphere 6.5: Install, Configure, Manage (5 Days) Introduction This five-day course features intensive hands-on training that focuses on installing, configuring, and managing
More informationToward Energy-efficient and Fault-tolerant Consistent Hashing based Data Store. Wei Xie TTU CS Department Seminar, 3/7/2017
Toward Energy-efficient and Fault-tolerant Consistent Hashing based Data Store Wei Xie TTU CS Department Seminar, 3/7/2017 1 Outline General introduction Study 1: Elastic Consistent Hashing based Store
More informationChapter 18 Distributed Systems and Web Services
Chapter 18 Distributed Systems and Web Services Outline 18.1 Introduction 18.2 Distributed File Systems 18.2.1 Distributed File System Concepts 18.2.2 Network File System (NFS) 18.2.3 Andrew File System
More informationDistributed Systems Principles and Paradigms. Chapter 01: Introduction. Contents. Distributed System: Definition.
Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 01: Version: February 21, 2011 1 / 26 Contents Chapter 01: 02: Architectures
More informationIntroducing VMware Validated Designs for Software-Defined Data Center
Introducing VMware Validated Designs for Software-Defined Data Center VMware Validated Design for Software-Defined Data Center 3.0 This document supports the version of each product listed and supports
More informationChapter 3. Design of Grid Scheduler. 3.1 Introduction
Chapter 3 Design of Grid Scheduler The scheduler component of the grid is responsible to prepare the job ques for grid resources. The research in design of grid schedulers has given various topologies
More informationDeveloping Microsoft Azure Solutions
Course 20532C: Developing Microsoft Azure Solutions Course details Course Outline Module 1: OVERVIEW OF THE MICROSOFT AZURE PLATFORM This module reviews the services available in the Azure platform and
More informationGrid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen
Grid Computing 7700 Fall 2005 Lecture 5: Grid Architecture and Globus Gabrielle Allen allen@bit.csc.lsu.edu http://www.cct.lsu.edu/~gallen Concrete Example I have a source file Main.F on machine A, an
More informationCENTRAL TEXAS COLLEGE ITNW 2356 Designing a Network Directory Infrastructure. Semester Hours Credit: 3 INSTRUCTOR: OFFICE HOURS:
CENTRAL TEXAS COLLEGE ITNW 2356 Designing a Network Directory Infrastructure Semester Hours Credit: 3 INSTRUCTOR: OFFICE HOURS: I. INTRODUCTION A. Design, implement, and support a network directory infrastructure
More information[MS20533]: Implementing Microsoft Azure Infrastructure Solutions
[MS20533]: Implementing Microsoft Azure Infrastructure Solutions Length : 5 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft Products Delivery Method : Instructor-led (Classroom)
More informationUnit 8 Peer-to-Peer Networking
Unit 8 Peer-to-Peer Networking P2P Systems Use the vast resources of machines at the edge of the Internet to build a network that allows resource sharing without any central authority. Client/Server System
More informationCMS Tier-2 Program for user Analysis Computing on the Open Science Grid Frank Würthwein UCSD Goals & Status
CMS Tier-2 Program for user Analysis Computing on the Open Science Grid Frank Würthwein UCSD Goals & Status High Level Requirements for user analysis computing Code Development Environment Compile, run,
More information