1 Resource Management
|
|
- Jasmin Joseph
- 5 years ago
- Views:
Transcription
1 Resource Management 1
2 2 Resource Sharing Problem Locate resources What computers can I use. Which are the IP addresses? Allocate resources Get a free computer now, reserve one for tomorrow at noon Authenticate and access control (authorization) i Grid security lecture Prepare a resource for use Free 2MB of disk space, ask the other users to log out Use the resource Run a job (execute an application) Monitor the resource Is the computer still connected to the Internet? Did my application finish?
3 3 Resource Management Stack 5. Scheduler Optimum resource allocation 4. Resource brokers Map high-level onto local requests Resource matchmaking and negotiation 3. Resource co-allocators Allocate multiple resources simultaneously 2. Grid site resource manager Bridge over local managers Common Grid resource management language 1. Local resource managers Heterogeneous site autonomy Every local l system administrator i does what he wants and likes Scheduler Resource Broker Multiple Grid Site Resource Co-allocator Grid Site Resource Manager Local Resource Manager
4 4 1. Local Resource Management
5 5 What is a Resource? An entity that is to be shared E.g., computers, networks, storage, data, software, people Does not have to be a physical entity Defined in terms of interfaces, not devices ssh defines the access to a computer POSIX defines interface to a computer resources Open/close/read/write functions define access to a distributed ib t d file system o e.g., NFS, AFS, DFS Everything (or anything) is a resource
6 6 Compute Resources Single processor computers Home computers Parallel computers Symmetric multi-processors o Set of processors sharing a common main memory Distributed memory computers o Network of workstations CC-NUMA o Physically distributed, ib t d logically ll shared, cache coherent memory Batch queuing systems for controlled use
7 7 2. Grid Resource Management
8 8 Why it is Hard? Site autonomy No control over local administration If one site suddenly decides to suspend its daily production it cannot announce the entire Grid At maximum it can publish a local Web page Heterogeneous substrate Many different platforms E.g., queuing systems: LoadLeveler, LSF, PBS, SGE, Co-allocation Simultaneous access to independent resources Unreliability ZID sites should be available at 22:00, but diligent tutors shut them down
9 9 GRAM Bridge GRAM LL LSF PBS SGE Grid Resource Allocation Manager Grid applications use GRAM to execute processes on computers GRAM uses the local job queuing system to manage resources manage resources Set-up by the local system administrator
10 10 Grid Resource Allocation on Manager GRAM Defines resource layer protocols and APIs that enable clients to securely instantiate a Grid computational task Remote job submissions Relies on local resource management interfaces Load Leveler, LSF, PBS, SGE GSI enabled This is a new required and important Grid feature Wrapper over local resource management (job queuing) systems
11 11 GRAM Architecture GRAM Client Gatekeeper Authentication Globus Security Infrastructure MDS client API calls to locate resources GRAM client API calls to request resource allocation and process creation. Create MDS Site boundary Gram Reporter Request Job Manager Parse RSL Library Update MDS with resource state information Query current status of resource Local Resource Manager Monitor & control Process Process Process Allocate & create processes
12 12 Host Certificates Each Grid host runs a GRAM gatekeeper Each Grid host has a X.509 certificate Identity + Public key /etc/grid-security/hostcert.pem The certificate must be for a machine which has a consistent name in DNS To not be run on a computer using DHCP where a different name could be assigned to your computer Used for mutual authentication with every user
13 13 Mutual Authentication Protocol User authenticates to GRAM with a proxy certificate GRAM authenticates to the user with a host certificate Handshaking protocol Generate a random message Encrypt with the public key partner (taken from the X.509 certificate) Send the encrypted message to the partner The partner decrypts it and sends it back If the decrypted message matches the original random one, the handshaking h is completed Host X.509 User Proxy private key Proxy X.509 GRAM Gatekeeper Host private key
14 GRAM Architecture 14
15 15 globus-job-run p.u.. grid-proxy-init Your identity: /O=AustrianGrid/O=UIBK/OU=DPS/CN=Radu Prodan Enter GRID pass phrase for this identity: Creating proxy... Done Your proxy is valid until: Thu Feb 3 21:58: radu@petzeck:~$ globus-job-run gescher.vcpc.univie.ac.at /bin/echo "Hello Grid" Hello Grid
16 16 Limitation Full path to executable must be given Big drawback in a Grid environment How am I supposed to know the full home path on each remote site? I never logged in on most of the Grid machines Information service radu@petzeck:~$ globus-job-run gescher.vcpc.univie.ac.at echo "Hello Grid" GRAM Job failed because the executable does not exist (error code 5)
17 17 The globus-job-run Script globus-job-run u altix1.uibk.ac.at /bin/echo "Hello Grid" Hello Grid radu@petzeck:~$ time globus-job-run altix1.uibk.ac.at ac at /bin/echo "Hello Grid" Hello Grid real 0m4.523s user 0m0.630s sys 0m0.280s
18 18 GRAM and PBS time globus-job-run altix1.uibk.ac.at/jobmanager-pbs /bin/hostname/h altix1 real 0m18.425s user 0m0.580s0 0 sys 0m0.220s
19 19 GRAM and SGE time globus-job-run hc-ma.uibk.ac.at/jobmanager-sge /bin/hostname/h real 0m33.651s user 0m0.620s sys 0m0.230s0 0
20 20 Job Termination How to detect job termination? Push events Local manager automatically ti notifies GRAM through callbacks Not supported Pull events GRAM queries the local manager While(1) { getstate(); Sleep(sec); } Compromise accuracy versus busy waiting (sec = 0) No way to know if the application crashed Master Process Pull event Slave Process Batch Queuing System Application Push event
21 21 Grid Resource Management nt Latency Executing jobs on the Grid is prone to HUGE latencies 5 seconds mutual authentication 15 seconds job queuing system o It cannot do busy waiting o Ask every 5-10 seconds or so Grid applications must have executions an order of magnitude higher than 20 seconds Otherwise you waste most of your time waiting
22 22 Batch Job Submission Send a batch job to the Grid globus-job-submit altix1.uibk.ac.at/jobmanager- pbs /bi/h /bin/hostname Receive a URL handler that is unique on the Grid / Internet Use the URL to query the status of the job globus-job-status / ACTIVE DONE
23 23 Batch Job Submission petzeck:/home/radu>globus-job-submit / u u m altix1.uibk.ac.at/jobmanager-pbs /home/cb56/cb561004/a.outout petzeck:/home/radu>globus-job-status ACTIVE petzeck:/home/radu>globus-job-status status DONE
24 24 Job Output globus-job-get-output g upu Job_id job URL job_id [-f number ] out -err [file] -out Gets standard d output -err Gets standard error -f number gets the last number of lines from stdout / stderr
25 25 Cancel a Job globus-job-cancel job_url Cancels (kills) a job Changes the state to FAILED globus-job-clean job_url Remove all the the files associated with a job previously started using globus-job-submit All the information concerning the job is lost Must be used after globus-job-get-output
26 26 Job Cancellation and Cleanup globus-job-cancel ibk t:40001/22451/ / Are you sure you want to cancel the job now (Y/N)? Y Job canceled. NOTE: You still need to clean files associated with the job by running globus-job-clean <jobid> radu@pc6163-c703:~$ globus-job-clean WARNING: Cleaning a job means: - Kill the job if it still running, and - Remove the cached output on the remote resource Are you sure you want to cleanup the job now (Y/N)? Y Cleanup successful.
27 27 Executable Staging Copy the file you want to execute to the remote machine Of course, the executable has to be compiled for the remote architecture Not for the local one globus-job-run altix1.uibk.ac.at stage /bin/ls aout a.out altix1.jku.austriangrid.at
28 28 Cross Platform Executable Staging cb561004]$ globus-job-run agrid1.uibk.ac.at -stage /bin/ls /scratch/cb56/cb561004/.globus/.gass_cache /local/md5/cf/b fcf8d51c3cecf 89c864/md5/4b/fde7a8d6acbd9901f1e1211 d2ff114/data: /scratch/cb56/cb561004/.globus/.gass_caccac he/local/md5/cf/b fcf8d51c3ce cf89c864/md5/4b/fde7a8d6acbd9901f1e12 864/ 8d6 bd9901f d2ff114/data: cannot execute binary file
29 29 Resource Specification Language Grid job requests may be too complex to be specified using batch commands RSL = Common language for specifying i complex job requests GRAM service translates this common language into scheduler specific language g E.g., RSL into PBS RSL is based on a set of (attribute=value) pairs & (executable= /bin/echo ) (arguments= Hello Grid ) GRAM service understands a well defined set of GRAM service understands a well defined set of attributes
30 30 Hello Grid RSL Example (directory = /home/radu/) (executable = /bin/echo) (arguments = Hello Grid ) (stdout = std.out) (stderr = std.err) Save this in run.rsl
31 31 The globusrun Script time globusrun -r altix1.jku.austriangrid.at/jobmanager-pbs b -f run.rsl globus_gram_client_callback_allow successful GRAM Job submission successful GLOBUS_GRAM_PROTOCOL_JOB_STATE_PENDING GLOBUS_GRAM_PROTOCOL_JOB_STATE_ACTIVE GLOBUS_GRAM_PROTOCOL_JOB_STATE_DONE real 0m20.150s user 0m0.220s sys 0m0.010s010s
32 32 State Transition Diagram GRAM manages asynchronous state change Mm g y u g callbacks
33 33 Execution Attributes (executable = string) Program to run A file path (absolute or relative) URL for file staging o Copy the input file from the remote URL to local l file (directory = string) Directory in which to run (default is $HOME) (arguments = arg1 arg2 arg3...) List of string arguments to program (environment = (var1 val1) (var2 val2)) List of environment nm nt variable name/value pairs
34 34 Input / Output Attributes (stdin = string) Stdin for program A file path (absolute or relative) URL for stdin staging o Automatically get the input from the given URL (stdout = string) (stderr = string) Stdout / stderr for program A file path (absolute or relative) URL for stdout / stderr staging
35 35 Sample RSL & (executable = my_prog) (directory = /home/radu/my_dir) (arguments = std1.in std2.in std3.in) (environment = (JAVA_HOME /usr/java) (stdin (stdout (stderr (PATH /usr/java/bin) ) (LD_LIBRARY_PATH LIBRARY /usr/lib ) ) = std.in) = std.out) = std.err)
36 36 Execution Time Attributes (maxtime=integer) m Maximum wall clock or cpu runtime (GRAM s choice) in minutes (maxwalltime=integer) Maximum wall clock runtime in minutes Wallclock time = total execution time What you see when you look at the watch (maxcputime=integer) Maximum CPU runtime in minutes How much the process sits on the CPU If the execution time is exceeded the job is killed
37 37 Parallel Processing Attributes (count=integer) Number of processors to allocate on a parallel machine Default is 1 (hostcount=integer) number of nodes to distribute the count processes On SMP clusters Count / hostcount processes per node (count = 4) (hostcount = 2) 2 distinct nodes 2 threads per node
38 38 Memory Attributes (maxmemory=integer) Maximum amount of memory for each process in megabytes (minmemory=integer) integer) Minimum amount of memory for each process in megabytes It would be interested what happens when you try this
39 39 Job Type (jobtype=value) Predefined job types for easy use Single run a single instance of the program let the program start the other count-1 processes Multiple Start <count> instances of the program using the appropriate scheduling mechanism MPI Run the program using mpirun -np <count> Condor (a very advanced job queuing system) Start a <count> Condor processes running in standard universe
40 40 Sequential Process & (jobtype (count = 1) (directory (executable (arguments (stdout (stderr = single) = /home/radu/my_dir) dir) = my_prog) = std.in) = std.out) = std.err) (maxwalltime = 120) (maxcputime = 60)
41 41 OpenMP Applications & (jobtype = single) (count = 4) (directory = /home/radu/my_dir)/ / di (executable = my_prog) (arguments = std.in) (stdout = std.out) (stderr = std.err) (environment = (OMP_GET_NUM_THREADS 4) ) (maxwalltime = 120) (maxcputime = 120) Allocate count (4) processors Run a single instance of the program my_prog Let my_prog start the other count-1 (3) processes
42 42 MPI Applications & (jobtype = mpi) (count = 4) (directory = /home/radu/my_dir) (executable = my_prog) (arguments = std.in) (stdout = std.out) (stderr = std.err) (maxwalltime = 120) (maxcputime = 120) Automatically run all the count (4) my prog Automatically run all the count (4) my_prog instances using mpirun
43 43 Queuing System Attributes (project=string) Project (account) against which to charge (queue=string) Queue into which to submit job short, express, long,
44 44 RSL Substitutions RSL supports simple variable substitutions Substitutions are declared using a list of pairs (rslsubstitution=(sub1 (SUB1 val1)(sub2 val2)) A substitution st tut is invoked with $(SUB) Processing order: Within scope, processed left-to-right, Outer scope processed before inner scope Variable definition can reference previously defined d variables
45 45 File Staging & (executable = my_prog) (directory = /home/radu/my_dir) (arguments = std1.in std2.in std3.in) (rsl _ substitution = URL ) (file_stage_in _ = ($( $(URL)/my_in_file _ my_in_file _ )) ) (file_stage_out = ( $(URL)/my_out_file my_out_file ) ) (file_clean_up = my_in_file) (stdin = std.in) (stdout = std.out) (stderr = std.err)
46 46 Default RSL Substitutions GRAM defines a set of RSL substitutions before processing the job request Useful for portability Machine Information GLOBUS_HOST_MANUFACTURER _ GLOBUS_HOST_CPUTYPE GLOBUS_HOST_OSNAME GLOBUS_HOST_OSVERSION Paths to Globus GLOBUS_LOCATION Miscellaneous HOME LOGNAME GLOBUS_ID
47 47 Other Attributes dryrun Simulate what would happen in terms of resource allocation (do not run the job) Useful for testing whether my interface to GRAM is correct scratch_dir Specifies the location to create a scratch subdirectory A SCRATCH_DIRECTORY substitution will be filled with the name of the directory which is created stdout_position / stderr_position Specifies where in the file remote standard output / error streaming should be restarted from.
48 48 Fault Tolerance Attributes (save _ state=yes) saves job state information to a persistent file on disk (restart) Start a new job manager to manage an existing job Searches for a state file saved with save_state Recover from a GRAM crash proxy_timeout Specifies how long (in seconds) before the delegated X509 proxy expires the job manager should exit
49 49 Two-Phase Commit Some application need to be executed exactly once (no more, no less) e.g. Bank transfers You ask GRAM to execute something, but after the mutual authentication, the network crashes You get no PENDING / ACTIVE event Did GRAM receive the request or not? Two-phase commit protocol Preparation phase Commit phase (two_phase=<int>) <int> = seconds to wait before job times out
50 2PC Protocol 50
51 51 Fault Tolerant Example & (count=8) (hostcount= 2:ppn=4") (jobtype=mpi) (directory=/home/radu/apps/lapw0/src) (executable=../src/lapw0) (arguments=lapw0.def) (stdout=st.out) (stderr=st.err) (maxwalltime = 60) (maxcputime = 60) (save_state=yes) (two_phase = yes) (proxy_timeout t = 10)
52 52 Commodity Grid Kits Application Grid Portals Grid Applications Java CoG Kit High BiologyChemistry Energy Physics Earth Science Grid Portals CoG Portal Services Python CoG Kit dleware Grid Mid Grid Management Services Advanced CoG Services (File Management, Job Management, Security Management) SWIG Grid Primitives Execution & Security Grid prototyping Web Services GT3 OGSA Globus Toolkit GT2 GT4
53 53 CoG Abstraction Layers Nano Bio- Disaster materials Informatics Management Applications Portals Development Support CoG Gridfaces Layer CoG Data and Task Management Layer CoG Abstraction Layer CoG og GridI IDE CoG CoG CoG CoG CoG CoG CoG CoG CoG CoG CoG CoG CoG CoG GT3 Others GT4 GT2 OGSI WS-RF Condor Unicore SSH Avaki classic SETI
54 54 Java CoG Kit Java API to the Globus services Download from org/index php/table ~/.globus.cog.properties gp p usercert=/home/radu/.globus/usercert.pem userkey=/home/radu/.globus/userkey.pem / / / k proxy=/tmp/x509up_u104 p p_
55 55 Poll Event-based Job import org.globus.gram.*; import org.globus.gram.internal.*; l class GramPoll { public static void main(string[] args) { String rsl = "&(executable=\"/bin/ls\")(stdout=\"std.out\")"; GramJobRun job = new GramJobRun(rsl,"agrid1.uibk.ac.at"); ibk job.run(); while(job.getstatus() ()!= GRAMConstants.STATUS_ DONE && job.getstatus()!= GRAMConstants.STATUS_FAILED) try { Thread.sleep(1000); System.out.println(job.getStatusAsString()); } catch(interruptedexception p ex) { } } }
56 56 Push Event-based Job import org.globus.gram.internal.*; import org.globus.gram.*; g g class GramPush implements GramJobListener { private static boolean done = false; public GramPush(String rsl) throws Exception { GramJob job = new GramJob(rsl); job.addlistener(this); job.request("altix1.uibk.ac.at"); } public static void main(string[] args) throws Exception { String rsl = "&(executable=\"/bin/ls\")(stdout=\"std.out\")"; new GramPush(rsl); while(! done) { Thread.sleep(1000); } } } public void statuschanged(gramjob job) { if(job.getstatus() == GRAMConstants.STATUS_DONE job.getstatus() == GRAMConstants.STATUS_FAILED) done = true; }
57 57 3. Multiple Grid Site Resource Co-allocation
58 Grid Resource Management 58 Architecture t Resource Specification Broker + Language Scheduler RSL specialization Application Ground RSL Queries & Info Information Service Co-allocator Grid resource managers Local resource managers Simple ground RSL GRAM GRAM GRAM LSF PBS NQE
59 59 Resource Co-allocation Run an application on two different computers in different administration domains The computers are not under the control of the same local l batch queuing system Problem: The job may start on one system immediately And stays s in the queue for 1 day in the other I 1 d h fi b In 1 day the first system may be no longer free PBS LSF
60 Dynamically Updated Request 60 DUROC Online Co-allocator Run jobs on multiple sites Decomposes the job into multiple GRAM requests Rendez-vous barrier with allocating agent What happens if one site is available but the other busy? RSL 1 GRAM 1 One site waits at the barrier Job Extra RSL options to control barrier synchronization inter sub-job communication Procs. RSL DUROC RSL n GRAM n globus_duroc_runtime_barrier Job Procs.
61 61 Co-allocation Example + ( & (resourcemanagercontact="c703-pc450 c703 pc450.uibk.ac.at/jobmanager at/jobmanager-pbs") (jobtype = mpi) (label ="subjob 0") (environment =(GLOBUS_DUROC_SUBJOB_INDEX SUBJOB 0) ) (count = 1) (executable = /afs/zid1.uibk.ac.at/home/c703/c703246/teaching/mpi/a.out) (stdout = mpi1.out) (stderr = mpi1.err) ) ( & (resourcemanagercontact="c703-pc421.uibk.ac.at/jobmanager-pbs") (jobtype = mpi) (label ="subjob 1") (environment =(GLOBUS_DUROC_SUBJOB_INDEX 1) ) (count = 1) (executable = /afs/zid1.uibk.ac.at/home/c703/c703246/teaching/mpi/a.out) (td (stdout t = mpi2.out) (stderr = mpi2.err) )
62 62 Resource Management nt Attributes (resourcemanagercontact=string) g) (resourcemanagername=string) resource manager to which to submit a subjob (label=string) identifier for this subjob (subjobstarttype=value) alters the startup t barrier mechanism values are strict-barrier, loose-barrier, no-barrier (subjobcommstype=value) values are blocking-join and independent if value is set to independent, the subjob won t be seen from the other subjobs when doing inter-subjob communication
63 63 MPI on the Grid MPI successful for parallel programming Grid idea came from HPC community MPI applications that require Grid computing exist Incremental approach to running Grid applications Full portability of existing MPI applications F p y f g pp on the Grid
64 64 MPICH Architecture MPICH Architecture Us er The MPI interface ( def i ned by t he MPI st andar ds) The MPI CH l ayer (impl ements the MPI i nt er f ace) Abst r act Devi ce I nt er f ace ( ADI ) A Particular Platform MPP SMP Cl ust er
65 65 MPICH-G2 Implemented m as globus2 device in MPICH GSI-based authentication and authorization Executable and IO file staging Communication intra-machine: local MPI implementation inter-machine: TCP/IP P socket communication (Globus I/O, GridFTP) limitation: requires public IP addresses for all parallel computer nodes DUROC for resource allocation barrier at the end of MPI_InitI
66 MPICH-G2 Architecture 66
67 67 2-Site MPI Latency Program + ( & (resourcemanagercontact="c703-pc450 c703 pc450.uibk.ac.at/jobmanager at/jobmanager-pbs") (jobtype = mpi) (label ="subjob 0") (environment =(GLOBUS_DUROC_SUBJOB_INDEX SUBJOB 0) ) (count = 1) (executable = /afs/zid1.uibk.ac.at/home/c703/c703246/teaching/mpi/a.out) (stdout = mpi1.out) (stderr = mpi1.err) ) ( & (resourcemanagercontact="c703-pc421.uibk.ac.at/jobmanager-pbs") (jobtype = mpi) (label ="subjob 1") (environment =(GLOBUS_DUROC_SUBJOB_INDEX 1) ) (count = 1) (executable = /afs/zid1.uibk.ac.at/home/c703/c703246/teaching/mpi/a.out) (td (stdout t = mpi2.out) (stderr = mpi2.err) )
68 68 Best Effort Resource Allocation GRAM does best effort resource allocation No request for resource allocation is rejected Consequently, no level of service is guaranteed E.g., IP networks, time-sharing CPU schedulers, job queuing systems Problem Run Grid application on sites A and B A is immediately available and allocated B will be available only in 1 hour Advance reservation when and for how long a resource will be available and exclusively allocated Ensures that the application starts simultaneously on two sites Quality of service (QoS)
69 69 GARA General-purpose Architecture for Reservation and Allocation Integrate t QoS for different types of resources networks, CPUs, batch job schedulers, disks Mechanisms for advanced and immediate reservations for resources create, modify, cancel reservations RSL extensions Integration with DUROC for CPU reservations No Globus toolkit integration i
70 70 GARA RSL Example & (reservation-type=compute) p (start-time= ) (duration=3600) (percentcpu=100) & (reservation-type=network) type network) (start-time= ) (duration=3600) (endpoint-a= ) (endpoint-b= ) (directionality=unidirectional-ba) (bandwidth=150)
71 71 GRAM Summary Client MDS client API calls to locate resources MDS client API calls to get resource info MDS: Grid Index Info Server Site boundary GRAM client API calls to request resource allocation and process creation. Globus Security Infrastructure Gatekeeper Create GRAM client API state t change callbacks Job Manager Parse RSL Library MDS: Grid Resource Info Server Request Query current status of resource Local Resource Manager Monitor & control Process Process Process Allocate & create processes
72 72 Resource Management Stack Scheduler Resource Broker To come Multiple Grid Site Resource Co-allocator Grid Site Resource Manager Done Local Resource Manager
73 73 Bibliography GRAM DUROC MPICH-G2 GARA
Resource Sharing Problem
1 Resource Management Resource Sharing Problem Locate resources What computers can I use. Which are the IP addresses? Allocate resources Get a free computer now, reserve one for tomorrow at noon Authenticate
More informationGRAM: Grid Resource Allocation & Management
Copyright (c) 2002 University of Chicago and The University of Southern California. All Rights Reserved. This presentation is licensed for use under the terms of the Globus Toolkit Public License. See
More informationGrid Computing Fall 2005 Lecture 10 and 12: Globus V2. Gabrielle Allen
Grid Computing 7700 Fall 2005 Lecture 10 and 12: Globus V2 Gabrielle Allen allen@bit.csc.lsu.edu http://www.cct.lsu.edu/~gallen/ Globus 4 Primer Required Reading Coursework Essay: 4 pages Describe the
More informationGrid Compute Resources and Job Management
Grid Compute Resources and Job Management How do we access the grid? Command line with tools that you'll use Specialised applications Ex: Write a program to process images that sends data to run on the
More information30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy
Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of
More informationGrid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen
Grid Computing 7700 Fall 2005 Lecture 5: Grid Architecture and Globus Gabrielle Allen allen@bit.csc.lsu.edu http://www.cct.lsu.edu/~gallen Concrete Example I have a source file Main.F on machine A, an
More informationHistory of SURAgrid Deployment
All Hands Meeting: May 20, 2013 History of SURAgrid Deployment Steve Johnson Texas A&M University Copyright 2013, Steve Johnson, All Rights Reserved. Original Deployment Each job would send entire R binary
More informationGrid Compute Resources and Grid Job Management
Grid Compute Resources and Job Management March 24-25, 2007 Grid Job Management 1 Job and compute resource management! This module is about running jobs on remote compute resources March 24-25, 2007 Grid
More informationGlobus Toolkit 4 Execution Management. Alexandra Jimborean International School of Informatics Hagenberg, 2009
Globus Toolkit 4 Execution Management Alexandra Jimborean International School of Informatics Hagenberg, 2009 2 Agenda of the day Introduction to Globus Toolkit and GRAM Zoom In WS GRAM Usage Guide Architecture
More informationDay 1 : August (Thursday) An overview of Globus Toolkit 2.4
An Overview of Grid Computing Workshop Day 1 : August 05 2004 (Thursday) An overview of Globus Toolkit 2.4 By CDAC Experts Contact :vcvrao@cdacindia.com; betatest@cdacindia.com URL : http://www.cs.umn.edu/~vcvrao
More informationLayered Architecture
The Globus Toolkit : Introdution Dr Simon See Sun APSTC 09 June 2003 Jie Song, Grid Computing Specialist, Sun APSTC 2 Globus Toolkit TM An open source software toolkit addressing key technical problems
More informationGEMS: A Fault Tolerant Grid Job Management System
GEMS: A Fault Tolerant Grid Job Management System Sriram Satish Tadepalli Thesis submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements
More informationGrid Architectural Models
Grid Architectural Models Computational Grids - A computational Grid aggregates the processing power from a distributed collection of systems - This type of Grid is primarily composed of low powered computers
More informationResource Specification Language (RSL)
(RSL) Shamjith K V System Software Development Group, CDAC, Bangalore. Common notation for exchange of information between components Syntax similar to MDS/LDAP filters RSL provides Resource requirements:
More informationResource Specification Language (RSL)
(RSL) Asvija B System Software Development Group, CDAC, Bangalore. Common notation for exchange of information between components Syntax similar to MDS/LDAP filters RSL provides Resource requirements:
More informationManaging MPICH-G2 Jobs with WebCom-G
Managing MPICH-G2 Jobs with WebCom-G Padraig J. O Dowd, Adarsh Patil and John P. Morrison Computer Science Dept., University College Cork, Ireland {p.odowd, adarsh, j.morrison}@cs.ucc.ie Abstract This
More informationGrid Computing Middleware. Definitions & functions Middleware components Globus glite
Seminar Review 1 Topics Grid Computing Middleware Grid Resource Management Grid Computing Security Applications of SOA and Web Services Semantic Grid Grid & E-Science Grid Economics Cloud Computing 2 Grid
More informationGrid Scheduling Architectures with Globus
Grid Scheduling Architectures with Workshop on Scheduling WS 07 Cetraro, Italy July 28, 2007 Ignacio Martin Llorente Distributed Systems Architecture Group Universidad Complutense de Madrid 1/38 Contents
More informationAgent Teamwork Research Assistant. Progress Report. Prepared by Solomon Lane
Agent Teamwork Research Assistant Progress Report Prepared by Solomon Lane December 2006 Introduction... 3 Environment Overview... 3 Globus Grid...3 PBS Clusters... 3 Grid/Cluster Integration... 4 MPICH-G2...
More informationCloud Computing. Up until now
Cloud Computing Lecture 4 and 5 Grid: 2012-2013 Introduction. Up until now Definition of Cloud Computing. Grid Computing: Schedulers: Condor SGE 1 Summary Core Grid: Toolkit Condor-G Grid: Conceptual Architecture
More informationArchitecture Proposal
Nordic Testbed for Wide Area Computing and Data Handling NORDUGRID-TECH-1 19/02/2002 Architecture Proposal M.Ellert, A.Konstantinov, B.Kónya, O.Smirnova, A.Wäänänen Introduction The document describes
More informationJOB SUBMISSION ON GRID
arxiv:physics/0701101v2 [physics.comp-ph] 12 Jan 2007 JOB SUBMISSION ON GRID An Users Introduction Rudra Banerjee ADVANCED COMPUTING LAB. Dept. of Physics, University of Pune March 13, 2018 Contents preface
More informationGrid Examples. Steve Gallo Center for Computational Research University at Buffalo
Grid Examples Steve Gallo Center for Computational Research University at Buffalo Examples COBALT (Computational Fluid Dynamics) Ercan Dumlupinar, Syracyse University Aerodynamic loads on helicopter rotors
More informationChapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies
More informationFirst evaluation of the Globus GRAM Service. Massimo Sgaravatto INFN Padova
First evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova massimo.sgaravatto@pd.infn.it Draft version release 1.0.5 20 June 2000 1 Introduction...... 3 2 Running jobs... 3 2.1 Usage examples.
More informationGT-OGSA Grid Service Infrastructure
Introduction to GT3 Background The Grid Problem The Globus Approach OGSA & OGSI Globus Toolkit GT3 Architecture and Functionality: The Latest Refinement of the Globus Toolkit Core Base s User-Defined s
More informationInstallation and Administration
Introduction to GT3 Background The Grid Problem The Globus Approach OGSA & OGSI Globus Toolkit GT3 Architecture and Functionality: The Latest Refinement of the Globus Toolkit Core Base Services User-Defined
More informationUNICORE Globus: Interoperability of Grid Infrastructures
UNICORE : Interoperability of Grid Infrastructures Michael Rambadt Philipp Wieder Central Institute for Applied Mathematics (ZAM) Research Centre Juelich D 52425 Juelich, Germany Phone: +49 2461 612057
More informationGrid Computing Training Courseware v-1.0
-Testing Group, C-DAC Grid Computing Training Courseware Grid Computing Training Courseware v-1.0 Designed for Testing, Benchmarking & Performance Activities Document Title Grid Computing Training Courseware
More informationTutorial 4: Condor. John Watt, National e-science Centre
Tutorial 4: Condor John Watt, National e-science Centre Tutorials Timetable Week Day/Time Topic Staff 3 Fri 11am Introduction to Globus J.W. 4 Fri 11am Globus Development J.W. 5 Fri 11am Globus Development
More informationContent. MPIRUN Command Environment Variables LoadLeveler SUBMIT Command IBM Simple Scheduler. IBM PSSC Montpellier Customer Center
Content IBM PSSC Montpellier Customer Center MPIRUN Command Environment Variables LoadLeveler SUBMIT Command IBM Simple Scheduler Control System Service Node (SN) An IBM system-p 64-bit system Control
More informationAdvanced Job Submission on the Grid
Advanced Job Submission on the Grid Antun Balaz Scientific Computing Laboratory Institute of Physics Belgrade http://www.scl.rs/ 30 Nov 11 Dec 2009 www.eu-egee.org Scope User Interface Submit job Workload
More informationNUSGRID a computational grid at NUS
NUSGRID a computational grid at NUS Grace Foo (SVU/Academic Computing, Computer Centre) SVU is leading an initiative to set up a campus wide computational grid prototype at NUS. The initiative arose out
More informationOpenPBS Users Manual
How to Write a PBS Batch Script OpenPBS Users Manual PBS scripts are rather simple. An MPI example for user your-user-name: Example: MPI Code PBS -N a_name_for_my_parallel_job PBS -l nodes=7,walltime=1:00:00
More informationNew User Seminar: Part 2 (best practices)
New User Seminar: Part 2 (best practices) General Interest Seminar January 2015 Hugh Merz merz@sharcnet.ca Session Outline Submitting Jobs Minimizing queue waits Investigating jobs Checkpointing Efficiency
More informationNorduGrid Tutorial. Client Installation and Job Examples
NorduGrid Tutorial Client Installation and Job Examples Linux Clusters for Super Computing Conference Linköping, Sweden October 18, 2004 Arto Teräs arto.teras@csc.fi Steps to Start Using NorduGrid 1) Install
More informationUNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus
UNIT IV PROGRAMMING MODEL Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus Globus: One of the most influential Grid middleware projects is the Globus
More informationM. Roehrig, Sandia National Laboratories. Philipp Wieder, Research Centre Jülich Nov 2002
Category: INFORMATIONAL Grid Scheduling Dictionary WG (SD-WG) M. Roehrig, Sandia National Laboratories Wolfgang Ziegler, Fraunhofer-Institute for Algorithms and Scientific Computing Philipp Wieder, Research
More informationImplementation of Parallelization
Implementation of Parallelization OpenMP, PThreads and MPI Jascha Schewtschenko Institute of Cosmology and Gravitation, University of Portsmouth May 9, 2018 JAS (ICG, Portsmouth) Implementation of Parallelization
More informationThe University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager
The University of Oxford campus grid, expansion and integrating new partners Dr. David Wallom Technical Manager Outline Overview of OxGrid Self designed components Users Resources, adding new local or
More informationDistributed Memory Programming With MPI Computer Lab Exercises
Distributed Memory Programming With MPI Computer Lab Exercises Advanced Computational Science II John Burkardt Department of Scientific Computing Florida State University http://people.sc.fsu.edu/ jburkardt/classes/acs2
More informationExtensible Job Managers for Grid Computing
Extensible Job Managers for Grid Computing Paul D. Coddington Lici Lu Darren Webb Andrew L. Wendelborn Department of Computer Science University of Adelaide Adelaide, SA 5005, Australia Email: {paulc,andrew,darren}@cs.adelaide.edu.au
More informationARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer
ARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer computing platform Internal report Marko Niinimaki, Mohamed BenBelgacem, Nabil Abdennadher HEPIA, January 2010 1. Background and motivation
More informationGlobus GTK and Grid Services
Globus GTK and Grid Services Michael Rokitka SUNY@Buffalo CSE510B 9/2007 OGSA The Open Grid Services Architecture What are some key requirements of Grid computing? Interoperability: Critical due to nature
More informationCSC209 Review. Yeah! We made it!
CSC209 Review Yeah! We made it! 1 CSC209: Software tools Unix files and directories permissions utilities/commands Shell programming quoting wild cards files 2 ... and C programming... C basic syntax functions
More informationM/s. Managing distributed workloads. Language Reference Manual. Miranda Li (mjl2206) Benjamin Hanser (bwh2124) Mengdi Lin (ml3567)
1 M/s Managing distributed workloads Language Reference Manual Miranda Li (mjl2206) Benjamin Hanser (bwh2124) Mengdi Lin (ml3567) Table of Contents 1. Introduction 2. Lexical elements 2.1 Comments 2.2
More informationCloud Computing. Summary
Cloud Computing Lectures 2 and 3 Definition of Cloud Computing, Grid Architectures 2012-2013 Summary Definition of Cloud Computing (more complete). Grid Computing: Conceptual Architecture. Condor. 1 Cloud
More informationCSC209: Software tools. Unix files and directories permissions utilities/commands Shell programming quoting wild cards files
CSC209 Review CSC209: Software tools Unix files and directories permissions utilities/commands Shell programming quoting wild cards files ... and systems programming C basic syntax functions arrays structs
More informationCSC209: Software tools. Unix files and directories permissions utilities/commands Shell programming quoting wild cards files. Compiler vs.
CSC209 Review CSC209: Software tools Unix files and directories permissions utilities/commands Shell programming quoting wild cards files... and systems programming C basic syntax functions arrays structs
More informationBy Ian Foster. Zhifeng Yun
By Ian Foster Zhifeng Yun Outline Introduction Globus Architecture Globus Software Details Dev.Globus Community Summary Future Readings Introduction Globus Toolkit v4 is the work of many Globus Alliance
More informationGrid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007
Grid Programming: Concepts and Challenges Michael Rokitka SUNY@Buffalo CSE510B 10/2007 Issues Due to Heterogeneous Hardware level Environment Different architectures, chipsets, execution speeds Software
More informationAn Example Grid Middleware - The Globus Toolkit. MCSN N. Tonellotto Complements of Distributed Enabling Platforms
An Example Grid Middleware - The Globus Toolkit 1 Globus Toolkit A software toolkit addressing key technical problems in the development of Grid enabled tools, services, and applications Offer a modular
More informationPARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM
PARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM Szabolcs Pota 1, Gergely Sipos 2, Zoltan Juhasz 1,3 and Peter Kacsuk 2 1 Department of Information Systems, University of Veszprem, Hungary 2 Laboratory
More informationKnowledge Discovery Services and Tools on Grids
Knowledge Discovery Services and Tools on Grids DOMENICO TALIA DEIS University of Calabria ITALY talia@deis.unical.it Symposium ISMIS 2003, Maebashi City, Japan, Oct. 29, 2003 OUTLINE Introduction Grid
More informationGrid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia
Grid services Dusan Vudragovic dusan@phy.bg.ac.yu Scientific Computing Laboratory Institute of Physics Belgrade, Serbia Sep. 19, 2008 www.eu-egee.org Set of basic Grid services Job submission/management
More informationGridbus Portlets -- USER GUIDE -- GRIDBUS PORTLETS 1 1. GETTING STARTED 2 2. AUTHENTICATION 3 3. WORKING WITH PROJECTS 4
Gridbus Portlets -- USER GUIDE -- www.gridbus.org/broker GRIDBUS PORTLETS 1 1. GETTING STARTED 2 1.1. PREREQUISITES: 2 1.2. INSTALLATION: 2 2. AUTHENTICATION 3 3. WORKING WITH PROJECTS 4 3.1. CREATING
More informationProgramming with MPI
Programming with MPI p. 1/?? Programming with MPI Miscellaneous Guidelines Nick Maclaren nmm1@cam.ac.uk March 2010 Programming with MPI p. 2/?? Summary This is a miscellaneous set of practical points Over--simplifies
More informationMichigan Grid Research and Infrastructure Development (MGRID)
Michigan Grid Research and Infrastructure Development (MGRID) Abhijit Bose MGRID and Dept. of Electrical Engineering and Computer Science The University of Michigan Ann Arbor, MI 48109 abose@umich.edu
More informationGT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide
GT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide GT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide Introduction This guide contains advanced configuration
More informationCommunication. Distributed Systems Santa Clara University 2016
Communication Distributed Systems Santa Clara University 2016 Protocol Stack Each layer has its own protocol Can make changes at one layer without changing layers above or below Use well defined interfaces
More informationProgramming with MPI
Programming with MPI p. 1/?? Programming with MPI Miscellaneous Guidelines Nick Maclaren Computing Service nmm1@cam.ac.uk, ext. 34761 March 2010 Programming with MPI p. 2/?? Summary This is a miscellaneous
More informationGlobus Toolkit Manoj Soni SENG, CDAC. 20 th & 21 th Nov 2008 GGOA Workshop 08 Bangalore
Globus Toolkit 4.0.7 Manoj Soni SENG, CDAC 1 What is Globus Toolkit? The Globus Toolkit is an open source software toolkit used for building Grid systems and applications. It is being developed by the
More informationDistributed Systems Exam 1 Review Paul Krzyzanowski. Rutgers University. Fall 2016
Distributed Systems 2015 Exam 1 Review Paul Krzyzanowski Rutgers University Fall 2016 1 Question 1 Why did the use of reference counting for remote objects prove to be impractical? Explain. It s not fault
More informationAssignment 3 GridNexus Job Submission. Authors: Grid Computing Course Team C. Ferner and B. Wilkinson February 5, 2010
I. Introduction Assignment 3 GridNexus Job Submission Authors: Grid Computing Course Team C. Ferner and B. Wilkinson February 5, 2010 In this assignment, you will use the two programs that you developed
More informationIntroduction to GT3. Overview. Installation Pre-requisites GT3.2. Overview of Installing GT3
Introduction to GT3 Background The Grid Problem The Globus Approach OGSA & OGSI Globus Toolkit GT3 Architecture and Functionality: The Latest Refinement of the Globus Toolkit Core Base Services User-Defined
More informationOutline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems
Distributed Systems Outline Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems What Is A Distributed System? A collection of independent computers that appears
More informationHow to install Condor-G
How to install Condor-G Tomasz Wlodek University of the Great State of Texas at Arlington Abstract: I describe the installation procedure for Condor-G Before you start: Go to http://www.cs.wisc.edu/condor/condorg/
More informationMultiprocessors 2007/2008
Multiprocessors 2007/2008 Abstractions of parallel machines Johan Lukkien 1 Overview Problem context Abstraction Operating system support Language / middleware support 2 Parallel processing Scope: several
More informationUsing the MyProxy Online Credential Repository
Using the MyProxy Online Credential Repository Jim Basney National Center for Supercomputing Applications University of Illinois jbasney@ncsa.uiuc.edu What is MyProxy? Independent Globus Toolkit add-on
More informationCredentials Management for Authentication in a Grid-Based E-Learning Platform
Credentials Management for Authentication in a Grid-Based E-Learning Platform Felicia Ionescu, Vlad Nae, Alexandru Gherega University Politehnica of Bucharest {fionescu, vnae, agherega}@tech.pub.ro Abstract
More informationChapter 3. Design of Grid Scheduler. 3.1 Introduction
Chapter 3 Design of Grid Scheduler The scheduler component of the grid is responsible to prepare the job ques for grid resources. The research in design of grid schedulers has given various topologies
More informationSimpleChubby: a simple distributed lock service
SimpleChubby: a simple distributed lock service Jing Pu, Mingyu Gao, Hang Qu 1 Introduction We implement a distributed lock service called SimpleChubby similar to the original Google Chubby lock service[1].
More informationOur new HPC-Cluster An overview
Our new HPC-Cluster An overview Christian Hagen Universität Regensburg Regensburg, 15.05.2009 Outline 1 Layout 2 Hardware 3 Software 4 Getting an account 5 Compiling 6 Queueing system 7 Parallelization
More informationCommunication. Overview
Communication Chapter 2 1 Overview Layered protocols Remote procedure call Remote object invocation Message-oriented communication Stream-oriented communication 2 Layered protocols Low-level layers Transport
More informationCS516 Programming Languages and Compilers II
CS516 Programming Languages and Compilers II Zheng Zhang Spring 2015 Mar 12 Parallelism and Shared Memory Hierarchy I Rutgers University Review: Classical Three-pass Compiler Front End IR Middle End IR
More informationLecture 9: MIMD Architectures
Lecture 9: MIMD Architectures Introduction and classification Symmetric multiprocessors NUMA architecture Clusters Zebo Peng, IDA, LiTH 1 Introduction A set of general purpose processors is connected together.
More informationGrid Middleware and Globus Toolkit Architecture
Grid Middleware and Globus Toolkit Architecture Lisa Childers Argonne National Laboratory University of Chicago 2 Overview Grid Middleware The problem: supporting Virtual Organizations equirements Capabilities
More informationAn introduction to checkpointing. for scientific applications
damien.francois@uclouvain.be UCL/CISM - FNRS/CÉCI An introduction to checkpointing for scientific applications November 2013 CISM/CÉCI training session What is checkpointing? Without checkpointing: $./count
More informationArchitecture of the WMS
Architecture of the WMS Dr. Giuliano Taffoni INFORMATION SYSTEMS UNIT Outline This presentation will cover the following arguments: Overview of WMS Architecture Job Description Language Overview WMProxy
More informationOperating Systems Overview. Chapter 2
Operating Systems Overview Chapter 2 Operating System A program that controls the execution of application programs An interface between the user and hardware Masks the details of the hardware Layers and
More informationglite Grid Services Overview
The EPIKH Project (Exchange Programme to advance e-infrastructure Know-How) glite Grid Services Overview Antonio Calanducci INFN Catania Joint GISELA/EPIKH School for Grid Site Administrators Valparaiso,
More informationCS 326: Operating Systems. Process Execution. Lecture 5
CS 326: Operating Systems Process Execution Lecture 5 Today s Schedule Process Creation Threads Limited Direct Execution Basic Scheduling 2/5/18 CS 326: Operating Systems 2 Today s Schedule Process Creation
More informationGrid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms
Grid Computing 1 Resource sharing Elements of Grid Computing - Computers, data, storage, sensors, networks, - Sharing always conditional: issues of trust, policy, negotiation, payment, Coordinated problem
More informationToday. Operating System Evolution. CSCI 4061 Introduction to Operating Systems. Gen 1: Mono-programming ( ) OS Evolution Unix Overview
Today CSCI 4061 Introduction to s Instructor: Abhishek Chandra OS Evolution Unix Overview Unix Structure Shells and Utilities Calls and APIs 2 Evolution How did the OS evolve? Dependent on hardware and
More informationSuperMike-II Launch Workshop. System Overview and Allocations
: System Overview and Allocations Dr Jim Lupo CCT Computational Enablement jalupo@cct.lsu.edu SuperMike-II: Serious Heterogeneous Computing Power System Hardware SuperMike provides 442 nodes, 221TB of
More informationIntroduction to PICO Parallel & Production Enviroment
Introduction to PICO Parallel & Production Enviroment Mirko Cestari m.cestari@cineca.it Alessandro Marani a.marani@cineca.it Domenico Guida d.guida@cineca.it Nicola Spallanzani n.spallanzani@cineca.it
More informationCluster Network Products
Cluster Network Products Cluster interconnects include, among others: Gigabit Ethernet Myrinet Quadrics InfiniBand 1 Interconnects in Top500 list 11/2009 2 Interconnects in Top500 list 11/2008 3 Cluster
More informationToday. Operating System Evolution. CSCI 4061 Introduction to Operating Systems. Gen 1: Mono-programming ( ) OS Evolution Unix Overview
Today CSCI 4061 Introduction to s Instructor: Abhishek Chandra OS Evolution Unix Overview Unix Structure Shells and Utilities Calls and APIs 2 Evolution How did the OS evolve? Generation 1: Mono-programming
More informationMultithreading and Interactive Programs
Multithreading and Interactive Programs CS160: User Interfaces John Canny. This time Multithreading for interactivity need and risks Some design patterns for multithreaded programs Debugging multithreaded
More information2 Model. 2.1 Introduction
2 Model 2.1 Introduction The mostly used attempt to define Grid computing [77] is through an analogy with the electric power evolution around 1910. The truly revolutionary development was not the discovery
More informationIntegrating SGE and Globus in a Heterogeneous HPC Environment
Integrating SGE and Globus in a Heterogeneous HPC Environment David McBride London e-science Centre, Department of Computing, Imperial College Presentation Outline Overview of Centre
More informationCloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo
CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase Chen Zhang Hans De Sterck University of Waterloo Outline Introduction Motivation Related Work System Design Future Work Introduction
More informationProcesses and Threads. Processes and Threads. Processes (2) Processes (1)
Processes and Threads (Topic 2-1) 2 홍성수 Processes and Threads Question: What is a process and why is it useful? Why? With many things happening at once in a system, need some way of separating them all
More informationData Management 1. Grid data management. Different sources of data. Sensors Analytic equipment Measurement tools and devices
Data Management 1 Grid data management Different sources of data Sensors Analytic equipment Measurement tools and devices Need to discover patterns in data to create information Need mechanisms to deal
More informationIntroduction to Grid Computing
Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able
More informationThe MPI API s baseline requirements
LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY Document Type LIGO-T990086-07- E 10/30/2000 The MPI API s baseline
More informationThe GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection
The GridWay approach for job Submission and Management on Grids Eduardo Huedo Rubén S. Montero Ignacio M. Llorente Laboratorio de Computación Avanzada Centro de Astrobiología (INTA - CSIC) Associated to
More informationECE 574 Cluster Computing Lecture 4
ECE 574 Cluster Computing Lecture 4 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 31 January 2017 Announcements Don t forget about homework #3 I ran HPCG benchmark on Haswell-EP
More informationIntroduction to Cluster Computing
Introduction to Cluster Computing Prabhaker Mateti Wright State University Dayton, Ohio, USA Overview High performance computing High throughput computing NOW, HPC, and HTC Parallel algorithms Software
More informationCPSC/ECE 3220 Fall 2017 Exam Give the definition (note: not the roles) for an operating system as stated in the textbook. (2 pts.
CPSC/ECE 3220 Fall 2017 Exam 1 Name: 1. Give the definition (note: not the roles) for an operating system as stated in the textbook. (2 pts.) Referee / Illusionist / Glue. Circle only one of R, I, or G.
More information