Day 11: Workflows with DAGMan
|
|
- Daniela Malone
- 6 years ago
- Views:
Transcription
1 Day 11: Workflows with DAGMan Suggested reading: Condor 7.7 Manual: Section 2.10: DAGMan Applications Chapter 9: condor_submit_dag 1
2 Turn In Homework 2
3 Homework Review 3
4 Workflows 4
5 Introduction to Workflow Series of related steps to complete a complex task Review slides NO NO Attend class OK? YES Write code OK? YES Print code Organize, manage, and make a process reliable Important in science, where repeatability is key 5
6 Workflow Components Workflows are essentially algorithmic! Steps Prerequisites and inputs Process (black box / white box) Outputs Connections Sequence Branching Parallelism Metadata: Resources, owners, timing, etc. 6
7 Workflow Example I Yale: C. Mason, S. Sanders, M. State 7
8 Workflow Example II Terrain data files 1 2 Terrain Preprocessor WRF Static Preprocessor Surface, terrestrial data files Run once per forecast region Radar data (Level III) 3 4 3D Model Data Interpolator (Initial Boundary Conditions) 88D Radar Re-mapper Radar data (Level II) NIDS Radar Re-mapper 5 NAM, RUC, GFS data Surface data, upper air mesonet data and wind profiler data ADAS 8 7 ADAM 11 9 ARPS to WRF Data Interpolator 10 ARPS Ensemble Generator 3D Model Data Interpolator (lateral Boundary Conditions) 15 IDV WRF WRF 12 WRF WRF WRF to ARPS Data Interpolator 13 Satellite data Repeat periodically for new data 6 Satellite Data Re-mapper Data mining: look for storm signature Static data Real time datainitialization Analysis Data Mining Forecast Visualization Visualization on users request LEAD Weather Forecasting ARPS Plotting Program 14 Triggered if a 5" storm is detected 8
9 Automated Workflows Ideally, we want to automate workflows Minimize wait times and (certain kinds of) errors Allow humans to concentrate on design and results Broad objectives: Capture whole workflow Define steps clearly Identify easy automation Copying files Changing data formats Running jobs! Balance costs vs. savings! 9
10 Workflows in CHTC 10
11 Directed Acyclic Graphs (DAGs) Abstract, formal definition of allowable workflows Terminology Step (typically, a job) = Node Connection is directed: Parent Child No loops (or cycles, hence acyclic) Each node may have 0 n children Each node may have 0 n parents must succeed before running Parent Child 11
12 Example DAG Shapes Disconnected A B C D Linear / Serial A B C D Diamond A B C D 12
13 A Real Scientific DAG datafind_l_1 tmplt_l1_1 tmplt_l1_2 tmplt_l1_3 tmplt_l1_4 tmplt_l1_5 tmplt_l1_6 datafind_l_2 insp_l1_1 insp_l1_2 insp_l1_3 insp_l1_4 insp_l1_5 insp_l1_6 datafind_l_3 tmplt_l1_7 tmplt_l1_8 tmplt_l1_9 trigbank_h2_1 trigbank_h2_2 trigbank_h1_1 trigbank_h1_2 trigbank_h1_3 datafind_h_1 trigbank_h1_4 tmplt_l1_11 tmplt_l1_10 insp_l1_7 insp_l1_8 insp_l1_9 insp_h1_1 insp_h1_2 insp_h1_3 insp_h1_4 insp_l1_11 insp_l1_10 datafind_h_2 trigbank_h1_5 trigbank_h1_6 trigbank_h2_4 trigbank_h2_3 inca_l1h1_3 inca_l1h1_1 trigbank_h1_8 datafind_h_3 trigbank_h1_7 insp_h1_5 insp_h1_6 trigbank_h2_6 datafind_h_4 insp_h1_8 insp_h1_7 inca_l1h1_2 insp_h2_1 insp_h2_2 trigbank_h2_5 trigbank_h2_7 datafind_h_5 inca_l1h1_4 inca_l1h2_1 datafind_h_6 trigbank_h2_9 insp_h2_4 insp_h2_3 insp_h2_9 insp_h2_5 insp_h2_6 datafind_h_7 trigbank_h2_8 inca_l1h2_2 inca_h1h2_1 insp_h2_8 insp_h2_7 inca_l1h1_5 inca_h1h2_2 inca_l1h1_6 Laser Interferometer Gravitational-wave Observatory (LIGO) 13
14 Condor DAGMan DAGMan: Directed Acyclic Graph Manager Organize Condor jobs into a DAG Condor handles all details of running workflow Submits individual jobs when appropriate Tracks overall workflow Can retry failed nodes and resume failed workflow Can limit amount of work done at once DAGs up to 1,000,000 nodes have been run! 14
15 prepare data check prereq.s skip node DAGMan Nodes I Pre-Script Job (Cluster) Post-Script clean up files check success 15
16 DAGMan Nodes II Order of execution 1. Pre-script on submit machine 2. Job(s) on pool 3. Post-script on submit machine Failure handling Pre-script exit 0: Skip job, run post-script (if any) Any job exit 0: Run post-script (if any) Last exit status determines success/failure of node Make sure scripts exit 0 upon success! Can skip job & post on given pre-script exit status 16
17 DAGMan Files 17
18 Basic DAGMan Submit File # Define nodes Comment JOB First first.sub JOB Analyze1 stats-1.sub JOB Analyze2 stats-2.sub JOB Sum collate.sub Declare node name and its Condor submit file Analyze1 First Sum Analyze2 Define pre/post SCRIPT PRE Sum verify-all.py 2 Define scripts node for connections nodes # Define connections PARENT First CHILD Analyze1 Analyze2 PARENT Analyze1 Analyze2 CHILD Sum 18
19 JOB name submit-file Define a Job One per node Defines node s name, unique within this DAG Associated with a Condor submit-file Job must yield 1 cluster; may have many processes JOB Collate collate.sub JOB Rjob3 run-r-3.sub 19
20 Define Dependencies PARENT parent1 p2 CHILD child1 c2 Defines the lines (dependencies) between nodes Parent and child names are node names (cf. JOB) EACH child depends on ALL parents PARENT p1 p2 p3 CHILD c1 c2 p1 p2 p3 c1 c2 20
21 Define Pre- and Post-Scripts SCRIPT PRE name executable arguments SCRIPT POST name executable arguments Scripts are always optional! Associated with given node name Optional arguments are passed to executable Place scripts in same directory as node s submit file Scripts run on the submit machine JOB First prepare.sub SCRIPT PRE First fetch-data.py SCRIPT POST Collate sum-stats.py
22 Logs in DAGMan DAGMan tracks progress via your log files All nodes (i.e., submit files) can use same log file Can be tricky for a person to decode Best DAGMan performance Each node may have own log file More like what you are used to Easier to read for a person Cannot use $(CLUSTER) or $(PROCESS), though! Can omit log statement entirely! DAGMan defaults to dagfile.nodes.log 22
23 DAGMan Commands 23
24 Submit a DAG condor_submit_dag dag-file DAGMan itself runs as a Condor job On the submit machine This command creates submit file and submits it File for submitting this DAG to Condor Log of DAGMan debugging messages Log of Condor library output Log of Condor library error messages Log of the life of condor_dagman itself : dagman.dag.condor.sub : dagman.dag.dagman.out : dagman.dag.lib.out : dagman.dag.lib.err : dagman.dag.dagman.log Submitting job(s). 1 job(s) submitted to cluster 65. condor_submit_dag -no_submit dag-file Just creates DAGMan submit file, if you are curious 24
25 Submit Options condor_submit_dag -maxjobs N dag-file Maximum number of jobs to submit at once Can help avoid overload on submit machine Can be limited further by administrator condor_submit_dag -maxpre N dag-file condor_submit_dag -maxpost N dag-file Limits pre- and post-scripts Again, helps avoid overload on submit machine All options are optional and can be combined 25
26 Monitor a DAG condor_q -dag Same command as always; same options available But: Organizes DAG jobs visually Not required to use -dag option! 65.0 cat 11/22 15: :11:23 R condor_dagman -f Random1 11/22 15: :00:00 I dag_2.py Random2 11/22 15: :00:00 I dag_2.py Other options: Watch log file(s) notifications on each job (maybe just on last?) Node status file (later slide) 26
27 condor_rm jobid Remove a DAG But, which job ID? Essentially: remove the condor_dagman job itself Same cluster printed by condor_submit_dag Removes all jobs (idle & running) within DAG 65.0 cat 11/22 15: :11:23 R condor_dagman -f Random1 11/22 15: :00:00 I dag_2.py Random2 11/22 15: :00:00 I dag_2.py 27
28 Job Recovery Rescue DAG created when DAG does not succeed Due to being removed, or After node fails, when all possible progress completes -rw-rw-r-- 1 cat cat -rw-r--r-- 1 cat cat -rw-rw-r-- 1 cat cat -rw-r--r-- 1 cat cat -rw-r--r-- 1 cat cat 512 Nov 23 10:26 sleep.dag 988 Nov 23 10:38 sleep.dag.condor.sub 517 Nov 23 10:40 sleep.dag.dagman.log Nov 23 10:40 sleep.dag.dagman.out 261 Nov 23 10:40 sleep.dag.rescue001 Resubmit the DAG to resume, using Rescue DAG Completed nodes are not rerun condor_submit_dag dagfile.rescuennn < condor_submit_dag dagfile
29 Status of DAG Nodes NODE_STATUS_FILE filename seconds Writes DAG status info to the given filename Overwrites file no more often than seconds apart JOB A STATUS_DONE () JOB B STATUS_DONE () JOB C STATUS_DONE () JOB D STATUS_DONE () JOB E STATUS_DONE () JOB F STATUS_SUBMITTED (not_idle) JOB G STATUS_SUBMITTED (idle) JOB H STATUS_UNREADY () 29
30 Homework 30
31 Homework Run a workflow! The queue simulator is back, but does its own loops If you have an alternate workflow that you would like to work on instead, talk to me 31
Condor and Workflows: An Introduction. Condor Week 2011
Condor and Workflows: An Introduction Condor Week 2011 Kent Wenger Condor Project Computer Sciences Department University of Wisconsin-Madison Outline > Introduction/motivation > Basic DAG concepts > Running
More informationAN INTRODUCTION TO WORKFLOWS WITH DAGMAN
AN INTRODUCTION TO WORKFLOWS WITH DAGMAN Presented by Lauren Michael HTCondor Week 2018 1 Covered In This Tutorial Why Create a Workflow? Describing workflows as directed acyclic graphs (DAGs) Workflow
More informationGrid Compute Resources and Grid Job Management
Grid Compute Resources and Job Management March 24-25, 2007 Grid Job Management 1 Job and compute resource management! This module is about running jobs on remote compute resources March 24-25, 2007 Grid
More informationDAGMan workflow. Kumaran Baskaran
DAGMan workflow Kumaran Baskaran NMRbox summer workshop June 26-29,2017 what is a workflow? Laboratory Publication Workflow management Softwares Workflow management Data Softwares Workflow management
More informationGrid Compute Resources and Job Management
Grid Compute Resources and Job Management How do we access the grid? Command line with tools that you'll use Specialised applications Ex: Write a program to process images that sends data to run on the
More informationHTCONDOR USER TUTORIAL. Greg Thain Center for High Throughput Computing University of Wisconsin Madison
HTCONDOR USER TUTORIAL Greg Thain Center for High Throughput Computing University of Wisconsin Madison gthain@cs.wisc.edu 2015 Internet2 HTCondor User Tutorial CONTENTS Overview Basic job submission How
More informationDay 9: Introduction to CHTC
Day 9: Introduction to CHTC Suggested reading: Condor 7.7 Manual: http://www.cs.wisc.edu/condor/manual/v7.7/ Chapter 1: Overview Chapter 2: Users Manual (at most, 2.1 2.7) 1 Turn In Homework 2 Homework
More informationHistory of SURAgrid Deployment
All Hands Meeting: May 20, 2013 History of SURAgrid Deployment Steve Johnson Texas A&M University Copyright 2013, Steve Johnson, All Rights Reserved. Original Deployment Each job would send entire R binary
More informationUsing the INSPIRAL program to search for gravitational waves from low-mass binary inspiral
INSTITUTE OF PHYSICS PUBLISHING Class. Quantum Grav. 22 (2005) S1097 S1107 CLASSICAL AND QUANTUM GRAVITY doi:10.1088/0264-9381/22/18/s24 Using the INSPIRAL program to search for gravitational waves from
More informationCloud Computing. Up until now
Cloud Computing Lectures 3 and 4 Grid Schedulers: Condor, Sun Grid Engine 2012-2013 Introduction. Up until now Definition of Cloud Computing. Grid Computing: Schedulers: Condor architecture. 1 Summary
More informationGetting Started with OSG Connect ~ an Interactive Tutorial ~
Getting Started with OSG Connect ~ an Interactive Tutorial ~ Emelie Harstad , Mats Rynge , Lincoln Bryant , Suchandra Thapa ,
More informationTechniques for Efficient Execution of Large-Scale Scientific Workflows in Distributed Environments
Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 11-14-2014 Techniques for Efficient Execution of Large-Scale Scientific Workflows
More informationIntroduction to Grid Computing
Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able
More information! " # " $ $ % & '(()
!"# " $ $ % &'(() First These slides are available from: http://www.cs.wisc.edu/~roy/italy-condor/ 2 This Morning s Condor Topics * +&, & - *.&- *. & * && - * + $ 3 Part One Matchmaking: Finding Machines
More informationHortonworks Data Platform
Hortonworks Data Platform Workflow Management (August 31, 2017) docs.hortonworks.com Hortonworks Data Platform: Workflow Management Copyright 2012-2017 Hortonworks, Inc. Some rights reserved. The Hortonworks
More informationProject Blackbird. U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis. Sam Hoover
Project Blackbird U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis Sam Hoover shoover@clemson.edu 1 Project Blackbird Blackboard at Clemson End of Semester archives
More informationBuilding the International Data Placement Lab. Greg Thain
Building the International Data Placement Lab Greg Thain What is the IDPL? Overview How we built it Examples of use 2 Who is the IDPL? Phil Papadapolus -- UCSD Miron Livny -- Wisconsin Collaborators in
More informationManaging large-scale workflows with Pegasus
Funded by the National Science Foundation under the OCI SDCI program, grant #0722019 Managing large-scale workflows with Pegasus Karan Vahi ( vahi@isi.edu) Collaborative Computing Group USC Information
More informationTreeSearch User Guide
TreeSearch User Guide Version 0.9 Derrick Stolee University of Nebraska-Lincoln s-dstolee1@math.unl.edu March 30, 2011 Abstract The TreeSearch library abstracts the structure of a search tree in order
More informationPegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute
Pegasus Workflow Management System Gideon Juve USC Informa3on Sciences Ins3tute Scientific Workflows Orchestrate complex, multi-stage scientific computations Often expressed as directed acyclic graphs
More informationXSEDE High Throughput Computing Use Cases
XSEDE High Throughput Computing Use Cases 31 May 2013 Version 0.3 XSEDE HTC Use Cases Page 1 XSEDE HTC Use Cases Page 2 Table of Contents A. Document History B. Document Scope C. High Throughput Computing
More informationSwiftrun i. Swiftrun
i Swiftrun ii Contents 1 Introduction 1 2 Running older Swift releases 1 2.1 sites.xml........................................................ 1 2.2 tc.data.........................................................
More informationBMRB is a member of the wwpdb. BMRB collaborates with
BMRB is a member of the wwpdb BMRB collaborates with Using Condor behind the scenes to provide a public CS-Rosetta server at the BioMagResBank Funded By: ! Biological Magnetic Resonance Data Bank What
More informationPegasus. Pegasus Workflow Management System. Mats Rynge
Pegasus Pegasus Workflow Management System Mats Rynge rynge@isi.edu https://pegasus.isi.edu Automate Why workflows? Recover Automates complex, multi-stage processing pipelines Enables parallel, distributed
More informationAN INTRODUCTION TO USING
AN INTRODUCTION TO USING Todd Tannenbaum June 6, 2017 HTCondor Week 2017 1 Covered In This Tutorial What is HTCondor? Running a Job with HTCondor How HTCondor Matches and Runs Jobs - pause for questions
More informationCloud Computing. Summary
Cloud Computing Lectures 2 and 3 Definition of Cloud Computing, Grid Architectures 2012-2013 Summary Definition of Cloud Computing (more complete). Grid Computing: Conceptual Architecture. Condor. 1 Cloud
More informationDirected Acyclic Graphs & Topological Sort
Directed Acyclic Graphs & Topological Sort CS6: Introduction to Data Structures & Algorithms Spring 08 Outline Directed Acyclic Graphs Topological Sort Hand-simulation Pseudo-code Runtime analysis 4 Directed
More informationA Fully Automated Faulttolerant. Distributed Video Processing and Off site Replication
A Fully Automated Faulttolerant System for Distributed Video Processing and Off site Replication George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison June 2004 What is the talk about?
More informationChapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies
More information! " #$%! &%& ' ( $ $ ) $*+ $
! " #$%! &%& ' ( $ $ ) $*+ $ + 2 &,-)%./01 ) 2 $ & $ $ ) 340 %(%% 3 &,-)%./01 ) 2 $& $ $ ) 34 0 %(%% $ $ %% $ ) 5 67 89 5 & % % %$ %%)( % % ( %$ ) ( '$ :!"#$%%&%'&( )!)&(!&( *+,& )- &*./ &*( ' 0&/ 1&2
More informationHIGH-THROUGHPUT COMPUTING AND YOUR RESEARCH
HIGH-THROUGHPUT COMPUTING AND YOUR RESEARCH Christina Koch, Research Computing Facilitator Center for High Throughput Computing STAT679, October 29, 2018 1 About Me I work for the Center for High Throughput
More informationPegasus. Automate, recover, and debug scientific computations. Mats Rynge https://pegasus.isi.edu
Pegasus Automate, recover, and debug scientific computations. Mats Rynge rynge@isi.edu https://pegasus.isi.edu Why Pegasus? Automates complex, multi-stage processing pipelines Automate Enables parallel,
More informationPerformance Evaluation of the Karma Provenance Framework for Scientific Workflows
Performance Evaluation of the Karma Provenance Framework for Scientific Workflows Yogesh L. Simmhan, Beth Plale, Dennis Gannon, and Suresh Marru Indiana University, Computer Science Department, Bloomington,
More informationarxiv:gr-qc/ v1 19 May 2005
Using the inspiral program to search for gravitational waves from low-mass binary inspiral arxiv:gr-qc/0505102v1 19 May 2005 Duncan A. Brown (for the LIGO Scientific Collaboration) University of Wisconsin
More informationAutomating Real-time Seismic Analysis
Automating Real-time Seismic Analysis Through Streaming and High Throughput Workflows Rafael Ferreira da Silva, Ph.D. http://pegasus.isi.edu Do we need seismic analysis? Pegasus http://pegasus.isi.edu
More informationIntroduction & Motivation Problem Statement Proposed Work Evaluation Conclusions Future Work
Introduction & Motivation Problem Statement Proposed Work Evaluation Conclusions Future Work Introduction & Motivation Problem Statement Proposed Work Evaluation Conclusions Future Work Today (2014):
More informationHTCondor Essentials. Index
HTCondor Essentials 31.10.2017 Index Login How to submit a job in the HTCondor pool Why the -name option? Submitting a job Checking status of submitted jobs Getting id and other info about a job
More informationProcesses. CS 475, Spring 2018 Concurrent & Distributed Systems
Processes CS 475, Spring 2018 Concurrent & Distributed Systems Review: Abstractions 2 Review: Concurrency & Parallelism 4 different things: T1 T2 T3 T4 Concurrency: (1 processor) Time T1 T2 T3 T4 T1 T1
More informationPegasus. Automate, recover, and debug scientific computations. Mats Rynge
Pegasus Automate, recover, and debug scientific computations. Mats Rynge rynge@isi.edu https://pegasus.isi.edu Why Pegasus? Automates complex, multi-stage processing pipelines Automate Enables parallel,
More informationPegasus. Automate, recover, and debug scientific computations. Rafael Ferreira da Silva.
Pegasus Automate, recover, and debug scientific computations. Rafael Ferreira da Silva http://pegasus.isi.edu Experiment Timeline Scientific Problem Earth Science, Astronomy, Neuroinformatics, Bioinformatics,
More informationCS 2505 Computer Organization I Test 1
Instructions: Print your name in the space provided below. This examination is closed book and closed notes, aside from the permitted one-page formula sheet. No calculators or other computing devices may
More informationTutorial 4: Condor. John Watt, National e-science Centre
Tutorial 4: Condor John Watt, National e-science Centre Tutorials Timetable Week Day/Time Topic Staff 3 Fri 11am Introduction to Globus J.W. 4 Fri 11am Globus Development J.W. 5 Fri 11am Globus Development
More informationStat Analysis Tool. Filtering Summarizing Aggregating. of Grid-Stat, Point-Stat, & Wavelet-Stat output
Stat Analysis Tool Filtering Summarizing Aggregating of Grid-Stat, Point-Stat, & Wavelet-Stat output Presenter: Tara Jensen What can Stat Analysis do for you? Can I get Q: Overall statistics for all gridded
More informationDistributed Computation Models
Distributed Computation Models SWE 622, Spring 2017 Distributed Software Engineering Some slides ack: Jeff Dean HW4 Recap https://b.socrative.com/ Class: SWE622 2 Review Replicating state machines Case
More informationEnabling Large-scale Scientific Workflows on Petascale Resources Using MPI Master/Worker
Enabling Large-scale Scientific Workflows on Petascale Resources Using MPI Master/Worker Mats Rynge 1 rynge@isi.edu Gideon Juve 1 gideon@isi.edu Karan Vahi 1 vahi@isi.edu Scott Callaghan 2 scottcal@usc.edu
More informationSnakemake overview. Thomas Cokelaer. Nov 9th 2017 Snakemake and Sequana overview. Institut Pasteur
Snakemake overview Thomas Cokelaer Institut Pasteur Nov 9th 2017 Snakemake and Sequana overview Many bioinformatic pipeline frameworks available A review of bioinformatic pipeline frameworks. Jeremy Leipzig
More informationAccelerating Science with High Throughput Computing (HTC)
Accelerating Science with High Throughput Computing (HTC) Miron Livny Morgridge Institute Of Research Center for High Throughput Computing Computer Sciences Department University of Wisconsin-Madison The
More informationIntroduction to HTCondor
Introduction to HTCondor Kenyi Hurtado June 17, 2016 1 Covered in this presentation What is HTCondor? How to run a job (and multiple ones) Monitoring your queue 2 What is HTCondor? A specialized workload
More informationArchitecture of the WMS
Architecture of the WMS Dr. Giuliano Taffoni INFORMATION SYSTEMS UNIT Outline This presentation will cover the following arguments: Overview of WMS Architecture Job Description Language Overview WMProxy
More informationThe MPI API s baseline requirements
LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY Document Type LIGO-T990086-01- E 01/26/2000 The MPI API s baseline
More informationARPS-3dvar Program. 1. Introduction
1. Introduction The ARPS variational data assimilation program 3dvar is designed to assimilate the observation data to ARPS model. The program takes an ARPS forecast as background, and surface data, multi-level
More information2/26/2017. For instance, consider running Word Count across 20 splits
Based on the slides of prof. Pietro Michiardi Hadoop Internals https://github.com/michiard/disc-cloud-course/raw/master/hadoop/hadoop.pdf Job: execution of a MapReduce application across a data set Task:
More informationCSE 417: Algorithms and Computational Complexity. 3.1 Basic Definitions and Applications. Goals. Chapter 3. Winter 2012 Graphs and Graph Algorithms
Chapter 3 CSE 417: Algorithms and Computational Complexity Graphs Reading: 3.1-3.6 Winter 2012 Graphs and Graph Algorithms Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.
More informationITNPBD7 Cluster Computing Spring Using Condor
The aim of this practical is to work through the Condor examples demonstrated in the lectures and adapt them to alternative tasks. Before we start, you will need to map a network drive to \\wsv.cs.stir.ac.uk\datasets
More informationCS 2505 Computer Organization I Test 1
Instructions: Print your name in the space provided below. This examination is closed book and closed notes, aside from the permitted one-page formula sheet. No calculators or other computing devices may
More informationAzure Certification BootCamp for Exam (Developer)
Azure Certification BootCamp for Exam 70-532 (Developer) Course Duration: 5 Days Course Authored by CloudThat Description Microsoft Azure is a cloud computing platform and infrastructure created for building,
More informationUnderstanding WebSphere Business Monitor Failed Events Manager
IBM Software Group Understanding WebSphere Business Monitor Failed Events Manager Sridhar Edam(sedam@us.ibm.com) Staff Software Engineer 17 June 2010 WebSphere Support Technical Exchange Agenda Overview
More informationc y l c a high level introduction CSPP/IMAPP Users Group Meeting SSEC, Madison, Wisconsin, June
a high level introduction CSPP/IMAPP Users Group Meeting SSEC, Madison, Wisconsin, June 27 2017 Adapted from original slides by Hilary J Oliver, NIWA what s cylc? a workflow engine to construct complex,
More informationOutline. ASP 2012 Grid School
Distributed Storage Rob Quick Indiana University Slides courtesy of Derek Weitzel University of Nebraska Lincoln Outline Storage Patterns in Grid Applications Storage
More information$Id: asg4-shell-tree.mm,v :36: $
cmps012b 2002q2 Assignment 4 Shell and Tree Structure page 1 $Id: asg4-shell-tree.mm,v 323.32 2002-05-08 15:36:09-07 - - $ 1. Overview A data structure that is useful in many applications is the Tree.
More informationZooKeeper. Table of contents
by Table of contents 1 ZooKeeper: A Distributed Coordination Service for Distributed Applications... 2 1.1 Design Goals... 2 1.2 Data model and the hierarchical namespace... 3 1.3 Nodes and ephemeral nodes...
More informationMEA 716 WRF Homework, Data Case Spring WRF Real-Data Case Study Simulation
MEA 716 WRF Homework, Data Case Spring 2016 Assigned: Tuesday 2/2/16 Due: Tuesday, 2/9/16 WRF Real-Data Case Study Simulation This assignment takes students through the process of obtaining initial and
More informationScientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute
Scientific Workflows and Cloud Computing Gideon Juve USC Information Sciences Institute gideon@isi.edu Scientific Workflows Loosely-coupled parallel applications Expressed as directed acyclic graphs (DAGs)
More informationA Virtual Comet. HTCondor Week 2017 May Edgar Fajardo On behalf of OSG Software and Technology
A Virtual Comet HTCondor Week 2017 May 3 2017 Edgar Fajardo On behalf of OSG Software and Technology 1 Working in Comet What my friends think I do What Instagram thinks I do What my boss thinks I do 2
More informationDMTN-003: Description of v1.0 of the Alert Production Simulator
DMTN-003: Description of v1.0 of the Alert Production Simulator Release 1.0 Stephen Pietrowicz 2015-12-07 Contents 1 Systems 3 2 Software packages 5 3 Workflow 7 3.1 Status...................................................
More informationThe MPI API s baseline requirements
LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORY - LIGO - CALIFORNIA INSTITUTE OF TECHNOLOGY MASSACHUSETTS INSTITUTE OF TECHNOLOGY Document Type LIGO-T990086-00- E 09/14/1999 The MPI API s baseline
More informationCMSC421: Principles of Operating Systems
CMSC421: Principles of Operating Systems Nilanjan Banerjee Assistant Professor, University of Maryland Baltimore County nilanb@umbc.edu http://www.csee.umbc.edu/~nilanb/teaching/421/ Principles of Operating
More informationParalleliza(on Challenges for Ensemble Data Assimila(on
Paralleliza(on Challenges for Ensemble Data Assimila(on Helen Kershaw Institute for Mathematics Applied to Geophysics, National Center for Atmospheric Research Email: hkershaw@ucar.edu What am I going
More informationData Protection Guide
SnapCenter Software 4.0 Data Protection Guide For Microsoft Exchange Server March 2018 215-12936_C0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to read the SnapCenter Data Protection
More informationA Practical Approach for a Workflow Management System
A Practical Approach for a Workflow Management System Simone Pellegrini, Francesco Giacomini, Antonia Ghiselli INFN Cnaf Viale B. Pichat, 6/2 40127 Bologna {simone.pellegrini francesco.giacomini antonia.ghiselli}@cnaf.infn.it
More informationQUIZ. How could we disable the automatic creation of copyconstructors
QUIZ How could we disable the automatic creation of copyconstructors pre-c++11? What syntax feature did C++11 introduce to make the disabling clearer and more permanent? Give a code example. Ch. 14: Inheritance
More informationLING 408/508: Computational Techniques for Linguists. Lecture 5
LING 408/508: Computational Techniques for Linguists Lecture 5 Last Time Installing Ubuntu 18.04 LTS on top of VirtualBox Your Homework 2: did everyone succeed? Ubuntu VirtualBox Host OS: MacOS or Windows
More informationParallel Programming & Cluster Computing High Throughput Computing
Parallel Programming & Cluster Computing High Throughput Computing Henry Neeman, University of Oklahoma Charlie Peck, Earlham College Tuesday October 11 2011 Outline What is High Throughput Computing?
More informationCREAM-WMS Integration
-WMS Integration Luigi Zangrando (zangrando@pd.infn.it) Moreno Marzolla (marzolla@pd.infn.it) JRA1 IT-CZ Cluster Meeting, Rome, 12 13/9/2005 1 Preliminaries We had a meeting in PD between Francesco G.,
More informationIntroduction to Discovery.
Introduction to Discovery http://discovery.dartmouth.edu The Discovery Cluster 2 Agenda What is a cluster and why use it Overview of computer hardware in cluster Help Available to Discovery Users Logging
More informationEMC SourceOne for Microsoft SharePoint Version 6.7
EMC SourceOne for Microsoft SharePoint Version 6.7 Administration Guide P/N 300-012-746 REV A01 EMC Corporation Corporate Headquarters: Hopkinton, MA 01748-9103 1-508-435-1000 www.emc.com Copyright 2011
More informationAdvanced Job Submission on the Grid
Advanced Job Submission on the Grid Antun Balaz Scientific Computing Laboratory Institute of Physics Belgrade http://www.scl.rs/ 30 Nov 11 Dec 2009 www.eu-egee.org Scope User Interface Submit job Workload
More informationParent Portal. Registration and Login
EPISD Parent Portal Registration and Login Parent Portal Overview Register Online 1. Step-by-step tutorial 2. Register together Login Objectives 1. View Report Card and Attendance 2. Email teacher as introduction
More informationChapter 6: Distributed Systems: The Web. Fall 2012 Sini Ruohomaa Slides joint work with Jussi Kangasharju et al.
Chapter 6: Distributed Systems: The Web Fall 2012 Sini Ruohomaa Slides joint work with Jussi Kangasharju et al. Chapter Outline Web as a distributed system Basic web architecture Content delivery networks
More informationExporting Phones. Using Phone Export CHAPTER
CHAPTER 9 You can use the export utility to merge records from multiple Cisco Unified Communications Manager servers onto one Cisco Unified Communications Manager server. Use this procedure to move records
More informationReport on integration of E-VLBI System
Express Production Real-time e-vlbi Service EXPReS is funded by the European Commission (DG-INFSO), Sixth Framework Programme, Contract #026642 Report on integration of E-VLBI System Title: Report on integration
More informationDay 15: Science Code in Python
Day 15: Science Code in Python 1 Turn In Homework 2 Homework Review 3 Science Code in Python? 4 Custom Code vs. Off-the-Shelf Trade-offs Costs (your time vs. your $$$) Your time (coding vs. learning) Control
More informationSection 1: Tools. Contents CS162. January 19, Make More details about Make Git Commands to know... 3
CS162 January 19, 2017 Contents 1 Make 2 1.1 More details about Make.................................... 2 2 Git 3 2.1 Commands to know....................................... 3 3 GDB: The GNU Debugger
More informationPresented by: Jon Wedell BioMagResBank
HTCondor Tutorial Presented by: Jon Wedell BioMagResBank wedell@bmrb.wisc.edu Background During this tutorial we will walk through submitting several jobs to the HTCondor workload management system. We
More informationBuilding a DeepDive Application Infrastructure
Building a DeepDive Application Infrastructure Ian Ross, University of Wisconsin-Madison Center for High Through Computing iross@cs.wisc.edu Key Questions Can a machine reading system construct a literature-based
More informationShell and Utility Commands
Table of contents 1 Shell Commands... 2 2 Utility Commands... 3 1 Shell Commands 1.1 fs Invokes any FsShell command from within a Pig script or the Grunt shell. 1.1.1 Syntax fs subcommand subcommand_parameters
More informationMapping Abstract Complex Workflows onto Grid Environments
Mapping Abstract Complex Workflows onto Grid Environments Ewa Deelman, James Blythe, Yolanda Gil, Carl Kesselman, Gaurang Mehta, Karan Vahi Information Sciences Institute University of Southern California
More informationPowerful Scheduling made easy Run scheduled jobs in a unattended environment to increase: Throughput, Accuracy, Efficiency.
Powerful Scheduling made easy Run scheduled jobs in a unattended environment to increase: Throughput, Accuracy, Efficiency. ANY Job that can be: Submitted to Batch, Run Interactively on the AS/400 can
More informationPractice of Software Development: Dynamic scheduler for scientific simulations
Practice of Software Development: Dynamic scheduler for scientific simulations @ SimLab EA Teilchen STEINBUCH CENTRE FOR COMPUTING - SCC KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum
More information<Insert Picture Here> Get the best out of Oracle Scheduler: Learn how you can leverage Scheduler for enterprise scheduling
1 Get the best out of Oracle Scheduler: Learn how you can leverage Scheduler for enterprise scheduling Vira Goorah (vira.goorah@oracle.com) Oracle Principal Product Manager Agenda
More informationSkybot Scheduler Release Notes
Skybot Scheduler Release Notes Following is a list of the new features and enhancements included in each release of Skybot Scheduler. Skybot Scheduler 3.5 Skybot Scheduler 3.5 (May 19, 2014 update) Informatica
More informationECE 550D Fundamentals of Computer Systems and Engineering. Fall 2017
ECE 550D Fundamentals of Computer Systems and Engineering Fall 2017 The Operating System (OS) Prof. John Board Duke University Slides are derived from work by Profs. Tyler Bletsch and Andrew Hilton (Duke)
More informationFeatures & Functionalities
Features & Functionalities Release 3.0 www.capture-experts.com Import FEATURES Processing TIF CSV EML Text Clean-up Email HTML ZIP TXT Merge Documents Convert to TIF PST RTF PPT XLS Text Recognition Barcode
More informationHow to install Condor-G
How to install Condor-G Tomasz Wlodek University of the Great State of Texas at Arlington Abstract: I describe the installation procedure for Condor-G Before you start: Go to http://www.cs.wisc.edu/condor/condorg/
More informationBuild a MODRAT model by defining a hydrologic schematic
v. 11.0 WMS 11.0 Tutorial Build a MODRAT model by defining a hydrologic schematic Objectives Learn how to define a basic MODRAT model using the hydrologic schematic tree in WMS by building a tree and defining
More informationDirected Graphical Models (Bayes Nets) (9/4/13)
STA561: Probabilistic machine learning Directed Graphical Models (Bayes Nets) (9/4/13) Lecturer: Barbara Engelhardt Scribes: Richard (Fangjian) Guo, Yan Chen, Siyang Wang, Huayang Cui 1 Introduction For
More informationProject 1 Balanced binary
CMSC262 DS/Alg Applied Blaheta Project 1 Balanced binary Due: 7 September 2017 You saw basic binary search trees in 162, and may remember that their weakness is that in the worst case they behave like
More informationWelcome to ExtremeZ-IP.
Welcome to ExtremeZ-IP. This guide provides the essential steps for setting up ExtremeZ-IP. For detailed File and Print Server configuration instructions, see the ExtremeZ-IP Users Manual PDF. You can
More informationSetup InstrucIons. If you need help with the setup, please put a red sicky note at the top of your laptop.
Setup InstrucIons Please complete these steps for the June 26 th workshop before the lessons start at 1:00 PM: h;p://hcc.unl.edu/june-workshop-setup#weekfour And make sure you can log into Crane. OS-specific
More informationSNAG: SDN-managed Network Architecture for GridFTP Transfers
SNAG: SDN-managed Network Architecture for GridFTP Transfers Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian Bockelman, Garhan Attebury and David Swanson Dept. of Computer Science & Engineering,
More information