DAGMan workflow. Kumaran Baskaran

Size: px
Start display at page:

Download "DAGMan workflow. Kumaran Baskaran"

Transcription

1 DAGMan workflow Kumaran Baskaran NMRbox summer workshop June 26-29,2017

2 what is a workflow?

3 Laboratory Publication

4

5 Workflow management

6 Softwares Workflow management

7 Data Softwares Workflow management

8

9 Manual Train

10 Manual Train Automated Train

11 Introduction DAGMan (Directed Acyclic Graph Manager) is a metascheduler for HTCondor Manages dependencies between jobs DAG can be used to represent workflow Nodes (vertices) represent steps and edges (arcs) identify the dependencies. Maintains event logs and generates rescue DAG in case of one or more failed jobs

12 Workflow and Automation Workflow can be automated if it doesn t require any user interaction Goal of the workflow is to capture every step and automation is a special case of it Workflow management is necessary for reproducible computing

13 Workflow Data Step 1 Step 2 Step 3 Result

14 NMR Workflow FID FT Peak picking Peak assign CS list

15 NMR Workflow FID Format conversion FT Format conversion Peak picking Peak assign CS list

16 NMR Workflow FID Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list

17 NMR Workflow NMRpipe NMRFAM-Sparky FID Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list

18 Back bone assignment NMRpipe NMRFAM-Sparky FID CBCACONH Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list NMRpipe NMRFAM-Sparky FID HNCBCA Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list NMRpipe NMRFAM-Sparky FID NHSQC Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list

19 Back bone assignment NMRpipe NMRFAM-Sparky FID CBCACONH Varian to pipe FT Pipe to Sparky Peak picking Peak assign NMRpipe NMRFAM-Sparky FID HNCBCA Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list NMRpipe NMRFAM-Sparky FID NHSQC Varian to pipe FT Pipe to Sparky Peak picking Peak assign

20 Side chain assignment NMRpipe NMRFAM-Sparky FID CCONH Varian to pipe FT Pipe to Sparky Peak picking Peak assign NMRpipe NMRFAM-Sparky FID HBHACONH Varian to pipe FT Pipe to Sparky Peak picking Peak assign CS list NMRpipe NMRFAM-Sparky FID. Varian to pipe FT Pipe to Sparky Peak picking Peak assign

21 NMR Workflow Back bone CS list All atom CS list Side chain CS list MD NMRpipe NMRFAM-Sparky FID NOESY Varian to pipe FT Pipe to Sparky Peak picking Peak assign Distance restraints

22 DAGMan One Node within a DAG [optional] PRE script HRCondor Job(s) [optional] POST script

23

24 Example JOB A A.condor JOB B B.condor JOB C C.condor JOB D D.condor PARENT A CHILD B C PARENT B C CHILD D B A D C

25 Example Key words HTCondor job description files JOB A A.condor JOB B B.condor JOB C C.condor JOB D D.condor PARENT A CHILD B C PARENT B C CHILD D B A D C Job dependencies

26 Complex workflow

27 An error in the workflow

28 Complex workflow

29 Rescue DAG

30 Job done!

31 NMR Example File: NMRWorkflow.dag FID Job Varian2Pipe Varian2Pipe.sub Job FT FT.sub Job Pipe2Sparky Pipe2Sparky.sub Job Sparky Sparky.sub PARENT Varian2Pine CHILD FT PARENT FT CHILD Pine2Sparky PARENT Pine2Sparky CHILD Sparky NMRpipe script to convert Bruker/Varian to pipe NMRpipe script to FT NMRpipe script to convert pipe to UCSF NMRFAM-Sparky

32 File: Varian2Pipe.sub NMR Example universe = local notify_user = kbaskaran@bmrb.wisc.edu notification = Error getenv = True executable = <path>/varian2pipe.com arguments = $(OPTIONAL) log = logs/varian2pipe.log output = logs/varian2pipe.out error = logs/varian2pipe.err copy_to_spool = False priority = 0 queue

33 File:NMRWorkflow.dag NMR Example Job Varian2Pipe(CBCACONH) cbcaconhvarian2pipe.sub Job FT(CBCACONH) cbcaconhft.sub Job Pipe2Sparky(CBCACONH) cbcaconhpipe2sparky.sub Job Varian2Pipe(HNCACB) hncacbvarian2pipe.sub Job FT(HNCACB) hncacbft.sub Job Pipe2Sparky(HNCACB) hncacbpipe2sparky.sub Job Varian2Pipe(NHSQC) nhsqcvarian2pipe.sub Job FT(NHSQC) nhsqcft.sub Job Pipe2Sparky(NHSQC) nhsqcpipe2sparky.sub Job Sparky Sparky.sub VARS Sparky OPTIONAL="/home/nmrbox/kbaskaran/Desktop/ubiquitin/Original_data/CBCACONH.fid/CBCACONH.ucsf / home/nmrbox/kbaskaran/desktop/ubiquitin/original_data/hncacb.fid/hncacb.ucsf /home/nmrbox/kbaskaran/desktop/ ubiquitin/original_data/nhsqc.fid/nhsqc.ucsf" PARENT Varian2Pipe(CBCACONH) CHILD FT(CBCACONH) PARENT FT(CBCACONH) CHILD Pipe2Sparky(CBCACONH) PARENT Varian2Pipe(HNCACB) CHILD FT(HNCACB) PARENT FT(HNCACB) CHILD Pipe2Sparky(HNCACB) PARENT Varian2Pipe(NHSQC) CHILD FT(NHSQC) PARENT FT(NHSQC) CHILD Pipe2Sparky(NHSQC) PARENT Pipe2Sparky(CBCACONH) Pipe2Sparky(HNCACB) Pipe2Sparky(NHSQC) CHILD Sparky

34 File:NMRWorkflow.dag NMR Example DOT dag.dot UPDATE Job Varian2Pipe(CBCACONH) cbcaconhvarian2pipe.sub Job FT(CBCACONH) cbcaconhft.sub Job Pipe2Sparky(CBCACONH) cbcaconhpipe2sparky.sub Job Varian2Pipe(HNCACB) hncacbvarian2pipe.sub Job FT(HNCACB) hncacbft.sub Job Pipe2Sparky(HNCACB) hncacbpipe2sparky.sub Job Varian2Pipe(NHSQC) nhsqcvarian2pipe.sub Job FT(NHSQC) nhsqcft.sub Job Pipe2Sparky(NHSQC) nhsqcpipe2sparky.sub Job Sparky Sparky.sub VARS Sparky OPTIONAL="/home/nmrbox/kbaskaran/Desktop/ubiquitin/Original_data/CBCACONH.fid/CBCACONH.ucsf / home/nmrbox/kbaskaran/desktop/ubiquitin/original_data/hncacb.fid/hncacb.ucsf /home/nmrbox/kbaskaran/desktop/ ubiquitin/original_data/nhsqc.fid/nhsqc.ucsf" PARENT Varian2Pipe(CBCACONH) CHILD FT(CBCACONH) PARENT FT(CBCACONH) CHILD Pipe2Sparky(CBCACONH) PARENT Varian2Pipe(HNCACB) CHILD FT(HNCACB) PARENT FT(HNCACB) CHILD Pipe2Sparky(HNCACB) PARENT Varian2Pipe(NHSQC) CHILD FT(NHSQC) PARENT FT(NHSQC) CHILD Pipe2Sparky(NHSQC) PARENT Pipe2Sparky(CBCACONH) Pipe2Sparky(HNCACB) Pipe2Sparky(NHSQC) CHILD Sparky

35 File:NMRWorkflow.dag NMR Example DOT dag.dot UPDATE Job Varian2Pipe(CBCACONH) cbcaconhvarian2pipe.sub Job FT(CBCACONH) cbcaconhft.sub Job Pipe2Sparky(CBCACONH) cbcaconhpipe2sparky.sub dot -Tps dag.dot -o dag.ps Job Varian2Pipe(HNCACB) hncacbvarian2pipe.sub Job FT(HNCACB) hncacbft.sub will produce dag image Job Pipe2Sparky(HNCACB) hncacbpipe2sparky.sub Job Varian2Pipe(NHSQC) nhsqcvarian2pipe.sub Job FT(NHSQC) nhsqcft.sub Job Pipe2Sparky(NHSQC) nhsqcpipe2sparky.sub Job Sparky Sparky.sub VARS Sparky OPTIONAL="/home/nmrbox/kbaskaran/Desktop/ubiquitin/Original_data/CBCACONH.fid/CBCACONH.ucsf / home/nmrbox/kbaskaran/desktop/ubiquitin/original_data/hncacb.fid/hncacb.ucsf /home/nmrbox/kbaskaran/desktop/ ubiquitin/original_data/nhsqc.fid/nhsqc.ucsf" PARENT Varian2Pipe(CBCACONH) CHILD FT(CBCACONH) PARENT FT(CBCACONH) CHILD Pipe2Sparky(CBCACONH) PARENT Varian2Pipe(HNCACB) CHILD FT(HNCACB) PARENT FT(HNCACB) CHILD Pipe2Sparky(HNCACB) PARENT Varian2Pipe(NHSQC) CHILD FT(NHSQC) PARENT FT(NHSQC) CHILD Pipe2Sparky(NHSQC) PARENT Pipe2Sparky(CBCACONH) Pipe2Sparky(HNCACB) Pipe2Sparky(NHSQC) CHILD Sparky

36 DAGMan commands Submit a dag: condor_submit_dag NMRWorkflow.dag Check the condor jobs : condor_q Delete a job : condor_rm <job id> Generate a DAG image : dot -Tps dag.dot -o dag.ps

37 NMR Example DAG image (all job successful)

38 NMR Example DAG image (some running and one error)

39 NMR Example DAG image (completed with an error)

40 Rescue DAG A rescue DAG is automatically generated whenever one or more jobs fail User may correct the error and submit the rescue DAG using the following command condor_submit_dag -doresucefrom <rescue id> <original DAG file>

41 NMR Example DAG image (Rescue DAG) won't run

42 Advantages of DAGMan Captures every step with extensive logging Easily scaleable from personal condor to high performance cloud based clusters basically it is the same DAG you submit to different condor pool Rescue DAGs are quite useful for debugging Easy to recover from an intermediate stage

43 Disadvantages of DAGMan No GUI, user need to edit the DAG file to make changes Since it is based on acyclic architecture, handling a recursion is not straight forward

44 Current issues There are no well defined steps in structure determination using NMR There are multiple ways to achieve the goal Users may prefer their own choice of softwares tools and protocols It s not easy to define an end point for a particular step Example : In Sparky peak picking is a non trivial process. User may add/delete peaks at anytime. Similarly user may correct an assignment manually anytime

45 Future plan A generic DAG for most common workflow and users may add flavors to it Create a workflow library for different tasks workflow for peak assign workflow for refinement PRE and POST script options could be used to validate the data before and after every job

46 Tutorial Data : 3 time domain data sets available in summer_workshop/dagman/ubiquitin/original_data CBCA(CO)NH HNCACB NHSQC NMRPipe scripts already in the folders Workflow : Condor submit files and a DAG file is available in summer_workshop/dagman/ubiquitin/workflow

47 Tutorial Go to workflow' folder and open the NMRWorkflow.dag in a test editor This contains job definitions and dependencies. Open any one of the.sub file using a text editor This contains condor job descriptions

48 Submitting a DAG Open terminal and go summer_workshop/ DAGMan/ubiquitin/workflow Command tp submit a DAG condor_submit_dag NMRWorkflow.dag

49 Submitting a DAG

50 Checking jobs

51 Generating a DAG image

52 Generating a DAG image

53 Opens Sparky session at the end

54 Rescue dag Rename one of the experimental data folder to create an error Submit the dag and one of the spectra will fail Now you can correct the error and run a rescue DAG to complete only missing jobs

55 Creating an error

56 Submitting DAG with an error to overwrite logs

57 Check the DAG image

58 DAG image Other jobs are running Error

59 Wait for the DAG to complete and generate DAG image

60 DAG completed with an error

61 Rescue file generated

62 Correct the error and run the rescue DAG

63 Correct the error and run the rescue DAG

64 Check the DAG image

65 Done!

From FID to 2D: Processing HSQC Data Using NMRPipe Macro Methods

From FID to 2D: Processing HSQC Data Using NMRPipe Macro Methods The data recorded during a NMR experiment are stored in the form of a digitized free induction decay (FID), which is in the time domain. In order for us to gain information regarding the chemical environment

More information

AN INTRODUCTION TO WORKFLOWS WITH DAGMAN

AN INTRODUCTION TO WORKFLOWS WITH DAGMAN AN INTRODUCTION TO WORKFLOWS WITH DAGMAN Presented by Lauren Michael HTCondor Week 2018 1 Covered In This Tutorial Why Create a Workflow? Describing workflows as directed acyclic graphs (DAGs) Workflow

More information

Grid Compute Resources and Grid Job Management

Grid Compute Resources and Grid Job Management Grid Compute Resources and Job Management March 24-25, 2007 Grid Job Management 1 Job and compute resource management! This module is about running jobs on remote compute resources March 24-25, 2007 Grid

More information

Condor and Workflows: An Introduction. Condor Week 2011

Condor and Workflows: An Introduction. Condor Week 2011 Condor and Workflows: An Introduction Condor Week 2011 Kent Wenger Condor Project Computer Sciences Department University of Wisconsin-Madison Outline > Introduction/motivation > Basic DAG concepts > Running

More information

Day 11: Workflows with DAGMan

Day 11: Workflows with DAGMan Day 11: Workflows with DAGMan Suggested reading: Condor 7.7 Manual: http://www.cs.wisc.edu/condor/manual/v7.7/ Section 2.10: DAGMan Applications Chapter 9: condor_submit_dag 1 Turn In Homework 2 Homework

More information

HTCONDOR USER TUTORIAL. Greg Thain Center for High Throughput Computing University of Wisconsin Madison

HTCONDOR USER TUTORIAL. Greg Thain Center for High Throughput Computing University of Wisconsin Madison HTCONDOR USER TUTORIAL Greg Thain Center for High Throughput Computing University of Wisconsin Madison gthain@cs.wisc.edu 2015 Internet2 HTCondor User Tutorial CONTENTS Overview Basic job submission How

More information

Goals of tutorial. Introduce NMRbox platform

Goals of tutorial. Introduce NMRbox platform Introduce NMRbox platform Goals of tutorial Showcase NMRbox with NUS tools A dozen different NUS processing tools installed and configured more coming. Demonstrate potential of NMRbox Now that the platform

More information

Introduction to the Graphical User Interface of TREND

Introduction to the Graphical User Interface of TREND Introduction to the Graphical User Interface of TREND Description: Both GUI and command-line versions of TREND are available. The arguments and usages of the GUI panels are briefly introduced below. Detailed

More information

Grid Compute Resources and Job Management

Grid Compute Resources and Job Management Grid Compute Resources and Job Management How do we access the grid? Command line with tools that you'll use Specialised applications Ex: Write a program to process images that sends data to run on the

More information

Goals of tutorial. Showcase NMRbox with NUS tools A dozen different NUS processing tools installed and configured more coming.

Goals of tutorial. Showcase NMRbox with NUS tools A dozen different NUS processing tools installed and configured more coming. Introduce NMRbox platform Goals of tutorial Showcase NMRbox with NUS tools A dozen different NUS processing tools installed and configured more coming. Demonstrate potential of NMRbox Now that the platform

More information

SMART NOTEBOOK: A semi-automated approach for the rapid assignment of triple resonance NMR spectra. Valerie Booth Brian Sykes

SMART NOTEBOOK: A semi-automated approach for the rapid assignment of triple resonance NMR spectra. Valerie Booth Brian Sykes SMART NOTEBOOK: A semi-automated approach for the rapid assignment of triple resonance NMR spectra. Programmer: Consultants: Robert Boyko Carolyn Slupsky Valerie Booth Brian Sykes Protein Engineering Network

More information

Cloud Computing. Up until now

Cloud Computing. Up until now Cloud Computing Lectures 3 and 4 Grid Schedulers: Condor, Sun Grid Engine 2012-2013 Introduction. Up until now Definition of Cloud Computing. Grid Computing: Schedulers: Condor architecture. 1 Summary

More information

BMRB is a member of the wwpdb. BMRB collaborates with

BMRB is a member of the wwpdb. BMRB collaborates with BMRB is a member of the wwpdb BMRB collaborates with Using Condor behind the scenes to provide a public CS-Rosetta server at the BioMagResBank Funded By: ! Biological Magnetic Resonance Data Bank What

More information

Getting Started with OSG Connect ~ an Interactive Tutorial ~

Getting Started with OSG Connect ~ an Interactive Tutorial ~ Getting Started with OSG Connect ~ an Interactive Tutorial ~ Emelie Harstad , Mats Rynge , Lincoln Bryant , Suchandra Thapa ,

More information

Project Blackbird. U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis. Sam Hoover

Project Blackbird. Ulizing Condor and HTC to address archiving online courses at Clemson on a weekly basis. Sam Hoover Project Blackbird U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis Sam Hoover shoover@clemson.edu 1 Project Blackbird Blackboard at Clemson End of Semester archives

More information

History of SURAgrid Deployment

History of SURAgrid Deployment All Hands Meeting: May 20, 2013 History of SURAgrid Deployment Steve Johnson Texas A&M University Copyright 2013, Steve Johnson, All Rights Reserved. Original Deployment Each job would send entire R binary

More information

Tutorial 4: Condor. John Watt, National e-science Centre

Tutorial 4: Condor. John Watt, National e-science Centre Tutorial 4: Condor John Watt, National e-science Centre Tutorials Timetable Week Day/Time Topic Staff 3 Fri 11am Introduction to Globus J.W. 4 Fri 11am Globus Development J.W. 5 Fri 11am Globus Development

More information

Building the International Data Placement Lab. Greg Thain

Building the International Data Placement Lab. Greg Thain Building the International Data Placement Lab Greg Thain What is the IDPL? Overview How we built it Examples of use 2 Who is the IDPL? Phil Papadapolus -- UCSD Miron Livny -- Wisconsin Collaborators in

More information

* need 4-6GB memory... depending on the size of protein and spectrum data

* need 4-6GB memory... depending on the size of protein and spectrum data How to use Filt_Robot 1-1. Before to use Filt_Robot [requirements] * TCL/Tk8.4 or newer * csh * spectrum data in NMRView format (see the NMRPipe macros) currently limited type of spectrum data can be used

More information

! " # " $ $ % & '(()

!  #  $ $ % & '(() !"# " $ $ % &'(() First These slides are available from: http://www.cs.wisc.edu/~roy/italy-condor/ 2 This Morning s Condor Topics * +&, & - *.&- *. & * && - * + $ 3 Part One Matchmaking: Finding Machines

More information

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute Pegasus Workflow Management System Gideon Juve USC Informa3on Sciences Ins3tute Scientific Workflows Orchestrate complex, multi-stage scientific computations Often expressed as directed acyclic graphs

More information

CONNJUR Workflow Builder: Tutorial. Where to download:

CONNJUR Workflow Builder: Tutorial. Where to download: CONNJUR Workflow Builder: Tutorial Part I: Introduction 1.1 About CONNJUR and CONNJUR Workflow Builder (CWB) The CONNJUR Project is to develop an open source integration environment for biomolecular NMR

More information

Presented by: Jon Wedell BioMagResBank

Presented by: Jon Wedell BioMagResBank HTCondor Tutorial Presented by: Jon Wedell BioMagResBank wedell@bmrb.wisc.edu Background During this tutorial we will walk through submitting several jobs to the HTCondor workload management system. We

More information

Introduction to HTCondor

Introduction to HTCondor Introduction to HTCondor Kenyi Hurtado June 17, 2016 1 Covered in this presentation What is HTCondor? How to run a job (and multiple ones) Monitoring your queue 2 What is HTCondor? A specialized workload

More information

Quick Start Guide to nmrpipe Processing of 15 N-HSQC data

Quick Start Guide to nmrpipe Processing of 15 N-HSQC data Quick Start Guide to nmrpipe Processing of 15 N-HSQC data Data conversion: We make the basic assumption here that the data set was recorded on an INOVA-class or later model of NMR instrument running VNMRJ.

More information

User Tools and Languages for Graph-based Grid Workflows

User Tools and Languages for Graph-based Grid Workflows User Tools and Languages for Graph-based Grid Workflows User Tools and Languages for Graph-based Grid Workflows Global Grid Forum 10 Berlin, Germany Grid Workflow Workshop Andreas Hoheisel (andreas.hoheisel@first.fraunhofer.de)

More information

Pegasus WMS Automated Data Management in Shared and Nonshared Environments

Pegasus WMS Automated Data Management in Shared and Nonshared Environments Pegasus WMS Automated Data Management in Shared and Nonshared Environments Mats Rynge USC Information Sciences Institute Pegasus Workflow Management System NSF funded project and developed

More information

Rowland NMR Toolkit (RNMRTK) Overview

Rowland NMR Toolkit (RNMRTK) Overview Rowland NMR Toolkit (RNMRTK) Overview Originally developed as a platform for developing new NMR data processing methods Now widely used as a general processing platform Provides Rich set of apodization

More information

BOSCO Architecture. Derek Weitzel University of Nebraska Lincoln

BOSCO Architecture. Derek Weitzel University of Nebraska Lincoln BOSCO Architecture Derek Weitzel University of Nebraska Lincoln Goals We want an easy to use method for users to do computational research It should be easy to install, use, and maintain It should be simple

More information

HTCondor Essentials. Index

HTCondor Essentials. Index HTCondor Essentials 31.10.2017 Index Login How to submit a job in the HTCondor pool Why the -name option? Submitting a job Checking status of submitted jobs Getting id and other info about a job

More information

Pegasus. Pegasus Workflow Management System. Mats Rynge

Pegasus. Pegasus Workflow Management System. Mats Rynge Pegasus Pegasus Workflow Management System Mats Rynge rynge@isi.edu https://pegasus.isi.edu Automate Why workflows? Recover Automates complex, multi-stage processing pipelines Enables parallel, distributed

More information

Scientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute

Scientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute Scientific Workflows and Cloud Computing Gideon Juve USC Information Sciences Institute gideon@isi.edu Scientific Workflows Loosely-coupled parallel applications Expressed as directed acyclic graphs (DAGs)

More information

Getting Started with Serial and Parallel MATLAB on bwgrid

Getting Started with Serial and Parallel MATLAB on bwgrid Getting Started with Serial and Parallel MATLAB on bwgrid CONFIGURATION Download either bwgrid.remote.r2014b.zip (Windows) or bwgrid.remote.r2014b.tar (Linux/Mac) For Windows users, unzip the download

More information

! " #$%! &%& ' ( $ $ ) $*+ $

!  #$%! &%& ' ( $ $ ) $*+ $ ! " #$%! &%& ' ( $ $ ) $*+ $ + 2 &,-)%./01 ) 2 $ & $ $ ) 340 %(%% 3 &,-)%./01 ) 2 $& $ $ ) 34 0 %(%% $ $ %% $ ) 5 67 89 5 & % % %$ %%)( % % ( %$ ) ( '$ :!"#$%%&%'&( )!)&(!&( *+,& )- &*./ &*( ' 0&/ 1&2

More information

Cloud Computing. Summary

Cloud Computing. Summary Cloud Computing Lectures 2 and 3 Definition of Cloud Computing, Grid Architectures 2012-2013 Summary Definition of Cloud Computing (more complete). Grid Computing: Conceptual Architecture. Condor. 1 Cloud

More information

HIFI NMR : part1 automated backbone assignments using 3D->2D

HIFI NMR : part1 automated backbone assignments using 3D->2D HIFI NMR : part automated backbone assignments using 3D->2D Marco Tonelli National Magnetic Resonance Facility At Madison NMRFAM Recording multidimensional experiments is time costly In conventional multidimensional

More information

Pegasus. Automate, recover, and debug scientific computations. Mats Rynge

Pegasus. Automate, recover, and debug scientific computations. Mats Rynge Pegasus Automate, recover, and debug scientific computations. Mats Rynge rynge@isi.edu https://pegasus.isi.edu Why Pegasus? Automates complex, multi-stage processing pipelines Automate Enables parallel,

More information

AixViPMaP towards an open simulation platform for microstructure modelling

AixViPMaP towards an open simulation platform for microstructure modelling AixViPMaP towards an open simulation platform for microstructure modelling Aachen, den 7. September 2018 Lukas Koschmieder, ICME Group, Steel Institute IEHK, RWTH Aachen University ICME Cloud Service Web-based

More information

Changing landscape of computing at BNL

Changing landscape of computing at BNL Changing landscape of computing at BNL Shared Pool and New Users and Tools HTCondor Week May 2018 William Strecker-Kellogg Shared Pool Merging 6 HTCondor Pools into 1 2 What? Current Situation

More information

Unix/Linux Basics. Cpt S 223, Fall 2007 Copyright: Washington State University

Unix/Linux Basics. Cpt S 223, Fall 2007 Copyright: Washington State University Unix/Linux Basics 1 Some basics to remember Everything is case sensitive Eg., you can have two different files of the same name but different case in the same folder Console-driven (same as terminal )

More information

AN INTRODUCTION TO USING

AN INTRODUCTION TO USING AN INTRODUCTION TO USING Todd Tannenbaum June 6, 2017 HTCondor Week 2017 1 Covered In This Tutorial What is HTCondor? Running a Job with HTCondor How HTCondor Matches and Runs Jobs - pause for questions

More information

Condor-G: HTCondor for grid submission. Jaime Frey (UW-Madison), Jeff Dost (UCSD)

Condor-G: HTCondor for grid submission. Jaime Frey (UW-Madison), Jeff Dost (UCSD) Condor-G: HTCondor for grid submission Jaime Frey (UW-Madison), Jeff Dost (UCSD) Acknowledgement These slides are heavily based on the presentation Jaime Frey gave at UCSD in Feb 2011 http://www.t2.ucsd.edu/twiki2/bin/view/main/glideinfactory1111

More information

What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018

What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018 What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018 Todd Tannenbaum Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison

More information

Table of Contents. Table of Contents Job Manager for remote execution of QuantumATK scripts. A single remote machine

Table of Contents. Table of Contents Job Manager for remote execution of QuantumATK scripts. A single remote machine Table of Contents Table of Contents Job Manager for remote execution of QuantumATK scripts A single remote machine Settings Environment Resources Notifications Diagnostics Save and test the new machine

More information

Working with Basic Linux. Daniel Balagué

Working with Basic Linux. Daniel Balagué Working with Basic Linux Daniel Balagué How Linux Works? Everything in Linux is either a file or a process. A process is an executing program identified with a PID number. It runs in short or long duration

More information

ITNPBD7 Cluster Computing Spring Using Condor

ITNPBD7 Cluster Computing Spring Using Condor The aim of this practical is to work through the Condor examples demonstrated in the lectures and adapt them to alternative tasks. Before we start, you will need to map a network drive to \\wsv.cs.stir.ac.uk\datasets

More information

Lab 2: Threads and Processes

Lab 2: Threads and Processes CS333: Operating Systems Lab Lab 2: Threads and Processes Goal The goal of this lab is to get you comfortable with writing basic multi-process / multi-threaded applications, and understanding their performance.

More information

Dynamic Workflows for Grid Applications

Dynamic Workflows for Grid Applications Dynamic Workflows for Grid Applications Dynamic Workflows for Grid Applications Fraunhofer Resource Grid Fraunhofer Institute for Computer Architecture and Software Technology Berlin Germany Andreas Hoheisel

More information

Automating Real-time Seismic Analysis

Automating Real-time Seismic Analysis Automating Real-time Seismic Analysis Through Streaming and High Throughput Workflows Rafael Ferreira da Silva, Ph.D. http://pegasus.isi.edu Do we need seismic analysis? Pegasus http://pegasus.isi.edu

More information

ArcGIS Enterprise: Advanced Topics in Administration. Thomas Edghill & Moginraj Mohandas

ArcGIS Enterprise: Advanced Topics in Administration. Thomas Edghill & Moginraj Mohandas ArcGIS Enterprise: Advanced Topics in Administration Thomas Edghill & Moginraj Mohandas Outline Overview: Base ArcGIS Enterprise Deployment - Key Components - Administrator Endpoints Advanced Workflows:

More information

NMR INSTRUMENT INSTRUCTIONS: Safety and Sample Preparation

NMR INSTRUMENT INSTRUCTIONS: Safety and Sample Preparation NMR INSTRUMENT INSTRUCTIONS: Safety and Sample Preparation The GVSU chemistry department owns 2 NMR spectrometers. A JEOL Eclipse 300 MHz and a Varian Inova 400 MHz. Due to their strong, constant magnetic

More information

STORK: Making Data Placement a First Class Citizen in the Grid

STORK: Making Data Placement a First Class Citizen in the Grid STORK: Making Data Placement a First Class Citizen in the Grid Tevfik Kosar University of Wisconsin-Madison May 25 th, 2004 CERN Need to move data around.. TB PB TB PB While doing this.. Locate the data

More information

Introduction to UNIX/Linux

Introduction to UNIX/Linux Introduction to UNIX/Linux Biochemistry Boot Camp 2018 Session #3 Nick Fitzkee nfitzkee@chemistry.msstate.edu Operating system (OS) Some terms Command-line interface (CLI) Graphical user interface (GUI)

More information

TreeSearch User Guide

TreeSearch User Guide TreeSearch User Guide Version 0.9 Derrick Stolee University of Nebraska-Lincoln s-dstolee1@math.unl.edu March 30, 2011 Abstract The TreeSearch library abstracts the structure of a search tree in order

More information

HPC Resources at Lehigh. Steve Anthony March 22, 2012

HPC Resources at Lehigh. Steve Anthony March 22, 2012 HPC Resources at Lehigh Steve Anthony March 22, 2012 HPC at Lehigh: Resources What's Available? Service Level Basic Service Level E-1 Service Level E-2 Leaf and Condor Pool Altair Trits, Cuda0, Inferno,

More information

AvePoint Cloud Backup. Release Notes

AvePoint Cloud Backup. Release Notes AvePoint Cloud Backup Release Notes Table of Contents Table of Contents... 2 AvePoint Cloud Backup 1.1.1... 3... 3... 3 AvePoint Cloud Backup 1.1.0... 5... 5... 5 AvePoint Cloud Backup 1.0.4... 6... 6...

More information

Quick Guide to Boardingware: Parent Account

Quick Guide to Boardingware: Parent Account Quick Guide to Boardingware: Parent Account This guide will cover: Connecting to your child Approving your child s leave request Approving your child s leave request via email Viewing your child s live

More information

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science T. Maeno, K. De, A. Klimentov, P. Nilsson, D. Oleynik, S. Panitkin, A. Petrosyan, J. Schovancova, A. Vaniachine,

More information

Contents. Add a Form Element to a Group Box Add a Field to a Form... 22

Contents. Add a Form Element to a Group Box Add a Field to a Form... 22 Workflow Design Guide Version 17 November 2017 Contents About This Guide... 7 Workflows and Forms Overview... 7 Security Permissions for Workflows and Forms... 8 Search for a Workflow Design, Workflow

More information

Oracle. Human Capital Management Cloud Creating and Administering Analytics and Reports for HCM. Release 13 (update 18A)

Oracle. Human Capital Management Cloud Creating and Administering Analytics and Reports for HCM. Release 13 (update 18A) Oracle Human Capital Management Cloud Creating and Administering Analytics and Reports for HCM Release 13 (update 18A) Release 13 (update 18A) Part Number E92271-02 Copyright 2011-2018, Oracle and/or its

More information

Introduction to Linux and Supercomputers

Introduction to Linux and Supercomputers Introduction to Linux and Supercomputers Doug Crabill Senior Academic IT Specialist Department of Statistics Purdue University dgc@purdue.edu What you will learn How to log into a Linux Supercomputer Basics

More information

Managing large-scale workflows with Pegasus

Managing large-scale workflows with Pegasus Funded by the National Science Foundation under the OCI SDCI program, grant #0722019 Managing large-scale workflows with Pegasus Karan Vahi ( vahi@isi.edu) Collaborative Computing Group USC Information

More information

Unix File System. Learning command-line navigation of the file system is essential for efficient system usage

Unix File System. Learning command-line navigation of the file system is essential for efficient system usage ULI101 Week 02 Week Overview Unix file system File types and file naming Basic file system commands: pwd,cd,ls,mkdir,rmdir,mv,cp,rm man pages Text editing Common file utilities: cat,more,less,touch,file,find

More information

Li Yu. University of Notre Dame

Li Yu. University of Notre Dame Li Yu University of Notre Dame 1 Distributed systems are hard to use! An abstraction is a regular structure that can be efficiently scaled up to large problem sizes. We have implemented abstractions such

More information

Grid Mashups. Gluing grids together with Condor and BOINC

Grid Mashups. Gluing grids together with Condor and BOINC Grid Mashups Gluing grids together with Condor and BOINC, Artyom Sharov, Assaf Schuster, Dan Geiger Technion Israel Institute of Technology 1 Problem... 2 Problem... 3 Problem... 4 Parallelization From

More information

XSEDE High Throughput Computing Use Cases

XSEDE High Throughput Computing Use Cases XSEDE High Throughput Computing Use Cases 31 May 2013 Version 0.3 XSEDE HTC Use Cases Page 1 XSEDE HTC Use Cases Page 2 Table of Contents A. Document History B. Document Scope C. High Throughput Computing

More information

Metropolitan Community College, Kansas City. CMS Editor Training. mcckc.edu website content management

Metropolitan Community College, Kansas City. CMS Editor Training. mcckc.edu website content management mcckc.edu website content management MCC Web Applications 7/27/2012 1 Table of Contents About OU Campus... 2 About mcckc.edu... 2 Navigation & Content Layout... 2 Logging In... 3 Website CMS Login... 3

More information

BOSCO Architecture. Derek Weitzel University of Nebraska Lincoln

BOSCO Architecture. Derek Weitzel University of Nebraska Lincoln BOSCO Architecture Derek Weitzel University of Nebraska Lincoln Goals We want an easy to use method for users to do computational research It should be easy to install, use, and maintain It should be simple

More information

Pegasus. Automate, recover, and debug scientific computations. Rafael Ferreira da Silva.

Pegasus. Automate, recover, and debug scientific computations. Rafael Ferreira da Silva. Pegasus Automate, recover, and debug scientific computations. Rafael Ferreira da Silva http://pegasus.isi.edu Experiment Timeline Scientific Problem Earth Science, Astronomy, Neuroinformatics, Bioinformatics,

More information

Gergely Sipos MTA SZTAKI

Gergely Sipos MTA SZTAKI Application development on EGEE with P-GRADE Portal Gergely Sipos MTA SZTAKI sipos@sztaki.hu EGEE Training and Induction EGEE Application Porting Support www.lpds.sztaki.hu/gasuc www.portal.p-grade.hu

More information

Contents. Properties: Field Area Fields Add a Table to a Form... 23

Contents. Properties: Field Area Fields Add a Table to a Form... 23 Workflow Design Guide Version 18 February 2018 Contents About This Guide... 7 Workflows and Forms Overview... 7 Security Permissions for Workflows and Forms... 8 Search for a Workflow Design, Workflow

More information

COMPUTER SCIENCE TRIPOS

COMPUTER SCIENCE TRIPOS CST.2009.4.1 COMPUTER SCIENCE TRIPOS Part IB Tuesday 2 June 2009 1.30 to 4.30 COMPUTER SCIENCE Paper 4 Answer five questions. Submit the answers in five separate bundles, each with its own cover sheet.

More information

Introduction to Condor. Jari Varje

Introduction to Condor. Jari Varje Introduction to Condor Jari Varje 25. 27.4.2016 Outline Basics Condor overview Submitting a job Monitoring jobs Parallel jobs Advanced topics Host requirements Running MATLAB jobs Checkpointing Case study:

More information

Sorting and Searching

Sorting and Searching Sorting and Searching Lecture 2: Priority Queues, Heaps, and Heapsort Lecture 2: Priority Queues, Heaps, and Heapsort Sorting and Searching 1 / 24 Priority Queue: Motivating Example 3 jobs have been submitted

More information

Data Protection Guide

Data Protection Guide SnapCenter Software 4.0 Data Protection Guide For Microsoft Exchange Server March 2018 215-12936_C0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to read the SnapCenter Data Protection

More information

The HTCondor CacheD. Derek Weitzel, Brian Bockelman University of Nebraska Lincoln

The HTCondor CacheD. Derek Weitzel, Brian Bockelman University of Nebraska Lincoln The HTCondor CacheD Derek Weitzel, Brian Bockelman University of Nebraska Lincoln Today s Talk Today s talk summarizes work for my a part of my PhD Dissertation Also, this work has been accepted to PDPTA

More information

HTCondor overview. by Igor Sfiligoi, Jeff Dost (UCSD)

HTCondor overview. by Igor Sfiligoi, Jeff Dost (UCSD) HTCondor overview by Igor Sfiligoi, Jeff Dost (UCSD) Acknowledgement These slides are heavily based on the presentation Todd Tannenbaum gave at CERN in Feb 2011 https://indico.cern.ch/event/124982/timetable/#20110214.detailed

More information

Using Smedge with After Effects

Using Smedge with After Effects Smedge Using Smedge with After Effects Smedge 2018 2004-2017 Überware Table of Contents INTRODUCTION 3 CONFIGURING SMEDGEENGINE TO RENDER AFTER EFFECTS 4 PREPARING YOUR AFTER EFFECTS PROJECT FOR RENDERING

More information

Network Rendering in AccuRender Plan the Master and the Slaves

Network Rendering in AccuRender Plan the Master and the Slaves Network Rendering in AccuRender 4 AccuRender can render single frames in a distributed way over the network. This means you can use many computers that have a copy of AutoCAD and AccuRender 4 installed

More information

NMRbox Release Notes

NMRbox Release Notes NMRbox Release.0 (December, 0) NMRbox Release Notes IMPORTANT: Please Cite and Acknowledge NMRbox in your publications! See https://nmrbox.org/acknowledge-us for details. Software: Version.0 of the NMRbox

More information

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007 Grid Programming: Concepts and Challenges Michael Rokitka SUNY@Buffalo CSE510B 10/2007 Issues Due to Heterogeneous Hardware level Environment Different architectures, chipsets, execution speeds Software

More information

Sorting and Searching

Sorting and Searching Sorting and Searching Lecture 2: Priority Queues, Heaps, and Heapsort Lecture 2: Priority Queues, Heaps, and Heapsort Sorting and Searching 1 / 24 Priority Queue: Motivating Example 3 jobs have been submitted

More information

Title Page. Guide. to Migrating a NetVault Smart Client to a Backup Server

Title Page. Guide. to Migrating a NetVault Smart Client to a Backup Server Title Page Guide to Migrating a NetVault Smart Client to a Backup Server Copyrights Guide to Migrating a NetVault Smart Client to a Backup Server Software Copyright 2008 BakBone Software Documentation

More information

Adding Distribution Settings to a Job Profile (CLUI)

Adding Distribution Settings to a Job Profile (CLUI) CHAPTER 7 Adding Distribution Settings to a Job Profile (CLUI) Revised: October 10, 2008, Distribution tabs tell the Cisco MXE 3000 what to do with output files once encoding is complete. This section

More information

Techniques for Efficient Execution of Large-Scale Scientific Workflows in Distributed Environments

Techniques for Efficient Execution of Large-Scale Scientific Workflows in Distributed Environments Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 11-14-2014 Techniques for Efficient Execution of Large-Scale Scientific Workflows

More information

A Framework for Space and Time Efficient Scheduling of Parallelism

A Framework for Space and Time Efficient Scheduling of Parallelism A Framework for Space and Time Efficient Scheduling of Parallelism Girija J. Narlikar Guy E. Blelloch December 996 CMU-CS-96-97 School of Computer Science Carnegie Mellon University Pittsburgh, PA 523

More information

NMR Assignments using NMRView I: Introduction to NMRView

NMR Assignments using NMRView I: Introduction to NMRView NMR Assignments using NMRView I: Introduction to NMRView In your home directory, you should have a subdirectory called nmrview. The contents of that directory should be as shown below (subdirectories are

More information

Week Overview. Unix file system File types and file naming Basic file system commands: pwd,cd,ls,mkdir,rmdir,mv,cp,rm man pages

Week Overview. Unix file system File types and file naming Basic file system commands: pwd,cd,ls,mkdir,rmdir,mv,cp,rm man pages ULI101 Week 02 Week Overview Unix file system File types and file naming Basic file system commands: pwd,cd,ls,mkdir,rmdir,mv,cp,rm man pages Text editing Common file utilities: cat,more,less,touch,file,find

More information

Intermediate Code Generation

Intermediate Code Generation Intermediate Code Generation In the analysis-synthesis model of a compiler, the front end analyzes a source program and creates an intermediate representation, from which the back end generates target

More information

SPINSv3.0 Users Guide

SPINSv3.0 Users Guide SPINSv3.0 Users Guide Michael Baran mcbaran@cabm.rutgers.edu Table of Contents 1. Introduction 1-1 What is SPINS 1-2 Files SPINS Tracks 1-3 How SPINS Stores Data 1-4 SPINS Schema 1-5 SPINS Data Processing

More information

CMPSCI 250: Introduction to Computation. Lecture #22: Graphs, Paths, and Trees David Mix Barrington 12 March 2014

CMPSCI 250: Introduction to Computation. Lecture #22: Graphs, Paths, and Trees David Mix Barrington 12 March 2014 CMPSCI 250: Introduction to Computation Lecture #22: Graphs, Paths, and Trees David Mix Barrington 12 March 2014 Graphs, Paths, and Trees Graph Definitions Paths and the Path Predicate Cycles, Directed

More information

Transaction Intermediate. Bill Wiseman Shelby Murrell

Transaction Intermediate. Bill Wiseman Shelby Murrell Transaction Intermediate Bill Wiseman Shelby Murrell 1 Course Objectives Gain familiarity with the intermediate use of Winshuttle Studio Transaction Adding functionality to Winshuttle Studio Transaction

More information

Vector Issue Tracker and License Manager - Administrator s Guide. Configuring and Maintaining Vector Issue Tracker and License Manager

Vector Issue Tracker and License Manager - Administrator s Guide. Configuring and Maintaining Vector Issue Tracker and License Manager Vector Issue Tracker and License Manager - Administrator s Guide Configuring and Maintaining Vector Issue Tracker and License Manager Copyright Vector Networks Limited, MetaQuest Software Inc. and NetSupport

More information

Ge#ng Started with the RCE. Len Wisniewski

Ge#ng Started with the RCE. Len Wisniewski Ge#ng Started with the RCE Len Wisniewski First thing to do Sign up for an RCE account Send e- mail to support@help.hmdc.harvard.edu requesdng an RCE account IQSS system administrator will send you a quesdonnaire

More information

Std: XI CHAPTER-3 LINUX

Std: XI CHAPTER-3 LINUX Commands: General format: Command Option Argument Command: ls - Lists the contents of a file. Option: Begins with minus sign (-) ls a Lists including the hidden files. Argument refers to the name of a

More information

If you have suggestions or requests please ask!

If you have suggestions or requests please ask! Tutorial Nonuniform Sampling Software nus-tool v0.3 Presenter Adam Schuyler A. Introduction 1. What is nus-tool? The nus-tool software package is a sample schedule utility that provides functions to generate

More information

CS 215 Fundamentals of Programming II Spring 2019 Very Basic UNIX

CS 215 Fundamentals of Programming II Spring 2019 Very Basic UNIX CS 215 Fundamentals of Programming II Spring 2019 Very Basic UNIX This handout very briefly describes how to use Unix and how to use the Linux server and client machines in the EECS labs that dual boot

More information

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad -500 043 COMPUTER SCIENCE AND ENGINEERING TUTORIAL QUESTION BANK Course Name : LINUX PROGRAMMING Course Code : A70511 (R15) Class

More information

Oracle. Human Capital Management Cloud Creating and Administering Analytics and Reports for HCM. Release 13 (update 18B)

Oracle. Human Capital Management Cloud Creating and Administering Analytics and Reports for HCM. Release 13 (update 18B) Oracle Human Capital Management Cloud Creating and Administering Analytics and Reports for HCM Release 13 (update 18B) Release 13 (update 18B) Part Number E94260-01 Copyright 2011-2018, Oracle and/or its

More information

SummaCut & S Class Series Cutter Integration Note

SummaCut & S Class Series Cutter Integration Note Tutorial SummaCut & S Class Series Cutter Integration Note Software version: Asanti 3.0 Document version: November 27, 2017 This document explains how drive the Summa S Class Series cutters. The steps

More information