Workflow Scheduling Algorithms in Grid Computing

Similar documents
An Exploration of Multi-Objective Scientific Workflow Scheduling in Cloud

Workflow Scheduling Using Heuristics Based Ant Colony Optimization

Reference Point Based Evolutionary Approach for Workflow Grid Scheduling

Hybrid Bee Ant Colony Algorithm for Effective Load Balancing And Job Scheduling In Cloud Computing

Efficient Load Balancing and Fault tolerance Mechanism for Cloud Environment

QoS Guided Min-Mean Task Scheduling Algorithm for Scheduling Dr.G.K.Kamalam

IJSER. features of some popular technologies such as grid

Bio-Inspired Techniques for the Efficient Migration of Virtual Machine for Load Balancing In Cloud Computing

Fuzzy Inspired Hybrid Genetic Approach to Optimize Travelling Salesman Problem

Particle Swarm Optimization Approach with Parameter-wise Hill-climbing Heuristic for Task Allocation of Workflow Applications on the Cloud

Scheduling of Independent Tasks in Cloud Computing Using Modified Genetic Algorithm (FUZZY LOGIC)

New Optimal Load Allocation for Scheduling Divisible Data Grid Applications

A QoS Load Balancing Scheduling Algorithm in Cloud Environment

ADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT

An Energy Efficient and Delay Aware Data Collection Protocol in Heterogeneous Wireless Sensor Networks A Review

LOW AND HIGH LEVEL HYBRIDIZATION OF ANT COLONY SYSTEM AND GENETIC ALGORITHM FOR JOB SCHEDULING IN GRID COMPUTING

Tasks Scheduling using Ant Colony Optimization

International Journal of Current Trends in Engineering & Technology Volume: 02, Issue: 01 (JAN-FAB 2016)

Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Balancing Strategy in Cloud Computing

Scheduling Scientific Workflows using Imperialist Competitive Algorithm

Efficient Technique for Allocation of Processing Elements to Virtual Machines in Cloud Environment

Framework for Preventing Deadlock : A Resource Co-allocation Issue in Grid Environment

Efficient Task Scheduling Algorithms for Cloud Computing Environment

Optimization of Makespan and Mean Flow Time for Job Shop Scheduling Problem FT06 Using ACO

A Review of Multiprocessor Directed Acyclic Graph (DAG) Scheduling Algorithms

QADR with Energy Consumption for DIA in Cloud

2. Literature review. 3. Model development

Modified Hierarchical Load Balancing Algorithm for Scheduling in Grid Computing

A study of hybridizing Population based Meta heuristics

AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING

Study on GA-based matching method of railway vehicle wheels

Grid Scheduling Strategy using GA (GSSGA)

FAULT-TOLERANCE AWARE MULTI OBJECTIVE SCHEDULING ALGORITHM FOR TASK SCHEDULING IN COMPUTATIONAL GRID

Survey on Dynamic Resource Allocation Scheduler in Cloud Computing

A New Approach to Ant Colony to Load Balancing in Cloud Computing Environment

WORKFLOW ENGINE FOR CLOUDS

Online Optimization of VM Deployment in IaaS Cloud

Research on Heterogeneous Communication Network for Power Distribution Automation

Index. Encoding representation 330 Enterprise grid 39 Enterprise Grids 5

ENHANCED BEE COLONY ALGORITHM FOR SOLVING TRAVELLING SALESPERSON PROBLEM

A Modified Black hole-based Task Scheduling Technique for Cloud Computing Environment

Workflow scheduling algorithms for hard-deadline constrained cloud environments

Provide a Method of Scheduling In Computational Grid Using Imperialist Competitive Algorithm

Level Based Task Prioritization Scheduling for Small Workflows in Cloud Environment

LOAD BALANCING IN CLOUD COMPUTING USING ANT COLONY OPTIMIZATION

International Journal of Scientific & Engineering Research Volume 9, Issue 3, March-2018 ISSN

Various Strategies of Load Balancing Techniques and Challenges in Distributed Systems

An Optimized Virtual Machine Migration Algorithm for Energy Efficient Data Centers

Task Scheduling Using Probabilistic Ant Colony Heuristics

ACCELERATING THE ANT COLONY OPTIMIZATION

A SURVEY ON CLOUD RESOURCE ALLOCATION STRATEGIES

A hybrid algorithm for grid task scheduling problem

Optimization of Benchmark Functions Using Artificial Bee Colony (ABC) Algorithm

ScienceDirect. Differential Search Algorithm for Multiobjective Problems

Associate Professor, Aditya Engineering College, Surampalem, India 3, 4. Department of CSE, Adikavi Nannaya University, Rajahmundry, India

Hybrid Task Scheduling Method for Cloud Computing by Genetic and PSO Algorithms

A Hybrid Genetic Algorithm for the Distributed Permutation Flowshop Scheduling Problem Yan Li 1, a*, Zhigang Chen 2, b

Figure 1: Virtualization

Performance Analysis of Min-Min, Max-Min and Artificial Bee Colony Load Balancing Algorithms in Cloud Computing.

A Partial Critical Path Based Approach for Grid Workflow Scheduling

Nowadays data-intensive applications play a

HJSA: A HIERARCHICAL JOB SCHEDULING ALGORITHM FOR COST OPTIMIZATION IN CLOUD COMPUTING ENVIRONMENT

Keywords: Load balancing, Honey bee Algorithm, Execution time, response time, cost evaluation.

CLOUD WORKFLOW SCHEDULING BASED ON STANDARD DEVIATION OF PREDICTIVE RESOURCE AVAILABILITY

Scheduling Mixed-Model Assembly Lines with Cost Objectives by a Hybrid Algorithm

An Experimental Cloud Resource Broker System for Virtual Application Control with VM Allocation Scheme

A Tool for Comparing Resource-Constrained Project Scheduling Problem Algorithms

Enhanced Bee Colony Algorithm for Efficient Load Balancing and Scheduling in Cloud

Grid Scheduler. Grid Information Service. Local Resource Manager L l Resource Manager. Single CPU (Time Shared Allocation) (Space Shared Allocation)

A Particle Swarm Optimization Algorithm for Solving Flexible Job-Shop Scheduling Problem

A Novel Hybrid Self Organizing Migrating Algorithm with Mutation for Global Optimization

Energy Efficiency Using Load Balancing in Cloud Data Centers: Proposed Methodology

Minimal Test Cost Feature Selection with Positive Region Constraint

A Comparative Study of Various Scheduling Algorithms in Cloud Computing

Enhanced SLA-Tree Algorithm to Support Incremental Tree Building

Task Scheduling Algorithm in Cloud Computing based on Power Factor

Clustering based Max-Min Scheduling in Cloud Environment

Dynamic task scheduling in cloud computing based on Naïve Bayesian classifier

Solving Travelling Salesmen Problem using Ant Colony Optimization Algorithm

An Approach to Mapping Scientific Workflow in Cloud Computing data centers to Minimize Costs of Workflow Execution

Review On Data Replication with QoS and Energy Consumption for Data Intensive Applications in Cloud Computing

A Static Tasks Assignment For Grid Computing

Solving the Graph Bisection Problem with Imperialist Competitive Algorithm

An Active Resource Management System for Computational Grid*

A Study on Load Balancing in Cloud Computing * Parveen Kumar,* Er.Mandeep Kaur Guru kashi University, Talwandi Sabo

Hybrid Improved Max Min Ant Algorithm for Load Balancing in Cloud

A Heuristic Based Load Balancing Algorithm

Double Threshold Based Load Balancing Approach by Using VM Migration for the Cloud Computing Environment

Optimized Coverage and Efficient Load Balancing Algorithm for WSNs-A Survey P.Gowtham 1, P.Vivek Karthick 2

In cloud computing, IaaS approach is to

CHAOTIC ANT SYSTEM OPTIMIZATION FOR PATH PLANNING OF THE MOBILE ROBOTS

SCIENTIFIC WORKFLOW SCHEDULING IN CLOUD COMPUTING ENVIRONMENT: A SURVEY

K-MEANS BASED CONSENSUS CLUSTERING (KCC) A FRAMEWORK FOR DATASETS

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing

A SOLUTION FOR REPLICA CONSISTENCY MAINTENANCE IN UNSTRUCTURED PEER-TO-PEER NETWORKS

An Improved Heft Algorithm Using Multi- Criterian Resource Factors

An Improved Task Scheduling Algorithm based on Max-min for Cloud Computing

Comparative Analysis of Anonymization Techniques

Research on Full-text Retrieval based on Lucene in Enterprise Content Management System Lixin Xu 1, a, XiaoLin Fu 2, b, Chunhua Zhang 1, c

Comparative Analysis of Swarm Intelligence Optimization Techniques for Cloud Scheduling

Transcription:

Workflow s in Grid Computing Neha Bhardwaj CSE Department UIET Kurukshetra University, Kurukshetra Haryana, INDIA bhardwaj.mylife@gmail.com Abstract Grid computing is a process of aggregate the functionality of different geographically resources and provide services to the user. is most popular research area in grid computing for achieving high performance. In scheduling, tasks are assigning to resources and maintain the execution. Dependencies constraints are need to preserve for workflow scheduling in grid computing. In this paper, surveyed various workflow scheduling algorithms. Lot of research has been done in the area workflow scheduling but still there is a requirement of optimization techniques like bat optimization, water drop optimization to the workflow scheduling. Keywords Grid ; QoS; Workflow ; Intelligent Water Drop ; Bat. I. INTRODUCTION Grid computing is like a distributed system with non-interactive workloads having large number of files. Grid computing is highly dispersed with heterogeneous resources than cluster computing. Unlike cluster computing, nodes of grid computing are having different task or application to perform. Grid is an environment for solving the problem, which is submitted by the one more user without knowing the location of resources[1]. Grid scheduling is process of assigning the task to resources grid and achieving the high performance. Grid workflow is set of tasks which are distributed on the resources for achieving the goal. Data intensive application such e-science, high energy physics, chemistry etc. uses the grids to manage, share and process large sets of data [1]. Grid having various challenges in security [2], implementation models [3], access control [4], resource management and workflow management. In workflow, the tasks are set of sub tasks and having the dependency among them. Hence there are various challenges faced in grid workflows. Some of the challenges are [5]: Grid having various challenges like accessibility, availability etc. Workflow tasks distributed on various resources for completion. Some tasks of workflow need to be executed in parallel and concurrent manner. Grid workflows create, manage and execute various grid applications with high efficiency. Grid workflow is highly dynamic, heterogeneous and distribution in nature therefore it is very difficult to solve the problem in grid environment. Grid workflow scheduling algorithms based on DAG (Directed Acyclic Graph). Scheduler assigns appropriate resources to workflow tasks for execution to be completed and satisfy objective functions applied by the users. The problem of mapping the workflow tasks to the distributed and heterogeneous resources is belongs to class of problems which is known as NP-Hard problems [6]. For such workflow problems, there is no algorithm which provides the optimal solutions within the polynomial time. Many heuristic and meta-heuristic based algorithms have been proposed for the implementation of workflow in grid environment.is very difficult to solve the problem in grid environment. Grid workflow scheduling algorithms based on DAG (Directed Acyclic Graph) [6]. A. Grid workflow Tasks Type Grid workflow have two types of task [7]: Metatasks: The result of metatasks is not affected by the sequence of metatask because they are dependent but there is no dependent relation among metatasks. The objective of scheduling algorithm for metatask is makespan. Dependent tasks: The tasks having the some relation like data, temporal relation. So, the order of the task cannot be changed. B. Phase The scheduling process of workflow having 3 phases [7]: Matching phase: Resources are selected which satisfy the requirement of tasks. The minimal requirement of tasks defined in terms of static information like memory, architecture etc. ISSN : 2229-3345 Vol. 6 No. 05 May 2015 271

phase: In this phase, the resources are selected for the sequence of task by considering the constraints and rules imposed by the users. s are used for getting the optimal solution due to NP problem. Execution phase: The task is assigned to selected resources and is executed. Some management and administration are considered. II. WORKFLOW SCHEDULING Workflow is set of task, subtasks having dependency among them. Workflow scheduling algorithm takes input of workflow model which provide the set of task(i.e workflow). There are two types of workflow model [6]: 1. Deterministic model: The input tasks and dependency among them is known in advance. 2. Non-deterministic model: The input task and dependency is known at run time. The deterministic type of algorithm is represented in the form DAG. define number jobs in workflow and edges between jobs such as (T i, T j ) edge. T i is a parent of T j hence T i must be executed before execution of T j [8]. Figure 1. DAG Representation of Workflow. In a workflow, task execution is decided on the basis of dependencies among them and executed after completion of parent node. As shown in fig. 1 T 3 is executed after completion of T 0 and T 1. The related task like T 2 and T 3 are assigned to same service node. The priorities are used for solving the conflict among tasks. The two dimensional string to represent the scheduling result of fig. 1 is shown in fig. 2 [8]. Figure 2. Schedule Plan III. EXISTING WORKFLOW SCHEDULING ALGORITHM There are various algorithm was developed in the field of grid computing. Some of the algorithms are: 1. Maria Arsuaga-Rios and Miguel A. Vega-Rodriguez [9] presented a Multiobjective Brain Strom (MOBSA) based on the brain storming in which humans processed the job in order to optimize the job scheduling problem in grid. MOBSA is based on two objectives: Execution time and execution cost. 2. Mansoure Yaghoobi et al. [10] proposed a game theory based approach for minimizing the time and cost. The brokers act as a players and players compete to maximize the profits. 3. P. Mathiyalagan et al. [11] proposed an intelligent water drop algorithm along with ACO algorithm. Intelligent water drop is used to find out the resources according to job requirement and routing information. 4. Xi Li et al. [12] defined a concept of Satisfaction Degree of workflow which is used to represent the probability of workflow to be completed before its deadline. 5. Sunita Bansal and Chittarajan Hota [13] proposed an Efficient on Heterogeneous Computing System (EAHCS) which manage the load across the machines and reduce the makespan. ISSN : 2229-3345 Vol. 6 No. 05 May 2015 272

6. Chaokun Yan et al. [14] proposed a Reliability Enhanced Grid Workflow with budget constraint which can maximize the reliability. 7. Fabio Coutinho et.al [15] defined an energy efficient model and HGreen heuristic which assign the heaviest workflow tasks to energy efficient resources. 8. Dengpan Yin and Tevfik Kosar [16] proposed A-star based data-aware workflow scheduling algorithm. s allow overlap of data placement and task execution and due to this turnaround time and time complexity are decreased. This algorithm extended to the co-scheduling problem. 9. Zhang Wenpeng and Liu Hongzhao [17] proposed algorithm with combination of advantages of genetic algorithm, clonal algorithm and simulated annealing. 10. Wei-Neng Chen and Jun Zhang [18] proposed Ant Colony Optimization algorithm for scheduling the tasks with various parameters. Seven new heuristic was developed for the ACO. Adaptive scheme also developed to enable artificial ants to one of heuristic on the basis pheromone values. 11. Long Hao et al. [19] proposed a Deadlock Segment Leveling (DSL) a novel heuristic which divide the workflow application into segments and further segments are partitioned into groups. 12. Kassian Plankensteiner et al. [20] proposed resubmission impact a fault tolerance heuristic for scientific workflows. is based on task replication and task resubmission. Approach Brain Storm (Swarm algorithm) [9] Game theory Approach [10] Ant Colony with intelligent water drop [11] Satisfaction Degree of workflow(dsdw) [12] Multiobjective Brain Storm (MOBS A) Genetic and Random Ant Colony and Intelligent Water Drop satisfaction Enhanced Table 1. Survey of Workflow Year of Publicati on Type of 2014 Meta- 2013 or Meta- 2013 Meta- Objectives Reduce Execution time and Economic Cost Reduce Time and cost Reduce makespan, improve system utilization and load balancing 2011 Reduce Total Execution Time Future Work objectives. Different choice of algorithm can be used in the proposed Approach. Try optimized more by using different approaches. In scheduling, can be considered reliability, performance of grid resources. Task with maximum completion time assign to fastest machine [13] Budget Constraints [14] HGreen [15] A-star [16] algorithm Efficient on Heterogeneous Computing System(EAHCS) Reliability Enhanced Grid Workflow with Budget Constraint HGreen heuristic and Energy Efficient Model Data-Aware Workflow based on A-star 2011 improved Makespan, System Utilization 2011 QoS based heuristic and Load balancing Better Reliability and Adaptability 2011 Reduced Power Consumption 2011 Minimized turnaround time Communication cost can considered for further work. Performance impact of budget distribution and define improved budget distribution strategy. Explore new heuristics and introduce new execution scenarios. heuristic for more improvement in the algorithm. ISSN : 2229-3345 Vol. 6 No. 05 May 2015 273

Genetic, Clonal Selection, Simulated Annealing [17] Genetic Clonal Annealing (GCAA ) 2010 Meta- Load Balancing GCAA can apply with different features. Implementation of method on real grid. 7 s : Reliability greedy, Time greedy, Suggested, Suggest Budget, Time/Cost, Overall Performance [18] Ant Colony System (ACS) 2009 QoS based Reliability, Time, Cost heuristic and make it more optimized. Segment Leveling [19] Segment leveling 2009 Cost Optimization and Synchronization heuristic so that it become more synchronized and cost optimized. Resubmission Imapct for Fault tolerance combination of Task Replication and Task Resubmission.[20] HEFT with Task Replication 2009 Without information of about resource failure, Resubmission impact wastes on average 42% less than other approaches without changing in task rate and overall performance. Try reduce waste of resources by introducing new methods. IV. PROBLEM FORMULATION Intelligent drop optimization technique is applied with ant colony algorithm to reduce makespan and improve load balance [11]. There is need to apply this optimization technique in such way other constraints like availability, budget constraints etc. are also improved. Bat optimization [21] is also need to apply in scheduling problem of grid environment. Bat optimization algorithm is applied in cloud computing [22] which is an metaheuristic algorithm. V. CONCLUSION In this paper, surveyed various workflow scheduling algorithm. The algorithm was explained and tabulated them on the basis of scheduling approach, type of scheduling, year, objectives and future work. Lots of research carried out but there are still requirement of improvement or need to be optimized. As concluded that there is need to explore intelligent drop optimization and bat optimization in the scheduling problem of grid environment. ISSN : 2229-3345 Vol. 6 No. 05 May 2015 274

REFERENCES [1] J. Yu, and R. Buyya. "A taxonomy of scientific workflow systems for grid computing." ACM Sigmod Record vol. 34,no. 3, pp. 44-49, 2005. [2] E. Cody, R. Sharman, R. H. Rao, and Shambhu Upadhyaya. "Security in grid computing: a review and synthesis." Decision Support Systems, ScienceDirect, vol. 44, no. 4, pp. 749-764, 2005. [3] M. Parashar, J. C.Browne, "Conceptual and implementation models for the grids." in Proceedings of the IEEE, vol. 93, No. 3, 2005. [4] G. Zhang and M. Parashar. "Dynamic context-aware access control for grid applications." in IEEE 4 th International Workshop on Grid Computing, pp. 101-108, 2003. [5] S. Benedict and V. Vasudevan. " of scientific workflows using niched pareto ga for grids." in IEEE International Conference on Service Operations and Logistics, and Informatics, pp. 908-912, 2006. [6] J. Yu, R. Buyya, and K. Ramamohanarao. "Workflow scheduling algorithms for grid computing." Metaheuristics For in Distributed Computing Environments, Springer Berlin Heidelberg, pp. 173-214, 2008. [7] H. Jin, ed. Grid and Cooperative Computing-GCC 2004 Workshops: GCC 2004 International Workshops, IGKG, SGT, GISS, AAC- GEVO, and VVS, Wuhan, China, October 21-24, 2004. Vol. 3252. Springer Science & Business Media, 2004. [8] L. Xiang-ning, Y. Guang-ping, and G. Lin-ying. "Design of a dynamic scheduling engine of grid workflow." in IEEE 3 rd International Conference on Communication Software and Networks, pp. 258-262, 2011. [9] M. Arsuaga-Rios, and M. A. Vega-Rodriguez. "Cost optimization based on brain storming for grid scheduling." in IEEE 4 th International Conference on Innovative Computing Technology, pp. 31-36, 2014. [10] M. Yaghoobi, A. Fanian, H. Khajemohammadi, and T. Aaron Gulliver. "A non-cooperative game theory approach to optimize workflow scheduling in grid computing." in 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, pp. 108-113, 2013. [11] P. Mathiyalagan, S. N. Sivanandam, and K. S. Saranya. "Hybridization of modified ant colony optimization and intelligent water drops algorithm for job scheduling in computational grid." ICTACT Journal on Soft Computing, vol. 4, no. 1, 2013. [12] X. Li, Z. Hu, and C. Yan. "A deadline satisfaction enhanced workflow scheduling algorithm." in 19 th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 55-61, 2011. [13] S. Bansal, and C. Hota. "Efficient algorithm on heterogeneous computing system." in International Conference on Recent Trends in Information Systems, pp. 57-61, 2011. [14] C. Yan, Z. Hu, X. Li, Z. Hu, and P. Xiao. "Reliability enhanced grid workflow scheduling algorithm with budget constraint." in IEEE International Conference on Information and Automation, pp. 629-633, 2011. [15] F. Coutinho, L. Alfredo V. de Carvalho, and R. Santana. "A workflow scheduling algorithm for optimizing energy-efficient grid resources usage." in IEEE 9 th International Conference on Dependable, Autonomic and Secure Computing, pp. 642-649, 2011. [16] D. Yin, and T. Kosar. "A data-aware workflow scheduling algorithm for heterogeneous distributed systems." in International Conference on High Performance Computing and Simulation, pp. 114-120, 2011. [17] Z. Wenpeng, and L. Hongzhao. "A load balancing method based on genetic clonal annealing strategy in grid environments." in International Conference on Educational and Network Technology, pp. 549-552, 2010. [18] W.-N. Chen, and J. Zhang. "An ant colony optimization approach to a grid workflow scheduling problem with various qos requirements." in IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 39, no. 1, 29-43, 2009. [19] H. Long, D. Rui-Hua, and H. Ming-He. "Cost-effective heuristics for grid workflow scheduling based on segment leveling." in IEEE WASE International Conference on Information Engineering, vol. 1, pp. 496-501, 2009. [20] K. Plankensteiner, R. Prodan, and T. Fahringer. "A new fault tolerance heuristic for scientific workflows in highly distributed environments based on resubmission impact." in 5 th IEEE International Conference on e-science, pp. 313-320, 2009. [21] J. Krause, J. Cordeiro, R. S. Parpinelli, and H. S. Lopes. "A survey of swarm algorithms applied to discrete optimization problems." Swarm Intelligence and Bio-inspired Computation: Theory and Applications, Elsevier Science & Technology Books, pp. 169-191, 2013. [22] S. Raghavan, C. Marimuthu, P. Sarwesh, and K. Chandrasekaran. "Bat algorithm for scheduling workflow applications in cloud." in International Conference on Electronic Design, Computer Networks & Automated Verification, pp. 139-144, 2015. ISSN : 2229-3345 Vol. 6 No. 05 May 2015 275