A First Step to the Evaluation of SimGrid in the Context of a Real Application. Abdou Guermouche

Size: px

Start display at page:

Download "A First Step to the Evaluation of SimGrid in the Context of a Real Application. Abdou Guermouche"

Gwendoline Peters
5 years ago
Views:

1 A First Step to the Evaluation of SimGrid in the Context of a Real Application Abdou Guermouche Hélène Renard 19th International Heterogeneity in Computing Workshop April 19, 2010 École polytechnique universitaire de Nice-Sophia Antipolis i3s Université Bordeaux 1 LaBRI

2 Abdou Guermouche SimGrid vs Real-Life 2 Plan of presentation 1. Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

3 Framework Plan of presentation Abdou Guermouche SimGrid vs Real-Life 3 1. Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

4 Abdou Guermouche SimGrid vs Real-Life 4 Framework Plan of presentation Data redistribution algorithms 1. Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

Abdou Guermouche SimGrid vs Real-Life 5 Framework Data redistribution algorithms Data redistribution algorithms : context Target platforms: distributed heterogeneous platforms (network of

5 Abdou Guermouche SimGrid vs Real-Life 5 Framework Data redistribution algorithms Data redistribution algorithms : context Target platforms: distributed heterogeneous platforms (network of workstations, clusters of clusters, grids, etc.) 1. Various sources of load imbalance : application requirements / platform. 2. The data must be redistributed to achieve a better load balancing. 3. No discussion of the mechanism of load balancing we consider it as given.

6 Abdou Guermouche SimGrid vs Real-Life 6 Framework Data redistribution algorithms Data redistribution algorithms : context The algorithm operates on a wide array of rectangular sample data: The array is split in vertical slices; This geometric constraint recommends that processors must be organized as a virtual ring: Each processor only communicates twice (once with each neighbor). x i 1,j x i,j 1 x i,j x i,j+1 x i+1,j P i 1 P i P i+1 Figure: Communication scheme.

7 Abdou Guermouche SimGrid vs Real-Life 7 Framework Data redistribution algorithms Redistribution problem for heterogeneous bidirectional rings Definition A redistribution is light if each processor initially owns all data that it will send during the execution of the algorithm. Minimize τ subject to S i,i+1 0 S i,i 1 0 S i,i+1 + S i,i 1 S i+1,i S i 1,i = δ i S i,i+1 c i,i+1 + S i,i 1 c i,i 1 τ S i+1,i c i+1,i + S i 1,i c i 1,i τ 1 i n 1 i n 1 i n 1 i n 1 i n (1) To lead to... We can use the solution of System 1 safely.

8 Abdou Guermouche SimGrid vs Real-Life 8 Framework Plan of presentation Heat propagation 1. Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

9 Abdou Guermouche SimGrid vs Real-Life 9 Laplace equation Framework Heat propagation Context A metal plate to which is applied a source of heat from the edges. The heat will spread within plate. The temperature at the edges is kept constant, the heat distribution in the plate tends to a stationary state. Heat source Laplace equation : 2 f x f y 2 = 0 Heat source

10 Abdou Guermouche SimGrid vs Real-Life 10 Laplace equation : Framework Heat propagation 2 f x + 2 f 2 y = 0 2 Resolution : 1. Approximating the solution discretization grid n 2 points Heat source Heat source 2. Using finite differences on the Laplace equation, this is equivalent to iteratively solve the following equation: 4x i,j (x i 1,j + x i+1,j + x i,j 1 + x i,j+1 ) = 0

11 Abdou Guermouche SimGrid vs Real-Life 11 Laplace equation: Framework Heat propagation 2 f x + 2 f 2 y = 0 2 xi,j 1 xi 1,j xi,j xi,j+1 Same pattern of communication as the ring of processors xi+1,j Pi 1 Pi Pi+1 Figure: Communication scheme. Communication only with immediate neighbors. 3. Solving a linear system Jacobi, since it is of the form: Ax = b, with A and x as... x 1,1... x 1, = b x n,n 1 x n,n

12 Real-life and simulation Plan of presentation Abdou Guermouche SimGrid vs Real-Life Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

13 Abdou Guermouche SimGrid vs Real-Life 13 Real-life and simulation Plan of presentation 1. Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

load balancing and data redistribution on two different

14 Abdou Guermouche SimGrid vs Real-Life 14 Real-life and simulation Goal : Compare the behavior of algorithms for load balancing and data redistribution on two different platforms : Grid 5000 SimGrid Figure: SimGrid Figure: Grid 5000

15 Abdou Guermouche SimGrid vs Real-Life 15 Real-life and simulation The master and the workers Cluster 1... Cluster 3 Monitor Master node (scheduler)... regular communications redistribution/state information monitoring... Cluster 2 Figure: Experimental scheme: the master and the workers. This organization is used in both the simulated and real-life context. The difference comes from the monitor which is given by SimGrid in the simulated context.

16 Abdou Guermouche SimGrid vs Real-Life 15 Real-life and simulation The master and the workers Cluster 1... Cluster 3 Monitor Master node (scheduler)... regular communications redistribution/state information monitoring... Cluster 2 Figure: Experimental scheme: the master and the workers. Master: Gather the results of the measurements. Call the redistribution algorithms when needed.

17 Abdou Guermouche SimGrid vs Real-Life 15 Real-life and simulation The master and the workers Cluster 1... Cluster 3 Monitor Master node (scheduler)... regular communications redistribution/state information monitoring... Cluster 2 Figure: Experimental scheme: the master and the workers. Monitor: Modify (using wrekavoc) the characteristics of the platform.

18 Abdou Guermouche SimGrid vs Real-Life 15 Real-life and simulation The master and the workers Cluster 1... Cluster 3 Monitor Master node (scheduler)... regular communications redistribution/state information monitoring... Cluster 2 Slaves: Figure: Experimental scheme: the master and the workers. Do all the computations and communications. Exchange data for redistribution according to the results of the master.

19 Abdou Guermouche SimGrid vs Real-Life 16 Real-life and simulation Plan of presentation Wrekavoc 1. Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

20 Abdou Guermouche SimGrid vs Real-Life 17 Real-life and simulation Wrekavoc Wrekavoc, in the center of both platforms 1. in our context, Wrekavoc is used to control CPU and network capabilities; of randomly chosen resources; in order to study the behavior of the application. Modify CPU speed Modify Memory available Node Daemon Modify Network bandwith & Latency Figure: Wrekavoc in pictures

21 Abdou Guermouche SimGrid vs Real-Life 18 Real-life and simulation Wrekavoc 1. Real and simulated execution: Retrieve through measurements: processor speed network latency inbound bandwidth Differences: Real execution: the modification of the characteristics of the platform are done using wrekavoc, Simulated execution: the modification of the characteristics of the platfrom is a built-in functionnality of SimGrid.

22 Experimental results Plan of presentation Abdou Guermouche SimGrid vs Real-Life Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

23 Experimental results Time for iteration (in seconds) Real-life Simgrid Iteration number Time for iteration (in seconds) Real-life Simgrid Iteration number (a) No platform variation. (b) With platform variation (3 platform variations, once every 29 iterations). Figure: Time needed (in seconds) for each iteration on the real-life and the simulated platform: one site platform. Abdou Guermouche SimGrid vs Real-Life 20

24 Experimental results Time for iteration (in seconds) Real-life Simgrid Iteration number Time for iteration (in seconds) Real-life Simgrid Iteration number (a) No platform variation. (b) With platform variation (3 platform variations, once every 29 iterations). Figure: Time needed (in seconds) for each iteration on the real-life and the simulated platform: two sites platform. Abdou Guermouche SimGrid vs Real-Life 21

25 Experimental results Time for iteration (in seconds) Real-life Simgrid Iteration number Time for iteration (in seconds) Real-life Simgrid Iteration number (a) No platform variation. (b) With platform variation (3 platform variations, once every 29 iterations). Figure: Time needed (in seconds) for each iteration on the real-life and the simulated platform: five sites platform. Abdou Guermouche SimGrid vs Real-Life 22

26 Experimental results Time for iteration (in seconds) Real-life Simgrid Iteration number Time for iteration (in seconds) Real-life Simgrid Iteration number (a) No platform variation. (b) With platform variation (3 platform variations, once every 29 iterations). Figure: Time needed (in seconds) for each iteration on the real-life and the simulated platform: two sites platform. Each iteration is three time more costly than a regular one. Abdou Guermouche SimGrid vs Real-Life 23

27 Conclusion Plan of presentation Abdou Guermouche SimGrid vs Real-Life Framework Data redistribution algorithms Heat propagation 2. Real-life and simulation Wrekavoc 3. Experimental results 4. Conclusion

28 Abdou Guermouche SimGrid vs Real-Life 25 Conclusion Conclusion 1. Two versions of the same application: the propagation of heat Simulated implementation on top of SimGrid. Real-life implementation running on the Grid 5000 platform. Using wrekavoc to control the characteristics of the platform. Use the same platform characteristics over time in the two contexts. 2. The observed behavior for the simulated case is very close to that of a real execution. 3. A first step for validation of SimGrid in the context of complex applications.

Modalis. A First Step to the Evaluation of SimGrid in the Context of a Real Application. Abdou Guermouche and Hélène Renard, May 5, 2010

Modalis. A First Step to the Evaluation of SimGrid in the Context of a Real Application. Abdou Guermouche and Hélène Renard, May 5, 2010 A First Step to the Evaluation of SimGrid in the Context of a Real Application Abdou Guermouche and Hélène Renard, LaBRI/Univ Bordeaux 1 I3S/École polytechnique universitaire de Nice-Sophia Antipolis May