An efficient multilevel master-slave model for distributed parallel computation
|
|
- Martha McGee
- 6 years ago
- Views:
Transcription
1 An efficient multilevel master-slave model for distributed parallel computation Hsin-Chu Chen,W Alvin Lim,<*) and Nazir A. Warsi^ W Army Center of Research in Information Sciences, Dept. of Computer and Information Sciences, Clark Atlanta University, 223 James P. Brawley Dr. SW, Atlanta, GA hchen@diamond.cau.edu (2) Department of Computer Science and Engineering, Auburn University, 107 Dunstan Hall, Auburn, AL lim@eng.auburn.edu Abstract The master-slave (MS) parallel computing model is one of the most widely used model in a networked computing environment due to its ease of implementation. This model, however, suffers from the disadvantages of the sequential generation of slave processes and heavy communication overheads imposed on the master processor. To overcome this problem, we present in this paper an efficient multilevel master-slave (MMS) scheme which is especially useful for solving decomposable large-scale problems such as structure mechanics or dynamics problems with rotational symmetry, on networked workstations. Our MMS model implements the MS model at multiple levels and generates processes using a special class of tree structures, allowing parallel creation of slave processes. It also improves performance in the distribution of initial data and merging of computed results to and from slave processes. We shall describe the generation of processes using different MMS structures to generate a prescribed number of processes and to broadcast global data to all processes. We then present the implementation of the optimal MMS model via PVM on a networked computer system consisting of workstations for a plate-bending problem that is discretized using the finite strip method The performance of our numerical experiments employing this MMS model is reported to demonstrate its efficiency. This scheme can be applied equally well to other types of problems that can be decomposed using the Fourier decomposition or circular decomposition, no matter whether the physical problem is discretized by the boundary element method or finite element method.
2 440 Boundary Element Technology 1 Introduction The master-slave (MS) parallel programming model [Geis93, GeSS87] is currently one of the most widely used model in networked computing environment in which a collection of heterogeneous or homogeneous computers are connected by a network to serve as a large parallel virtual machine. In this model the initiating process, referred to as the master process, is responsible for spawning (generating) all other processes, referred to as slave processes, to perform tasks assigned to them. Although the tasks assigned to the slaves processes can be different, a typical application of this model is that all slaves perform a set of similar tasks (with different set of data). In such cases, usually only two programs are necessary no matter how many slaves there are. One of the two programs is for the master and the other is for all the slaves. The interprocess communications are usually handled through some form of message passing between the master and the slaves. The advantage of the MS model for parallel computation is simplicity in terms of implementation. It suffers, however, from the disadvantage of heavy communication overheads since all slaves communicate only with the master. There is no direct communications among slaves. This disadvantage is especially apparent when the master needs to collect data from all slaves in order to perform an accumulated sum of them. To overcome the drawbacks of this MS model, an approach that implements the MS model at multiple levels as a complete tree structure is presented in [ChBy95]. In this paper, we develop a more efficient approach for the multilevel master-slave (MMS) scheme, which is useful for solving large-scale master-slave problems, e.g., plate-bending problems arising in the area of structural analysis or groundwater modeling that can be handled by the finite strip method [PuSc90, Cheu68] or heat conduction and wave problems that are solved by a fast Poisson solver [Stra86, pp.457] 2 Multilevel Master-Slave Trees The MMS scheme has been demonstrated to be more efficient than the MS model for the problem presented in [ChBy95] and seems to have a great potential for other applications. In this paper, we further investigate the potential of this approach and present our new results. In particular, we present and analyze a special class of unbalanced m-ary tree structure for the MMS approach. It should be mentioned that the generation of our unbalanced tree structure is not arbitrary. Instead, it is generated based on certain rules to be described in this section. Since there can be many master processes in an MMS model, we refer the process at the root of the tree to as the initial master process. We assume that each process can directly generate at most ra slaves, and any single master process cannot simultaneously generate more than one process
3 Boundary Element Technology 441 Figure 1: Spatial representation of an unbalanced tree at any given time which is the case for a machine with only one processing unit. All distinct processes are assumed to execute on different processors, one process per processor. Note also that processes are created as time advances. Therefore, they are dynamic although processors are static. The time spent in generating a new process is assumed to be fixed and taken to be unity in this paper for convenience. We shall refer processes generated at time t = k, k 0, 1, 2,, to as step-k processes. Now, we are in a position to present our proposed unbalanced m-ary tree models for the MMS scheme. To begin with, wefirstconsider the special case when m 2 and then generalize it to the general case of an arbitrary m, m > 1. Note that when TTi 1, the resulting configuration is simply a pipeline of processes, a degenerate tree with no branches, which is of no interest to us since the generation of processes in this case becomes strictly sequential. Due to a close relation between the Fibonacci sequence and the number of processes generated at each time step, this class of unbalanced m-ary tress will be referred to as Fibonacci m-ary trees in this paper. 2.1 Fibonacci Binary Tree When m = 2, each process can generate at most two processes and the resulting configuration is usually an unbalanced binary tree for time t = k > 1. Shown in Figure 1 is the spatial representation of such an example generated in five time steps where the number next to each of the arrows represents the step number at which the process pointed to by the arrow is generated and the depth denotes the level number. A temporal representation of this tree is shown in Figure 2 where the depth denotes the step number instead of the level. It is not necessary to display the step numbers as shown on the links in this representation. We show them in this figure only for the sake of emphasis. We now use this example to explain the generation procedure. First, the initial master process (A) must enroll itself to the networked computing system and is considered as having been generated at time t 0 (step 0). This process then spends one time unit (from time t = 0 to t = I) to generate process B (step 1). Recall that A cannot generate B and C
4 442 Boundary Element Technology Figure 2: Temporal representation of an unbalanced tree at the same time by our assumption. Once B have been generated, both A and B are ready for generating their own slave, C and D, respectively. The generation of C and D can be completed in one time unit since the generation of C by A is entirely independent of the generation of D by B, by our assumption that different processes execute on different processors. This is done at step 2. Now, process A has already generated two slaves, B and C, and, therefore, stops generating new slaves. Accordingly, only three new slaves are generated at step 3. They are processes E, F, and G generated by B, C, and D, respectively. By the same argument, it is not difficult to see that the five leaves in this figure are the only processes that can be generated at the next step. Similar to processes B and C, these five processes can be generated in parallel since their immediate masters are all distinct. In general, the generation of processes continues until a prescribed number of time steps (or a fixed number of processes) has been reached. Let fn be the number of processes generated according to this procedure at time step n (from time t = n- l t o t = n) and Fn be the total number of processes in the tree at time t = n. It is easy to see that fn follows the interesting Fibonacci sequence: 1, 1, 2, 3, 5, 8,..., i.e., with the initial conditions A = A-i+A_2, n = l, 2,, (1) /o = 1 and /_i = 0. (2) The explicit expression for /n, given (2), is well-known [Tuck84, pp.282]: f _ 1 [,1 + An+l (1-An+ll '"-Vl[^2^ ^ 2 > [ As mentioned earlier, we shall refer this type of unbalanced binary trees to as Fibonacci binary trees due to this recurrence relation, in order to distinguish them from other unbalanced binary trees that do not satisfy this recurrence relation. Since Fn is simply the sum of all processes generated
5 Boundary Element Technology 443 Figure 3: A Fibonacci tree of depth 4 with ra = oo (temporal representation). from step 0 up to step n, we have Fn ]Cfc=o f* which follows the following recurrence relation F^i=2^-F^2, ^ = 0, 1,... (3) with the initial conditions FQ = 1 and F_i = F_2 = 0. This relation enables us to determine iteratively the maximum number of steps required to generate a prescribed number of processes for our MMS scheme. To close this subsection, we stress that all the processes generated at a given time step can be accomplished in parallel since their immediate masters are all distinct, a clear advantage of this approach over the traditional MS model, which spawns processes sequentially. For example, to spawn 11 processes (excluding the initial master), this MMS model takes only four time steps as clearly seen in Figure 2, instead of 11 steps as required by the MS model. 2.2 Minimum-generation-time Tree Model (ra = oo) Obviously, as ra increases, each process is allowed to have more slaves. In the extreme case of m oo, there is no constraint on the number of slaves a process can generate. Given a total number of processes to be generated, this extreme case takes the smallest number of steps (the smallest amount of time) to generate a Fibonacci tree, possibly incomplete. This model is, thus, referred to as the minimum-generation-time (MGT) tree model. Figure 3 shows an example of the Fibonacci tree with ra = oo, generated using 5 time steps, A: = 0, 1,, 4. It deserves mentioning that every complete Fibonacci ra-ary tree with TV < ra is a MGT tree. From our generation rules, it is not difficult to see that 2*~* additional slaves can be generated at time step A:, k > 1, making the total number of processes in the tree equal to 2* in the MGT tree model. This appears, in some sense, to be similar to the creation of hypercube nodes [Leig92, 392]. However, the configuration of this model is different from the hypercube structure, as seen in Figure 3. The recurrence relation of / for MGT can
6 444 Boundary Element Technology be expressed as n-l /n = /* = Z>' n = l, 2,.-. (4) k<n k=0 with /o = 1 since every process generated before time t = n can generate an additional slave at t n for all n > 0 and fk = 0 for any A: < 0. It is trivial to see that Fn+i=2Fn, n = 0, 1, (5) with FQ = 1. The MGT model has some very desirable features. Not only does it spend the least time in generating a Fibonacci tree when the total number of processes is prescribed, but it requires the least time (among all possible ra) to flood data from the root to all other processes and to accumulate results (the reduction operation) computed by each process back to the root, which occurs very often in scientific computations. For each of the flood and reduction operations, it takes only log2(fw) steps to accomplish the task where FN is the total number of processes in the tree, assuming FN is a power of 2. 3 Applications The MMS programming paradigm presented in this paper not only can save time spent in spawning processes in a distributed computing environment, as compared with the standard MS model, but is suitable for distributing global data to all processes and for performing reduction operations, e.g., the accumulated sum of a series of numerical values (scalars or vectors). This programming paradigm is useful for solving discretized physical problems (or partial differential equations) that use some form of Fourier decomposition techniques, such as those involved in the fast Poisson solver, fast biharmonic solver, circular decomposition method, and finite strip method. In each of these schemes, three stages are involved. First, a single problem is decomposed into a number of smaller and independent subproblems. In this stage global data (data required by all processes), normally read from an input file or generated by the initial master processes, need to be broadcast to all processes and process-specific data must be distributed to each involved process, except those that can be generated by the involved process itself. In the second stage, the subproblems are solved in parallel, one problem on one processor in principle, without the need of interprocess communications. Once the solutions to all subproblems are obtained, they are combined/accumulated to yield the final result. Parallel reduction operations are involved in this final stage. In the following, we present an application of our proposed MGT MMS model to the finite strip method for solving a special class of plate-bending problems on a networked system consisting of SUN4 workstations. Before
7 Boundary Element Technology ft Figure 4: A rectangular plate discretized into (n 1) strips presenting the physical problem for our experiments, we briefly describe this approach. The finite strip method [Cheu68] is a highly parallel approach with large-grain parallelism. It is a special type of finite element analysis that approximates the true solution of the displacementfieldusing a combination of continuous harmonic functions that satisfy the boundary conditions in one direction and piecewise interpolation polynomials in the other. The method decomposes a single problem into m subproblems if m harmonic functions are employed in the approximation. In other words, each harmonic term will result in a linear subsystem to solve and the three stages mentioned above are all involved in the analysis. With the MGT model, runtime reconfiguration is not necessary in this approach. 3.1 Problem Statement The physical problem we consider for our experiments is the static analysis of a 48 ft by 32 ft rectangular Mindlin plate (0.5 ft thick), simply supported on all four edges. The plate is subject to a 2Q-kip concentrated load acting downward in the z-direction at (z,y) = (12ft,8ft], as shown in Figure 4. The material of the plate is assumed to be isotropic with a Young's modulus equal to kips/ft* and a Poisson ratio of This problem is taken from [Chen94]. The mathematical modeling of the Mindlin plate problems and the formulation of the algebraic equations using thefinitestrip method can be found in [Mind51, BeHi76] or in [Chen94]. We discretize the plate into 64, 128,..., 2048 strips. Eight harmonic (Fourier) terms are employed to approximate the true solution (displacementfield)for each discretization. 3.2 Implementations and Experiments Our experiments consist of three different implementations: one for sequential execution and two for parallel execution. The two implementations for parallel execution are the standard MS model and our new MGT MMS model, as shown in Figures 5 and 6, respectively. We use the software package PVM3 (Parallel Virtual Machine, Version 3.3.9) [Geis93] to implement interprocess communications. In the standard MS model, we only
8 446 Boundary Element Technology Figure 5: MS model with eight processes (temporal representation) Figure 6: MGT MMS model with eight processes (temporal representation) need two computer programs: one for the master and the other for all the slaves. In the MGT model, we employ four computer programs, which differ mainly in the way the interprocess communication is handled. All processes generated at the same time step share the same code. This, of course, is not a requirement. There are a great number of varieties. For example, one may develop a separate program for a process, or combine the two programs for the intermediate processes into a single program as an alternative. Extensive code sharing among processes, however, usually adds a certain degree of difficulty to the development of the code and may not always be preferable. Our preliminary experimental results are shown in Tables 1, where the CPU time includes both the user and system CPU time in seconds spent in the entire analysis, including the generation of the strip stiffness matrices, assemblage of the linear subsystems, solution to each subsystem, and the calculation of displacements. It should be noted that each timing result represents the best timing observed in a series of six consecutive executions of the same code on the same data. The speedups, defined to be the ratio of the time spent in the sequential code run on one workstation to that spent in the parallel code executed on the networked system configured with eight workstations, for each individual discretization are shown in Table 2. As seen from these two tables, both the MS and MGT (MMS) models yield pretty good performance, as compared with the sequential execution. It is also clear that the MGT model improves the performance obtained by the standard MS scheme, as expected. The improvement is more than 13% in all cases. This is mainly due to the better exploitation of parallelism by the
9 Boundary Element Technology 447 Table 1: Performance of MS and MMS models (CPU time in seconds) No. of strips Sequential CPU time in seconds Traditional MS MGT (MMS) Table 2: Performance of MS and MMS models (individual speedup) No. of strips Sequential Individual speedup Traditional MS MGT (MMS) MGT than by the MS model. 4 Conclusions The master-slave parallel programming model has been widely used in a networked computing environment. This model, however, suffers from the disadvantages that the master alone is responsible for spawning all the slave processes and the heavy communication overhead and delays when all slaves attempt to simultaneously communicate with the single master. In this paper we have presented a more efficient approach for the multilevel masterslave (MMS) model that implements the MS model at multiple levels. The MMS model is useful for large-scale master-slave problems, because it not only allows for parallel creation of slave processes but has the ability to improve performance in flooding global data to and merging results from slave processes. To demonstrate the effectiveness of our proposed approach, we have presented an application of the MGT model to the finite strip method for
10 448 Boundary Element Technology solving a particular class of plate-bending problems that arise in the area of structural analysis. The results of our experiments show that the MGT model improves the performance of parallel computation. 5 Acknowledgment This work was supported in part by the Army Research Laboratory under Grant No. DAAL D065 and in part by the National Science Foundation under grant CCR References [BeHi76] P.R. Benson and E. Hinton, A thick finite strip solution for static, free vibration and stability problems, Int. J. for Numer. Meth. in Eng., 10 (1976), pp [Chen94] H.-C. Chen, Increasing parallelism in the finite strip formulation: static analysis, International Journal on Neural, Parallel & Scientific Computations, Vol. 2, No. 3 (1994), pp [ChBy95] H.-C. Chen and V. Byreddy, Solving plate bending problems using finite strips on network workstations, submitted to Computers & Structures. [Cheu68] Y.K. Cheung, The finite strip method in the analysis of elastic plates with two opposite simply supported ends, Proc. Inst. Civ. Eng., 40(1968), pp [EGSM94] G. Eisenhauer, W. Gu, K. Schwan, and N. Mallavarupu, Falcon - Toward Interactive Parallel Programs: The On-line Steering of a Molecular Dynamics Application, Proceedings of High-Performance Distributed Computing, August [Geis93] A. Geist, et al, PVM3 User's Guide and Reference Manual, ORNL/TM-12187, Oak Ridge National Laboratory. [GeSS87] E.F. Gehringer, D.P. Siewiorek, Z. Segall, Parallel Processing: The CM Experience, Digital Press, MA, [Leig92] F.T. Leighton, Introduction to Parallel Algorithms and Architectures: Arrays. Trees.Hypercubes, Morgan Kaufmann, CA, [Mind51] R.D. Mindlin, Influence of rotatory inertia and shear on flexural motions of isotropic, elastic plates, J. of Applied Mechanics, 18 (1951), pp [PuSc90] J. A. Puckett and R. J. Schmidt, Finite strip method for groundwater modeling in a parallel computing environment, Eng. Comput., 7 (1990), pp [Stra86] G. Strang, Introduction to Applied Mathematics, Wellesley-Cambridge, MA, [Tuck84] A. Tucher, Applied Combinatorics (2nd ed.), John Wiley & Sons, NY, 1984.
Second-order shape optimization of a steel bridge
Computer Aided Optimum Design of Structures 67 Second-order shape optimization of a steel bridge A.F.M. Azevedo, A. Adao da Fonseca Faculty of Engineering, University of Porto, Porto, Portugal Email: alvaro@fe.up.pt,
More informationThe Dynamic Response of an Euler-Bernoulli Beam on an Elastic Foundation by Finite Element Analysis using the Exact Stiffness Matrix
Journal of Physics: Conference Series The Dynamic Response of an Euler-Bernoulli Beam on an Elastic Foundation by Finite Element Analysis using the Exact Stiffness Matrix To cite this article: Jeong Soo
More informationApplication of Finite Volume Method for Structural Analysis
Application of Finite Volume Method for Structural Analysis Saeed-Reza Sabbagh-Yazdi and Milad Bayatlou Associate Professor, Civil Engineering Department of KNToosi University of Technology, PostGraduate
More informationTransactions on Information and Communications Technologies vol 9, 1995 WIT Press, ISSN
Finite difference and finite element analyses using a cluster of workstations K.P. Wang, J.C. Bruch, Jr. Department of Mechanical and Environmental Engineering, q/ca/z/brm'a, 5Wa jbw6wa CW 937% Abstract
More informationAN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING
AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING Irina Bernst, Patrick Bouillon, Jörg Frochte *, Christof Kaufmann Dept. of Electrical Engineering
More informationExample 24 Spring-back
Example 24 Spring-back Summary The spring-back simulation of sheet metal bent into a hat-shape is studied. The problem is one of the famous tests from the Numisheet 93. As spring-back is generally a quasi-static
More information1314. Estimation of mode shapes expanded from incomplete measurements
34. Estimation of mode shapes expanded from incomplete measurements Sang-Kyu Rim, Hee-Chang Eun, Eun-Taik Lee 3 Department of Architectural Engineering, Kangwon National University, Samcheok, Korea Corresponding
More informationStatic analysis of eolicblade through finite element method and OOP C++
International Conference on Control, Engineering & Information echnology (CEI 4) Proceedings Copyright IPCO-4 ISSN 56-568 Static analysis of eolicblade through finite element method and OOP C++ MateusDantas,
More informationAn explicit feature control approach in structural topology optimization
th World Congress on Structural and Multidisciplinary Optimisation 07 th -2 th, June 205, Sydney Australia An explicit feature control approach in structural topology optimization Weisheng Zhang, Xu Guo
More informationFOUNDATION IN OVERCONSOLIDATED CLAY
1 FOUNDATION IN OVERCONSOLIDATED CLAY In this chapter a first application of PLAXIS 3D is considered, namely the settlement of a foundation in clay. This is the first step in becoming familiar with the
More informationA MATLAB Toolbox for Distributed and Parallel Processing
A MATLAB Toolbox for Distributed and Parallel Processing S. Pawletta a, W. Drewelow a, P. Duenow a, T. Pawletta b and M. Suesse a a Institute of Automatic Control, Department of Electrical Engineering,
More informationROTATIONAL DEPENDENCE OF THE SUPERCONVERGENT PATCH RECOVERY AND ITS REMEDY FOR 4-NODE ISOPARAMETRIC QUADRILATERAL ELEMENTS
COMMUNICATIONS IN NUMERICAL METHODS IN ENGINEERING Commun. Numer. Meth. Engng, 15, 493±499 (1999) ROTATIONAL DEPENDENCE OF THE SUPERCONVERGENT PATCH RECOVERY AND ITS REMEDY FOR 4-NODE ISOPARAMETRIC QUADRILATERAL
More informationTruss structural configuration optimization using the linear extended interior penalty function method
ANZIAM J. 46 (E) pp.c1311 C1326, 2006 C1311 Truss structural configuration optimization using the linear extended interior penalty function method Wahyu Kuntjoro Jamaluddin Mahmud (Received 25 October
More informationModule 1: Introduction to Finite Element Analysis. Lecture 4: Steps in Finite Element Analysis
25 Module 1: Introduction to Finite Element Analysis Lecture 4: Steps in Finite Element Analysis 1.4.1 Loading Conditions There are multiple loading conditions which may be applied to a system. The load
More informationPath Planning. Marcello Restelli. Dipartimento di Elettronica e Informazione Politecnico di Milano tel:
Marcello Restelli Dipartimento di Elettronica e Informazione Politecnico di Milano email: restelli@elet.polimi.it tel: 02 2399 3470 Path Planning Robotica for Computer Engineering students A.A. 2006/2007
More informationBeams. Lesson Objectives:
Beams Lesson Objectives: 1) Derive the member local stiffness values for two-dimensional beam members. 2) Assemble the local stiffness matrix into global coordinates. 3) Assemble the structural stiffness
More informationSETTLEMENT OF A CIRCULAR FOOTING ON SAND
1 SETTLEMENT OF A CIRCULAR FOOTING ON SAND In this chapter a first application is considered, namely the settlement of a circular foundation footing on sand. This is the first step in becoming familiar
More informationModelling Flat Spring Performance Using FEA
Modelling Flat Spring Performance Using FEA Blessing O Fatola, Patrick Keogh and Ben Hicks Department of Mechanical Engineering, University of Corresponding author bf223@bath.ac.uk Abstract. This paper
More informationThe Application of EXCEL in Teaching Finite Element Analysis to Final Year Engineering Students.
The Application of EXCEL in Teaching Finite Element Analysis to Final Year Engineering Students. Kian Teh and Laurie Morgan Curtin University of Technology Abstract. Many commercial programs exist for
More informationHomework # 2 Due: October 6. Programming Multiprocessors: Parallelism, Communication, and Synchronization
ECE669: Parallel Computer Architecture Fall 2 Handout #2 Homework # 2 Due: October 6 Programming Multiprocessors: Parallelism, Communication, and Synchronization 1 Introduction When developing multiprocessor
More informationcourse outline basic principles of numerical analysis, intro FEM
idealization, equilibrium, solutions, interpretation of results types of numerical engineering problems continuous vs discrete systems direct stiffness approach differential & variational formulation introduction
More informationDevelopment of a particle difference scheme for dynamic crack propagation problem
Development of a particle difference scheme for dynamic crack propagation problem *Young-Choel Yoon 1) and Kyeong-Hwan Kim ) 1) Dept. of Civil Engineering, Myongji College, Seoul 10-776, Korea ) Dept.
More informationA METHOD TO MODELIZE THE OVERALL STIFFNESS OF A BUILDING IN A STICK MODEL FITTED TO A 3D MODEL
A METHOD TO MODELIE THE OVERALL STIFFNESS OF A BUILDING IN A STICK MODEL FITTED TO A 3D MODEL Marc LEBELLE 1 SUMMARY The aseismic design of a building using the spectral analysis of a stick model presents
More informationOptimal Design of Steel Columns with Axial Load Using Artificial Neural Networks
2017 2nd International Conference on Applied Mechanics and Mechatronics Engineering (AMME 2017) ISBN: 978-1-60595-521-6 Optimal Design of Steel Columns with Axial Load Using Artificial Neural Networks
More informationLevel-set and ALE Based Topology Optimization Using Nonlinear Programming
10 th World Congress on Structural and Multidisciplinary Optimization May 19-24, 2013, Orlando, Florida, USA Level-set and ALE Based Topology Optimization Using Nonlinear Programming Shintaro Yamasaki
More informationCrashbox Tutorial. In this tutorial the focus is on modeling a Formula Student Racecar Crashbox with HyperCrash 12.0
Crashbox Tutorial In this tutorial the focus is on modeling a Formula Student Racecar Crashbox with HyperCrash 12.0 (Written by Moritz Guenther, student at Altair Engineering GmbH) 1 HyperMesh* 1. Start
More informationFinite Element Method. Chapter 7. Practical considerations in FEM modeling
Finite Element Method Chapter 7 Practical considerations in FEM modeling Finite Element Modeling General Consideration The following are some of the difficult tasks (or decisions) that face the engineer
More informationStress due to surface load
Stress due to surface load To analyze problems such as compressibility of soils, bearing capacity of foundations, stability of embankments, and lateral pressure on earth retaining structures, we need to
More informationy(b)-- Y[a,b]y(a). EQUATIONS ON AN INTEL HYPERCUBE*
SIAM J. ScI. STAT. COMPUT. Vol. 12, No. 6, pp. 1480-1485, November 1991 ()1991 Society for Industrial and Applied Mathematics 015 SOLUTION OF LINEAR SYSTEMS OF ORDINARY DIFFERENTIAL EQUATIONS ON AN INTEL
More informationFB-MULTIPIER vs ADINA VALIDATION MODELING
FB-MULTIPIER vs ADINA VALIDATION MODELING 1. INTRODUCTION 1.1 Purpose of FB-MultiPier Validation testing Performing validation of structural analysis software delineates the capabilities and limitations
More informationUSES AND ABUSES OF AMDAHL'S LAW *
USES AND ABUSES OF AMDAHL'S LAW * S. Krishnaprasad Mathematical, Computing, and Information Sciences Jacksonville State University Jacksonville, AL 36265 Email: skp@jsucc.jsu.edu ABSTRACT Amdahl's law
More informationMATLAB. Advanced Mathematics and Mechanics Applications Using. Third Edition. David Halpern University of Alabama CHAPMAN & HALL/CRC
Advanced Mathematics and Mechanics Applications Using MATLAB Third Edition Howard B. Wilson University of Alabama Louis H. Turcotte Rose-Hulman Institute of Technology David Halpern University of Alabama
More informationHomework # 1 Due: Feb 23. Multicore Programming: An Introduction
C O N D I T I O N S C O N D I T I O N S Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science 6.86: Parallel Computing Spring 21, Agarwal Handout #5 Homework #
More information17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES
17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES The Current Building Codes Use the Terminology: Principal Direction without a Unique Definition 17.1 INTRODUCTION { XE "Building Codes" }Currently
More informationCMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC. Guest Lecturer: Sukhyun Song (original slides by Alan Sussman)
CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC Guest Lecturer: Sukhyun Song (original slides by Alan Sussman) Parallel Programming with Message Passing and Directives 2 MPI + OpenMP Some applications can
More informationEuropean Hyperworks Technology Conference 2010 Mesh and orientation dependance of FE models for dynamic simulations.
European Hyperworks Technology Conference 2010 Mesh and orientation dependance of FE models for dynamic simulations. Sébastien ROTH Jennifer OUDRY Hossein SHAKOURZADEH Influence of mesh density on a finite
More informationAn Approximate Method for Permuting Frame with Repeated Lattice Structure to Equivalent Beam
The Open Ocean Engineering Journal, 2011, 4, 55-59 55 Open Access An Approximate Method for Permuting Frame with Repeated Lattice Structure to Equivalent Beam H.I. Park a, * and C.G. Park b a Department
More informationAnalytical Modeling of Parallel Systems. To accompany the text ``Introduction to Parallel Computing'', Addison Wesley, 2003.
Analytical Modeling of Parallel Systems To accompany the text ``Introduction to Parallel Computing'', Addison Wesley, 2003. Topic Overview Sources of Overhead in Parallel Programs Performance Metrics for
More informationTOPOLOGY OPTIMIZATION OF WING RIBS IN CESSNA CITATION
TOPOLOGY OPTIMIZATION OF WING RIBS IN CESSNA CITATION [1],Sathiyavani S [2], Arun K K [3] 1,2 student, 3 Assistant professor Kumaraguru College of technology, Coimbatore Abstract Structural design optimization
More informationScienceDirect. Vibration Response Prediction of the Printed Circuit Boards using Experimentally Validated Finite Element Model
Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 144 (2016 ) 576 583 12th International Conference on Vibration Problems, ICOVP 2015 Vibration Response Prediction of the Printed
More informationParallel Auction Algorithm for Linear Assignment Problem
Parallel Auction Algorithm for Linear Assignment Problem Xin Jin 1 Introduction The (linear) assignment problem is one of classic combinatorial optimization problems, first appearing in the studies on
More informationThe Level Set Method applied to Structural Topology Optimization
The Level Set Method applied to Structural Topology Optimization Dr Peter Dunning 22-Jan-2013 Structural Optimization Sizing Optimization Shape Optimization Increasing: No. design variables Opportunity
More informationSpeed-up of Parallel Processing of Divisible Loads on k-dimensional Meshes and Tori
The Computer Journal, 46(6, c British Computer Society 2003; all rights reserved Speed-up of Parallel Processing of Divisible Loads on k-dimensional Meshes Tori KEQIN LI Department of Computer Science,
More informationParallelization Strategy
COSC 6374 Parallel Computation Algorithm structure Spring 2008 Parallelization Strategy Finding Concurrency Structure the problem to expose exploitable concurrency Algorithm Structure Supporting Structure
More informationSolve Non-Linear Parabolic Partial Differential Equation by Spline Collocation Method
Solve Non-Linear Parabolic Partial Differential Equation by Spline Collocation Method P.B. Choksi 1 and A.K. Pathak 2 1 Research Scholar, Rai University,Ahemdabad. Email:pinalchoksey@gmail.com 2 H.O.D.
More informationA STRUCTURAL OPTIMIZATION METHODOLOGY USING THE INDEPENDENCE AXIOM
Proceedings of ICAD Cambridge, MA June -3, ICAD A STRUCTURAL OPTIMIZATION METHODOLOGY USING THE INDEPENDENCE AXIOM Kwang Won Lee leekw3@yahoo.com Research Center Daewoo Motor Company 99 Cheongchon-Dong
More informationMODELING MIXED BOUNDARY PROBLEMS WITH THE COMPLEX VARIABLE BOUNDARY ELEMENT METHOD (CVBEM) USING MATLAB AND MATHEMATICA
A. N. Johnson et al., Int. J. Comp. Meth. and Exp. Meas., Vol. 3, No. 3 (2015) 269 278 MODELING MIXED BOUNDARY PROBLEMS WITH THE COMPLEX VARIABLE BOUNDARY ELEMENT METHOD (CVBEM) USING MATLAB AND MATHEMATICA
More information1. Carlos A. Felippa, Introduction to Finite Element Methods,
Chapter Finite Element Methods In this chapter we will consider how one can model the deformation of solid objects under the influence of external (and possibly internal) forces. As we shall see, the coupled
More informationVisualization of errors of finite element solutions P. Beckers, H.G. Zhong, Ph. Andry Aerospace Department, University of Liege, B-4000 Liege, Belgium
Visualization of errors of finite element solutions P. Beckers, H.G. Zhong, Ph. Andry Aerospace Department, University of Liege, B-4000 Liege, Belgium Abstract The aim of this paper is to show how to use
More informationFast Fuzzy Clustering of Infrared Images. 2. brfcm
Fast Fuzzy Clustering of Infrared Images Steven Eschrich, Jingwei Ke, Lawrence O. Hall and Dmitry B. Goldgof Department of Computer Science and Engineering, ENB 118 University of South Florida 4202 E.
More informationCost Models for Query Processing Strategies in the Active Data Repository
Cost Models for Query rocessing Strategies in the Active Data Repository Chialin Chang Institute for Advanced Computer Studies and Department of Computer Science University of Maryland, College ark 272
More informationDevelopment of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics
Development of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics I. Pantle Fachgebiet Strömungsmaschinen Karlsruher Institut für Technologie KIT Motivation
More informationTopology Optimization of Multiple Load Case Structures
Topology Optimization of Multiple Load Case Structures Rafael Santos Iwamura Exectuive Aviation Engineering Department EMBRAER S.A. rafael.iwamura@embraer.com.br Alfredo Rocha de Faria Department of Mechanical
More informationWired and Wireless Parallel Simulation of Fluid Flow Problem on Heterogeneous Network Cluster
www.ijcsi.org 81 Wired and Wireless Parallel Simulation of Fluid Flow Problem on Heterogeneous Network Cluster Fariha Quratulain Baloch, SE Development Switching, PTCL, Hyderabad, Sindh, Pakistan Mahera
More informationISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 2, Issue 3, September 2012
Mitigation Curves for Determination of Relief Holes to Mitigate Concentration Factor in Thin Plates Loaded Axially for Different Discontinuities Shubhrata Nagpal, S.Sanyal, Nitin Jain Abstract In many
More informationSDC. Engineering Analysis with COSMOSWorks. Paul M. Kurowski Ph.D., P.Eng. SolidWorks 2003 / COSMOSWorks 2003
Engineering Analysis with COSMOSWorks SolidWorks 2003 / COSMOSWorks 2003 Paul M. Kurowski Ph.D., P.Eng. SDC PUBLICATIONS Design Generator, Inc. Schroff Development Corporation www.schroff.com www.schroff-europe.com
More informationFinite Element Analysis of Dynamic Flapper Valve Stresses
Purdue University Purdue e-pubs International Compressor Engineering Conference School of Mechanical Engineering 2000 Finite Element Analysis of Dynamic Flapper Valve Stresses J. R. Lenz Tecumseh Products
More informationStandard dimension optimization of steel frames
Computer Aided Optimum Design in Engineering IX 157 Standard dimension optimization of steel frames U. Klanšek & S. Kravanja University of Maribor, Faculty of Civil Engineering, Slovenia Abstract This
More informationEffect of Mesh Size of Finite Element Method in Modal Analysis for Periodic Symmetric Struts Support
Effect of Mesh Size of Finite Element Method in Modal Analysis for Periodic Symmetric Struts Support Weibing Liu 1,a, *Mamtimin Geni 1,b, and Lie Yu 2 1 (School of Mechanical Engineering, Xinjiang University,
More informationForce density method for simultaneous optimization of geometry and topology of spatial trusses
25-28th September, 217, Hamburg, Germany Annette Bögle, Manfred Grohmann (eds.) Force density method for simultaneous optimization of geometry and topology of spatial trusses Kazuki Hayashi*, Makoto Ohsaki
More informationMatrix Multiplication on an Experimental Parallel System With Hybrid Architecture
Matrix Multiplication on an Experimental Parallel System With Hybrid Architecture SOTIRIOS G. ZIAVRAS and CONSTANTINE N. MANIKOPOULOS Department of Electrical and Computer Engineering New Jersey Institute
More informationEffectiveness of Element Free Galerkin Method over FEM
Effectiveness of Element Free Galerkin Method over FEM Remya C R 1, Suji P 2 1 M Tech Student, Dept. of Civil Engineering, Sri Vellappaly Natesan College of Engineering, Pallickal P O, Mavelikara, Kerala,
More informationFINITE ELEMENT ANALYSIS OF A COMPOSITE CATAMARAN
NAFEMS WORLD CONGRESS 2013, SALZBURG, AUSTRIA FINITE ELEMENT ANALYSIS OF A COMPOSITE CATAMARAN Dr. C. Lequesne, Dr. M. Bruyneel (LMS Samtech, Belgium); Ir. R. Van Vlodorp (Aerofleet, Belgium). Dr. C. Lequesne,
More informationA NEW MILP APPROACH FOR THE FACILITY LAYOUT DESIGN PROBLEM WITH RECTANGULAR AND L/T SHAPED DEPARTMENTS
A NEW MILP APPROACH FOR THE FACILITY LAYOUT DESIGN PROBLEM WITH RECTANGULAR AND L/T SHAPED DEPARTMENTS Yossi Bukchin Michal Tzur Dept. of Industrial Engineering, Tel Aviv University, ISRAEL Abstract In
More information2: Static analysis of a plate
2: Static analysis of a plate Topics covered Project description Using SolidWorks Simulation interface Linear static analysis with solid elements Finding reaction forces Controlling discretization errors
More informationInfluence of mesh density on a finite element model under dynamic loading Sébastien ROTH
Influence of mesh density on a finite element model under dynamic loading Sébastien ROTH Jennifer OUDRY Influence of mesh density on a finite element model under dynamic loading 1. Introduction 2. Theoretical
More informationTopology Optimization of Two Linear Elastic Bodies in Unilateral Contact
2 nd International Conference on Engineering Optimization September 6-9, 2010, Lisbon, Portugal Topology Optimization of Two Linear Elastic Bodies in Unilateral Contact Niclas Strömberg Department of Mechanical
More informationCONTACT STATE AND STRESS ANALYSIS IN A KEY JOINT BY FEM
PERJODICA POLYTECHNICA SER. ME CH. ENG. VOL. 36, NO. 1, PP. -15-60 (1992) CONTACT STATE AND STRESS ANALYSIS IN A KEY JOINT BY FEM K. VARADI and D. M. VERGHESE Institute of Machine Design Technical University,
More informationFinite Element Specialists and Engineering Consultants
Finite Element Specialists and Engineering Consultants Limit Analysis Using Finite Element Techniques Seminar for the Advanced Structural Engineering Module College of Engineering, Mathematics & Physical
More informationParallel Combinatorial Search on Computer Cluster: Sam Loyd s Puzzle
Parallel Combinatorial Search on Computer Cluster: Sam Loyd s Puzzle Plamenka Borovska Abstract: The paper investigates the efficiency of parallel branch-and-bound search on multicomputer cluster for the
More informationData Partitioning. Figure 1-31: Communication Topologies. Regular Partitions
Data In single-program multiple-data (SPMD) parallel programs, global data is partitioned, with a portion of the data assigned to each processing node. Issues relevant to choosing a partitioning strategy
More information3D Nearest-Nodes Finite Element Method for Solid Continuum Analysis
Adv. Theor. Appl. Mech., Vol. 1, 2008, no. 3, 131-139 3D Nearest-Nodes Finite Element Method for Solid Continuum Analysis Yunhua Luo Department of Mechanical & Manufacturing Engineering, University of
More informationDETC APPROXIMATE MOTION SYNTHESIS OF SPHERICAL KINEMATIC CHAINS
Proceedings of the ASME 2007 International Design Engineering Technical Conferences & Computers and Information in Engineering Conference IDETC/CIE 2007 September 4-7, 2007, Las Vegas, Nevada, USA DETC2007-34372
More informationPATCH TEST OF HEXAHEDRAL ELEMENT
Annual Report of ADVENTURE Project ADV-99- (999) PATCH TEST OF HEXAHEDRAL ELEMENT Yoshikazu ISHIHARA * and Hirohisa NOGUCHI * * Mitsubishi Research Institute, Inc. e-mail: y-ishi@mri.co.jp * Department
More informationCOMPARISON OF THE ASSUMED SHAPE AND FINITE ELEMENT METHODS FOR A VIBRATING BRACE-PLATE SYSTEM
Proceedings of The Canadian Society for Mechanical Engineering Forum CSME FORUM June 7-9,, Victoria, British Columbia, Canada COMPARISON OF THE ASSUMED SHAPE AND FINITE ELEMENT METHODS FOR A VIBRATING
More informationLearning Module 8 Shape Optimization
Learning Module 8 Shape Optimization What is a Learning Module? Title Page Guide A Learning Module (LM) is a structured, concise, and self-sufficient learning resource. An LM provides the learner with
More informationContents. Preface xvii Acknowledgments. CHAPTER 1 Introduction to Parallel Computing 1. CHAPTER 2 Parallel Programming Platforms 11
Preface xvii Acknowledgments xix CHAPTER 1 Introduction to Parallel Computing 1 1.1 Motivating Parallelism 2 1.1.1 The Computational Power Argument from Transistors to FLOPS 2 1.1.2 The Memory/Disk Speed
More informationThe Mechanics of Composites Collection Material Minds Software A Product of Materials Sciences Corporation
The Mechanics of Composites Collection Material Minds Software A Product of Materials Sciences Corporation aterial inds Software The Materials Minds Philosophy Engineers are smart people that want the
More informationINTERIOR POINT METHOD BASED CONTACT ALGORITHM FOR STRUCTURAL ANALYSIS OF ELECTRONIC DEVICE MODELS
11th World Congress on Computational Mechanics (WCCM XI) 5th European Conference on Computational Mechanics (ECCM V) 6th European Conference on Computational Fluid Dynamics (ECFD VI) E. Oñate, J. Oliver
More informationThe p-sized partitioning algorithm for fast computation of factorials of numbers
J Supercomput (2006) 38:73 82 DOI 10.1007/s11227-006-7285-5 The p-sized partitioning algorithm for fast computation of factorials of numbers Ahmet Ugur Henry Thompson C Science + Business Media, LLC 2006
More informationDesign Optimization of Building Structures Using a Metamodeling Method
Proceedings of the 3 rd International Conference on Civil, Structural and Transportation Engineering (ICCSTE'18) Niagara Falls, Canada June 10 12, 2018 Paper No. 137 DOI: 10.11159/iccste18.137 Design Optimization
More informationAnalysis of Fluid-Structure Interaction Effects of Liquid-Filled Container under Drop Testing
Kasetsart J. (Nat. Sci.) 42 : 165-176 (2008) Analysis of Fluid-Structure Interaction Effects of Liquid-Filled Container under Drop Testing Chakrit Suvanjumrat*, Tumrong Puttapitukporn and Satjarthip Thusneyapan
More informationA mesh refinement technique for the boundary element method based on local error analysis
A mesh refinement technique for the boundary element method based on local error analysis J. J. Rodriguez
More informationEffect of Mesh Size of Finite Element Analysis in Modal Analysis for Periodic Symmetric Struts Support
Key Engineering Materials Online: 2011-01-20 ISSN: 1662-9795, Vols. 462-463, pp 1008-1012 doi:10.4028/www.scientific.net/kem.462-463.1008 2011 Trans Tech Publications, Switzerland Effect of Mesh Size of
More informationA meshfree weak-strong form method
A meshfree weak-strong form method G. R. & Y. T. GU' 'centre for Advanced Computations in Engineering Science (ACES) Dept. of Mechanical Engineering, National University of Singapore 2~~~ Fellow, Singapore-MIT
More informationCHAPTER 4. Numerical Models. descriptions of the boundary conditions, element types, validation, and the force
CHAPTER 4 Numerical Models This chapter presents the development of numerical models for sandwich beams/plates subjected to four-point bending and the hydromat test system. Detailed descriptions of the
More informationInternational Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationChapter 3 Analysis of Original Steel Post
Chapter 3. Analysis of original steel post 35 Chapter 3 Analysis of Original Steel Post This type of post is a real functioning structure. It is in service throughout the rail network of Spain as part
More informationThree-dimensional nondestructive evaluation of cylindrical objects (pipe) using an infrared camera coupled to a 3D scanner
Three-dimensional nondestructive evaluation of cylindrical objects (pipe) using an infrared camera coupled to a 3D scanner F. B. Djupkep Dizeu, S. Hesabi, D. Laurendeau, A. Bendada Computer Vision and
More informationA COMPARISON OF MESHES WITH STATIC BUSES AND HALF-DUPLEX WRAP-AROUNDS. and. and
Parallel Processing Letters c World Scientific Publishing Company A COMPARISON OF MESHES WITH STATIC BUSES AND HALF-DUPLEX WRAP-AROUNDS DANNY KRIZANC Department of Computer Science, University of Rochester
More informationParallelization Strategy
COSC 335 Software Design Parallel Design Patterns (II) Spring 2008 Parallelization Strategy Finding Concurrency Structure the problem to expose exploitable concurrency Algorithm Structure Supporting Structure
More informationMesh Models (Chapter 8)
Mesh Models (Chapter 8) 1. Overview of Mesh and Related models. a. Diameter: The linear array is O n, which is large. ThemeshasdiameterO n, which is significantly smaller. b. The size of the diameter is
More informationComputer Life (CPL) ISSN: Finite Element Analysis of Bearing Box on SolidWorks
Computer Life (CPL) ISSN: 1819-4818 Delivering Quality Science to the World Finite Element Analysis of Bearing Box on SolidWorks Chenling Zheng 1, a, Hang Li 1, b and Jianyong Li 1, c 1 Shandong University
More information9 FREE VIBRATION AND EARTHQUAKE ANALYSIS OF A BUILDING
TUTORIAL MANUAL 9 FREE VIBRATION AND EARTHQUAKE ANALYSIS OF A BUILDING This example demonstrates the natural frequency of a long five-storey building when subjected to free vibration and earthquake loading.
More informationChapter 7 Practical Considerations in Modeling. Chapter 7 Practical Considerations in Modeling
CIVL 7/8117 1/43 Chapter 7 Learning Objectives To present concepts that should be considered when modeling for a situation by the finite element method, such as aspect ratio, symmetry, natural subdivisions,
More informationENGINEERING TRIPOS PART IIA FINITE ELEMENT METHOD
ENGINEERING TRIPOS PART IIA LOCATION: DPO EXPERIMENT 3D7 FINITE ELEMENT METHOD Those who have performed the 3C7 experiment should bring the write-up along to this laboratory Objectives Show that the accuracy
More informationCurrent Status of Isogeometric Analysis in LS-DYNA
Current Status of Isogeometric Analysis in LS-DYNA Stefan Hartmann Developer Forum, September 24 th, 2013, Filderstadt, Germany Development in cooperation with: D.J. Benson: Professor of Applied Mechanics,
More informationGuidelines for proper use of Plate elements
Guidelines for proper use of Plate elements In structural analysis using finite element method, the analysis model is created by dividing the entire structure into finite elements. This procedure is known
More informationSet No. 1 IV B.Tech. I Semester Regular Examinations, November 2010 FINITE ELEMENT METHODS (Mechanical Engineering) Time: 3 Hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks
More informationA Novel Image Transform Based on Potential field Source Reverse for Image Analysis
A Novel Image Transform Based on Potential field Source Reverse for Image Analysis X. D. ZHUANG 1,2 and N. E. MASTORAKIS 1,3 1. WSEAS Headquarters, Agiou Ioannou Theologou 17-23, 15773, Zografou, Athens,
More information