9. Efficient Many-Body Algorithms: Barnes-Hut and Fast Multipole

Size: px
Start display at page:

Download "9. Efficient Many-Body Algorithms: Barnes-Hut and Fast Multipole"

Transcription

1 9. Efficient Many-Body Algorithms: Barnes-Hut and Fast Multipole Perlen der Informatik II, Hans-Joachim Bungartz page 1 of 47

2 9.1. Simulation on the Molecular Level Hierarchy of Models Different points of view for simulating human beings: issue level of resolution model basis (e.g.!) global increase in countries, regions population dynamics population local increase in villages, individuals population dynamics population man circulations, organs system simulator blood circulation pump/channels/valves network simulator heart blood cells continuum mechanics cell macro molecules continuum mechanics macro molecules atoms molecular dynamics atoms electrons or finer quantum mechanics Perlen der Informatik II, Hans-Joachim Bungartz page 2 of 47

3 Scales an Important Issue length scales in simulations: from 10 9 m (atoms) to m (galaxy clusters) time scales in simulations: from s to s mass scales in simulations: from g (atoms) to g (galaxies) obviously impossible to take all scales into acount in an explicit and simultaneous way first molecular dynamics simulations reported in 1957 Perlen der Informatik II, Hans-Joachim Bungartz page 3 of 47

4 Simulation on the Molecular Level Physical Model Mathematical Model Long-Range Potentials Applications for Micro and Nano Simulations Lab-on-a-chip, used in brewing technology (Siemens) Flow through a nanotube (where the assumptions of continuum mechanics are no longer valid) Perlen der Informatik II, Hans-Joachim Bungartz page 4 of 47

5 Applications for Micro and Nano Simulations Material science: hexagonal crystal grid of Bornitrid Protein simulation: actin, important component of muscles (overlay of macromolecular model with electron density obtained by X-ray crystallography (brown) and simulation (blue)) Perlen der Informatik II, Hans-Joachim Bungartz page 5 of 47

6 Applications for Micro and Nano Simulations Protein simulation: human haemoglobin (light blue and purple: alpha chains; red and green: beta chains; yellow, black, and dark blue: docked stabilizers or potential docking positions for oxygen) Perlen der Informatik II, Hans-Joachim Bungartz page 6 of 47

7 A Prominent Recent Example: Gordon-Bell-Prize 2005 (most important annual supercomputing award) phenomenon studied: solidification processes in Tantalum and Uranium method: 3D molecular dynamics, up to 524,000,000 atoms simulated machine: IBM Blue Gene/L, 131,072 processors (world s #1 in November 2005) performance: more than 101 TeraFlops (almost 30% of the peak performance) Perlen der Informatik II, Hans-Joachim Bungartz page 7 of 47

8 9.2. Molecular Dynamics the Physical Model Classical Molecular Dynamics Quantum mechanics approximation classical Molecular Dynamics classical Molecular Dynamics is based on Newton s equations of motion molecules are modelled as particles; simplest case: point masses there are interactions between molecules multibody potential functions describe the potential energy of the system; the velocities of the molecules (kinetic energy) are a composition of the Brownian motion (high velocities, no macroscopic movement), flow velocity (for fluids) Perlen der Informatik II, Hans-Joachim Bungartz page 8 of 47

9 Fundamental Interactions Classification of the fundamental interactions: strong nuclear force electromagnetic force weak nuclear force gravity interaction potential energy total potential of N particles is the sum of multibody potentials: U := 0<i<N U 1(r i ) + 0<i<N i<j<n U 2(r i, r j ) + 0<i<N i<j<n j<k<n U 3(r i, r j, r k ) +... there are ( N n ) = N! n!(n n)! O(Nn ) n-body potentials U n, particulary N one-body and 1 2N(N 1) two-body potentials force F = gradu r j r i r k O Perlen der Informatik II, Hans-Joachim Bungartz page 9 of 47

10 Lennard Jones Potential r ij r i r j O a widespread family of intermolecular two-body potentials ( ( ) ( ) n ( ) ) m Lennard Jones potential: U LJ rij = αɛ σ r ij σ r ij with n > m and α = 1 n m n n 1 n m m m continuous and differentiable (C ), since r ij > 0 Perlen der Informatik II, Hans-Joachim Bungartz page 10 of 47

11 LJ 12-6 potential ( ( ) ( ) 12 ( ) ) 6 U LJ rij = 4ɛ σ r ij σ r ij m = 6: van der Waals attraction (van der Waals potential) n = 12: Pauli repulsion (softsphere potential): heuristic application: simulation of inert gases (e.g. Argon) force between 2 molecules: ( ) F ij = U(r ij) 12 ( ) ) 6 r ij = 24ɛ σ r ij (2 r ij σ r ij very fast fade away short range (m = 6 > 3 = d dimension) Perlen der Informatik II, Hans-Joachim Bungartz page 11 of 47

12 9.3. Molecular Dynamics the Mathematical Model System of ODE resulting force acting on a molecule: F i = j i acceleration of a molecule (Newton s 2nd law): F r i = U( r i, r j ) i j i F ij j i r = = ij (1) m i m i m i system of dn coupled ordinary differential equations of 2nd order transferable (as compared to Hamilton formalism) to 2dN coupled ordinary differential equations of 1st order (N: number of molecules, d: dimension), e.g. independent variables q := r and p with p i := m i r i p i = F i Perlen der Informatik II, Hans-Joachim Bungartz page 12 of 47 F ij (2a) (2b)

13 Boundary Conditions Initial Value Problem: position of the molecules and velocities have to be given; initial configuration e.g.: molecules in crystal lattice (body-/face-centered cell) initial velocity random direction absolute value dependent of the temperature (normal distribution or uniform), e.g. 3 P N i=1 mv 2 2 Nk BT = 1 2 v 0 := q 3kB T m i with v i := v 0 resp. v 0 := 3T t time discretisation: t := t 0 + i t time integration procedure Perlen der Informatik II, Hans-Joachim Bungartz page 13 of 47

14 Short-Range Interactions dim. red. Lennard Jones 12 6 Potential 0.5 dim. red. Lennard Jones 12 6 Force U * 0.2 F * r * F ij r * Force matrix/interaction-graph - F 12 F 13 F 14 F 15 F 12 - F 23 F 24 F 25 F 13 F 23 - F 34 F 35 F 14 F 24 F 34 - F 45 F 15 F 25 F 35 F 45 - fast decay of force contributions with increasing distance dense force matrix with O(n 2 ), mostly very small, entries Perlen der Informatik II, Hans-Joachim Bungartz page 14 of 47

15 Short-Range Interactions dim. red. finites Lennard Jones 12 6 Potential (rc=2) 0.5 dim. red. finite Lennard Jones 12 6 Force (rc=2) U * 0.2 F * r * F ij r * Force matrix/interaction-graph - F 12 F 13 F 14 0 F 12-0 F 24 F 25 F F 34 0 F 14 F 24 F 34 - F 45 0 F 25 0 F 45 - cut-off radius leads to a reduction of the computational effort sparse force matrix with O(n) entries Perlen der Informatik II, Hans-Joachim Bungartz page 15 of 47

16 Nucleation Nucleation process of supersaturated Argon t = t = t = t = nucleation process for an oversaturated Argon vapour at 0.97 Mol/l and 80k the simulation program automatically detects clusters (droplets) Perlen der Informatik II, Hans-Joachim Bungartz page 16 of 47

17 Clusters in a supersaturated Argon vapor, 80 K, 0.97 Mol/l f 1 (x) = x f 2 (x) = x f 3 (x) = x cluster size > 20 cluster size > 30 cluster size > 40 f 1 (x) f 2 (x) f 3 (x) number of clusters timesteps (10 3 ) counting and grouping clusters of certain size ranges, a statistic can be generated the growth of the clusters (slope) is known as nucleation rate and important for macroscopic simulations Perlen der Informatik II, Hans-Joachim Bungartz page 17 of 47

18 9.4. Numerical Methods for Long-Range Potentials Numerical Methods for Long-Range Potentials so far: focus on short-range potentials such as Lennard-Jones resulting mutual interactions are restricted to particles in some local neighbourhood facilitates numerical treatment and algorithmic organization: no quadratic complexity induced by an each-with-each behaviour now: tackle long-range potentials, too examples: Coulomb or gravitation potential interactions between remote particles must not be neglected simple cut-off not possible nevertheless need for approaches that avoid quadratic complexity Perlen der Informatik II, Hans-Joachim Bungartz page 18 of 47

19 what is long-range? intuitively: potential function U(r) does not decrease rapidly with increasing r formally (one possibility): for d > 2, potentials not decreasing faster than r d for increasing r (criterion: integrability over R d ) typical potentials in applications have both a short-range part (to be dealt with according to the previous sections) and a long-range part, represented as two additive components: U(r) := U short + U long Perlen der Informatik II, Hans-Joachim Bungartz page 19 of 47

20 Tree-Based Methods based on integral representation of the potential Φ(x) = 1 1 ϱ(y) 4πɛ 0 y x dy hierarchical decompositions of the domain of simulation adaptive approximation of the particle distribution widespread scheme: octrees allow for separation of near-field and far-field influences log-linear or even linear complexity can be obtained Perlen der Informatik II, Hans-Joachim Bungartz page 20 of 47

21 high flexibility with respect to more general potentials (as needed for special applications, such as biomolecular problems) advantageous especially for heterogeneous particle distributions (frequent in astrophysics, relevant also for molecular dynamics) examples: panel clustering Barnes-Hut method (fast) multipole methods Perlen der Informatik II, Hans-Joachim Bungartz page 21 of 47

22 Series Expansion of the Potential general (integral) representation of the potential: Φ(x) = G(x, y)ϱ(y)dy (general kernel G, particle density ϱ(y), and domain Ω) Ω Taylor expansion of the kernel G (if sufficiently smooth apart from the singularity in x = y) in y around y 0 : G(x, y) = j 1 p 1 j! G 0,j(x, y 0 )(y y 0 ) j + R p (x, y) (multi-index j = (j 1, j 2, j 3 ), j! = j 1!j 2!j 3!, G k,j (x, y) mixed (k, j)-th derivative (k-th w.r.t. x, j-th w.r.t. y), remainder R p (x, y)) Perlen der Informatik II, Hans-Joachim Bungartz page 22 of 47

23 leads to expansion (and approximation) of the potential: Φ(x) j 1 p 1 j! M j(ω, y 0 )G 0,j (x, y 0 ) with the so-called moments M j (Ω, y 0 ) := Ω ϱ(y)(y y 0 ) j dy Perlen der Informatik II, Hans-Joachim Bungartz page 23 of 47

24 Subdivision of the Domain separation of near-field and far-field for given x: Ω = Ω near Ω far, Ω near Ω far = decomposition of the far-field into disjoint, convex subdomains: Ω far = i Ω far i note: this decomposition depends on x, i.e. it is done for each particle position x (efficient derivation possible from one hierarchical tree structure) each Ω far i has an associated point y i 0 how to choose the subdivision? diam x y i 0 sup y y y Ωfar i 0 i := x y0 i θ for some suitable constant 0 < θ < 1 Perlen der Informatik II, Hans-Joachim Bungartz page 24 of 47

25 resulting approximation for Φ(x): Φ(x) = = = ϱ(y)g(x, y)dy Ω ϱ(y)g(x, y)dy + ϱ(y)g(x, y)dy Ω near Ω far ϱ(y)g(x, y)dy + ϱ(y)g(x, y)dy Ω near i Ω far i Ω near ϱ(y)g(x, y)dy + i j 1 p 1 j! M j(ω far i, y i 0)G 0,j (x, y i 0) Perlen der Informatik II, Hans-Joachim Bungartz page 25 of 47

26 Error Estimates error characteristics for one fix particle position x Ω: local relative approximation error for one Ω far i can be shown to be of order O(θ p+1 ) global relative approximation error (summation over whole far-field) can be shown to be of order O(θ p+1 ) this clarifies the role of θ: allows to control the global approximation error in x geometric requirement to the far-field subdivision: the closer Ω far i is located to x, the smaller it has to be to fulfil the θ-condition hence: a typical level of detail the closer, the higher resolved cf. terrain representation in flight simulators Perlen der Informatik II, Hans-Joachim Bungartz page 26 of 47

27 Tree Structures central question: how can we construct all these necessary separations of near-fields and far-fields and subdivisions of far-fields in an efficient way? idea: recursive decomposition of Ω (a square in 2D, a cube in 3D without loss of generality) in cells of different size, terminating the subdivision process if a cell is either empty or contains just one particle concepts: kd-tree: alternate subdivision in coordinate direction (x, y, and z), such that the separation produces two subdomains that roughly contain the same number of particles each quadtree (2D) or octree (3D): subdivision into four congruent subsquares or eight congruent subcubes, respectively the following algorithms (Barnes-Hut etc.) use the octree approach Perlen der Informatik II, Hans-Joachim Bungartz page 27 of 47

28 kd-trees Example Perlen der Informatik II, Hans-Joachim Bungartz page 28 of 47

29 Quadtrees and Octrees Examples Perlen der Informatik II, Hans-Joachim Bungartz page 29 of 47

30 Recursive Computation of the Far-Field starting point: create the octree corresponding to the set of particles each node of the octree represents a subdomain of Ω or one cell for each cell i, define some y0 i (the centre point or the centre of gravity of all particles contained, e.g.) for doing the Taylor expansion for each cell i, let the parameter diam just denote the diameter of the smallest surrounding sphere, e.g. objective: for each particle position x, use as few cells as possible (i.e. as big cells as possible) for fulfilling the diam-θ rule hence: start from root node, check diam x y i 0 θ, stop if fulfilled (no need for further subdivision) and proceed if not yet fulfilled Perlen der Informatik II, Hans-Joachim Bungartz page 30 of 47

31 note that for each x, we typically get a different subdivision but note also that all these subdivisions are just subtrees of our constructed octree Perlen der Informatik II, Hans-Joachim Bungartz page 31 of 47

32 Recursive Computation of the Moments now: use this subdivision for the efficient calculation of the local moments M j (Ω far i, y i 0 ) direct (numerical) integration or direct summation are not efficient therefore: use hierarchical tree structure to calculate all moments for all cells in one run and store them crucial property for that: M j (Ω 1 Ω 2, y 0 ) = M j (Ω 1, y 0 ) + M j (Ω 2, y 0 ), if the point of expansion y 0 is the same in the (standard) case of different y0 1 and y 0 2, there are simple conversion formulas: M j (Ω 1, ŷ 0 ) = ( ) j (y 0 ŷ 0 ) j k M k (Ω 1, y 0 ) k k j (k j component-wise, multiplicative binomial coefficients) Perlen der Informatik II, Hans-Joachim Bungartz page 32 of 47

33 this allows for a bottom-up calculation of the moments from the leaves to the root in the leaves: if no particle present: zero if one particle of mass m there in x: m(x y i 0 )j Perlen der Informatik II, Hans-Joachim Bungartz page 33 of 47

34 Using these Building Blocks still to be done for a numerical routine: how to construct the tree, starting from a given set of particles? how to store the tree? how to choose cells and expansion points? how to determine far-field and near-field? several algorithmic variants to be discussed in the following Perlen der Informatik II, Hans-Joachim Bungartz page 34 of 47

35 The Barnes-Hut Method the so-called Barnes-Hut method is the oldest, simplest, and most widespread hierarchical tree-based approach, dates back to 1986 original (and still main) target applications: astrophysics (high particle numbers and a typically very heterogeneous density distribution) particle-particle interaction via gravitation potential (or modifications): U(r ij ) = G Grav m i m j r ij uses octrees: leaves of the tree represent empty cells or one particle, inner nodes represent clusters of particles or pseudo particles main underlying idea: gravitation (or other force) induced by many particles in one remote cell can be approximated by the influence induced by one pseudo particle of the accumulated mass in the cell s centre of gravity Perlen der Informatik II, Hans-Joachim Bungartz page 35 of 47

36 three main steps: construction of the octree: refinement, until just one or no particle per cell (top-down recursion) calculation of pseudo particles: mass is just sum of the cell s particles masses, associated point is the mass-weighted average of the particles positions (bottom-up recursion) calculation of forces (N incomplete top-down traversals for N particles) Perlen der Informatik II, Hans-Joachim Bungartz page 36 of 47

37 Calculation of the Forces remember: to be done for each particle assume a particle with position x Ω start in root node proceed to son nodes, until the θ rule: diam r θ is fulfilled, where r denotes the distance of the corresponding pseudo particle to x then, add the resulting interaction influence to the overall result implicit separation into near-field and far-field: near-field: all leaves reached during this traversal (single-particle influence) far-field: all inner nodes where the process stops, i.e. cells representing pseudo particles Perlen der Informatik II, Hans-Joachim Bungartz page 37 of 47

38 several variants concerning the determination of the parameter diam Barnes-Hut method can be interpreted as a special case of the Taylor approximation discussed above for p = 0 (for the calculation of the pseudo particles, we just sum up masses, i.e. zero-th moments) Perlen der Informatik II, Hans-Joachim Bungartz page 38 of 47

39 Example of the Tree Construction and Force Calculation x i x i Perlen der Informatik II, Hans-Joachim Bungartz page 39 of 47

40 Accuracy and Complexity accuracy: depends on control parameter θ the smaller we choose θ, the larger gets the near-field and the smaller, hence, gets the error resulting from the corresponding far-field approximation however, slow O(θ) convergence due to p = 0 complexity: increases for decreasing θ for roughly homogeneous particle distributions: number of active cells bounded by C log N/θ 3 for some constant C and N particles overall cost of order O(θ 3 N log N) for limit case θ 0: method degenerates to the quadratic complexity of the original each-with-each approach without far-field approximation Perlen der Informatik II, Hans-Joachim Bungartz page 40 of 47

41 Some Remarks on the Implementation use a standard data structure for each (pseudo) particle standard tree implementation with pointers: each node contains one (pseudo) particle at most subdomain corresponding to a node/cell can be either stored explicitly or calculated on-the-fly during a tree traversal four (2D) or eight (3D) pointers from a node to its sons linearisation is possible, too (i.e. no need for pointers) tree traversal typically by recursion pre-order (top-down) post-order (bottom-up) tree construction: successive insertion of particles starting from the empty tree (i.e. the octree consisting of the root node only) refine, when necessary (i.e. two particles in one node) Perlen der Informatik II, Hans-Joachim Bungartz page 41 of 47

42 Some Remarks on the Implementation (2) calculation of pseudo particles: bottom-up (post-order) traversal sum of masses, weighted average of positions as described before calculation of forces: outer loop: traversal visiting each leaf (to get the far-field approximation for each particle) inner loop: top-down (pre-order) traversal until the θ-rule is fulfilled parameter diam on-the-fly time integration: essentially as before, now as another tree traversal motion: either via constructing a new octree or via modifying the existing octree (generally preferred) Perlen der Informatik II, Hans-Joachim Bungartz page 42 of 47

43 The Fast-Multipole Method Main Idea idea of Barnes-Hut: replace particle-particle interactions by interactions between particles and pseudo particles effect is an improved complexity: O(N log N) instead of O(N 2 ) for almost homogeneous particle distributions, and still a significant effect otherwise next step now: avoid calculation of interactions with remote pseudo particles on the origin side for each particle on the effect side individually instead, combine particles to pseudo particles on the effect side also and calculate interactions of pseudo particles with remote pseudo particles, only cells representing pseudo particles with their contained particles are called clusters from such a clucter-cluster interaction, the particle-cluster interactions can be derived for all of the cluster s particles if done properly, this fast multipole method provides a linear O(N) complexity Perlen der Informatik II, Hans-Joachim Bungartz page 43 of 47

44 Fundamentals starting point again: Taylor expansion of the kernel G(x, y), but now in x and y around x l 0 Ω l and y i 0 Ω i: G(x, y) = k 1 p j 1 p 1 k!j! (x x l 0) k (y y i 0) j G k,j (x l 0, y i 0) + ˆR p (x, y) leads to expansion (and approximation) of the potential (here: the part due to the subdomain Ω i ): Φ i (x) k 1 p 1 k! (x x 0) l k (moments defined as before) j 1 p 1 j! G k,j(x l 0, y i 0)M j (Ω i, y i 0) Perlen der Informatik II, Hans-Joachim Bungartz page 44 of 47

45 hence: interaction between Ω i and Ω l is calculated once and can, afterwards, be used for deriving the particle-cluster interactions for all x Ω l (using suitable transformation rules) this principle is now applied hierarchically or recursively: first, calculate interactions between large clusters these are inherited to the son level, where now the resulting interactions are determined finally, get particle-x interactions in the leaves Perlen der Informatik II, Hans-Joachim Bungartz page 45 of 47

46 Barnes-Hut vs. Fast Multipole Comparison x i 0 y x l 0 i 0 y Perlen der Informatik II, Hans-Joachim Bungartz page 46 of 47

47 Subdivision instead of a recursive subdivision of Ω, we have to do it for Ω Ω now same octree structure for origin and effect side typically, same midpoints : x i 0 = y i 0 Ω i (again, geometrical midpoint or centre of gravity) now, also a double θ-criterion: x x l 0 x l 0 y i 0 θ and y y i 0 x l 0 y i 0 θ x Ω l, y Ω i hierarchy can be used for an efficient calculation of the interactions (for details, we refer to the literature) accuracy: as before, an O(θ p+1 )-characteristics can be shown for the relative error parallelisation: only slightly more complicated than for Barnes-Hut Perlen der Informatik II, Hans-Joachim Bungartz page 47 of 47

CSC 391/691: GPU Programming Fall N-Body Problem. Copyright 2011 Samuel S. Cho

CSC 391/691: GPU Programming Fall N-Body Problem. Copyright 2011 Samuel S. Cho CSC 391/691: GPU Programming Fall 2011 N-Body Problem Copyright 2011 Samuel S. Cho Introduction Many physical phenomena can be simulated with a particle system where each particle interacts with all other

More information

8. Hardware-Aware Numerics. Approaching supercomputing...

8. Hardware-Aware Numerics. Approaching supercomputing... Approaching supercomputing... Numerisches Programmieren, Hans-Joachim Bungartz page 1 of 48 8.1. Hardware-Awareness Introduction Since numerical algorithms are ubiquitous, they have to run on a broad spectrum

More information

8. Hardware-Aware Numerics. Approaching supercomputing...

8. Hardware-Aware Numerics. Approaching supercomputing... Approaching supercomputing... Numerisches Programmieren, Hans-Joachim Bungartz page 1 of 22 8.1. Hardware-Awareness Introduction Since numerical algorithms are ubiquitous, they have to run on a broad spectrum

More information

CS 542G: Barnes-Hut. Robert Bridson. November 5, 2008

CS 542G: Barnes-Hut. Robert Bridson. November 5, 2008 CS 54G: Barnes-Hut Robert Bridson November 5, 008 The Gravitational Potential While it s perfectly possible to derive all the following at the level of forces, it can be both convenient and useful to re-express

More information

A Kernel-independent Adaptive Fast Multipole Method

A Kernel-independent Adaptive Fast Multipole Method A Kernel-independent Adaptive Fast Multipole Method Lexing Ying Caltech Joint work with George Biros and Denis Zorin Problem Statement Given G an elliptic PDE kernel, e.g. {x i } points in {φ i } charges

More information

ExaFMM. Fast multipole method software aiming for exascale systems. User's Manual. Rio Yokota, L. A. Barba. November Revision 1

ExaFMM. Fast multipole method software aiming for exascale systems. User's Manual. Rio Yokota, L. A. Barba. November Revision 1 ExaFMM Fast multipole method software aiming for exascale systems User's Manual Rio Yokota, L. A. Barba November 2011 --- Revision 1 ExaFMM User's Manual i Revision History Name Date Notes Rio Yokota,

More information

1. Mathematical Modelling

1. Mathematical Modelling 1. describe a given problem with some mathematical formalism in order to get a formal and precise description see fundamental properties due to the abstraction allow a systematic treatment and, thus, solution

More information

Spatial Data Structures

Spatial Data Structures CSCI 420 Computer Graphics Lecture 17 Spatial Data Structures Jernej Barbic University of Southern California Hierarchical Bounding Volumes Regular Grids Octrees BSP Trees [Angel Ch. 8] 1 Ray Tracing Acceleration

More information

Spatial Data Structures

Spatial Data Structures CSCI 480 Computer Graphics Lecture 7 Spatial Data Structures Hierarchical Bounding Volumes Regular Grids BSP Trees [Ch. 0.] March 8, 0 Jernej Barbic University of Southern California http://www-bcf.usc.edu/~jbarbic/cs480-s/

More information

Fast methods for N-body problems

Fast methods for N-body problems CME342- Parallel Methods in Numerical Analysis Fast methods for N-body problems May 21, 2014 Lecture 13 Outline Motivation: N-body problem is O(N^2) Data structures: quad tree and oct tree (ADT?) The Barnes-Hut

More information

Intermediate Parallel Programming & Cluster Computing

Intermediate Parallel Programming & Cluster Computing High Performance Computing Modernization Program (HPCMP) Summer 2011 Puerto Rico Workshop on Intermediate Parallel Programming & Cluster Computing in conjunction with the National Computational Science

More information

Ray Tracing Acceleration Data Structures

Ray Tracing Acceleration Data Structures Ray Tracing Acceleration Data Structures Sumair Ahmed October 29, 2009 Ray Tracing is very time-consuming because of the ray-object intersection calculations. With the brute force method, each ray has

More information

Spatial Data Structures

Spatial Data Structures 15-462 Computer Graphics I Lecture 17 Spatial Data Structures Hierarchical Bounding Volumes Regular Grids Octrees BSP Trees Constructive Solid Geometry (CSG) April 1, 2003 [Angel 9.10] Frank Pfenning Carnegie

More information

Using Adaptive Sparse Grids to Solve High-Dimensional Dynamic Models. J. Brumm & S. Scheidegger BFI, University of Chicago, Nov.

Using Adaptive Sparse Grids to Solve High-Dimensional Dynamic Models. J. Brumm & S. Scheidegger BFI, University of Chicago, Nov. Using Adaptive Sparse Grids to Solve High-Dimensional Dynamic Models J. Brumm & S. Scheidegger, Nov. 1st 2013 Outline I.) From Full (Cartesian) Grids to Sparse Grids II.) Adaptive Sparse Grids III.) Time

More information

1.2 Numerical Solutions of Flow Problems

1.2 Numerical Solutions of Flow Problems 1.2 Numerical Solutions of Flow Problems DIFFERENTIAL EQUATIONS OF MOTION FOR A SIMPLIFIED FLOW PROBLEM Continuity equation for incompressible flow: 0 Momentum (Navier-Stokes) equations for a Newtonian

More information

Adaptive-Mesh-Refinement Pattern

Adaptive-Mesh-Refinement Pattern Adaptive-Mesh-Refinement Pattern I. Problem Data-parallelism is exposed on a geometric mesh structure (either irregular or regular), where each point iteratively communicates with nearby neighboring points

More information

WETTING PROPERTIES OF STRUCTURED INTERFACES COMPOSED OF SURFACE-ATTACHED SPHERICAL NANOPARTICLES

WETTING PROPERTIES OF STRUCTURED INTERFACES COMPOSED OF SURFACE-ATTACHED SPHERICAL NANOPARTICLES November 20, 2018 WETTING PROPERTIES OF STRUCTURED INTERFACES COMPOSED OF SURFACE-ATTACHED SPHERICAL NANOPARTICLES Bishal Bhattarai and Nikolai V. Priezjev Department of Mechanical and Materials Engineering

More information

Spatial Data Structures

Spatial Data Structures 15-462 Computer Graphics I Lecture 17 Spatial Data Structures Hierarchical Bounding Volumes Regular Grids Octrees BSP Trees Constructive Solid Geometry (CSG) March 28, 2002 [Angel 8.9] Frank Pfenning Carnegie

More information

Lecture 4: Locality and parallelism in simulation I

Lecture 4: Locality and parallelism in simulation I Lecture 4: Locality and parallelism in simulation I David Bindel 6 Sep 2011 Logistics Distributed memory machines Each node has local memory... and no direct access to memory on other nodes Nodes communicate

More information

PSE Molekulardynamik

PSE Molekulardynamik OpenMP, Multicore Architectures, PAPI 21.12.2012 Outline Schedule Presentations: Worksheet 4 OpenMP Multicore Architectures PAPI Membrane, Crystallization Preparation: Worksheet 5 2 Schedule (big meetings)

More information

Spatial Data Structures

Spatial Data Structures Spatial Data Structures Hierarchical Bounding Volumes Regular Grids Octrees BSP Trees Constructive Solid Geometry (CSG) [Angel 9.10] Outline Ray tracing review what rays matter? Ray tracing speedup faster

More information

Level Set Method in a Finite Element Setting

Level Set Method in a Finite Element Setting Level Set Method in a Finite Element Setting John Shopple University of California, San Diego November 6, 2007 Outline 1 Level Set Method 2 Solute-Solvent Model 3 Reinitialization 4 Conclusion Types of

More information

CHAPTER 3. Elementary Fluid Dynamics

CHAPTER 3. Elementary Fluid Dynamics CHAPTER 3. Elementary Fluid Dynamics - Understanding the physics of fluid in motion - Derivation of the Bernoulli equation from Newton s second law Basic Assumptions of fluid stream, unless a specific

More information

CSG obj. oper3. obj1 obj2 obj3. obj5. obj4

CSG obj. oper3. obj1 obj2 obj3. obj5. obj4 Solid Modeling Solid: Boundary + Interior Volume occupied by geometry Solid representation schemes Constructive Solid Geometry (CSG) Boundary representations (B-reps) Space-partition representations Operations

More information

Partitioning and Divide-and-Conquer Strategies

Partitioning and Divide-and-Conquer Strategies Chapter 4 Partitioning and Divide-and-Conquer Strategies Slides for Parallel Programming Techniques & Applications Using Networked Workstations & Parallel Computers 2nd ed., by B. Wilkinson & M. Allen,

More information

Molecular Surface Representation

Molecular Surface Representation Molecular Surface Representation Applications to docking 1 Motivation Studies of protein folding (Lee and Richards). Prediction of biomolecular recognition. Detection of drug binding cavities. Molecular

More information

Announcements. Written Assignment2 is out, due March 8 Graded Programming Assignment2 next Tuesday

Announcements. Written Assignment2 is out, due March 8 Graded Programming Assignment2 next Tuesday Announcements Written Assignment2 is out, due March 8 Graded Programming Assignment2 next Tuesday 1 Spatial Data Structures Hierarchical Bounding Volumes Grids Octrees BSP Trees 11/7/02 Speeding Up Computations

More information

The jello cube. Undeformed cube. Deformed cube

The jello cube. Undeformed cube. Deformed cube The Jello Cube Assignment 1, CSCI 520 Jernej Barbic, USC Undeformed cube The jello cube Deformed cube The jello cube is elastic, Can be bent, stretched, squeezed,, Without external forces, it eventually

More information

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015 Instance-based Learning CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2015 Outline Non-parametric approach Unsupervised: Non-parametric density estimation Parzen Windows K-Nearest

More information

Cloth Simulation. Tanja Munz. Master of Science Computer Animation and Visual Effects. CGI Techniques Report

Cloth Simulation. Tanja Munz. Master of Science Computer Animation and Visual Effects. CGI Techniques Report Cloth Simulation CGI Techniques Report Tanja Munz Master of Science Computer Animation and Visual Effects 21st November, 2014 Abstract Cloth simulation is a wide and popular area of research. First papers

More information

Fast Multipole Method on the GPU

Fast Multipole Method on the GPU Fast Multipole Method on the GPU with application to the Adaptive Vortex Method University of Bristol, Bristol, United Kingdom. 1 Introduction Particle methods Highly parallel Computational intensive Numerical

More information

HPC Algorithms and Applications

HPC Algorithms and Applications HPC Algorithms and Applications Dwarf #5 Structured Grids Michael Bader Winter 2012/2013 Dwarf #5 Structured Grids, Winter 2012/2013 1 Dwarf #5 Structured Grids 1. dense linear algebra 2. sparse linear

More information

2.7 Cloth Animation. Jacobs University Visualization and Computer Graphics Lab : Advanced Graphics - Chapter 2 123

2.7 Cloth Animation. Jacobs University Visualization and Computer Graphics Lab : Advanced Graphics - Chapter 2 123 2.7 Cloth Animation 320491: Advanced Graphics - Chapter 2 123 Example: Cloth draping Image Michael Kass 320491: Advanced Graphics - Chapter 2 124 Cloth using mass-spring model Network of masses and springs

More information

Transactions on Modelling and Simulation vol 20, 1998 WIT Press, ISSN X

Transactions on Modelling and Simulation vol 20, 1998 WIT Press,   ISSN X Parallel indirect multipole BEM analysis of Stokes flow in a multiply connected domain M.S. Ingber*, A.A. Mammoli* & J.S. Warsa* "Department of Mechanical Engineering, University of New Mexico, Albuquerque,

More information

The Jello Cube Assignment 1, CSCI 520. Jernej Barbic, USC

The Jello Cube Assignment 1, CSCI 520. Jernej Barbic, USC The Jello Cube Assignment 1, CSCI 520 Jernej Barbic, USC 1 The jello cube Undeformed cube Deformed cube The jello cube is elastic, Can be bent, stretched, squeezed,, Without external forces, it eventually

More information

Space Filling Curves and Hierarchical Basis. Klaus Speer

Space Filling Curves and Hierarchical Basis. Klaus Speer Space Filling Curves and Hierarchical Basis Klaus Speer Abstract Real world phenomena can be best described using differential equations. After linearisation we have to deal with huge linear systems of

More information

Numerical Methods for (Time-Dependent) HJ PDEs

Numerical Methods for (Time-Dependent) HJ PDEs Numerical Methods for (Time-Dependent) HJ PDEs Ian Mitchell Department of Computer Science The University of British Columbia research supported by National Science and Engineering Research Council of

More information

Driven Cavity Example

Driven Cavity Example BMAppendixI.qxd 11/14/12 6:55 PM Page I-1 I CFD Driven Cavity Example I.1 Problem One of the classic benchmarks in CFD is the driven cavity problem. Consider steady, incompressible, viscous flow in a square

More information

Partitioning and Divide-and-Conquer Strategies

Partitioning and Divide-and-Conquer Strategies Partitioning and Divide-and-Conquer Strategies Chapter 4 slides4-1 Partitioning Partitioning simply divides the problem into parts. Divide and Conquer Characterized by dividing problem into subproblems

More information

Bumptrees for Efficient Function, Constraint, and Classification Learning

Bumptrees for Efficient Function, Constraint, and Classification Learning umptrees for Efficient Function, Constraint, and Classification Learning Stephen M. Omohundro International Computer Science Institute 1947 Center Street, Suite 600 erkeley, California 94704 Abstract A

More information

Ordinary differential equations solving methods

Ordinary differential equations solving methods Radim Hošek, A07237 radhost@students.zcu.cz Ordinary differential equations solving methods Problem: y = y2 (1) y = x y (2) y = sin ( + y 2 ) (3) Where it is possible we try to solve the equations analytically,

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

Don t just read it; fight it! Ask your own questions, look for your own examples, discover your own proofs. Is the hypothesis necessary?

Don t just read it; fight it! Ask your own questions, look for your own examples, discover your own proofs. Is the hypothesis necessary? Don t just read it; fight it! Ask your own questions, look for your own examples, discover your own proofs. Is the hypothesis necessary? Is the converse true? What happens in the classical special case?

More information

A General Greedy Approximation Algorithm with Applications

A General Greedy Approximation Algorithm with Applications A General Greedy Approximation Algorithm with Applications Tong Zhang IBM T.J. Watson Research Center Yorktown Heights, NY 10598 tzhang@watson.ibm.com Abstract Greedy approximation algorithms have been

More information

Efficient computation of source magnetic scalar potential

Efficient computation of source magnetic scalar potential Adv. Radio Sci., 4, 59 63, 2006 Author(s) 2006. This work is licensed under a Creative Commons License. Advances in Radio Science Efficient computation of source magnetic scalar potential W. Hafla, A.

More information

SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND

SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND Student Submission for the 5 th OpenFOAM User Conference 2017, Wiesbaden - Germany: SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND TESSA UROIĆ Faculty of Mechanical Engineering and Naval Architecture, Ivana

More information

Conquer Strategies. Partitioning and Divide-and- Partitioning Strategies + + Using separate send()s and recv()s. Slave

Conquer Strategies. Partitioning and Divide-and- Partitioning Strategies + + Using separate send()s and recv()s. Slave Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply divides the problem into parts Example - Adding a sequence of numbers We might consider dividing the sequence

More information

FORSCHUNGSZENTRUM JÜLICH GmbH Jülich Supercomputing Centre D Jülich, Tel. (02461)

FORSCHUNGSZENTRUM JÜLICH GmbH Jülich Supercomputing Centre D Jülich, Tel. (02461) FORSCHUNGSZENTRUM JÜLICH GmbH Jülich Supercomputing Centre D-52425 Jülich, Tel. (02461) 61-6402 Technical Report Benchmark of fast Coulomb Solvers for open and periodic boundary conditions Sebastian Krumscheid

More information

The Fast Multipole Method on NVIDIA GPUs and Multicore Processors

The Fast Multipole Method on NVIDIA GPUs and Multicore Processors The Fast Multipole Method on NVIDIA GPUs and Multicore Processors Toru Takahashi, a Cris Cecka, b Eric Darve c a b c Department of Mechanical Science and Engineering, Nagoya University Institute for Applied

More information

1 Proximity via Graph Spanners

1 Proximity via Graph Spanners CS273: Algorithms for Structure Handout # 11 and Motion in Biology Stanford University Tuesday, 4 May 2003 Lecture #11: 4 May 2004 Topics: Proximity via Graph Spanners Geometric Models of Molecules, I

More information

Multi-Domain Pattern. I. Problem. II. Driving Forces. III. Solution

Multi-Domain Pattern. I. Problem. II. Driving Forces. III. Solution Multi-Domain Pattern I. Problem The problem represents computations characterized by an underlying system of mathematical equations, often simulating behaviors of physical objects through discrete time

More information

Parallel and Distributed Systems Lab.

Parallel and Distributed Systems Lab. Parallel and Distributed Systems Lab. Department of Computer Sciences Purdue University. Jie Chi, Ronaldo Ferreira, Ananth Grama, Tzvetan Horozov, Ioannis Ioannidis, Mehmet Koyuturk, Shan Lei, Robert Light,

More information

Particle-based simulations in Astrophysics

Particle-based simulations in Astrophysics Particle-based simulations in Astrophysics Jun Makino Particle Simulator Research Team, AICS/ Earth-Life Science Institute(ELSI), Tokyo Institute of Technology Feb 28, 2013 3rd AICS International Symposium

More information

CS205b/CME306. Lecture 9

CS205b/CME306. Lecture 9 CS205b/CME306 Lecture 9 1 Convection Supplementary Reading: Osher and Fedkiw, Sections 3.3 and 3.5; Leveque, Sections 6.7, 8.3, 10.2, 10.4. For a reference on Newton polynomial interpolation via divided

More information

Continuum-Microscopic Models

Continuum-Microscopic Models Scientific Computing and Numerical Analysis Seminar October 1, 2010 Outline Heterogeneous Multiscale Method Adaptive Mesh ad Algorithm Refinement Equation-Free Method Incorporates two scales (length, time

More information

ON THE VELOCITY OF A WEIGHTED CYLINDER DOWN AN INCLINED PLANE

ON THE VELOCITY OF A WEIGHTED CYLINDER DOWN AN INCLINED PLANE ON THE VELOCITY OF A WEIGHTED CYLINDER DOWN AN INCLINED PLANE Raghav Grover and Aneesh Agarwal RG (Grade 12 High School), AA (Grade 11 High School) Department of Physics, The Doon School, Dehradun. raghav.503.2019@doonschool.com,

More information

A Low Level Introduction to High Dimensional Sparse Grids

A Low Level Introduction to High Dimensional Sparse Grids A Low Level Introduction to High Dimensional Sparse Grids http://people.sc.fsu.edu/ jburkardt/presentations/sandia 2007.pdf... John 1 Clayton Webster 2 1 Virginia Tech 2 Sandia National Laboratory. 21

More information

Algorithms. Algorithms GEOMETRIC APPLICATIONS OF BSTS. 1d range search line segment intersection kd trees interval search trees rectangle intersection

Algorithms. Algorithms GEOMETRIC APPLICATIONS OF BSTS. 1d range search line segment intersection kd trees interval search trees rectangle intersection Algorithms ROBERT SEDGEWICK KEVIN WAYNE GEOMETRIC APPLICATIONS OF BSTS Algorithms F O U R T H E D I T I O N ROBERT SEDGEWICK KEVIN WAYNE 1d range search line segment intersection kd trees interval search

More information

Partitioning and Divide-and- Conquer Strategies

Partitioning and Divide-and- Conquer Strategies Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply divides the problem into parts Example - Adding a sequence of numbers We might consider dividing the sequence

More information

Algorithms. Algorithms GEOMETRIC APPLICATIONS OF BSTS. 1d range search line segment intersection kd trees interval search trees rectangle intersection

Algorithms. Algorithms GEOMETRIC APPLICATIONS OF BSTS. 1d range search line segment intersection kd trees interval search trees rectangle intersection Algorithms ROBERT SEDGEWICK KEVIN WAYNE GEOMETRIC APPLICATIONS OF BSTS Algorithms F O U R T H E D I T I O N ROBERT SEDGEWICK KEVIN WAYNE 1d range search line segment intersection kd trees interval search

More information

Intersection Acceleration

Intersection Acceleration Advanced Computer Graphics Intersection Acceleration Matthias Teschner Computer Science Department University of Freiburg Outline introduction bounding volume hierarchies uniform grids kd-trees octrees

More information

Scientific Applications. Chao Sun

Scientific Applications. Chao Sun Large Scale Multiprocessors And Scientific Applications Zhou Li Chao Sun Contents Introduction Interprocessor Communication: The Critical Performance Issue Characteristics of Scientific Applications Synchronization:

More information

Surfaces and Integral Curves

Surfaces and Integral Curves MODULE 1: MATHEMATICAL PRELIMINARIES 16 Lecture 3 Surfaces and Integral Curves In Lecture 3, we recall some geometrical concepts that are essential for understanding the nature of solutions of partial

More information

Algorithms. Algorithms GEOMETRIC APPLICATIONS OF BSTS. 1d range search line segment intersection kd trees interval search trees rectangle intersection

Algorithms. Algorithms GEOMETRIC APPLICATIONS OF BSTS. 1d range search line segment intersection kd trees interval search trees rectangle intersection Algorithms ROBERT SEDGEWICK KEVIN WAYNE GEOMETRIC APPLICATIONS OF BSTS Algorithms F O U R T H E D I T I O N ROBERT SEDGEWICK KEVIN WAYNE 1d range search line segment intersection kd trees interval search

More information

ECE276B: Planning & Learning in Robotics Lecture 5: Configuration Space

ECE276B: Planning & Learning in Robotics Lecture 5: Configuration Space ECE276B: Planning & Learning in Robotics Lecture 5: Configuration Space Lecturer: Nikolay Atanasov: natanasov@ucsd.edu Teaching Assistants: Tianyu Wang: tiw161@eng.ucsd.edu Yongxi Lu: yol070@eng.ucsd.edu

More information

Introduction to Computer Graphics. Animation (2) May 26, 2016 Kenshi Takayama

Introduction to Computer Graphics. Animation (2) May 26, 2016 Kenshi Takayama Introduction to Computer Graphics Animation (2) May 26, 2016 Kenshi Takayama Physically-based deformations 2 Simple example: single mass & spring in 1D Mass m, position x, spring coefficient k, rest length

More information

Bounded-Error Compression of Particle Data from Hierarchical Approximate Methods

Bounded-Error Compression of Particle Data from Hierarchical Approximate Methods Bounded-Error Compression of Particle Data from Hierarchical Approximate Methods Dow-Yung Yang and Ananth Grama ( Primary Author is a Student) Computer Science Department, Purdue University, West Lafayette,

More information

FEMLAB Exercise 1 for ChE366

FEMLAB Exercise 1 for ChE366 FEMLAB Exercise 1 for ChE366 Problem statement Consider a spherical particle of radius r s moving with constant velocity U in an infinitely long cylinder of radius R that contains a Newtonian fluid. Let

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction ME 475: Computer-Aided Design of Structures 1-1 CHAPTER 1 Introduction 1.1 Analysis versus Design 1.2 Basic Steps in Analysis 1.3 What is the Finite Element Method? 1.4 Geometrical Representation, Discretization

More information

Solid Modeling. Thomas Funkhouser Princeton University C0S 426, Fall Represent solid interiors of objects

Solid Modeling. Thomas Funkhouser Princeton University C0S 426, Fall Represent solid interiors of objects Solid Modeling Thomas Funkhouser Princeton University C0S 426, Fall 2000 Solid Modeling Represent solid interiors of objects Surface may not be described explicitly Visible Human (National Library of Medicine)

More information

Columbus State Community College Mathematics Department Public Syllabus. Course and Number: MATH 1172 Engineering Mathematics A

Columbus State Community College Mathematics Department Public Syllabus. Course and Number: MATH 1172 Engineering Mathematics A Columbus State Community College Mathematics Department Public Syllabus Course and Number: MATH 1172 Engineering Mathematics A CREDITS: 5 CLASS HOURS PER WEEK: 5 PREREQUISITES: MATH 1151 with a C or higher

More information

3D vector fields. Contents. Introduction 3D vector field topology Representation of particle lines. 3D LIC Combining different techniques

3D vector fields. Contents. Introduction 3D vector field topology Representation of particle lines. 3D LIC Combining different techniques 3D vector fields Scientific Visualization (Part 9) PD Dr.-Ing. Peter Hastreiter Contents Introduction 3D vector field topology Representation of particle lines Path lines Ribbons Balls Tubes Stream tetrahedra

More information

CFD MODELING FOR PNEUMATIC CONVEYING

CFD MODELING FOR PNEUMATIC CONVEYING CFD MODELING FOR PNEUMATIC CONVEYING Arvind Kumar 1, D.R. Kaushal 2, Navneet Kumar 3 1 Associate Professor YMCAUST, Faridabad 2 Associate Professor, IIT, Delhi 3 Research Scholar IIT, Delhi e-mail: arvindeem@yahoo.co.in

More information

Organizing Spatial Data

Organizing Spatial Data Organizing Spatial Data Spatial data records include a sense of location as an attribute. Typically location is represented by coordinate data (in 2D or 3D). 1 If we are to search spatial data using the

More information

PTE 519 Lecture Note Finite Difference Approximation (Model)

PTE 519 Lecture Note Finite Difference Approximation (Model) PTE 519 Lecture Note 3 3.0 Finite Difference Approximation (Model) In this section of the lecture material, the focus is to define the terminology and to summarize the basic facts. The basic idea of any

More information

A Singular Example for the Averaged Mean Curvature Flow

A Singular Example for the Averaged Mean Curvature Flow To appear in Experimental Mathematics Preprint Vol. No. () pp. 3 7 February 9, A Singular Example for the Averaged Mean Curvature Flow Uwe F. Mayer Abstract An embedded curve is presented which under numerical

More information

Fluent User Services Center

Fluent User Services Center Solver Settings 5-1 Using the Solver Setting Solver Parameters Convergence Definition Monitoring Stability Accelerating Convergence Accuracy Grid Independence Adaption Appendix: Background Finite Volume

More information

Partitioning and Divide-and-Conquer Strategies

Partitioning and Divide-and-Conquer Strategies Chapter 4 Slide 125 Partitioning and Divide-and-Conquer Strategies Slide 126 Partitioning Partitioning simply divides the problem into parts. Divide and Conquer Characterized by dividing problem into subproblems

More information

PARALLEL SIMULATION OF A FLUID FLOW BY MEANS OF THE SPH METHOD: OPENMP VS. MPI COMPARISON. Pawe l Wróblewski, Krzysztof Boryczko

PARALLEL SIMULATION OF A FLUID FLOW BY MEANS OF THE SPH METHOD: OPENMP VS. MPI COMPARISON. Pawe l Wróblewski, Krzysztof Boryczko Computing and Informatics, Vol. 28, 2009, 139 150 PARALLEL SIMULATION OF A FLUID FLOW BY MEANS OF THE SPH METHOD: OPENMP VS. MPI COMPARISON Pawe l Wróblewski, Krzysztof Boryczko Department of Computer

More information

Mass-Spring Systems. Last Time?

Mass-Spring Systems. Last Time? Mass-Spring Systems Last Time? Implicit Surfaces & Marching Cubes/Tetras Collision Detection & Conservative Bounding Regions Spatial Acceleration Data Structures Octree, k-d tree, BSF tree 1 Today Particle

More information

Ray Tracing with Spatial Hierarchies. Jeff Mahovsky & Brian Wyvill CSC 305

Ray Tracing with Spatial Hierarchies. Jeff Mahovsky & Brian Wyvill CSC 305 Ray Tracing with Spatial Hierarchies Jeff Mahovsky & Brian Wyvill CSC 305 Ray Tracing Flexible, accurate, high-quality rendering Slow Simplest ray tracer: Test every ray against every object in the scene

More information

Domain Decomposition for Colloid Clusters. Pedro Fernando Gómez Fernández

Domain Decomposition for Colloid Clusters. Pedro Fernando Gómez Fernández Domain Decomposition for Colloid Clusters Pedro Fernando Gómez Fernández MSc in High Performance Computing The University of Edinburgh Year of Presentation: 2004 Authorship declaration I, Pedro Fernando

More information

Solving the N-Body Problem with the ALiCE Grid System

Solving the N-Body Problem with the ALiCE Grid System Solving the N-Body Problem with the ALiCE Grid System Dac Phuong Ho 1, Yong Meng Teo 2 and Johan Prawira Gozali 2 1 Department of Computer Network, Vietnam National University of Hanoi 144 Xuan Thuy Street,

More information

Center for Computational Science

Center for Computational Science Center for Computational Science Toward GPU-accelerated meshfree fluids simulation using the fast multipole method Lorena A Barba Boston University Department of Mechanical Engineering with: Felipe Cruz,

More information

AP Calculus BC Course Description

AP Calculus BC Course Description AP Calculus BC Course Description COURSE OUTLINE: The following topics define the AP Calculus BC course as it is taught over three trimesters, each consisting of twelve week grading periods. Limits and

More information

arxiv: v1 [math.na] 20 Sep 2016

arxiv: v1 [math.na] 20 Sep 2016 arxiv:1609.06236v1 [math.na] 20 Sep 2016 A Local Mesh Modification Strategy for Interface Problems with Application to Shape and Topology Optimization P. Gangl 1,2 and U. Langer 3 1 Doctoral Program Comp.

More information

Recent Developments in Model-based Derivative-free Optimization

Recent Developments in Model-based Derivative-free Optimization Recent Developments in Model-based Derivative-free Optimization Seppo Pulkkinen April 23, 2010 Introduction Problem definition The problem we are considering is a nonlinear optimization problem with constraints:

More information

Solution for Euler Equations Lagrangian and Eulerian Descriptions

Solution for Euler Equations Lagrangian and Eulerian Descriptions Solution for Euler Equations Lagrangian and Eulerian Descriptions Valdir Monteiro dos Santos Godoi valdir.msgodoi@gmail.com Abstract We find an exact solution for the system of Euler equations, following

More information

SPH: Why and what for?

SPH: Why and what for? SPH: Why and what for? 4 th SPHERIC training day David Le Touzé, Fluid Mechanics Laboratory, Ecole Centrale de Nantes / CNRS SPH What for and why? How it works? Why not for everything? Duality of SPH SPH

More information

COMPUTER VISION > OPTICAL FLOW UTRECHT UNIVERSITY RONALD POPPE

COMPUTER VISION > OPTICAL FLOW UTRECHT UNIVERSITY RONALD POPPE COMPUTER VISION 2017-2018 > OPTICAL FLOW UTRECHT UNIVERSITY RONALD POPPE OUTLINE Optical flow Lucas-Kanade Horn-Schunck Applications of optical flow Optical flow tracking Histograms of oriented flow Assignment

More information

Chapter 6. Curves and Surfaces. 6.1 Graphs as Surfaces

Chapter 6. Curves and Surfaces. 6.1 Graphs as Surfaces Chapter 6 Curves and Surfaces In Chapter 2 a plane is defined as the zero set of a linear function in R 3. It is expected a surface is the zero set of a differentiable function in R n. To motivate, graphs

More information

Ray Tracing III. Wen-Chieh (Steve) Lin National Chiao-Tung University

Ray Tracing III. Wen-Chieh (Steve) Lin National Chiao-Tung University Ray Tracing III Wen-Chieh (Steve) Lin National Chiao-Tung University Shirley, Fundamentals of Computer Graphics, Chap 10 Doug James CG slides, I-Chen Lin s CG slides Ray-tracing Review For each pixel,

More information

Data Structures for Approximate Proximity and Range Searching

Data Structures for Approximate Proximity and Range Searching Data Structures for Approximate Proximity and Range Searching David M. Mount University of Maryland Joint work with: Sunil Arya (Hong Kong U. of Sci. and Tech) Charis Malamatos (Max Plank Inst.) 1 Introduction

More information

NOISE PROPAGATION FROM VIBRATING STRUCTURES

NOISE PROPAGATION FROM VIBRATING STRUCTURES NOISE PROPAGATION FROM VIBRATING STRUCTURES Abstract R. Helfrich, M. Spriegel (INTES GmbH, Germany) Noise and noise exposure are becoming more important in product development due to environmental legislation.

More information

Lab 9: FLUENT: Transient Natural Convection Between Concentric Cylinders

Lab 9: FLUENT: Transient Natural Convection Between Concentric Cylinders Lab 9: FLUENT: Transient Natural Convection Between Concentric Cylinders Objective: The objective of this laboratory is to introduce how to use FLUENT to solve both transient and natural convection problems.

More information

Backcasting: A New Approach to Energy Conservation in Sensor Networks

Backcasting: A New Approach to Energy Conservation in Sensor Networks Backcasting: A New Approach to Energy Conservation in Sensor Networks Rebecca Willett 1, Aline Martin 2, and Robert Nowak 2 1 Rice University, Houston, TX 77005, USA, willett@rice.edu, WWW home page: http://www.ece.rice.edu/

More information

Convex Optimization. 2. Convex Sets. Prof. Ying Cui. Department of Electrical Engineering Shanghai Jiao Tong University. SJTU Ying Cui 1 / 33

Convex Optimization. 2. Convex Sets. Prof. Ying Cui. Department of Electrical Engineering Shanghai Jiao Tong University. SJTU Ying Cui 1 / 33 Convex Optimization 2. Convex Sets Prof. Ying Cui Department of Electrical Engineering Shanghai Jiao Tong University 2018 SJTU Ying Cui 1 / 33 Outline Affine and convex sets Some important examples Operations

More information

CS535 Fall Department of Computer Science Purdue University

CS535 Fall Department of Computer Science Purdue University Spatial Data Structures and Hierarchies CS535 Fall 2010 Daniel G Aliaga Daniel G. Aliaga Department of Computer Science Purdue University Spatial Data Structures Store geometric information Organize geometric

More information

Visualizing the inner structure of n-body data using skeletonization

Visualizing the inner structure of n-body data using skeletonization Rochester Institute of Technology RIT Scholar Works Presentations and other scholarship 2007 Visualizing the inner structure of n-body data using skeletonization Edward Dale Hans-Peter Bischof Follow this

More information

Parallelization of Scientific Applications (II)

Parallelization of Scientific Applications (II) Parallelization of Scientific Applications (II) Parallelization of Particle Based Methods Russian-German School on High Performance Computer Systems, June, 27 th until July, 6 th 2005, Novosibirsk 4. Day,

More information