A Moving Least Squares Material Point Method with Displacement Discontinuity and Two-Way Rigid Body Coupling (Supplementary Document)
|
|
- Jordan Warner
- 5 years ago
- Views:
Transcription
1 A Moving Least Squares Material Point Method with Dislacement Discontinuity and Two-Way Rigid Body Couling (Sulementary Document) Yuanming Hu Yu Fang Ziheng Ge Ziyin Qu Yixin Zhu Andre Pradhana Chenfanfu Jiang 1 Equivalence of APIC/PolyPIC and MLS on velocities 1.1 MLS with a linear olynomial basis In the simle case of a comlete linear olynomial basis (m = l = 1) for meshless solids, MLS is a 1 st -orderconsistent interolation scheme for scattered data and derivatives. With P(x i x) = [1, (x i x) T ] T, the reconstructed function value and its gradient estimation at x are [ ] û û = M (x)q T Ξ(x) u 1. u N, (1) where N is the total number of samle data oints, Ξ(x) is the diagonal weighting matrix with Ξ ii = ξ i (x), and Q(x) = [P(x 1 x),..., P(x N x)] T, M(x) = Q T ΞQ. 1.2 APIC and MLS APIC [Jiang et al., 2015] reresents the local velocity field near each article with v and matrix C. At time n, the momentum transfer m n i v n i = m N i (x n ) ( v n + C n (x i x n ) ) is equivalent to accumulating momentum contributions from article-wise local olynomial (affine) velocity fields to the grid. When we choose the weighting function ξ i (x) to be N i (x) where N i (x) is the B-sline kernel, grid-to-article transfer is exactly evaluating Eq. 1 after grid velocity is integrated from vi n to ˆv n+1 i. To see that, we first write down APIC transfers v n+1 C n+1 = = i ( i N i (x n )ˆv n+1 i, (2) N i (x n )ˆv n+1 i (x i x n ) T ) D, (3) where D = i N i(x n )(x i x n )(x i x n ) T. The equivalence to Eq. 1 relies on realizing that, in the MLS ersective, v n+1 is the reconstructed function value û and C n+1 is the reconstructed function gradient û. 1
2 In this case, the samle data {u 1,..., u N } in Eq. 1 corresonds to the velocities of all grid nodes influenced by article. Corresondingly we have Q T (x n ) = [ 1 (x 1 x n ) T 1 (x 2 x n ) T... ] and M(x n ) = i [ ξ i (x ) 1 (x i x n ) T (x i x n ) (x i x n )(x i x n ) T ]. (4) Substituting them into Eq. 1 and relacing ξ i with N i results in a block-wise result containing Eq. 2 and Eq. 3. It is not immediately clear what the relationshi is between D in APIC and M(x ) in MLS. Note that in other meshless methods like EFG, M can be singular if neighboring data samles are co-lanar, co-linear, or insufficient, in which case the seudo-inverse is required. Note that M(x ) in 3D is a general 4 4 matrix (in linear olynomial sace) if no restriction is ut on ξ i (x). However when i ξ i(x) = 1 and i ξ i(x)(x i x) = 0, we can see that M(x ) in Eq. 4 is block diagonal with its to left entry being just 1. The choice of quadratic/cubic B-slines for N i (x) makes M(x ) even simler. In fact, it is diagonal for APIC. The to left element is simly 1, while the bottom right block D is 1 4 x2 I for quadratic N i (x) and 1 3 x2 I for cubic N i (x), where x is the grid sacing. The simle form for D comes from the roerties of B-slines, and is not generally true for other interolation functions. 1.3 PolyPIC and MLS A related discussion without exlicitly mentioning the notion of MLS is given by Fu et al. [2017] for PolyPIC. Here we briefly oint out the main idea using the MLS view. PolyPIC uses a higher order basis for P(x). While article-to-grid is also accumulating momentum contributions from article-wise local olynomial velocity fields, the grid-to-article transfer for each article is essentially evaluating basis coefficients c(x) for minimizing the functional J x (c). As discussed in the aer, J x (c) reresents a weighted average error encoding the difference between actual data samles ˆv n+1 i and their aroximation through the olynomial basis. In ractice PolyPIC stores coefficients c(x ) on each article. The size of this coefficient vector deends on the dimension of the olynomial sace. When only linear functions are used, PolyPIC reduces to APIC, and the coefficients simly contain v n+1 and all comonents in C n+1. When higher order basis functions are added to P(x) (such as multilinear functions and quadratic functions), PolyPIC rovides a more accurate olynomial fit to the local velocity field. Notice that Fu et al. [2017] additionally use Gram-Schmidt to enforce a mass orthogonal quadratic basis to reach a diagonal moment matrix M(x), which is highly relevant to the classical idea of using an orthogonal olynomial basis in EFG methods [Lu et al., 1994]. 2 MLS-MPM force differential Comuting the MLS-MPM force differential requires treating the force as a function of fictitiously deformed grid node ositions (x i = x n i + tˆv i where ˆv i are the imlicit unknown grid velocities and x n i are regular Cartesian lattice locations). Secifically, f i (x i ) = N i (x n ) D Ψ F (F (x i ))F nt (x n i x n ), (5) where the imlicit deformation gradient udate is related to x i as F (x i ) = (I + tc (x i )) F n, (6) 2
3 and C is given by C (x i ) = ( i N i (x n ) x i x n i t (x n i x n ) T ) M, (7) where M is a constant in the case of quadratic/cubic B-sline N i (x). Based on these relationshis, the differentiation can be carried out directly using the chain rule. Imlicit time steing in MLS-MPM requires comuting the action of the energy Hessian on an arbitrary grid increment δu (just like traditional MPM). f iα = δf iα = j δf iα = δf iα = δf iα = Ψ F nt F γσd N i (x n )(x n iσ x n σ) αγ 2 E x iα x jβ δu jβ V 0 2 Ψ F ωη δu jβ F nt F j αγ F ωη x γσd jβ 2 Ψ j j N i (x n )(x n iσ x n σ) ( D N j (x n )(x n jξ x n F αγ F ξ)f n ) ξηδ ωβ δujβ F nt γσd N i (x n )(x n iσ x n σ) ωη 2 Ψ ( D F αγ F ωη N j (x n )(x n jξ x n ξ)fξη n ) δujω F nt γσd N i (x n )(x n iσ x n σ), (8) therefore, δf i = A F nt D N i (x n )(x n i x n ), (9) where 2 Ψ A = F F : j D N j (x n )δu j (x n j x n ) T F n. (10) 3 High-erformance imlementation details Benchmark settings We uniformly samle 8 articles er cell, with x = , in region [0.1, 0.9] 3. This results in a grid and 8M articles. APIC transfer and a linear elasticity constitutive model is used for simlicity. To get more accurate results and cover the total run time better, we make P2G and G2P cover more oerations: P2G : Calculate force based on deformation gradient Scatter mass, article elasticity force and APIC affine momentum to grid nodes Normalize grid velocity and aly gravity G2P : Gather velocity from grid Gather velocity gradient (if necessary) and the APIC affine velocity field Udate deformation gradient Advect articles 3
4 Imlementation details Our discussion is focused on 3D cases, which are significantly more comutationally exensive than 2D ones. To make discussions concrete, we further assume 32-bit floating-oint recision and an Intel Core CPU with an architecture later than Haswell (released 2013), though most ideas can be used in other settings or architectures. We further focus the imlementation on quadratic B-slines, the smallest kernel that makes all constitutive models stable. 3.1 Algorithmic imrovement As discussed in the main body of our aer, MLS-MPM halves the number of FLOPs needed for each article. The unification of affine velocity field and deformation gradient eliminates the need for N i (x n ), which seeds u both P2G and G2P. During P2G, MLS-MPM fuses the scattering of the affine momentum and article force contribution into N i (x n )Q (x i x n ), where Q = t M Ψ F (Fn )F nt + m C, so that only one matrix-vector multilication is needed for the inner loo (27 iterations for 3D and quadratic B-slines). During deformation gradient udate in G2P, MLS-MPM avoids evaluating v x with N i(x) and can directly re-use C from APIC. 3.2 Performance engineering When the algorithm kees the same, the FLOPs required for each article is at the same level for comarable imlementations. In this case, the erformance of such a comute-intensive rogram is roughly roortional to floating oint unit (FPU) utilization, which means how many floating-oint oerations are actually done er cycle in the back-end of CPUs. Our otimization goal is to (1) reduce the memory bandwidth consumtion and to (2) imrove floating oint unit (FPU) utilization, while (3) keeing the imlementation simle. Our high-erformance MPM solver is designed based on these three rinciles. Note that due to issues such as memory alignment, comilers may not automatically generate vectorized instructions. In our imlementation, we do that manually via CPU intrinsics. As a result, our low-level erformance engineering significantly imroves FPU utilization by reducing memory bandwidth bound and unnecessary run-time comutations such as memory addresses generation. As noted in the main aer, our P2G and G2P imlementations lead to 1.93 and 7.38 FPU utilization comared with [Tamubolon et al., 2017]. Sarse grid storage and blocked transfer SPGrid [Setaluri et al., 2014] is used for background grid storage. SPGrid cleverly uses Morton coding and the translation lookaside buffer (TLB) to maximize locality and minimize access latency. Grid nodes are arranged in the unit of blocks, and our block size is Notably, the blocked nature of SPGrid effectively reduces memory bandwidth usage during transfer. Padded local grids of size nodes are used as arenas for transfers, where articles within the block region will only read from and write to their arenas. Before P2G or G2P, we gather local arena information from the global SPGrid and write back the change only after P2G. This strategy fits grid data into the L1 cache for maximum throughut and minimum latency. We call different otimized kernels for kernels with and without rigid boundary articles. Doing so raises the branching from article level to block level, which maximizes the ower of seculative execution. Practically, 4
5 only a narrow band of rigid boundaries needs CPIC. Tyically, around 90% blocks use regular MLS-MPM transfers. Avoiding data race in P2G Even though P2G will be executed on local grid arenas only, data race will still be an issue when two neighboring local grid arenas are writing back to the same global grid node. Locking is one straightforward solution. We, however, want to avoid locking for erformance and ease of imlementation. To this end, one solution is to divide the blocks into searate grous where no global nodes are shared by two or more blocks in one grou. In this way, rocessing blocks in a single grou in arallel has no data race. One way to achieve such grouing is via the block coordinates, which are defined to be a Cartesian coordinate system in the unit of SPGrid blocks. We grou blocks according to the arity of block coordinates (all axes considered jointly), leading to 4 grous in 2D and 8 grous in 3D. Low-level erformance engineering We exlicitly write CPU intrinsics to force the comiler to generate desired instructions. Fused multily-add (FMA) instructions that comute a b + c are articularly useful in our case. Note that a, b and c are vectors of four 32-bit floating oint numbers, which can be used as 3D vectors. The remaining entry can be mass, which is fully utilized during P2G. Comilers are conservative regarding unrolling, esecially for a relatively large loo (27 elements for 3D quadratic B-sline transfers). In our case, manually unrolling the loos not only saves the cost of maintaining loo variables but also allows the comiler to generate array access offsets at comile time. 4 3D rinted robot Overall, the robot erforms one degree-of-freedom rotation in an alternating triod gait. A stable 4.5V is alied to the motor to ensure constant rotation seed. Secifically, following the descritions in [Li et al., 2013], we use the same robot (RoboXlorer, Smart Lab) as the main body. Six legs are 3D rinted using ABS material and attached to the main body. These legs have the same length of 2R = 4.1 cm, width = 1.0 cm, and thickness = 0.3 cm, but only two different curvatures 1/r = [, 1]/R are used in our exeriments. The belly area of the main body is 13 2cm 2, and the total weight is 85g. The track used for the exeriment is 180 cm long and 40 cm wide, filled with fine-grained sand 8 cm dee. Two cameras are used to record the motion of the robot from the front and the to at 30 FPS. References C. Fu, Q. Guo, T. Gast, C. Jiang, and J. Teran. A olynomial article-in-cell method. ACM Trans Grah, 36(6):222:1 222:12, C. Jiang, C. Schroeder, A. Selle, J. Teran, and A. Stomakhin. The affine article-in-cell method. ACM Trans Grah, 34(4):51:1 51:10, C. Li, T. Zhang, and D. Goldman. A terradynamics of legged locomotion on granular media. Science, 339 (6126): , Y. Lu, T. Belytschko, and L. Gu. A new imlementation of the element-free galerkin method. Comuter methods in alied mechanics and engineering, 113(3-4): , R. Setaluri, M. Aanjaneya, S. Bauer, and E. Sifakis. Sgrid: A sarse aged grid structure alied to adative smoke simulation. ACM Trans Grah, 33(6):205, A. P. Tamubolon, T. Gast, G. Klár, C. Fu, J. Teran, C. Jiang, and K. Museth. Multi-secies simulation of orous sand and water mixtures. ACM Trans Grah, 36(4),
Kinematics. Why inverse? The study of motion without regard to the forces that cause it. Forward kinematics. Inverse kinematics
Kinematics Inverse kinematics The study of motion without regard to the forces that cause it Forward kinematics Given a joint configuration, what is the osition of an end oint on the structure? Inverse
More informationAn Efficient and Highly Accurate Technique for Periodic Planar Scanner Calibration with the Antenna Under Test in Situ
An Efficient and Highly Accurate echnique for Periodic Planar Scanner Calibration with the Antenna Under est in Situ Scott Pierce I echnologies 1125 Satellite Boulevard, Suite 100 Suwanee, Georgia 30024
More informationA New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism
A New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism Erlin Yao, Mingyu Chen, Rui Wang, Wenli Zhang, Guangming Tan Key Laboratory of Comuter System and Architecture Institute
More informationStereo Disparity Estimation in Moment Space
Stereo Disarity Estimation in oment Sace Angeline Pang Faculty of Information Technology, ultimedia University, 63 Cyberjaya, alaysia. angeline.ang@mmu.edu.my R. ukundan Deartment of Comuter Science, University
More informationSensitivity Analysis for an Optimal Routing Policy in an Ad Hoc Wireless Network
1 Sensitivity Analysis for an Otimal Routing Policy in an Ad Hoc Wireless Network Tara Javidi and Demosthenis Teneketzis Deartment of Electrical Engineering and Comuter Science University of Michigan Ann
More informationCOMP Parallel Computing. BSP (1) Bulk-Synchronous Processing Model
COMP 6 - Parallel Comuting Lecture 6 November, 8 Bulk-Synchronous essing Model Models of arallel comutation Shared-memory model Imlicit communication algorithm design and analysis relatively simle but
More informationExtracting Optimal Paths from Roadmaps for Motion Planning
Extracting Otimal Paths from Roadmas for Motion Planning Jinsuck Kim Roger A. Pearce Nancy M. Amato Deartment of Comuter Science Texas A&M University College Station, TX 843 jinsuckk,ra231,amato @cs.tamu.edu
More information1.5 Case Study. dynamic connectivity quick find quick union improvements applications
. Case Study dynamic connectivity quick find quick union imrovements alications Subtext of today s lecture (and this course) Stes to develoing a usable algorithm. Model the roblem. Find an algorithm to
More informationLearning Motion Patterns in Crowded Scenes Using Motion Flow Field
Learning Motion Patterns in Crowded Scenes Using Motion Flow Field Min Hu, Saad Ali and Mubarak Shah Comuter Vision Lab, University of Central Florida {mhu,sali,shah}@eecs.ucf.edu Abstract Learning tyical
More informationEquality-Based Translation Validator for LLVM
Equality-Based Translation Validator for LLVM Michael Ste, Ross Tate, and Sorin Lerner University of California, San Diego {mste,rtate,lerner@cs.ucsd.edu Abstract. We udated our Peggy tool, reviously resented
More informationIntroduction to Visualization and Computer Graphics
Introduction to Visualization and Comuter Grahics DH2320, Fall 2015 Prof. Dr. Tino Weinkauf Introduction to Visualization and Comuter Grahics Grids and Interolation Next Tuesday No lecture next Tuesday!
More informationA Novel Iris Segmentation Method for Hand-Held Capture Device
A Novel Iris Segmentation Method for Hand-Held Cature Device XiaoFu He and PengFei Shi Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200030, China {xfhe,
More informationEfficient Parallel Hierarchical Clustering
Efficient Parallel Hierarchical Clustering Manoranjan Dash 1,SimonaPetrutiu, and Peter Scheuermann 1 Deartment of Information Systems, School of Comuter Engineering, Nanyang Technological University, Singaore
More informationResearch on Inverse Dynamics and Trajectory Planning for the 3-PTT Parallel Machine Tool
06 International Conference on aterials, Information, echanical, Electronic and Comuter Engineering (IECE 06 ISBN: 978--60595-40- Research on Inverse Dynamics and rajectory Planning for the 3-P Parallel
More informationUsing Rational Numbers and Parallel Computing to Efficiently Avoid Round-off Errors on Map Simplification
Using Rational Numbers and Parallel Comuting to Efficiently Avoid Round-off Errors on Ma Simlification Maurício G. Grui 1, Salles V. G. de Magalhães 1,2, Marcus V. A. Andrade 1, W. Randolh Franklin 2,
More informationA CLASS OF STRUCTURED LDPC CODES WITH LARGE GIRTH
A CLASS OF STRUCTURED LDPC CODES WITH LARGE GIRTH Jin Lu, José M. F. Moura, and Urs Niesen Deartment of Electrical and Comuter Engineering Carnegie Mellon University, Pittsburgh, PA 15213 jinlu, moura@ece.cmu.edu
More informationLecture 18. Today, we will discuss developing algorithms for a basic model for parallel computing the Parallel Random Access Machine (PRAM) model.
U.C. Berkeley CS273: Parallel and Distributed Theory Lecture 18 Professor Satish Rao Lecturer: Satish Rao Last revised Scribe so far: Satish Rao (following revious lecture notes quite closely. Lecture
More informationAUTOMATIC 3D SURFACE RECONSTRUCTION BY COMBINING STEREOVISION WITH THE SLIT-SCANNER APPROACH
AUTOMATIC 3D SURFACE RECONSTRUCTION BY COMBINING STEREOVISION WITH THE SLIT-SCANNER APPROACH A. Prokos 1, G. Karras 1, E. Petsa 2 1 Deartment of Surveying, National Technical University of Athens (NTUA),
More informationShuigeng Zhou. May 18, 2016 School of Computer Science Fudan University
Query Processing Shuigeng Zhou May 18, 2016 School of Comuter Science Fudan University Overview Outline Measures of Query Cost Selection Oeration Sorting Join Oeration Other Oerations Evaluation of Exressions
More informationGrouping of Patches in Progressive Radiosity
Grouing of Patches in Progressive Radiosity Arjan J.F. Kok * Abstract The radiosity method can be imroved by (adatively) grouing small neighboring atches into grous. Comutations normally done for searate
More informationIEEE Coyright Notice Personal use of this material is ermitted. However, ermission to rerint/reublish this material for advertising or romotional uroses or for creating new collective works for resale
More informationCMSC 425: Lecture 16 Motion Planning: Basic Concepts
: Lecture 16 Motion lanning: Basic Concets eading: Today s material comes from various sources, including AI Game rogramming Wisdom 2 by S. abin and lanning Algorithms by S. M. LaValle (Chats. 4 and 5).
More informationSpace-efficient Region Filling in Raster Graphics
"The Visual Comuter: An International Journal of Comuter Grahics" (submitted July 13, 1992; revised December 7, 1992; acceted in Aril 16, 1993) Sace-efficient Region Filling in Raster Grahics Dominik Henrich
More information12) United States Patent 10) Patent No.: US 6,321,328 B1
USOO6321328B1 12) United States Patent 10) Patent No.: 9 9 Kar et al. (45) Date of Patent: Nov. 20, 2001 (54) PROCESSOR HAVING DATA FOR 5,961,615 10/1999 Zaid... 710/54 SPECULATIVE LOADS 6,006,317 * 12/1999
More informationObjectives. Part 1: Implement Friction Compensation.
ME 446 Laboratory # Inverse Dynamics Joint Control Reort is due at the beginning of your lab time the week of Aril 9 th. One reort er grou. Lab sessions will be held the weeks of March th, March 6 th,
More informationInteractive Image Segmentation
Interactive Image Segmentation Fahim Mannan (260 266 294) Abstract This reort resents the roject work done based on Boykov and Jolly s interactive grah cuts based N-D image segmentation algorithm([1]).
More informationStructured Mesh Refinement in Generalized Interpolation Material Point (GIMP) Method for Simulation of Dynamic Problems
Coyright c 26 Tech Science Press CMES, vol.12, no.3,.213-227, 26 Structured Mesh Refinement in Generalized Interolation Material Point (GIMP) Method for Simulation of Dynamic Problems Jin Ma, Hongbing
More information10 File System Mass Storage Structure Mass Storage Systems Mass Storage Structure Mass Storage Structure FILE SYSTEM 1
10 File System 1 We will examine this chater in three subtitles: Mass Storage Systems OERATING SYSTEMS FILE SYSTEM 1 File System Interface File System Imlementation 10.1.1 Mass Storage Structure 3 2 10.1
More informationA Morphological LiDAR Points Cloud Filtering Method based on GPGPU
A Morhological LiDAR Points Cloud Filtering Method based on GPGPU Shuo Li 1, Hui Wang 1, Qiuhe Ma 1 and Xuan Zha 2 1 Zhengzhou Institute of Surveying & Maing, No.66, Longhai Middle Road, Zhengzhou, China
More informationAUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS. Ren Chen and Viktor K.
inuts er clock cycle Streaming ermutation oututs er clock cycle AUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS Ren Chen and Viktor K.
More informationPREDICTING LINKS IN LARGE COAUTHORSHIP NETWORKS
PREDICTING LINKS IN LARGE COAUTHORSHIP NETWORKS Kevin Miller, Vivian Lin, and Rui Zhang Grou ID: 5 1. INTRODUCTION The roblem we are trying to solve is redicting future links or recovering missing links
More informationMETHOD OF LANDSLIDE MEASUREMENT BY GROUND BASED LIDAR
METHOD OF LANDSLIDE MEASUREMENT BY GROUND BASED LIDAR Ryo INADA* and Masataka TAKAGI** Kochi University of Technology, Kami-shi, Kochi, 782-8502, Jaan *135082@gs.kochi-tech.ac.j **takagi.masataka@kochi-tech.ac.j
More informationAn Efficient Coding Method for Coding Region-of-Interest Locations in AVS2
An Efficient Coding Method for Coding Region-of-Interest Locations in AVS2 Mingliang Chen 1, Weiyao Lin 1*, Xiaozhen Zheng 2 1 Deartment of Electronic Engineering, Shanghai Jiao Tong University, China
More informationOptimization of Collective Communication Operations in MPICH
To be ublished in the International Journal of High Performance Comuting Alications, 5. c Sage Publications. Otimization of Collective Communication Oerations in MPICH Rajeev Thakur Rolf Rabenseifner William
More informationP Z. parametric surface Q Z. 2nd Image T Z
Direct recovery of shae from multile views: a arallax based aroach Rakesh Kumar. Anandan Keith Hanna Abstract Given two arbitrary views of a scene under central rojection, if the motion of oints on a arametric
More informationImproving Trust Estimates in Planning Domains with Rare Failure Events
Imroving Trust Estimates in Planning Domains with Rare Failure Events Colin M. Potts and Kurt D. Krebsbach Det. of Mathematics and Comuter Science Lawrence University Aleton, Wisconsin 54911 USA {colin.m.otts,
More informationCS 428: Fall Introduction to. Geometric Transformations. Andrew Nealen, Rutgers, /15/2010 1
CS 428: Fall 21 Introduction to Comuter Grahics Geometric Transformations Andrew Nealen, Rutgers, 21 9/15/21 1 Toic overview Image formation and OenGL (last week) Modeling the image formation rocess OenGL
More informationDistributed Estimation from Relative Measurements in Sensor Networks
Distributed Estimation from Relative Measurements in Sensor Networks #Prabir Barooah and João P. Hesanha Abstract We consider the roblem of estimating vectorvalued variables from noisy relative measurements.
More informationCollective Communication: Theory, Practice, and Experience. FLAME Working Note #22
Collective Communication: Theory, Practice, and Exerience FLAME Working Note # Ernie Chan Marcel Heimlich Avi Purkayastha Robert van de Geijn Setember, 6 Abstract We discuss the design and high-erformance
More informationAutonomic Physical Database Design - From Indexing to Multidimensional Clustering
Autonomic Physical Database Design - From Indexing to Multidimensional Clustering Stehan Baumann, Kai-Uwe Sattler Databases and Information Systems Grou Technische Universität Ilmenau, Ilmenau, Germany
More informationSource Coding and express these numbers in a binary system using M log
Source Coding 30.1 Source Coding Introduction We have studied how to transmit digital bits over a radio channel. We also saw ways that we could code those bits to achieve error correction. Bandwidth is
More informationA GPU Heterogeneous Cluster Scheduling Model for Preventing Temperature Heat Island
A GPU Heterogeneous Cluster Scheduling Model for Preventing Temerature Heat Island Yun-Peng CAO 1,2,a and Hai-Feng WANG 1,2 1 School of Information Science and Engineering, Linyi University, Linyi Shandong,
More informationCS 450: COMPUTER GRAPHICS 2D TRANSFORMATIONS SPRING 2016 DR. MICHAEL J. REALE
CS 45: COMUTER GRAHICS 2D TRANSFORMATIONS SRING 26 DR. MICHAEL J. REALE INTRODUCTION Now that we hae some linear algebra under our resectie belts, we can start ug it in grahics! So far, for each rimitie,
More informationEfficient Processing of Top-k Dominating Queries on Multi-Dimensional Data
Efficient Processing of To-k Dominating Queries on Multi-Dimensional Data Man Lung Yiu Deartment of Comuter Science Aalborg University DK-922 Aalborg, Denmark mly@cs.aau.dk Nikos Mamoulis Deartment of
More informationIdentity-sensitive Points-to Analysis for the Dynamic Behavior of JavaScript Objects
Identity-sensitive Points-to Analysis for the Dynamic Behavior of JavaScrit Objects Shiyi Wei and Barbara G. Ryder Deartment of Comuter Science, Virginia Tech, Blacksburg, VA, USA. {wei,ryder}@cs.vt.edu
More informationHigh Order Accurate Curved-element Implementation in OpenFOAM for Discontinuous Galerkin Method
,.47-52 htt://dx.doi.org/10.14257/astl.2016.142.08 High Order Accurate Curved-element Imlementation in OenFOAM for Discontinuous Galerkin Method Li-Yang Xu 1, Shuai Ye 1, Yong-Quan Feng 1, Hao Li 1, Xin-Hai
More informationOMNI: An Efficient Overlay Multicast. Infrastructure for Real-time Applications
OMNI: An Efficient Overlay Multicast Infrastructure for Real-time Alications Suman Banerjee, Christoher Kommareddy, Koushik Kar, Bobby Bhattacharjee, Samir Khuller Abstract We consider an overlay architecture
More informationNear-Optimal Routing Lookups with Bounded Worst Case Performance
Near-Otimal Routing Lookus with Bounded Worst Case Performance Pankaj Guta Balaji Prabhakar Stehen Boyd Deartments of Electrical Engineering and Comuter Science Stanford University CA 9430 ankaj@stanfordedu
More informationCollective communication: theory, practice, and experience
CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Comutat.: Pract. Exer. 2007; 19:1749 1783 Published online 5 July 2007 in Wiley InterScience (www.interscience.wiley.com)..1206 Collective
More informationThe VEGA Moderately Parallel MIMD, Moderately Parallel SIMD, Architecture for High Performance Array Signal Processing
The VEGA Moderately Parallel MIMD, Moderately Parallel SIMD, Architecture for High Performance Array Signal Processing Mikael Taveniku 2,3, Anders Åhlander 1,3, Magnus Jonsson 1 and Bertil Svensson 1,2
More informationHardware-Accelerated Formal Verification
Hardare-Accelerated Formal Verification Hiroaki Yoshida, Satoshi Morishita 3 Masahiro Fujita,. VLSI Design and Education Center (VDEC), University of Tokyo. CREST, Jaan Science and Technology Agency 3.
More informationParallel Construction of Multidimensional Binary Search Trees. Ibraheem Al-furaih, Srinivas Aluru, Sanjay Goil Sanjay Ranka
Parallel Construction of Multidimensional Binary Search Trees Ibraheem Al-furaih, Srinivas Aluru, Sanjay Goil Sanjay Ranka School of CIS and School of CISE Northeast Parallel Architectures Center Syracuse
More informationImplementations of Partial Document Ranking Using. Inverted Files. Wai Yee Peter Wong. Dik Lun Lee
Imlementations of Partial Document Ranking Using Inverted Files Wai Yee Peter Wong Dik Lun Lee Deartment of Comuter and Information Science, Ohio State University, 36 Neil Ave, Columbus, Ohio 4321, U.S.A.
More informationarxiv: v1 [cs.dc] 13 Nov 2018
Task Grah Transformations for Latency Tolerance arxiv:1811.05077v1 [cs.dc] 13 Nov 2018 Victor Eijkhout November 14, 2018 Abstract The Integrative Model for Parallelism (IMP) derives a task grah from a
More informationSimulating Ocean Currents. Simulating Galaxy Evolution
Simulating Ocean Currents (a) Cross sections (b) Satial discretization of a cross section Model as two-dimensional grids Discretize in sace and time finer satial and temoral resolution => greater accuracy
More informationA BICRITERION STEINER TREE PROBLEM ON GRAPH. Mirko VUJO[EVI], Milan STANOJEVI] 1. INTRODUCTION
Yugoslav Journal of Oerations Research (00), umber, 5- A BICRITERIO STEIER TREE PROBLEM O GRAPH Mirko VUJO[EVI], Milan STAOJEVI] Laboratory for Oerational Research, Faculty of Organizational Sciences University
More informationConstruction of Irregular QC-LDPC Codes in Near-Earth Communications
Journal of Communications Vol. 9, No. 7, July 24 Construction of Irregular QC-LDPC Codes in Near-Earth Communications Hui Zhao, Xiaoxiao Bao, Liang Qin, Ruyan Wang, and Hong Zhang Chongqing University
More informationIntroduction to Parallel Algorithms
CS 1762 Fall, 2011 1 Introduction to Parallel Algorithms Introduction to Parallel Algorithms ECE 1762 Algorithms and Data Structures Fall Semester, 2011 1 Preliminaries Since the early 1990s, there has
More informationPerformance and Area Tradeoffs in Space-Qualified FPGA-Based Time-of-Flight Systems
Performance and Area Tradeoffs in Sace-Qualified FPGA-Based Time-of-Flight Systems Whitney Hsiong 1, Steve Huntzicker 1, Kevin King 1, Austin Lee 1, Chen Lim 1, Jason Wang 1, Sarah Harris, Ph.D. 1, Jörg-Micha
More informationCS335 Fall 2007 Graphics and Multimedia. 2D Drawings: Lines
CS335 Fall 007 Grahics and Multimedia D Drawings: Lines Primitive Drawing Oerations Digital Concets of Drawing in Raster Arras PIXEL is a single arra element at x, - No smaller drawing unit exists Px,
More informationNon-Strict Independence-Based Program Parallelization Using Sharing and Freeness Information
Non-Strict Indeendence-Based Program Parallelization Using Sharing and Freeness Information Daniel Cabeza Gras 1 and Manuel V. Hermenegildo 1,2 Abstract The current ubiuity of multi-core rocessors has
More informationDetection of Occluded Face Image using Mean Based Weight Matrix and Support Vector Machine
Journal of Comuter Science 8 (7): 1184-1190, 2012 ISSN 1549-3636 2012 Science Publications Detection of Occluded Face Image using Mean Based Weight Matrix and Suort Vector Machine 1 G. Nirmala Priya and
More informationCENTRAL AND PARALLEL PROJECTIONS OF REGULAR SURFACES: GEOMETRIC CONSTRUCTIONS USING 3D MODELING SOFTWARE
CENTRAL AND PARALLEL PROJECTIONS OF REGULAR SURFACES: GEOMETRIC CONSTRUCTIONS USING 3D MODELING SOFTWARE Petra Surynková Charles University in Prague, Faculty of Mathematics and Physics, Sokolovská 83,
More informationCASCH - a Scheduling Algorithm for "High Level"-Synthesis
CASCH a Scheduling Algorithm for "High Level"Synthesis P. Gutberlet H. Krämer W. Rosenstiel Comuter Science Research Center at the University of Karlsruhe (FZI) HaidundNeuStr. 1014, 7500 Karlsruhe, F.R.G.
More informationA Study of Protocols for Low-Latency Video Transport over the Internet
A Study of Protocols for Low-Latency Video Transort over the Internet Ciro A. Noronha, Ph.D. Cobalt Digital Santa Clara, CA ciro.noronha@cobaltdigital.com Juliana W. Noronha University of California, Davis
More informationEVALUATION OF THE ACCURACY OF A LASER SCANNER-BASED ROLL MAPPING SYSTEM
EVALUATION OF THE ACCURACY OF A LASER SCANNER-BASED ROLL MAPPING SYSTEM R.S. Radovanovic*, W.F. Teskey*, N.N. Al-Hanbali** *University of Calgary, Canada Deartment of Geomatics Engineering rsradova@ucalgary.ca
More informationRedundant wavelet processing on the half-axis with applications to signal denoising with small delays: Theory and experiments
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING Int. J. Adat. Control Signal Process. 006; 0:447 474 Published online 8 August 006 in Wiley InterScience (www.interscience.wiley.com)..9
More informationAuto-Tuning Distributed-Memory 3-Dimensional Fast Fourier Transforms on the Cray XT4
Auto-Tuning Distributed-Memory 3-Dimensional Fast Fourier Transforms on the Cray XT4 M. Gajbe a A. Canning, b L-W. Wang, b J. Shalf, b H. Wasserman, b and R. Vuduc, a a Georgia Institute of Technology,
More informationFast Algorithm to Estimate Dense Disparity Fields
Fast Algorithm to Estimate Dense Disarity Fields M Kardouchi a, E Hervet a University of Moncton, Comuter Science Det, Moncton, NB, Canada {kardoum,hervete}@umonctonca This aer describes a method to comute
More informationA 2D Random Walk Mobility Model for Location Management Studies in Wireless Networks Abstract: I. Introduction
A D Random Walk Mobility Model for Location Management Studies in Wireless Networks Kuo Hsing Chiang, RMIT University, Melbourne, Australia Nirmala Shenoy, Information Technology Deartment, RIT, Rochester,
More informationSource-to-Source Code Generation Based on Pattern Matching and Dynamic Programming
Source-to-Source Code Generation Based on Pattern Matching and Dynamic Programming Weimin Chen, Volker Turau TR-93-047 August, 1993 Abstract This aer introduces a new technique for source-to-source code
More informationAPPLICATION OF PARTICLE FILTERS TO MAP-MATCHING ALGORITHM
APPLICATION OF PARTICLE FILTERS TO MAP-MATCHING ALGORITHM Pavel Davidson 1, Jussi Collin 2, and Jarmo Taala 3 Deartment of Comuter Systems, Tamere University of Technology, Finland e-mail: avel.davidson@tut.fi
More informationMatlab Virtual Reality Simulations for optimizations and rapid prototyping of flexible lines systems
Matlab Virtual Reality Simulations for otimizations and raid rototying of flexible lines systems VAMVU PETRE, BARBU CAMELIA, POP MARIA Deartment of Automation, Comuters, Electrical Engineering and Energetics
More informationVariability-Aware Parametric Yield Estimation for Analog/Mixed-Signal Circuits: Concepts, Algorithms and Challenges
Variability-Aware Parametric Yield Estimation for Analog/Mixed-Signal Circuits: Concets, Algorithms and Challenges Fang Gong Cadence Design Systems, Inc. 2655 Seely Ave., San Jose, CA fgong@cadence.com
More information10. Parallel Methods for Data Sorting
10. Parallel Methods for Data Sorting 10. Parallel Methods for Data Sorting... 1 10.1. Parallelizing Princiles... 10.. Scaling Parallel Comutations... 10.3. Bubble Sort...3 10.3.1. Sequential Algorithm...3
More informationImproved heuristics for the single machine scheduling problem with linear early and quadratic tardy penalties
Imroved heuristics for the single machine scheduling roblem with linear early and quadratic tardy enalties Jorge M. S. Valente* LIAAD INESC Porto LA, Faculdade de Economia, Universidade do Porto Postal
More informationMulti-robot SLAM with Unknown Initial Correspondence: The Robot Rendezvous Case
Multi-robot SLAM with Unknown Initial Corresondence: The Robot Rendezvous Case Xun S. Zhou and Stergios I. Roumeliotis Deartment of Comuter Science & Engineering, University of Minnesota, Minneaolis, MN
More informationSingle character type identification
Single character tye identification Yefeng Zheng*, Changsong Liu, Xiaoqing Ding Deartment of Electronic Engineering, Tsinghua University Beijing 100084, P.R. China ABSTRACT Different character recognition
More informationExamination and Analysis of Implementation Choices within the Material Point Method (MPM)
Coyright c 2008 Tech Science Press CMES, vol.31, no.2,.107-127, 2008 Examination and Analysis of Imlementation Choices within the Material Point Method (MPM) M. Steffen 1,P.C.Wallstedt 2,J.E.Guilkey 2,3,R.M.Kirby
More informationRandomized algorithms: Two examples and Yao s Minimax Principle
Randomized algorithms: Two examles and Yao s Minimax Princile Maximum Satisfiability Consider the roblem Maximum Satisfiability (MAX-SAT). Bring your knowledge u-to-date on the Satisfiability roblem. Maximum
More informationThickness computation of trimmed B-Rep model using GPU ray tracing
Thickness comutation of trimmed B-Re model using GPU ray tracing G. Lemasson 1,2,3, J.C. Iehl 2, F. Zara 1, B. Shariat 1, V. Baudet 3, P. Arthaud 3 1 Université de Lyon, CNRS, Université Lyon 1, LIRIS,
More informationarxiv: v1 [cs.mm] 18 Jan 2016
Lossless Intra Coding in with 3-ta Filters Saeed R. Alvar a, Fatih Kamisli a a Deartment of Electrical and Electronics Engineering, Middle East Technical University, Turkey arxiv:1601.04473v1 [cs.mm] 18
More informationLow Power Implementations for Adaptive Filters
Low Power Imlementations for Adative ilters Marius Vollmer Stehan Klauke Jürgen Götze University of ortmund Information Processing Lab htt://www-dt.e-technik.uni-dortmund.de marius.vollmer stehan.klauke
More informationImproved Image Super-Resolution by Support Vector Regression
Proceedings of International Joint Conference on Neural Networks, San Jose, California, USA, July 3 August 5, 0 Imroved Image Suer-Resolution by Suort Vector Regression Le An and Bir Bhanu Abstract Suort
More informationPower Savings in Embedded Processors through Decode Filter Cache
Power Savings in Embedded Processors through Decode Filter Cache Weiyu Tang Rajesh Guta Alexandru Nicolau Deartment of Information and Comuter Science University of California, Irvine Irvine, CA 92697-3425
More informationSEARCH ENGINE MANAGEMENT
e-issn 2455 1392 Volume 2 Issue 5, May 2016. 254 259 Scientific Journal Imact Factor : 3.468 htt://www.ijcter.com SEARCH ENGINE MANAGEMENT Abhinav Sinha Kalinga Institute of Industrial Technology, Bhubaneswar,
More informationSwift Template Matching Based on Equivalent Histogram
Swift emlate Matching ased on Equivalent istogram Wangsheng Yu, Xiaohua ian, Zhiqiang ou * elecommunications Engineering Institute Air Force Engineering University Xi an, PR China *corresonding author:
More informationMitigating the Impact of Decompression Latency in L1 Compressed Data Caches via Prefetching
Mitigating the Imact of Decomression Latency in L1 Comressed Data Caches via Prefetching by Sean Rea A thesis resented to Lakehead University in artial fulfillment of the requirement for the degree of
More informationContinuous Visible k Nearest Neighbor Query on Moving Objects
Continuous Visible k Nearest Neighbor Query on Moving Objects Yaniu Wang a, Rui Zhang b, Chuanfei Xu a, Jianzhong Qi b, Yu Gu a, Ge Yu a, a Deartment of Comuter Software and Theory, Northeastern University,
More informationCS 229 Final Project: Single Image Depth Estimation From Predicted Semantic Labels
CS 229 Final Project: Single Image Deth Estimation From Predicted Semantic Labels Beyang Liu beyangl@cs.stanford.edu Stehen Gould sgould@stanford.edu Prof. Dahne Koller koller@cs.stanford.edu December
More informationAn Efficient VLSI Architecture for Adaptive Rank Order Filter for Image Noise Removal
International Journal of Information and Electronics Engineering, Vol. 1, No. 1, July 011 An Efficient VLSI Architecture for Adative Rank Order Filter for Image Noise Removal M. C Hanumantharaju, M. Ravishankar,
More information2010 First International Conference on Networking and Computing
First International Conference on Networking and Comuting Imlementations of Parallel Comutation of Euclidean Distance Ma in Multicore Processors and GPUs Duhu Man, Kenji Uda, Hironobu Ueyama, Yasuaki Ito,
More informationGDP: Using Dataflow Properties to Accurately Estimate Interference-Free Performance at Runtime
GDP: Using Dataflow Proerties to Accurately Estimate Interference-Free Performance at Runtime Magnus Jahre Deartment of Comuter Science Norwegian University of Science and Technology (NTNU) Email: magnus.jahre@ntnu.no
More informationLecture 2: Fixed-Radius Near Neighbors and Geometric Basics
structure arises in many alications of geometry. The dual structure, called a Delaunay triangulation also has many interesting roerties. Figure 3: Voronoi diagram and Delaunay triangulation. Search: Geometric
More informationarxiv: v1 [math.gt] 1 May 2018
CIRCUIT PRESENTATION AND LATTICE STICK NUMBER WITH EXACTLY 4 z-sticks HYOUNGJUN KIM AND SUNGJONG NO arxiv:805.0023v [math.gt] May 208 Abstract. The lattice stick number s L(L) of a link L is defined to
More information5. UDEC. 5.1 Introduction. 5.2 UDEC: Two-dimensional modelling
5. 5.1 Introduction On a conference in 1971 Peter A. Cundall, the doctorial student of Imerial College, introduced a novel software called Universal Distinct Element Code (). This software (Cundall, 1971)
More informationUPDATING ALGORITHMS FOR CONSTRAINT DELAUNAY TIN
UPDATING ALGORITMS FOR CONSTRAINT DELAUNAY TIN WU LiXin a,b WANG YanBing a, c a Institute for GIS/RS/GPS & Subsidence Engineering Research, China Uni Mining & Technology, Beijing, China, 100083 - awulixin@263.net,
More informationUse of Multivariate Statistical Analysis in the Modelling of Chromatographic Processes
Use of Multivariate Statistical Analysis in the Modelling of Chromatograhic Processes Simon Edwards-Parton 1, Nigel itchener-hooker 1, Nina hornhill 2, Daniel Bracewell 1, John Lidell 3 Abstract his aer
More informationA Symmetric FHE Scheme Based on Linear Algebra
A Symmetric FHE Scheme Based on Linear Algebra Iti Sharma University College of Engineering, Comuter Science Deartment. itisharma.uce@gmail.com Abstract FHE is considered to be Holy Grail of cloud comuting.
More informationAn FPGA Implementation of ( )-Regular Low-Density. Parity-Check Code Decoder
An FPGA Imlementation of ( )-Regular Low-Density Parity-Check Code Decoder Tong Zhang ECSE Det., RPI Troy, NY tzhang@ecse.ri.edu Keshab K. Parhi ECE Det., Univ. of Minnesota Minneaolis, MN arhi@ece.umn.edu
More information