Massively Parallel Graph Analytics
|
|
- Brendan Rodgers
- 5 years ago
- Views:
Transcription
1 Massively Parallel Graph Analytics Manycore graph processing, distributed graph layout, and supercomputing for graph analytics George M. Slota 1,2,3 Kamesh Madduri 2 Sivasankaran Rajamanickam 1 1 Sandia National Laboratories, 2 Penn State University, 3 Blue Waters Fellow gslota@psu.edu, madduri@cse.psu.edu, srajama@sandia.gov Blue Waters Symposium 11 May 2015
2 Research Motivation and Goals Graph analysis is key for the study of biological, chemical, social, and other networks Real-world graphs are big, irregular, complex Graph analytics is one of DARPA s 23 toughest mathematical challenges Web graph: 3.5B sites, 129B hyperlinks Brain graph: 100B neurons, 1,000T synaptic connections Goal: How can we analyze these massive graphs on supercomputers? Modern computational systems like Blue Waters are also big and complex Multiple levels of parallelism, memory hierarchy, hardware configurations, GPUs and coprocessors Goal: How can we generically optimize graph algorithms for varying computational hardware?
3 Methods and Approaches Observation: most graph algorithms follow a tri-nested loop structure Optimize for this general algorithmic structure Transform structure for more parallelism Observation: varying in-memory distributed graph layout affects total execution time Partition graph to minimize per-task computation and communication Order vertices within partition for optimal cache performance Observation: previous approaches for massive graph analytics have only considered external memory solutions Use proper distributed layout to efficiently store graph in distributed memory supercomputer Use algorithmic and layout optimizations to concurrently minimize intra-node execution times and inter-node communication times
4 Results - Improving Computation and Communication Algorithm H MG ML 3 GTEPS DBpedia XyceTest Google Flickr LiveJournal uk 2002 Graph Computational performance rate of a graph analytic with different optimization approaches on GPU (H: hierarchical, MG: global approach, ML: Local approach, Grey bar: baseline) WikiLinks uk 2005 IndoChina RMAT2M GNP2M HV15R Speedup vs LiveJournal Orkut Twitter uk 2005 WebBase sk Partitioner Communication speedups for a complex analytic relative to a random baseline with different distributed layout approaches (DGL-MC: multi-constraint, DGL-MOMC: multi-object)
5 Results - Analyzing the Internet Using performance optimization approaches, we can find communities and most important pages by centrality measures in minutes using Blue Waters Largest Communities Discovered (numbers in millions) Pages Internal Links External Links Representative Page YouTube Tumblr Creative Commons WordPress Amazon Flickr Individual Page Centrality Rankings In Degree PageRank Harmonic YouTube YouTube WordPress WordPress YouTube/t/.. Twitter YouTube/t/.. YouTube/testtube Twitter/privacy YouTube/.. YouTube/.. Twitter/About YouTube/.. Tumblr Twitter/account YouTube/t/.. Google/.. Twitter/about
6 Publications Based on Fellowship Work Distributed Graph Layout for Scalable Small-world Network Analysis George M. Slota, Kamesh Madduri, Sivasankaran Rajamanickam In submission Supercomputing for Web Graph Analytics George M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri Under Review High-performance Graph Analytics on Manycore Processors George M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri To appear in the Proceedings of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS15)
7 Summary of Accomplishments Optimizations for manycore parallelism result in up to a 3.25 performance improvement for graph analytics executing on GPU Modifications to in-memory storage of graph structure results in up to a 1.48 performance improvement for distributed analytics running with MPI+OpenMP on Blue Waters First-ever analysis of largest to-date web crawl (129B hyperlinks) on a distributed memory system Running on 256 nodes of Blue Waters, we are able to run several complex graph analytics on the web crawl in only minutes of execution time These approaches will allow further scaling to analyze even larger graphs, such as our brain s neural network (1K trillion connections)
8 Future Work Implement more graph analytic algorithms Subgraph counting Other community detection approaches etc. Further improve scaling and performance Explore parameter space of optimizations Vary layout objectives and constraints per-algorithm Acquire and analyze larger and more complex networks on Blue Waters Planned future presentations of fellowship work: Presentation of manycore-based optimizations strategies at IPDPS15 Poster presentation of overall layout approach at IPDPS15 Presentation and poster presentation of web graph analytics at SC15 (tentative)
9 Acknowledgments This research is part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (awards OCI , ACI , and ACI ) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. This work is also supported by NSF grants ACI , CCF , and the DOE Office of Science through the FASTMath SciDAC Institute. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy s National Nuclear Security Administration under contract DE-AC04-94AL85000.
Irregular Graph Algorithms on Parallel Processing Systems
Irregular Graph Algorithms on Parallel Processing Systems George M. Slota 1,2 Kamesh Madduri 1 (advisor) Sivasankaran Rajamanickam 2 (Sandia mentor) 1 Penn State University, 2 Sandia National Laboratories
More informationExtreme-scale Graph Analysis on Blue Waters
Extreme-scale Graph Analysis on Blue Waters 2016 Blue Waters Symposium George M. Slota 1,2, Siva Rajamanickam 1, Kamesh Madduri 2, Karen Devine 1 1 Sandia National Laboratories a 2 The Pennsylvania State
More informationExtreme-scale Graph Analysis on Blue Waters
Extreme-scale Graph Analysis on Blue Waters 2016 Blue Waters Symposium George M. Slota 1,2, Siva Rajamanickam 1, Kamesh Madduri 2, Karen Devine 1 1 Sandia National Laboratories a 2 The Pennsylvania State
More informationOrder or Shuffle: Empirically Evaluating Vertex Order Impact on Parallel Graph Computations
Order or Shuffle: Empirically Evaluating Vertex Order Impact on Parallel Graph Computations George M. Slota 1 Sivasankaran Rajamanickam 2 Kamesh Madduri 3 1 Rensselaer Polytechnic Institute, 2 Sandia National
More informationPuLP: Scalable Multi-Objective Multi-Constraint Partitioning for Small-World Networks
PuLP: Scalable Multi-Objective Multi-Constraint Partitioning for Small-World Networks George M. Slota 1,2 Kamesh Madduri 2 Sivasankaran Rajamanickam 1 1 Sandia National Laboratories, 2 The Pennsylvania
More informationXtraPuLP. Partitioning Trillion-edge Graphs in Minutes. State University
XtraPuLP Partitioning Trillion-edge Graphs in Minutes George M. Slota 1 Sivasankaran Rajamanickam 2 Kamesh Madduri 3 Karen Devine 2 1 Rensselaer Polytechnic Institute, 2 Sandia National Labs, 3 The Pennsylvania
More informationHPCGraph: Benchmarking Massive Graph Analytics on Supercomputers
HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers George M. Slota 1, Siva Rajamanickam 2, Kamesh Madduri 3 1 Rensselaer Polytechnic Institute 2 Sandia National Laboratories a 3 The Pennsylvania
More informationSimple Parallel Biconnectivity Algorithms for Multicore Platforms
Simple Parallel Biconnectivity Algorithms for Multicore Platforms George M. Slota Kamesh Madduri The Pennsylvania State University HiPC 2014 December 17-20, 2014 Code, presentation available at graphanalysis.info
More informationPULP: Fast and Simple Complex Network Partitioning
PULP: Fast and Simple Complex Network Partitioning George Slota #,* Kamesh Madduri # Siva Rajamanickam * # The Pennsylvania State University *Sandia National Laboratories Dagstuhl Seminar 14461 November
More informationPuLP. Complex Objective Partitioning of Small-World Networks Using Label Propagation. George M. Slota 1,2 Kamesh Madduri 2 Sivasankaran Rajamanickam 1
PuLP Complex Objective Partitioning of Small-World Networks Using Label Propagation George M. Slota 1,2 Kamesh Madduri 2 Sivasankaran Rajamanickam 1 1 Sandia National Laboratories, 2 The Pennsylvania State
More informationCharacterizing Biological Networks Using Subgraph Counting and Enumeration
Characterizing Biological Networks Using Subgraph Counting and Enumeration George Slota Kamesh Madduri madduri@cse.psu.edu Computer Science and Engineering The Pennsylvania State University SIAM PP14 February
More informationBFS and Coloring-based Parallel Algorithms for Strongly Connected Components and Related Problems
20 May 2014 Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy
More informationHigh-performance Graph Analytics
High-performance Graph Analytics Kamesh Madduri Computer Science and Engineering The Pennsylvania State University madduri@cse.psu.edu Papers, code, slides at graphanalysis.info Acknowledgments NSF grants
More informationWar Stories : Graph Algorithms in GPUs
SAND2014-18323PE War Stories : Graph Algorithms in GPUs Siva Rajamanickam(SNL) George Slota, Kamesh Madduri (PSU) FASTMath Meeting Exceptional service in the national interest is a multi-program laboratory
More informationDownloaded 10/31/16 to Redistribution subject to SIAM license or copyright; see
SIAM J. SCI. COMPUT. Vol. 38, No. 5, pp. S62 S645 c 216 Society for Industrial and Applied Mathematics COMPLEX NETWORK PARTITIONING USING LABEL PROPAGATION GEORGE M. SLOTA, KAMESH MADDURI, AND SIVASANKARAN
More informationScalable Community Detection Benchmark Generation
Scalable Community Detection Benchmark Generation Jonathan Berry 1 Cynthia Phillips 1 Siva Rajamanickam 1 George M. Slota 2 1 Sandia National Labs, 2 Rensselaer Polytechnic Institute jberry@sandia.gov,
More informationPartitioning Trillion-edge Graphs in Minutes
Partitioning Trillion-edge Graphs in Minutes George M. Slota Computer Science Department Rensselaer Polytechnic Institute Troy, NY slotag@rpi.edu Sivasankaran Rajamanickam & Karen Devine Scalable Algorithms
More informationAccelerated Load Balancing of Unstructured Meshes
Accelerated Load Balancing of Unstructured Meshes Gerrett Diamond, Lucas Davis, and Cameron W. Smith Abstract Unstructured mesh applications running on large, parallel, distributed memory systems require
More informationKartik Lakhotia, Rajgopal Kannan, Viktor Prasanna USENIX ATC 18
Accelerating PageRank using Partition-Centric Processing Kartik Lakhotia, Rajgopal Kannan, Viktor Prasanna USENIX ATC 18 Outline Introduction Partition-centric Processing Methodology Analytical Evaluation
More informationCo-existence: Can Big Data and Big Computation Co-exist on the Same Systems?
Co-existence: Can Big Data and Big Computation Co-exist on the Same Systems? Dr. William Kramer National Center for Supercomputing Applications, University of Illinois Where these views come from Large
More informationVisual Analysis of Lagrangian Particle Data from Combustion Simulations
Visual Analysis of Lagrangian Particle Data from Combustion Simulations Hongfeng Yu Sandia National Laboratories, CA Ultrascale Visualization Workshop, SC11 Nov 13 2011, Seattle, WA Joint work with Jishang
More informationOn Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs
On Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs Sungpack Hong 2, Nicole C. Rodia 1, and Kunle Olukotun 1 1 Pervasive Parallelism Laboratory, Stanford University
More informationPULP: Scalable Multi-Objective Multi-Constraint Partitioning for Small-World Networks
PULP: Scalable Multi-Objective Multi-Constraint Partitioning for Small-World Networks George M. Slota Kamesh Madduri Department of Computer Science and Engineering The Pennsylvania State University University
More informationParallel Graph Coloring For Many- core Architectures
Parallel Graph Coloring For Many- core Architectures Mehmet Deveci, Erik Boman, Siva Rajamanickam Sandia Na;onal Laboratories Sandia National Laboratories is a multi-program laboratory managed and operated
More informationTanuj Kr Aasawat, Tahsin Reza, Matei Ripeanu Networked Systems Laboratory (NetSysLab) University of British Columbia
How well do CPU, GPU and Hybrid Graph Processing Frameworks Perform? Tanuj Kr Aasawat, Tahsin Reza, Matei Ripeanu Networked Systems Laboratory (NetSysLab) University of British Columbia Networked Systems
More informationImplementing Strassen-like Fast Matrix Multiplication Algorithms with BLIS
Implementing Strassen-like Fast Matrix Multiplication Algorithms with BLIS Jianyu Huang, Leslie Rice Joint work with Tyler M. Smith, Greg M. Henry, Robert A. van de Geijn BLIS Retreat 2016 *Overlook of
More informationFast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs
Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs Sungpack Hong 2, Nicole C. Rodia 1, and Kunle Olukotun 1 1 Pervasive Parallelism Laboratory, Stanford University 2 Oracle
More informationIrregular Graph Algorithms on Modern Multicore, Manycore, and Distributed Processing Systems
Irregular Graph Algorithms on Modern Multicore, Manycore, and Distributed Processing Systems Comprehensive Examination George M. Slota Scalable Computing Laboratory Department of Computer Science and Engineering
More informationBFS and Coloring-based Parallel Algorithms for Strongly Connected Components and Related Problems
BFS and Coloring-based Parallel Algorithms for Strongly Connected Components and Related Problems George M. Slota 1, Sivasankaran Rajamanickam 2, and Kamesh Madduri 1 1 The Pennsylvania State University
More informationGraph Partitioning for Scalable Distributed Graph Computations
Graph Partitioning for Scalable Distributed Graph Computations Aydın Buluç ABuluc@lbl.gov Kamesh Madduri madduri@cse.psu.edu 10 th DIMACS Implementation Challenge, Graph Partitioning and Graph Clustering
More informationA Platform for Provisioning Integrated Data and Visualization Capabilities Presented to SATURN in May 2016 Gerry Giese, Sandia National Laboratories
Photos placed in horizontal position with even amount of white space between photos and header A Platform for Provisioning Integrated Data and Visualization Capabilities Presented to SATURN in May 2016
More informationMeasurements on (Complete) Graphs: The Power of Wedge and Diamond Sampling
Measurements on (Complete) Graphs: The Power of Wedge and Diamond Sampling Tamara G. Kolda plus Grey Ballard, Todd Plantenga, Ali Pinar, C. Seshadhri Workshop on Incomplete Network Data Sandia National
More informationAdvances in Parallel Partitioning, Load Balancing and Matrix Ordering for Scientific Computing
Advances in Parallel Partitioning, Load Balancing and Matrix Ordering for Scientific Computing Erik G. Boman 1, Umit V. Catalyurek 2, Cédric Chevalier 1, Karen D. Devine 1, Ilya Safro 3, Michael M. Wolf
More informationBFS and Coloring-based Parallel Algorithms for Strongly Connected Components and Related Problems
BFS and Coloring-based Parallel Algorithms for Strongly Connected Components and Related Problems George M. Slota, Sivasankaran Rajamanickam, and Kamesh Madduri Computer Science and Engineering, The Pennsylvania
More informationHypergraph Exploitation for Data Sciences
Photos placed in horizontal position with even amount of white space between photos and header Hypergraph Exploitation for Data Sciences Photos placed in horizontal position with even amount of white space
More informationA Case Study of Complex Graph Analysis in Distributed Memory: Implementation and Optimization
A Case Study of Complex Graph Analysis in Distributed Memory: Implementation and Optimization George M. Slota Computer Science and Engineering The Pennsylvania State University University Park, PA gslota@psu.edu
More informationPERFORMANCE PORTABILITY WITH OPENACC. Jeff Larkin, NVIDIA, November 2015
PERFORMANCE PORTABILITY WITH OPENACC Jeff Larkin, NVIDIA, November 2015 TWO TYPES OF PORTABILITY FUNCTIONAL PORTABILITY PERFORMANCE PORTABILITY The ability for a single code to run anywhere. The ability
More informationExploring the Hidden Dimension in Graph Processing
Exploring the Hidden Dimension in Graph Processing Mingxing Zhang, Yongwei Wu, Kang Chen, *Xuehai Qian, Xue Li, and Weimin Zheng Tsinghua University *University of Shouthern California Graph is Ubiquitous
More informationScaling species tree estimation methods to large datasets using NJMerge
Scaling species tree estimation methods to large datasets using NJMerge Erin Molloy and Tandy Warnow {emolloy2, warnow}@illinois.edu University of Illinois at Urbana Champaign 2018 Phylogenomics Software
More informationOptimizing Parallel Sparse Matrix-Vector Multiplication by Corner Partitioning
Optimizing Parallel Sparse Matrix-Vector Multiplication by Corner Partitioning Michael M. Wolf 1,2, Erik G. Boman 2, and Bruce A. Hendrickson 3 1 Dept. of Computer Science, University of Illinois at Urbana-Champaign,
More informationA Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard
A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard Sandia Na*onal Laboratories Kitware, Inc. University of California at Davis
More informationGetting Started with Memcached. Ahmed Soliman
Getting Started with Memcached Ahmed Soliman In this package, you will find: A Biography of the author of the book A synopsis of the book s content Information on where to buy this book About the Author
More informationRecent Advances in Heterogeneous Computing using Charm++
Recent Advances in Heterogeneous Computing using Charm++ Jaemin Choi, Michael Robson Parallel Programming Laboratory University of Illinois Urbana-Champaign April 12, 2018 1 / 24 Heterogeneous Computing
More informationPractical Near-Data Processing for In-Memory Analytics Frameworks
Practical Near-Data Processing for In-Memory Analytics Frameworks Mingyu Gao, Grant Ayers, Christos Kozyrakis Stanford University http://mast.stanford.edu PACT Oct 19, 2015 Motivating Trends End of Dennard
More informationPreconditioning Linear Systems Arising from Graph Laplacians of Complex Networks
Preconditioning Linear Systems Arising from Graph Laplacians of Complex Networks Kevin Deweese 1 Erik Boman 2 1 Department of Computer Science University of California, Santa Barbara 2 Scalable Algorithms
More informationHigh-Performance Graph Traversal for De Bruijn Graph-Based Metagenome Assembly
1 / 32 High-Performance Graph Traversal for De Bruijn Graph-Based Metagenome Assembly Vasudevan Rengasamy Kamesh Madduri School of EECS The Pennsylvania State University {vxr162, madduri}@psu.edu SIAM
More informationDemystifying Machine Learning
Demystifying Machine Learning Dmitry Figol, WW Enterprise Sales Systems Engineer - Programmability @dmfigol CTHRST-1002 Agenda Machine Learning examples What is Machine Learning Types of Machine Learning
More informationEnzo-P / Cello. Formation of the First Galaxies. San Diego Supercomputer Center. Department of Physics and Astronomy
Enzo-P / Cello Formation of the First Galaxies James Bordner 1 Michael L. Norman 1 Brian O Shea 2 1 University of California, San Diego San Diego Supercomputer Center 2 Michigan State University Department
More informationPortability and Scalability of Sparse Tensor Decompositions on CPU/MIC/GPU Architectures
Photos placed in horizontal position with even amount of white space between photos and header Portability and Scalability of Sparse Tensor Decompositions on CPU/MIC/GPU Architectures Christopher Forster,
More informationSpiNNaker - a million core ARM-powered neural HPC
The Advanced Processor Technologies Group SpiNNaker - a million core ARM-powered neural HPC Cameron Patterson cameron.patterson@cs.man.ac.uk School of Computer Science, The University of Manchester, UK
More informationVisualization of Energy Conversion Processes in a Light Harvesting Organelle at Atomic Detail
Visualization of Energy Conversion Processes in a Light Harvesting Organelle at Atomic Detail Theoretical and Computational Biophysics Group Center for the Physics of Living Cells Beckman Institute for
More informationWedge A New Frontier for Pull-based Graph Processing. Samuel Grossman and Christos Kozyrakis Platform Lab Retreat June 8, 2018
Wedge A New Frontier for Pull-based Graph Processing Samuel Grossman and Christos Kozyrakis Platform Lab Retreat June 8, 2018 Graph Processing Problems modelled as objects (vertices) and connections between
More informationLeveraging Flash in HPC Systems
Leveraging Flash in HPC Systems IEEE MSST June 3, 2015 This work was performed under the auspices of the U.S. Department of Energy by under Contract DE-AC52-07NA27344. Lawrence Livermore National Security,
More informationDesigning parallel algorithms for constructing large phylogenetic trees on Blue Waters
Designing parallel algorithms for constructing large phylogenetic trees on Blue Waters Erin Molloy University of Illinois at Urbana Champaign General Allocation (PI: Tandy Warnow) Exploratory Allocation
More informationVMD: Immersive Molecular Visualization and Interactive Ray Tracing for Domes, Panoramic Theaters, and Head Mounted Displays
VMD: Immersive Molecular Visualization and Interactive Ray Tracing for Domes, Panoramic Theaters, and Head Mounted Displays John E. Stone Theoretical and Computational Biophysics Group Beckman Institute
More informationMaster Course in Computer Science Orientation day
Master Course in Computer Science Orientation day Info on the Department of Computer Science Ranked first (in its area) in 5-year Research Assessment by Ministry of University and Research 2013 e 2017
More informationToward Runtime Power Management of Exascale Networks by On/Off Control of Links
Toward Runtime Power Management of Exascale Networks by On/Off Control of Links, Nikhil Jain, Laxmikant Kale University of Illinois at Urbana-Champaign HPPAC May 20, 2013 1 Power challenge Power is a major
More informationDax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale
Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale GPU Technology Conference March 26, 2014 Kenneth Moreland Sandia Na5onal Laboratories Robert Maynard Kitware, Inc. Sandia National
More informationMicrogrid System Design and Economic Analysis Tools
Microgrid System Design and Economic Analysis Tools DOE Microgrid Workshop 30 August 2011 Jason Stamp, Ph.D. (Sandia National Laboratories) Michael Clark (Encorp) 1 Sandia National Laboratories is a multi-program
More informationCommunication for PIM-based Graph Processing with Efficient Data Partition. Mingxing Zhang, Youwei Zhuo (equal contribution),
GraphP: Reducing Communication for PIM-based Graph Processing with Efficient Data Partition Mingxing Zhang, Youwei Zhuo (equal contribution), Chao Wang, Mingyu Gao, Yongwei Wu, Kang Chen, Christos Kozyrakis,
More informationHarp-DAAL for High Performance Big Data Computing
Harp-DAAL for High Performance Big Data Computing Large-scale data analytics is revolutionizing many business and scientific domains. Easy-touse scalable parallel techniques are necessary to process big
More informationCACHE-GUIDED SCHEDULING
CACHE-GUIDED SCHEDULING EXPLOITING CACHES TO MAXIMIZE LOCALITY IN GRAPH PROCESSING Anurag Mukkara, Nathan Beckmann, Daniel Sanchez 1 st AGP Toronto, Ontario 24 June 2017 Graph processing is memory-bound
More informationSST + MacSim. Case Studies Using SST MacSim. Genie Hsieh Sandia National Labs
Photos placed in horizontal position with even amount of white space between photos and header SST + MacSim Case Studies Using SST MacSim Genie Hsieh Sandia National Labs Sandia National Laboratories is
More informationDistributed State Es.ma.on Algorithms for Electric Power Systems
Distributed State Es.ma.on Algorithms for Electric Power Systems Ariana Minot, Blue Waters Graduate Fellow Professor Na Li, Professor Yue M. Lu Harvard University, School of Engineering and Applied Sciences
More informationDevelopment Environments for HPC: The View from NCSA
Development Environments for HPC: The View from NCSA Jay Alameda National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign DEHPC 15 San Francisco, CA 18 October 2015 Acknowledgements
More informationLarge Data Visualization
Large Data Visualization Seven Lectures 1. Overview (this one) 2. Scalable parallel rendering algorithms 3. Particle data visualization 4. Vector field visualization 5. Visual analytics techniques for
More informationCenter Extreme Scale CS Research
Center Extreme Scale CS Research Center for Compressible Multiphase Turbulence University of Florida Sanjay Ranka Herman Lam Outline 10 6 10 7 10 8 10 9 cores Parallelization and UQ of Rocfun and CMT-Nek
More informationShallow Water Simulations on Graphics Hardware
Shallow Water Simulations on Graphics Hardware Ph.D. Thesis Presentation 2014-06-27 Martin Lilleeng Sætra Outline Introduction Parallel Computing and the GPU Simulating Shallow Water Flow Topics of Thesis
More informationA Comparative Study on Exact Triangle Counting Algorithms on the GPU
A Comparative Study on Exact Triangle Counting Algorithms on the GPU Leyuan Wang, Yangzihao Wang, Carl Yang, John D. Owens University of California, Davis, CA, USA 31 st May 2016 L. Wang, Y. Wang, C. Yang,
More information2. Definitions and notations. 3. Background and related work. 1. Introduction
Exploring Optimizations on Shared-memory Platforms for Parallel Triangle Counting Algorithms Ancy Sarah Tom, Narayanan Sundaram, Nesreen K. Ahmed, Shaden Smith, Stijn Eyerman, Midhunchandra Kodiyath, Ibrahim
More informationExtracting Hidden Messages in Steganographic Images
DIGITAL FORENSIC RESEARCH CONFERENCE Extracting Hidden Messages in Steganographic Images By Tu-Thach Quach Presented At The Digital Forensic Research Conference DFRWS 2014 USA Denver, CO (Aug 3 rd - 6
More informationEarly Evaluation of the "Infinite Memory Engine" Burst Buffer Solution
Early Evaluation of the "Infinite Memory Engine" Burst Buffer Solution Wolfram Schenck Faculty of Engineering and Mathematics, Bielefeld University of Applied Sciences, Bielefeld, Germany Salem El Sayed,
More informationWhen Graph Meets Big Data: Opportunities and Challenges
High Performance Graph Data Management and Processing (HPGDM 2016) When Graph Meets Big Data: Opportunities and Challenges Yinglong Xia Huawei Research America 11/13/2016 The International Conference for
More informationCommercial Data Intensive Cloud Computing Architecture: A Decision Support Framework
Association for Information Systems AIS Electronic Library (AISeL) CONF-IRM 2014 Proceedings International Conference on Information Resources Management (CONF-IRM) 2014 Commercial Data Intensive Cloud
More informationMinimizing Computation in Convolutional Neural Networks
Minimizing Computation in Convolutional Neural Networks Jason Cong and Bingjun Xiao Computer Science Department, University of California, Los Angeles, CA 90095, USA {cong,xiao}@cs.ucla.edu Abstract. Convolutional
More informationPlanar: Parallel Lightweight Architecture-Aware Adaptive Graph Repartitioning
Planar: Parallel Lightweight Architecture-Aware Adaptive Graph Repartitioning Angen Zheng, Alexandros Labrinidis, and Panos K. Chrysanthis University of Pittsburgh 1 Graph Partitioning Applications of
More informationIMPLEMENTATION OF THE. Alexander J. Yee University of Illinois Urbana-Champaign
SINGLE-TRANSPOSE IMPLEMENTATION OF THE OUT-OF-ORDER 3D-FFT Alexander J. Yee University of Illinois Urbana-Champaign The Problem FFTs are extremely memory-intensive. Completely bound by memory access. Memory
More informationThe Constellation Project. Andrew W. Nash 14 November 2016
The Constellation Project Andrew W. Nash 14 November 2016 The Constellation Project: Representing a High Performance File System as a Graph for Analysis The Titan supercomputer utilizes high performance
More informationNERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber
NERSC Site Update National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory Richard Gerber NERSC Senior Science Advisor High Performance Computing Department Head Cori
More informationMaintaining An Online Publication List
Maintaining An Online Publication List Tamara G. Kolda Sandia National Labs Webpage Expert* * Self-proclaimed Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation,
More informationMulti-GPU Graph Analytics
2017 IEEE International Parallel and Distributed Processing Symposium Multi-GPU Graph Analytics Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, and John D. Owens University of California, Davis Email:
More informationFASCIA. Fast Approximate Subgraph Counting and Enumeration. 2 Oct Scalable Computing Laboratory The Pennsylvania State University 1 / 28
FASCIA Fast Approximate Subgraph Counting and Enumeration George M. Slota Kamesh Madduri Scalable Computing Laboratory The Pennsylvania State University 2 Oct. 2013 1 / 28 Overview Background Motivation
More informationHarnessing GPU speed to accelerate LAMMPS particle simulations
Harnessing GPU speed to accelerate LAMMPS particle simulations Paul S. Crozier, W. Michael Brown, Peng Wang pscrozi@sandia.gov, wmbrown@sandia.gov, penwang@nvidia.com SC09, Portland, Oregon November 18,
More informationExamples of Big Data analytics in ENEA: data sources and information extraction strategies
Examples of Big Data analytics in ENEA: data sources and information extraction strategies Ing. Giovanni Ponti, PhD ENEA DTE-ICT-HPC giovanni.ponti@enea.it DISRUPTIVE DATA 2017 5 Maggio, 2017, Via Santa
More informationAn Execution Strategy and Optimized Runtime Support for Parallelizing Irregular Reductions on Modern GPUs
An Execution Strategy and Optimized Runtime Support for Parallelizing Irregular Reductions on Modern GPUs Xin Huo, Vignesh T. Ravi, Wenjing Ma and Gagan Agrawal Department of Computer Science and Engineering
More informationBlue Waters Local Software To Be Released: Module Improvements and Parfu Parallel Archive Tool
November 15, 16 2016 Blue Waters Local Software To Be Released: Module Improvements and Parfu Parallel Archive Tool Craig P Steffen csteffen@ncsa.illinois.edu Blue Waters Science and Engineering Applications
More informationA CASE STUDY OF COMMUNICATION OPTIMIZATIONS ON 3D MESH INTERCONNECTS
A CASE STUDY OF COMMUNICATION OPTIMIZATIONS ON 3D MESH INTERCONNECTS Abhinav Bhatele, Eric Bohm, Laxmikant V. Kale Parallel Programming Laboratory Euro-Par 2009 University of Illinois at Urbana-Champaign
More informationWalk The Walk Social Media
Walk The Walk Social Media The Social Media Quiz 1. How many Facebook accounts are there in the world? a) 1.2 billion b) 540 million c) 120 million d) 53 million e) 10 million 2. Which do you think is
More informationIntegrating Analysis and Computation with Trios Services
October 31, 2012 Integrating Analysis and Computation with Trios Services Approved for Public Release: SAND2012-9323P Ron A. Oldfield Scalable System Software Sandia National Laboratories Albuquerque,
More informationDesigning High-Performance MPI Collectives in MVAPICH2 for HPC and Deep Learning
5th ANNUAL WORKSHOP 209 Designing High-Performance MPI Collectives in MVAPICH2 for HPC and Deep Learning Hari Subramoni Dhabaleswar K. (DK) Panda The Ohio State University The Ohio State University E-mail:
More informationTwo FPGA-DNN Projects: 1. Low Latency Multi-Layer Perceptrons using FPGAs 2. Acceleration of CNN Training on FPGA-based Clusters
Two FPGA-DNN Projects: 1. Low Latency Multi-Layer Perceptrons using FPGAs 2. Acceleration of CNN Training on FPGA-based Clusters *Argonne National Lab +BU & USTC Presented by Martin Herbordt Work by Ahmed
More informationMosaic: Processing a Trillion-Edge Graph on a Single Machine
Mosaic: Processing a Trillion-Edge Graph on a Single Machine Steffen Maass, Changwoo Min, Sanidhya Kashyap, Woonhak Kang, Mohan Kumar, Taesoo Kim Georgia Institute of Technology Best Student Paper @ EuroSys
More informationMachine Learning with Python
DEVNET-2163 Machine Learning with Python Dmitry Figol, SE WW Enterprise Sales @dmfigol Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session
More informationHarwich Haven - Surrender to Sanctuary.
Harwich Haven - Surrender to Sanctuary. Website Specification 7 February 2018 Issued by David Cain david@nhscic.org on behalf of New Heritage Solutions C.I.C Office 33 Red Gables Ipswich Road Stowmarket
More informationRevolver: Vertex-centric Graph Partitioning Using Reinforcement Learning
Revolver: Vertex-centric Graph Partitioning Using Reinforcement Learning Mohammad Hasanzadeh Mofrad 1, Rami Melhem 1 and Mohammad Hammoud 2 1 University of Pittsburgh 2 Carnegie Mellon University Qatar
More informationOh, Exascale! The effect of emerging architectures on scien1fic discovery. Kenneth Moreland, Sandia Na1onal Laboratories
Photos placed in horizontal posi1on with even amount of white space between photos and header Oh, $#*@! Exascale! The effect of emerging architectures on scien1fic discovery Ultrascale Visualiza1on Workshop,
More informationDataSToRM: Data Science and Technology Research Environment
The Future of Advanced (Secure) Computing DataSToRM: Data Science and Technology Research Environment This material is based upon work supported by the Assistant Secretary of Defense for Research and Engineering
More informationGunrock: A Fast and Programmable Multi- GPU Graph Processing Library
Gunrock: A Fast and Programmable Multi- GPU Graph Processing Library Yangzihao Wang and Yuechao Pan with Andrew Davidson, Yuduo Wu, Carl Yang, Leyuan Wang, Andy Riffel and John D. Owens University of California,
More informationAn Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization
An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science
More informationAntonio Fernández Anta
Antonio Fernández Anta Joint work with Luis F. Chiroque, Héctor Cordobés, Rafael A. García Leiva, Philippe Morere, Lorenzo Ornella, Fernando Pérez, and Agustín Santos Recommendation Engines (RE) suggest
More information