Visualizing Distributed Memory Computations with Hive Plots. VizSec 2012, October 15, 2012, Seattle, Washington Sophie Engle and Sean Whalen
|
|
- Allen Sparks
- 5 years ago
- Views:
Transcription
1 Visualizing Distributed Memory Computations with Hive Plots Sophie Engle and Sean Whalen
2 2 Introduction
3 Introduction 3 High performance computing environments Used for scientific computing applications at several national laboratories Potential for misuse by insiders and outsiders Anomaly detection Determine normal versus abnormal behavior for these environments to prevent unauthorized use Can classify codes into computational dwarves to determine normal (Asanovic 2006)
4 Introduction 4 Several network measures can be used as features in classification Time consuming to calculate these measures Time consuming to compare how well these measures perform for classification Use visualization to choose network measures to use as classification features Which measures look similar for similar codes? Which measures look distinct for distinct codes?
5 5 Dataset
6 Original Dataset 6 Data collection Collected by NERSC at LBNL Used IPM to monitor MPI calls between ranks (captures communication between compute nodes) Dataset contents Total of 1681 IPM logs Covers 29 different scientific computing codes with varying ranks, parameters, and architectures
7 Original Dataset 7 Src,Dst,MPICall,Bytes,Repeats,Code 0,1,29,99856,52,cactus 0,4,29,99856,52,cactus 0,0,2,4,5,cactus 0,0,2,8,7,cactus 0,1,22,599136,26,cactus 0,-1,5,0,1,cactus 0,4,22,599136,26,cactus 0,16,29,99856,52,cactus
8 Original Dataset 8 Src,Dst,MPICall,Bytes,Repeats,Code 0,1,29,99856,52,cactus 0,4,29,99856,52,cactus 0,0,2,4,5,cactus 0,0,2,8,7,cactus 0,1,22,599136,26,cactus 0,-1,5,0,1,cactus 0,4,22,599136,26,cactus 0,16,29,99856,52,cactus
9 Subset Analyzed 9 Code Description Nodes Edges cactus astrophysics ij algebraic multi-grid milc lattice gauge theory namd molecular dynamics paratec materials science superlu sparse linear algebra tgyro magnetic fusion vasp materials science
10 Subset Analyzed 10 Code Description Nodes Edges cactus astrophysics ij algebraic multi-grid milc lattice gauge theory namd molecular dynamics paratec materials science superlu sparse linear algebra tgyro magnetic fusion vasp materials science
11 Network Measures 11 Measure degree betweenness closeness eccentricity page rank transitivity Description the number of adjacent edges number of shortest paths going through a node measures steps required to reach every other node shortest path distance from farthest node measures relative importance of node probability adjacent nodes are connected (clustering coefficient) Calculated in R using the igraph library.
12 12 Motivation
13 Traditional Degree CCDF Plot 13 cactus ij milc namd superlu tgyro
14 Traditional Degree CCDF Plot 14 Pros: Able to compare individual metrics across datasets Simple approach, widely used Cons: Contains no information on topology Lines look visually similar, may not be appropriate for generating visual signatures
15 Adjacency Matrices 15
16 Adjacency Matrices 16 Pros: Comparable across datasets Easy to see communication patterns Many distinct codes look distinct Cons: No information on metrics needed for classification
17 Issues Identified 17 Traditional CCDF plot does not convey any information about network topology Traditional adjacency matrices do not convey any information about network properties Traditional network layout algorithms are not repeatable or comparable across networks
18 18 Hive Plots
19 Introduction to Hive Plots 19 What are hive plots? Network layout algorithm using network properties for consistent node placement A radially-arranged parallel coordinate plot Why use hive plots? Repeatable, comparable network layouts Integration of network properties with topology
20 Understanding Hive Plots tgyro degree
21 Understanding Hive Plots 21 edges b/w nodes on same axis 32 primary axis duplicate axis edges b/w nodes on different axes node degree b/w 260 and 837 max axis value self loop tgyro degree degree ranges from 0 to 837 across datasets
22 Implementation 22 Existing implementations exist JHive (Java) HiveR (R) HiveGraph (webapp) Prototypes in Perl and d3.js Custom implementation in R and ggplot2 Implements grammar of graphics (Wilkinson) Polar plots to create hive plots Facets to create hive panels* Non-interactive
23 Implementation 23 evenly spaced nodes linear interpolation interpolation and jitter page rank (milc) page rank (cactus)
24 Implementation 24 consistent alpha 32 variable alpha 32 closeness (cactus) degree (superlu)
25 Hive Plot References 25 Hive Plots Rational Approach to Visualizing Networks by Martin Krzywinski, Inanc Birol, Steven JM Jones and Marco A Marra in Briefings in Bioinformatics, volume 13, issue 5, pages , 2012 Hive Plots: Rational Network Visualization Farewell to Hairballs by Martin Krzywinski at online Getting Into Visualization of Large Biological Data Sets by Martin Krzywinski, Inanc Birol, Steven Jones, Marco Marra in BioVis 2012 Posters, 2nd floor foyer, Sunday 8:30am Monday 5:55pm
26 26 Initial Results
27 Degree 27 cactus ij milc namd superlu tgyro
28 Betweenness 28 cactus ij milc namd superlu tgyro
29 Closeness 29 cactus ij milc namd superlu tgyro
30 Eccentricity 30 cactus ij milc namd superlu tgyro
31 Page Rank 31 cactus ij milc namd superlu tgyro
32 Transitivity 32 cactus ij milc namd superlu tgyro
33 Visually Distinct 33 cactus ij milc namd superlu tygro degree x x x x betweenness x x x x x x closeness x x x x eccentricity x x x page rank x x x x transitivity x x x x x x
34 Visually Distinct 34 cactus ij milc namd superlu tygro degree x x x x betweenness x x x x x x closeness x x x x eccentricity x x x page rank x x x x transitivity x x x x x x
35 35 Next Steps
36 Next Steps 36 Improve hive plot visualizations Explore variable-length axes Explore better axes assignment Incorporate more information from data set Multiple-edge connections Type of IPM calls Amount of data transmitted
37 Next Steps 37 Feature identification Compare hive plots for more distinct codes Compare hive plots for similar codes Identify features that visually distinguish codes Classification and anomaly detection Determine if features identified by visualization lead to better classifiers and anomaly detection
38 38 Conclusion
39 Summary 39 Motivation and goals Improve anomaly detection in HPC environments Improve classification of HPC codes Use exploratory visualization for feature selection
40 Summary 40 Motivation and goals Improve anomaly detection in HPC environments Improve classification of HPC codes Use exploratory visualization for feature selection Initial results Hive plots allow visual comparison of HPC codes Some features distinguish distinct HPC codes
41 References 41 Hive Plots Rational Approach to Visualizing Networks by Martin Krzywinski, Inanc Birol, Steven JM Jones and Marco A Marra in Briefings in Bioinformatics, volume 13, issue 5, pages , 2012 Network-Theoretic Classification of Parallel Computation Patterns by Sean Whalen, Sophie Engle, Sean Peisert, and Matt Bishop in International Journal of High Performance Computing Applications (IJHPCA), volume 26, number 2, pages , May 2012 Multiclass Classification of Distributed Memory Parallel Computations by Sean Whalen, Sean Peisert, and Matt Bishop to appear in Pattern Recognition Letters (PRL), 2012
42 Contact Information 42 Sophie Engle University of San Francisco Department of Computer Science w Sean Whalen Mount Sinai School of Medicine Institute for Genomics and Multiscale Biology w
43 Questions? 43
Visualizing Distributed Memory Computations with Hive Plots
Visualizing Distributed Memory Computations with Hive Plots ABSTRACT Sophie Engle University of San Francisco Harney Science Center, Room 532 2130 Fulton Street San Francisco, CA 94117-1080 sjengle@cs.usfca.edu
More informationMulticlass Classification of Distributed Memory Parallel Computations
Multiclass Classification of Distributed Memory Parallel Computations Sean Whalen a,, Sean Peisert b,c, Matt Bishop c a Computer Science Department, Columbia University, New York NY 10027 b Lawrence Berkeley
More informationAn Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization
An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science
More informationCIE L*a*b* color model
CIE L*a*b* color model To further strengthen the correlation between the color model and human perception, we apply the following non-linear transformation: with where (X n,y n,z n ) are the tristimulus
More informationSupervised Clustering of Yeast Gene Expression Data
Supervised Clustering of Yeast Gene Expression Data In the DeRisi paper five expression profile clusters were cited, each containing a small number (7-8) of genes. In the following examples we apply supervised
More informationBasics of Network Analysis
Basics of Network Analysis Hiroki Sayama sayama@binghamton.edu Graph = Network G(V, E): graph (network) V: vertices (nodes), E: edges (links) 1 Nodes = 1, 2, 3, 4, 5 2 3 Links = 12, 13, 15, 23,
More informationPerformance Analysis and Modeling of the SciDAC MILC Code on Four Large-scale Clusters
Performance Analysis and Modeling of the SciDAC MILC Code on Four Large-scale Clusters Xingfu Wu and Valerie Taylor Department of Computer Science, Texas A&M University Email: {wuxf, taylor}@cs.tamu.edu
More information/ Computational Genomics. Normalization
10-810 /02-710 Computational Genomics Normalization Genes and Gene Expression Technology Display of Expression Information Yeast cell cycle expression Experiments (over time) baseline expression program
More informationGeometry and Spatial Reasoning
Geometry and Spatial Reasoning Activity: TEKS: Overview: Materials: Exploring Reflections (7.7) Geometry and spatial reasoning. The student uses coordinate geometry to describe location on a plane. The
More informationAdvanced visualization techniques for Self-Organizing Maps with graph-based methods
Advanced visualization techniques for Self-Organizing Maps with graph-based methods Georg Pölzlbauer 1, Andreas Rauber 1, and Michael Dittenbach 2 1 Department of Software Technology Vienna University
More informationTutorial: Application MPI Task Placement
Tutorial: Application MPI Task Placement Juan Galvez Nikhil Jain Palash Sharma PPL, University of Illinois at Urbana-Champaign Tutorial Outline Why Task Mapping on Blue Waters? When to do mapping? How
More information1. Discovering Important Nodes through Graph Entropy The Case of Enron Database
1. Discovering Important Nodes through Graph Entropy The Case of Enron Email Database ACM KDD 2005 Chicago, Illinois. 2. Optimizing Video Search Reranking Via Minimum Incremental Information Loss ACM MIR
More informationDesigning parallel algorithms for constructing large phylogenetic trees on Blue Waters
Designing parallel algorithms for constructing large phylogenetic trees on Blue Waters Erin Molloy University of Illinois at Urbana Champaign General Allocation (PI: Tandy Warnow) Exploratory Allocation
More informationIntegrated Performance Monitoring: Understanding Applications and Workloads. David Skinner Snowbird, July 2008
Integrated Performance Monitoring: Understanding Applications and Workloads David Skinner Snowbird, July 2008 Outline Overview of IPM The needs of production HPC centers wrt. tools IPM design and implementation
More informationData Mining: Exploring Data
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar But we start with a brief discussion of the Friedman article and the relationship between Data
More information9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology
9/9/ I9 Introduction to Bioinformatics, Clustering algorithms Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Outline Data mining tasks Predictive tasks vs descriptive tasks Example
More informationCompute Node Linux: Overview, Progress to Date & Roadmap
Compute Node Linux: Overview, Progress to Date & Roadmap David Wallace Cray Inc ABSTRACT: : This presentation will provide an overview of Compute Node Linux(CNL) for the CRAY XT machine series. Compute
More informationNetwork-Theoretic Classification of Parallel Computation Patterns
Network-Theoretic Classification of Parallel Computation Patterns Sean Whalen Berkeley Lab and University of California, Davis shwhalen@ucdavis.edu Sean Peisert Berkeley Lab and University of California,
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3
Data Mining: Exploring Data Lecture Notes for Chapter 3 1 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include
More informationPackage SOFIA. January 22, 2017
Package SOFIA January 22, 2017 Title Making Sophisticated and Aesthetical Figures in R Version 1.0 Author Luis Diaz-Garcia Maintainer Luis Diaz-Garcia Depends R (>= 2.14) Imports
More informationLecture Topic Projects
Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data
More informationUsing Statistical Techniques to Improve the QC Process of Swell Noise Filtering
Using Statistical Techniques to Improve the QC Process of Swell Noise Filtering A. Spanos* (Petroleum Geo-Services) & M. Bekara (PGS - Petroleum Geo- Services) SUMMARY The current approach for the quality
More informationFunction approximation using RBF network. 10 basis functions and 25 data points.
1 Function approximation using RBF network F (x j ) = m 1 w i ϕ( x j t i ) i=1 j = 1... N, m 1 = 10, N = 25 10 basis functions and 25 data points. Basis function centers are plotted with circles and data
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
More informationData Mining: Exploring Data. Lecture Notes for Data Exploration Chapter. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Data Exploration Chapter Introduction to Data Mining by Tan, Steinbach, Karpatne, Kumar 02/03/2018 Introduction to Data Mining 1 What is data exploration?
More informationFigure (5) Kohonen Self-Organized Map
2- KOHONEN SELF-ORGANIZING MAPS (SOM) - The self-organizing neural networks assume a topological structure among the cluster units. - There are m cluster units, arranged in a one- or two-dimensional array;
More informationUnsupervised Learning
Unsupervised Learning Unsupervised learning Until now, we have assumed our training samples are labeled by their category membership. Methods that use labeled samples are said to be supervised. However,
More informationYou submitted this quiz on Sat 17 May :19 AM CEST. You got a score of out of
uiz Feedback Coursera 1 of 7 01/06/2014 20:02 Feedback Week 2 Quiz Help You submitted this quiz on Sat 17 May 2014 11:19 AM CEST. You got a score of 10.00 out of 10.00. Question 1 Under the lattice graphics
More informationTopology, Bandwidth and Performance: A New Approach in Linear Orderings for Application Placement in a 3D Torus
Topology, Bandwidth and Performance: A New Approach in Linear Orderings for Application Placement in a 3D Torus Carl Albing Cray Inc., University of Reading; Norm Troullier, Stephen Whalen, Ryan Olson,
More informationFinal Exam. Advanced Methods for Data Analysis (36-402/36-608) Due Thursday May 8, 2014 at 11:59pm
Final Exam Advanced Methods for Data Analysis (36-402/36-608) Due Thursday May 8, 2014 at 11:59pm Instructions: you will submit this take-home final exam in three parts. 1. Writeup. This will be a complete
More informationClustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York
Clustering Robert M. Haralick Computer Science, Graduate Center City University of New York Outline K-means 1 K-means 2 3 4 5 Clustering K-means The purpose of clustering is to determine the similarity
More informationDesign Theory V Boo Virk Simon Andrews.
Design Theory V2017-06 Boo Virk Simon Andrews boo.virk@babraham.ac.uk simon.andrews@babraham.ac.uk Why does good design matter? Good design makes a great first impression Good design makes for effective
More informationHidden Markov Model for Sequential Data
Hidden Markov Model for Sequential Data Dr.-Ing. Michelle Karg mekarg@uwaterloo.ca Electrical and Computer Engineering Cheriton School of Computer Science Sequential Data Measurement of time series: Example:
More informationAn Empirical Study of Lazy Multilabel Classification Algorithms
An Empirical Study of Lazy Multilabel Classification Algorithms E. Spyromitros and G. Tsoumakas and I. Vlahavas Department of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
More informationEquations of planes in
Roberto s Notes on Linear Algebra Chapter 6: Lines, planes and other straight objects Section Equations of planes in What you need to know already: What vectors and vector operations are. What linear systems
More informationBig Data Management and NoSQL Databases
NDBI040 Big Data Management and NoSQL Databases Lecture 10. Graph databases Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz http://www.ksi.mff.cuni.cz/~holubova/ndbi040/ Graph Databases Basic
More informationCluster Validity Classification Approaches Based on Geometric Probability and Application in the Classification of Remotely Sensed Images
Sensors & Transducers 04 by IFSA Publishing, S. L. http://www.sensorsportal.com Cluster Validity ification Approaches Based on Geometric Probability and Application in the ification of Remotely Sensed
More informationFODAVA Partners Leland Wilkinson (SYSTAT & UIC) Robert Grossman (UIC) Adilson Motter (Northwestern) Anushka Anand, Troy Hernandez (UIC)
FODAVA Partners Leland Wilkinson (SYSTAT & UIC) Robert Grossman (UIC) Adilson Motter (Northwestern) Anushka Anand, Troy Hernandez (UIC) Visually-Motivated Characterizations of Point Sets Embedded in High-Dimensional
More informationECM A Novel On-line, Evolving Clustering Method and Its Applications
ECM A Novel On-line, Evolving Clustering Method and Its Applications Qun Song 1 and Nikola Kasabov 2 1, 2 Department of Information Science, University of Otago P.O Box 56, Dunedin, New Zealand (E-mail:
More informationSome Graph Theory for Network Analysis. CS 249B: Science of Networks Week 01: Thursday, 01/31/08 Daniel Bilar Wellesley College Spring 2008
Some Graph Theory for Network Analysis CS 9B: Science of Networks Week 0: Thursday, 0//08 Daniel Bilar Wellesley College Spring 008 Goals this lecture Introduce you to some jargon what we call things in
More informationUsing the RCircos Package
Using the RCircos Package Hongen Zhang, Ph.D. Genetics Branch, Center for Cancer Research, National Cancer Institute, NIH August 01, 2016 Contents 1 Introduction 1 2 Input Data Format 2 3 Plot Track Layout
More informationUnit 2: Graphs and Matrices. ICPSR University of Michigan, Ann Arbor Summer 2015 Instructor: Ann McCranie
Unit 2: Graphs and Matrices ICPSR University of Michigan, Ann Arbor Summer 2015 Instructor: Ann McCranie Four main ways to notate a social network There are a variety of ways to mathematize a social network,
More informationSubgraph Isomorphism. Artem Maksov, Yong Li, Reese Butler 03/04/2015
Subgraph Isomorphism Artem Maksov, Yong Li, Reese Butler 03/04/2015 Similar Graphs The two graphs below look different but are structurally the same. Definition What is Graph Isomorphism? An isomorphism
More informationRecognition of online captured, handwritten Tamil words on Android
Recognition of online captured, handwritten Tamil words on Android A G Ramakrishnan and Bhargava Urala K Medical Intelligence and Language Engineering (MILE) Laboratory, Dept. of Electrical Engineering,
More informationData Visualization. Fall 2016
Data Visualization Fall 2016 Information Visualization Upon now, we dealt with scientific visualization (scivis) Scivisincludes visualization of physical simulations, engineering, medical imaging, Earth
More informationWeek 7 Picturing Network. Vahe and Bethany
Week 7 Picturing Network Vahe and Bethany Freeman (2005) - Graphic Techniques for Exploring Social Network Data The two main goals of analyzing social network data are identification of cohesive groups
More informationNOVEL APPROACH FOR COMPARING SIMILARITY VECTORS IN IMAGE RETRIEVAL
NOVEL APPROACH FOR COMPARING SIMILARITY VECTORS IN IMAGE RETRIEVAL Abstract In this paper we will present new results in image database retrieval, which is a developing field with growing interest. In
More informationMHPE 494: Data Analysis. Welcome! The Analytic Process
MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your
More informationRadial Basis Function (RBF) Neural Networks Based on the Triple Modular Redundancy Technology (TMR)
Radial Basis Function (RBF) Neural Networks Based on the Triple Modular Redundancy Technology (TMR) Yaobin Qin qinxx143@umn.edu Supervisor: Pro.lilja Department of Electrical and Computer Engineering Abstract
More informationAN INTRODUCTION TO LATTICE GRAPHICS IN R
AN INTRODUCTION TO LATTICE GRAPHICS IN R William G. Jacoby Michigan State University ICPSR Summer Program July 27-28, 2016 I. Introductory Thoughts about Lattice Graphics A. The lattice package 1. Created
More informationAutomatic Document; Retrieval Systems. The conventional library classifies documents by numeric subject codes which are assigned manually (Dewey
I. Automatic Document; Retrieval Systems The conventional library classifies documents by numeric subject codes which are assigned manually (Dewey decimal aystemf Library of Congress system). Cross-indexing
More informationNetwork community detection with edge classifiers trained on LFR graphs
Network community detection with edge classifiers trained on LFR graphs Twan van Laarhoven and Elena Marchiori Department of Computer Science, Radboud University Nijmegen, The Netherlands Abstract. Graphs
More informationOverview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization
Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan & Matt Larsen (University of Oregon), Hank Childs (Lawrence Berkeley National Laboratory) 26
More informationMaximizing Throughput on a Dragonfly Network
Maximizing Throughput on a Dragonfly Network Nikhil Jain, Abhinav Bhatele, Xiang Ni, Nicholas J. Wright, Laxmikant V. Kale Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana,
More informationmpmorfsdb: A database of Molecular Recognition Features (MoRFs) in membrane proteins. Introduction
mpmorfsdb: A database of Molecular Recognition Features (MoRFs) in membrane proteins. Introduction Molecular Recognition Features (MoRFs) are short, intrinsically disordered regions in proteins that undergo
More informationClustering and Visualisation of Data
Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some
More informationCross Reference Strategies for Cooperative Modalities
Cross Reference Strategies for Cooperative Modalities D.SRIKAR*1 CH.S.V.V.S.N.MURTHY*2 Department of Computer Science and Engineering, Sri Sai Aditya institute of Science and Technology Department of Information
More informationUNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences
UNIVERSITY OF OSLO Faculty of Mathematics and Natural Sciences Exam: INF 4300 / INF 9305 Digital image analysis Date: Thursday December 21, 2017 Exam hours: 09.00-13.00 (4 hours) Number of pages: 8 pages
More informationRestricted Nearest Feature Line with Ellipse for Face Recognition
Journal of Information Hiding and Multimedia Signal Processing c 2012 ISSN 2073-4212 Ubiquitous International Volume 3, Number 3, July 2012 Restricted Nearest Feature Line with Ellipse for Face Recognition
More informationPATTERN RECOGNITION USING NEURAL NETWORKS
PATTERN RECOGNITION USING NEURAL NETWORKS Santaji Ghorpade 1, Jayshree Ghorpade 2 and Shamla Mantri 3 1 Department of Information Technology Engineering, Pune University, India santaji_11jan@yahoo.co.in,
More informationAmazon Web Services: Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud
Amazon Web Services: Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud Summarized by: Michael Riera 9/17/2011 University of Central Florida CDA5532 Agenda
More informationAS scientific computing matures, the demands for computational
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 1 Communication Requirements and Interconnect Optimization for High-End Scientific Applications Shoaib Kamil, Leonid Oliker, Ali Pinar, John Shalf
More informationIntroduction to Relational Databases
Introduction to Relational Databases Third La Serena School for Data Science: Applied Tools for Astronomy August 2015 Mauro San Martín msmartin@userena.cl Universidad de La Serena Contents Introduction
More informationNearest Neighbor Predictors
Nearest Neighbor Predictors September 2, 2018 Perhaps the simplest machine learning prediction method, from a conceptual point of view, and perhaps also the most unusual, is the nearest-neighbor method,
More informationAlignment of Trees and Directed Acyclic Graphs
Alignment of Trees and Directed Acyclic Graphs Gabriel Valiente Algorithms, Bioinformatics, Complexity and Formal Methods Research Group Technical University of Catalonia Computational Biology and Bioinformatics
More informationFinding sets of annotations that frequently co-occur in a list of genes
Finding sets of annotations that frequently co-occur in a list of genes In this document we illustrate, with a straightforward example, the GENECODIS algorithm for finding sets of annotations that frequently
More informationMultiple Dimensional Visualization
Multiple Dimensional Visualization Dimension 1 dimensional data Given price information of 200 or more houses, please find ways to visualization this dataset 2-Dimensional Dataset I also know the distances
More informationTESLA P100 PERFORMANCE GUIDE. Deep Learning and HPC Applications
TESLA P PERFORMANCE GUIDE Deep Learning and HPC Applications SEPTEMBER 217 TESLA P PERFORMANCE GUIDE Modern high performance computing (HPC) data centers are key to solving some of the world s most important
More informationComputational Discrete Mathematics
Computational Discrete Mathematics Combinatorics and Graph Theory with Mathematica SRIRAM PEMMARAJU The University of Iowa STEVEN SKIENA SUNY at Stony Brook CAMBRIDGE UNIVERSITY PRESS Table of Contents
More informationThe role of Fisher information in primary data space for neighbourhood mapping
The role of Fisher information in primary data space for neighbourhood mapping H. Ruiz 1, I. H. Jarman 2, J. D. Martín 3, P. J. Lisboa 1 1 - School of Computing and Mathematical Sciences - Department of
More informationTHINKING VISUALLY: AN INTRODUCTION TO DATA & INFORMATION VISUALIZATION
THINKING VISUALLY: AN INTRODUCTION TO DATA & INFORMATION VISUALIZATION Learning Event for CES Ontario Li Ka Shing Knowledge Institute June 22, 2016 Jesse Carliner ACTING COMMUNICATIONS & REFERENCE LIBRARIAN
More informationISSN: ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 2, Issue 4, July 2013
Offline Handwritten Signatures Classification Using Wavelets and Support Vector Machines Poornima G Patil, Ravindra S Hegadi Dayananda Sagar Institutions Bangalore, Solapur University Solapur Abstract
More informationMILC Performance Benchmark and Profiling. April 2013
MILC Performance Benchmark and Profiling April 2013 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting
More informationFebruary 13, notebook
Module 12 Lesson 1: Graphing on the coordinate plane Lesson 2: Independent and dependent variables in tables and graphs Lesson 3: Writing equations from tables Lesson 4: Representing Algebraic relationships
More informationA Fast Approximated k Median Algorithm
A Fast Approximated k Median Algorithm Eva Gómez Ballester, Luisa Micó, Jose Oncina Universidad de Alicante, Departamento de Lenguajes y Sistemas Informáticos {eva, mico,oncina}@dlsi.ua.es Abstract. The
More informationTo make sense of data, you can start by answering the following questions:
Taken from the Introductory Biology 1, 181 lab manual, Biological Sciences, Copyright NCSU (with appreciation to Dr. Miriam Ferzli--author of this appendix of the lab manual). Appendix : Understanding
More informationAction Recognition & Categories via Spatial-Temporal Features
Action Recognition & Categories via Spatial-Temporal Features 华俊豪, 11331007 huajh7@gmail.com 2014/4/9 Talk at Image & Video Analysis taught by Huimin Yu. Outline Introduction Frameworks Feature extraction
More informationHOLY ANGEL UNIVERSITY COLLEGE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY WEB TECHNOLOGIES 1 COURSE SYLLABUS
HOLY ANGEL UNIVERSITY COLLEGE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY WEB TECHNOLOGIES 1 COURSE SYLLABUS Course Code : 6WEBTECH1 Prerequisite : N/A Course Credit : 3 Units (2 hours LEC,3 hours LAB)
More informationvisualizing q uantitative quantitative information information
visualizing quantitative information visualizing quantitative information martin krzywinski outline best practices of graphical data design data-to-ink ratio cartjunk circos the visual display of quantitative
More informationIntroductory Combinatorics
Introductory Combinatorics Third Edition KENNETH P. BOGART Dartmouth College,. " A Harcourt Science and Technology Company San Diego San Francisco New York Boston London Toronto Sydney Tokyo xm CONTENTS
More informationUnit 5 Test 2 MCC5.G.1 StudyGuide/Homework Sheet
Unit 5 Test 2 MCC5.G.1 StudyGuide/Homework Sheet Tuesday, February 19, 2013 (Crosswalk Coach Page 221) GETTING THE IDEA! An ordered pair is a pair of numbers used to locate a point on a coordinate plane.
More informationCP SC 8810 Data Visualization. Joshua Levine
CP SC 8810 Data Visualization Joshua Levine levinej@clemson.edu Lecture 15 Text and Sets Oct. 14, 2014 Agenda Lab 02 Grades! Lab 03 due in 1 week Lab 2 Summary Preferences on x-axis label separation 10
More informationExam Design and Analysis of Algorithms for Parallel Computer Systems 9 15 at ÖP3
UMEÅ UNIVERSITET Institutionen för datavetenskap Lars Karlsson, Bo Kågström och Mikael Rännar Design and Analysis of Algorithms for Parallel Computer Systems VT2009 June 2, 2009 Exam Design and Analysis
More informationLearning to Recognize Faces in Realistic Conditions
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationEvgeny Maksakov Advantages and disadvantages: Advantages and disadvantages: Advantages and disadvantages: Advantages and disadvantages:
Today Problems with visualizing high dimensional data Problem Overview Direct Visualization Approaches High dimensionality Visual cluttering Clarity of representation Visualization is time consuming Dimensional
More informationSupporting Information. High-Throughput, Algorithmic Determination of Nanoparticle Structure From Electron Microscopy Images
Supporting Information High-Throughput, Algorithmic Determination of Nanoparticle Structure From Electron Microscopy Images Christine R. Laramy, 1, Keith A. Brown, 2, Matthew N. O Brien, 2 and Chad. A.
More informationGeneric Fourier Descriptor for Shape-based Image Retrieval
1 Generic Fourier Descriptor for Shape-based Image Retrieval Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info Tech Monash University Churchill, VIC 3842 Australia dengsheng.zhang@infotech.monash.edu.au
More informationTopological Classification of Data Sets without an Explicit Metric
Topological Classification of Data Sets without an Explicit Metric Tim Harrington, Andrew Tausz and Guillaume Troianowski December 10, 2008 A contemporary problem in data analysis is understanding the
More informationPreface MOTIVATION ORGANIZATION OF THE BOOK. Section 1: Basic Concepts of Graph Theory
xv Preface MOTIVATION Graph Theory as a well-known topic in discrete mathematics, has become increasingly under interest within recent decades. This is principally due to its applicability in a wide range
More informationPerformance and Energy Usage of Workloads on KNL and Haswell Architectures
Performance and Energy Usage of Workloads on KNL and Haswell Architectures Tyler Allen 1 Christopher Daley 2 Doug Doerfler 2 Brian Austin 2 Nicholas Wright 2 1 Clemson University 2 National Energy Research
More informationLecture 12: Graphs/Trees
Lecture 12: Graphs/Trees Information Visualization CPSC 533C, Fall 2009 Tamara Munzner UBC Computer Science Mon, 26 October 2009 1 / 37 Proposal Writeup Expectations project title (not just 533 Proposal
More informationVisualization Analysis & Design Full-Day Tutorial Session 1
Visualization Analysis & Design Full-Day Tutorial Session 1 Tamara Munzner Department of Computer Science University of British Columbia Sanger Institute / European Bioinformatics Institute June 2014,
More informationNetwork Thinking. Complexity: A Guided Tour, Chapters 15-16
Network Thinking Complexity: A Guided Tour, Chapters 15-16 Neural Network (C. Elegans) http://gephi.org/wp-content/uploads/2008/12/screenshot-celegans.png Food Web http://1.bp.blogspot.com/_vifbm3t8bou/sbhzqbchiei/aaaaaaaaaxk/rsc-pj45avc/
More informationDimension reduction : PCA and Clustering
Dimension reduction : PCA and Clustering By Hanne Jarmer Slides by Christopher Workman Center for Biological Sequence Analysis DTU The DNA Array Analysis Pipeline Array design Probe design Question Experimental
More informationHigh-Performance Scientific Computing
High-Performance Scientific Computing Instructor: Randy LeVeque TA: Grady Lemoine Applied Mathematics 483/583, Spring 2011 http://www.amath.washington.edu/~rjl/am583 World s fastest computers http://top500.org
More informationResearch Article International Journals of Advanced Research in Computer Science and Software Engineering ISSN: X (Volume-7, Issue-6)
International Journals of Advanced Research in Computer Science and Software Engineering ISSN: 77-18X (Volume-7, Issue-6) Research Article June 017 DDGARM: Dotlet Driven Global Alignment with Reduced Matrix
More informationCS395T paper review. Indoor Segmentation and Support Inference from RGBD Images. Chao Jia Sep
CS395T paper review Indoor Segmentation and Support Inference from RGBD Images Chao Jia Sep 28 2012 Introduction What do we want -- Indoor scene parsing Segmentation and labeling Support relationships
More informationLecture 1 (Part 1) Introduction/Overview
UMass Lowell Computer Science 91.503 Analysis of Algorithms Prof. Karen Daniels Fall, 2013 Lecture 1 (Part 1) Introduction/Overview Monday, 9/9/13 Web Page Web Page http://www.cs.uml.edu/~kdaniels/courses/alg_503_f13.html
More informationDictionary learning Based on Laplacian Score in Sparse Coding
Dictionary learning Based on Laplacian Score in Sparse Coding Jin Xu and Hong Man Department of Electrical and Computer Engineering, Stevens institute of Technology, Hoboken, NJ 73 USA Abstract. Sparse
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More information