Energy- Aware Time Change Detec4on Using Synthe4c Aperture Radar On High- Performance Heterogeneous Architectures: A DDDAS Approach
|
|
- Hilary Phillips
- 6 years ago
- Views:
Transcription
1 Energy- Aware Time Change Detec4on Using Synthe4c Aperture Radar On High- Performance Heterogeneous Architectures: A DDDAS Approach Sanjay Ranka (PI) Sartaj Sahni (Co- PI) Mark Schmalz (Co- PI) University of Florida Department of CISE Gainesville, FL DDDAS Program PI Mee>ng 27 Jan 2016
2 1. Research Team 2. Technical Objec4ves 3. Programma4c Informa4on 4. Technical Approach & Results Simula>on Reconstruc>on Coherent Change Detec>on Complexity and Error Analysis Energy and Power Reduc>on 5. Ongoing and Future Work 6. Discussion Overview of Presenta4on 2
3 Research Team q Principal Inves>gator Sanjay Ranka, Ph.D. Research Interests: High- Performance Compu>ng Energy- Aware Compu>ng Big Data Analy>cs q Co- PI Sartaj Sahni, Ph.D. Research Interests: High- Performance Compu>ng Data Structures and Algorithms Signal & Image Processing q Co- PI Research Interests: Mark Schmalz, Ph.D, O.D. High- Performance Compu>ng Signal & Image Processing Simula>on, Error Analysis 3
4 Technical Objec4ves q Develop Energy- Efficient Algorithms for Change Detec4on in Video Synthe4c Aperture Radar (SAR) Imagery q Topics of Inves4ga4on Parallel Architectures (CPUs, GPUs, HMPs) Adap4ve Algorithms for Image Tiling Adap4ve Segmenta4on of SAR Pulse Dataset(s) Efficient SAR Image Reconstruc4on o STEEP Constraints: Space, Time, Error, Energy Profile, and Power Consump4on Change Detec4on Coherent or Incoherent? o Effects of Noise and Clu`er o Incomplete Data o Support for Object Detec4on, Segmenta4on and Recogni4on o Complexity, Efficiency (Time, Space, and Energy or Power Consump4on) 4
5 Technical Approach q Overview of Concept Change detec4on accepts two input images as well as a window size, then generates a difference map. DDDAS Approach: Output of change detector is input to algorithm that assigns spa>al resolu>on to image >les. 5
6 Backprojec4on is decomposed along (a) the output image dimension, where each processing device renders a >le using all the pulse data, or (b) where each processing device renders an en>re image using a subset of the pulses. Tiling is crucial to efficient algorithm performance the selec>on of op>mal >le size is done em- pirically. Pulse Data Selec4on is likewise key to re- construc>on algor- ithm efficiency. We prefetch pulses that contribute to each output (>led) pixel. 6
7 q DDDAS Approach Mul4- Resolu4on DDDAS Resolu4on and Scheduling 1. Master decomposes the problem into atoms (set of pulses to be rendered onto one >le of the output image). 2. Resolu4on Controller determines spa>al resolu>on for render- ing each image >le. 3. Master sends each atom to Mul4level Scheduler that balances load for hetero- geneous devices and maintains locality of access for efficiency. 7
8 q DDDAS Approach Intelligent Pulse Selec4on Four sta4c schemes for distribu4ng 100 iden4cal- resolu4on atoms, each with an equivalent pulse set, to 4 homogeneous processing devices. Dynamic range dimension par44oning redistributes pulse data to minimize com- munica>on cost as the sensor's viewing axis moves around the scene. 8
9 1. Simula4on Construct Experimental Pulse Datasets from Digital Eleva>on Maps Precisely Controlled Test Data Controllable Parameters to Facilitate Performance & Error Analysis 2. Reconstruc4on Mul>resolu>on Superposi>on Algorithm Developed at UF Pulse Dataset Segmenta>on and Output Image Tiling for Efficiency 3. Change Detec4on (CD) CD Algorithm Developed at UF Highlights Regions of Interest Isola>on of Regions Containing Moving Vegeta>on (high frequency ST variance) Construc>on of Mul>resolu>on (Pyramidal) Scene Representa>on Iden>fica>on of Future Target Movement Regions (by variance & context) Sta>s>cal ID and Tracking of Targets by Spa>otemporal (ST) Variance 9
10 4. Complexity and Error Analysis (sta%c : compile- >me and dynamic : run- >me) HIDEF VSAR Processing Algorithm Functional Analysis Computational Analysis Algorithm structure Procedure calls Operation mix Execution trace Complexity Work estimate VSAR HIDEF Algorithm Data Structures Data Structure Layout Memory Analysis Error Propagation Analysis Forward analysis Backward analysis I/O Channel Analysis Uncertainty quantification Architectural Constraints Array indexing - row-major - column-major - custom scan Partitioning/Connectivity Interleaving/Caching Bandwidth/Error rate Packet size & overhead Bandwidth / Connectivity Error characteristics Time-Space-Error Cost Analysis 10
11 5. Energy and Power Consump4on Reduc4on Basis: (1) Algorithmic Model + (2) Energy/Power Measurements Mul4resolu4on Structure Facilitates Region Segmenta4on! Non- Target Regions Reconstructed at Low Resolu4on o Example: Foliage and cover regions o Increased computa>onal efficiency o Decreased power consump>on! Probable Target Regions Reconstructed at Higher Resolu4on o Detail preserved to facilitate target recogni>on and tracking o Increases success of change detec>on algorithm applica>on o Target predic>on model (speed, direc>on) predicts range of posi>ons in next frame o Energy and power consump>on models constrain processing strategy for next frame o Result: Data- and processing- directed reduc>on of power consump>on 11
12 q Results: Simula4on Construct Experimental Pulse Datasets from Digital Eleva4on Maps Receiver Pulse Emi`er z Given: Bidirec>onal Reflec>vity Distribu>on Func>on (BRDF) Emiher Intensity I T ζ sr S Received Intensity I r = I BRDF(θ i ) Emiher Track S = (s 1, s 2,, s P ) S = T monostatic SAR Receiver Track T = (t 1, t 2,, t P ) S T bistatic SAR Number of Pulses P Simulated Ground Plane X x y Pulse Data Resolu>on (per pulse) N B bins Resolu>on NxN pixels of Reconstructed Image defined on X Frame Rate F Objec@ve: Construct pulse dataset D having N P pulses of N B bins each F P N B elements 12
13 q Results: Simula4on & Reconstruc4on Example 1. Construct Digital Eleva>on Map 2. Construct Experimental Pulse Datasets from Digital Eleva>on Maps 3. Compute DEM from Pulse Dataset using SAR Reconstruc>on Algorithm Image of DEM Monosta4c SAR Reconstruc4on with Flare Ar4facts Early Simula4on Results (Reconstruc4on) ζ sr = 45 degrees Emiher alt = 7,071m Receiver alt = 106m Lateral separa>on of central receiver from transmiher = 9,850m 600x600 pixel image 13
14 q Results: Preliminary Reconstruc4on Timing 14
15 q Results: Preliminary Reconstruc4on Timing (May 2015, cont d) 28.4 sec / frame ( pixels) GPU exe 4me 15
16 q Results: Current Algorithm Testbed Master Processor: CPU: Intel Xeon clocked at 2.0 GHz. Opera>ng system : Ubuntu LTS SMP Co- Processor: GPU: Nvidia Tesla C2050, 1 GPU processor with 448 CUDA cores, clocked at GHz GPU Memory 6GB RAM clocked at 1.5 GHz Current Reconstruc4on Timing Results (Jan 2016): Single GPU latency of 2.72 sec / frame [ pixels] Speedup (versus May 2015) = 28.4 sec / 2.72 sec = 10.4X 16
17 q Results: Reconstruc4on Timing, One Nvidia Fermi TM GPU Measure Algorithm Performance with & without External Overhead Execu4on + I/O Time, sec TileSz = 4 TileSz = 8 TileSz = 16 TileSz = 32 Result: Op4mal Tile Size = 16 pixels Image Dimension, pixels Execu4on + I/O Time, sec TileSz = 4 TileSz = 8 TileSz = 16 TileSz = sec / frm ( pixels) GPU exe 4me Image Dimension, pixels 17
18 q Results: Reconstruc4on Timing, Four Nvidia Fermi TM GPUs Measure Algorithm Performance with & without Charm++ Charm++ Slower: Why? ms setup >me - - Lacks automa>c data par>>oning that we have - - Our approach op>m- ized for SAR - - Charm performance improves with data size 18
19 q Results: Preliminary Change Detec4on Algorithm 1: 250m x 170m road scene (SAR) 2: Car traverses road from right to lep 3: Mean of 25 previous frames 4: Standard devia>on of 25 previous frames 19
20 q Results: Preliminary Change Detec4on Algorithm (cont d) False Posi4ve due to tree blowing in the wind. 5: Compute z- score per pixel 6: Apply 15x15- pixel window filter 7: Threshold by mean and st- dev 8: Apply algorithm framewise, per >le Increase Efficiency by focusing on hi- resolu>on areas (probable target) and contextual cues (e.g., car on road) 20
21 Preliminary Conclusions & Future Work q Accomplishments ü Improved Reconstruc4on Algorithm > 10X Faster than Previous ü Preliminary Change Detec4on Algorithm Developed & Tested ü SAR Simulator Developed for Precise Control in Test & Analysis ü Publica4ons (prepared in conjunc>on with this project): " Monograph: preliminary version sub- mihed to John Wiley & Sons John Wiley & Sons, Inc. Journal Paper! Mul4sta4c SAR submihed to IEEE Tr. AES (in revision) 21
22 Preliminary Conclusions & Future Work q Future Work (next 12 months) Ø Performance Op4miza4on: Reconstruc>on & Change Detec>on (RCD) Algorithms Ø Improve CD Algorithm Performance: á Detec>on Probability, â False Alarms Ø Enhancement of Mul4resolu4on Structures to Focus Processing for Efficiency o More Accurately Determine Probable Target Regions (High Resolu>on) o Increase False Alarm Rejec>on via Temporal Spectral Filtering (Low Resolu>on) Ø Measure & Improve: RCD Algorithms Energy Profiles and Power Consump>on Ø Enhance SAR/VSAR Simulator with Flare, Interference, & Diffrac>on Effects Ø Simula4on/Test/Analysis of RCD Algorithms for Distributed Networked Embedded Apps 22
23 Discussion 23
A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System
A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System Ilkay Al(ntas and Daniel Crawl San Diego Supercomputer Center UC San Diego Jianwu Wang UMBC WorDS.sdsc.edu Computa3onal
More informationHypergraph Sparsifica/on and Its Applica/on to Par//oning
Hypergraph Sparsifica/on and Its Applica/on to Par//oning Mehmet Deveci 1,3, Kamer Kaya 1, Ümit V. Çatalyürek 1,2 1 Dept. of Biomedical Informa/cs, The Ohio State University 2 Dept. of Electrical & Computer
More informationDeep Temporal Models (Benchmarks and Applica6ons Analysis)
Deep Temporal Models (Benchmarks and Applica6ons Analysis) Sek Chai SRI Interna6onal Presented at: NICE 2016, March 7, 2016 2016 SRI International Project Summary Goals Analyze Deep Temporal Models (DTMs).
More informationOp#mizing PGAS overhead in a mul#-locale Chapel implementa#on of CoMD
Op#mizing PGAS overhead in a mul#-locale Chapel implementa#on of CoMD Riyaz Haque and David F. Richards This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore
More informationOrigin- des*na*on Flow Measurement in High- Speed Networks
IEEE INFOCOM, 2012 Origin- des*na*on Flow Measurement in High- Speed Networks Tao Li Shigang Chen Yan Qiao Introduc*on (Defini*ons) Origin- des+na+on flow between two routers is the set of packets that
More informationIntroduc)on to Informa)on Visualiza)on
Introduc)on to Informa)on Visualiza)on Seeing the Science with Visualiza)on Raw Data 01001101011001 11001010010101 00101010100110 11101101011011 00110010111010 Visualiza(on Applica(on Visualiza)on on
More informationFORTISSIMO HPGA HIGH PERFORMANCE GEAR ANALYZER. Ing. Antonio Zippo, PhD PULSAR DYNAMICS - UniMORE
FORTISSIMO HPGA HIGH PERFORMANCE GEAR ANALYZER Ing. Antonio Zippo, PhD PULSAR DYNAMICS - UniMORE PARTNERS Applica?on Expert: Vibra?ons and PowerTrain Lab UNIMORE HPC Expert: CINECA Galileo Supercomputer
More informationApplica'on Aware Deadlock Free Oblivious Rou'ng
Applica'on Aware Deadlock Free Oblivious Rou'ng Michel Kinsy, Myong Hyo Cho, Tina Wen, Edward Suh (Cornell University), Marten van Dijk and Srinivas Devadas Massachuse(s Ins,tute of Technology Outline
More informationBigDataBench- S: An Open- source Scien6fic Big Data Benchmark Suite
BigDataBench- S: An Open- source Scien6fic Big Data Benchmark Suite Xinhui Tian, Shaopeng Dai, Zhihui Du, Wanling Gao, Rui Ren, Yaodong Cheng, Zhifei Zhang, Zhen Jia, Peijian Wang and Jianfeng Zhan INSTITUTE
More informationDeformable Part Models
Deformable Part Models References: Felzenszwalb, Girshick, McAllester and Ramanan, Object Detec@on with Discrimina@vely Trained Part Based Models, PAMI 2010 Code available at hkp://www.cs.berkeley.edu/~rbg/latent/
More informationINNOVATING CONFOCAL MICROSCOPY
Sportello Matema9co per l Industria Italiana, Dr. Maurizio Ceseri CrestOp9cs S.p.A., Dr. Andrea San3nelli Authors Sportello Matema9co per l Industria Italiana, Dr. Antonino Sgalambro CrestOp9cs S.p.A.,
More informationOvercoming the Barriers of Graphs on GPUs: Delivering Graph Analy;cs 100X Faster and 40X Cheaper
Overcoming the Barriers of Graphs on GPUs: Delivering Graph Analy;cs 100X Faster and 40X Cheaper November 18, 2015 Super Compu3ng 2015 The Amount of Graph Data is Exploding! Billion+ Edges! 2 Graph Applications
More informationOrganized Segmenta.on
Organized Segmenta.on Alex Trevor, Georgia Ins.tute of Technology PCL TUTORIAL @ICRA 13 Overview Mo.va.on Connected Component Algorithm Planar Segmenta.on & Refinement Euclidean Clustering Timing Results
More informationVideo- to- Video Face Matching: Establishing a Baseline for Unconstrained Face Recogni:on
Video- to- Video Face Matching: Establishing a Baseline for Unconstrained Face Recogni:on Lacey Best- Rowden, Brendan Klare, Joshua Klontz, and Anil K. Jain Biometrics: Theory, Applica:ons, and Systems
More informationCS395T Visual Recogni5on and Search. Gautam S. Muralidhar
CS395T Visual Recogni5on and Search Gautam S. Muralidhar Today s Theme Unsupervised discovery of images Main mo5va5on behind unsupervised discovery is that supervision is expensive Common tasks include
More informationPhoton Mapping. Photon Mapping. Why Map Photons? Sources. What is a Photon? Refrac=on of a Caus=c. Jan Kautz
Refrac=on of a Caus=c Photon Mapping Jan Kautz Monte Carlo ray tracing handles all paths of light: L(D S)*E, but not equally well Has difficulty sampling LS*DS*E paths, e.g. refrac=on of a caus=c Path
More informationExecu&on Templates: Caching Control Plane Decisions for Strong Scaling of Data Analy&cs
Execu&on Templates: Caching Control Plane Decisions for Strong Scaling of Data Analy&cs Omid Mashayekhi Hang Qu Chinmayee Shah Philip Levis July 13, 2017 2 Cloud Frameworks SQL Streaming Machine Learning
More informationNetwork Coding: Theory and Applica7ons
Network Coding: Theory and Applica7ons PhD Course Part IV Tuesday 9.15-12.15 18.6.213 Muriel Médard (MIT), Frank H. P. Fitzek (AAU), Daniel E. Lucani (AAU), Morten V. Pedersen (AAU) Plan Hello World! Intra
More informationW1005 Intro to CS and Programming in MATLAB. Brief History of Compu?ng. Fall 2014 Instructor: Ilia Vovsha. hip://www.cs.columbia.
W1005 Intro to CS and Programming in MATLAB Brief History of Compu?ng Fall 2014 Instructor: Ilia Vovsha hip://www.cs.columbia.edu/~vovsha/w1005 Computer Philosophy Computer is a (electronic digital) device
More informationDecentralized K-Means Clustering with Emergent Computing
Decentralized K-Means Clustering with Emergent Computing Ryan McCune & Greg Madey University of Notre Dame, Computer Science & Engineering Spring Simula?on Mul?- Conference 2014, Tampa, FL Student Colloquium
More informationLecture 7: Spectral Clustering; Linear Dimensionality Reduc:on via Principal Component Analysis
Lecture 7: Spectral Clustering; Linear Dimensionality Reduc:on via Principal Component Analysis Lester Mackey April, Stats 6B: Unsupervised Learning Blackboard discussion See lecture notes Spectral clustering
More informationOutline. In Situ Data Triage and Visualiza8on
In Situ Data Triage and Visualiza8on Kwan- Liu Ma University of California at Davis Outline In situ data triage and visualiza8on: Issues and strategies Case study: An earthquake simula8on Case study: A
More informationPhD in Computer And Control Engineering XXVII cycle. Torino February 27th, 2015.
PhD in Computer And Control Engineering XXVII cycle Torino February 27th, 2015. Parallel and reconfigurable systems are more and more used in a wide number of applica7ons and environments, ranging from
More informationEmbedding System Dynamics in Agent Based Models for Complex Adap;ve Systems
Embedding System Dynamics in Agent Based Models for Complex Adap;ve Systems Kiyan Ahmadizadeh, Maarika Teose, Carla Gomes, Yrjo Grohn, Steve Ellner, Eoin O Mahony, Becky Smith, Zhao Lu, Becky Mitchell
More informationGASPP: A GPU- Accelerated Stateful Packet Processing Framework
GASPP: A GPU- Accelerated Stateful Packet Processing Framework Giorgos Vasiliadis, FORTH- ICS, Greece Lazaros Koromilas, FORTH- ICS, Greece Michalis Polychronakis, Columbia University, USA So5ris Ioannidis,
More informationRapid Extraction and Updating Road Network from LIDAR Data
Rapid Extraction and Updating Road Network from LIDAR Data Jiaping Zhao, Suya You, Jing Huang Computer Science Department University of Southern California October, 2011 Research Objec+ve Road extrac+on
More informationToday s Class. High Dimensional Data & Dimensionality Reduc8on. Readings for This Week: Today s Class. Scien8fic Data. Misc. Personal Data 2/22/12
High Dimensional Data & Dimensionality Reduc8on Readings for This Week: Graphical Histories for Visualiza8on: Suppor8ng Analysis, Communica8on, and Evalua8on, Jeffrey Heer, Jock D. Mackinlay, Chris Stolte,
More informationBoW model. Textual data: Bag of Words model
BoW model Textual data: Bag of Words model With text, categoriza9on is the task of assigning a document to one or more categories based on its content. It is appropriate for: Detec9ng and indexing similar
More informationOp#mizing MapReduce for Highly- Distributed Environments
Op#mizing MapReduce for Highly- Distributed Environments Abhishek Chandra Associate Professor Department of Computer Science and Engineering University of Minnesota hep://www.cs.umn.edu/~chandra 1 Big
More informationA scalability comparison study of data management approaches for smart metering systems
A scalability comparison study of data management approaches for smart metering systems Houssem Chihoub, Chris.ne Collet Grenoble INP houssem.chihoub@imag.fr Journées Plateformes Clermont Ferrand 6-7 octobre
More informationHPCSoC Modeling and Simulation Implications
Department Name (View Master > Edit Slide 1) HPCSoC Modeling and Simulation Implications (Sharing three concerns from an academic research user perspective using free, open tools. Solutions left to the
More informationStarchart*: GPU Program Power/Performance Op7miza7on Using Regression Trees
Starchart*: GPU Program Power/Performance Op7miza7on Using Regression Trees Wenhao Jia, Princeton University Kelly A. Shaw, University of Richmond Margaret Martonosi, Princeton University *Sta7s7cal Tuning
More informationVision: Towards an Extensible App Ecosystem for Home Automa;on through Cloud- Offload
Vision: Towards an Extensible App Ecosystem for Home Automa;on through Cloud- Offload Yuichi Igarashi Hitachi Ltd., Yokohama Research Laboratory Kaustubh Joshi MaL Hiltunen Richard Schlich;ng AT&T Shannon
More informationVolume Visualiza0on. Today s Class. Grades & Homework feedback on Homework Submission Server
11/3/14 Volume Visualiza0on h3p://imgur.com/trjonqk h3p://i.imgur.com/zcjc9kp.jpg Today s Class Grades & Homework feedback on Homework Submission Server Everything except HW4 (didn t get to that yet) &
More informationTools zur Op+mierung eingebe2eter Mul+core- Systeme. Bernhard Bauer
Tools zur Op+mierung eingebe2eter Mul+core- Systeme Bernhard Bauer Agenda Mo+va+on So.ware Engineering & Mul5core Think Parallel Models Added Value Tooling Quo Vadis? The Mul5core Era Moore s Law: The
More informationCloud Computing WSU Dr. Bahman Javadi. School of Computing, Engineering and Mathematics
Cloud Computing Research @ WSU Dr. Bahman Javadi School of Computing, Engineering and Mathematics Research Team and Research Interests Team 4 Academic Staff 5 PhD Students 1 Master Student Resource Scheduling
More informationMobile speed enforcement and video surveillance
Mobile speed enforcement and video surveillance Market leading innovaon ProVida combines high quality digital surveillance with the Police approved speed measurement tool. For over 20 years the ProVida
More informationOLTP on Hadoop: Reviewing the first Hadoop- based TPC- C benchmarks
OLTP on Hadoop: Reviewing the first Hadoop- based TPC- C benchmarks Monte Zweben Co- Founder and Chief Execu6ve Officer John Leach Co- Founder and Chief Technology Officer September 30, 2015 The Tradi6onal
More informationPor$ng Monte Carlo Algorithms to the GPU. Ryan Bergmann UC Berkeley Serpent Users Group Mee$ng 9/20/2012 Madrid, Spain
Por$ng Monte Carlo Algorithms to the GPU Ryan Bergmann UC Berkeley Serpent Users Group Mee$ng 9/20/2012 Madrid, Spain 1 Outline Introduc$on to GPUs Why they are interes$ng How they operate Pros and cons
More informationLUMOS. A Framework with Analy1cal Models for Heterogeneous Architectures. Liang Wang, and Kevin Skadron (University of Virginia)
LUMOS A Framework with Analy1cal Models for Heterogeneous Architectures Liang Wang, and Kevin Skadron (University of Virginia) What is LUMOS A set of first- order analy1cal models targe1ng heterogeneous
More informationPrivacy-Preserving Shortest Path Computa6on
Privacy-Preserving Shortest Path Computa6on David J. Wu, Joe Zimmerman, Jérémy Planul, and John C. Mitchell Stanford University Naviga6on desired des@na@on current posi@on Naviga6on: A Solved Problem?
More informationBroadcas(ng Video in Dense g Networks Using Applica(on FEC and Mul(cast
Broadcas(ng Video in Dense 802.11g Networks Using Applica(on FEC and Mul(cast Last update: 6-10-2011 Dr James Martin School of Computing Clemson University Clemson, SC jim.martin@cs.clemson.edu Dr James
More informationProbabilis)c Temporal Inference on Reconstructed 3D Scenes
Probabilis)c Temporal Inference on Reconstructed 3D Scenes Grant Schindler Frank Dellaert Georgia Ins)tute of Technology The World Changes Over Time How can we reason about )me in structure from mo)on
More informationPredic'ng ALS Progression with Bayesian Addi've Regression Trees
Predic'ng ALS Progression with Bayesian Addi've Regression Trees Lilly Fang and Lester Mackey November 13, 2012 RECOMB Conference on Regulatory and Systems Genomics The ALS Predic'on Prize Challenge: Predict
More informationAr#ficial Intelligence
Ar#ficial Intelligence Advanced Searching Prof Alexiei Dingli Gene#c Algorithms Charles Darwin Genetic Algorithms are good at taking large, potentially huge search spaces and navigating them, looking for
More informationMobile Speed Enforcement and Video Surveillance
Mobile Speed Enforcement and Video Surveillance Market Leading Innovaon ProVida is the world s leading speed enforcement and mobile surveillance brand with over 4,000 systems sold worldwide For over 20
More informationCrowdsourcing the Acquisi3on and Analysis of Mobile Videos for Disaster Response
IEEE Big Data 2015, October 31, 2015 Crowdsourcing the Acquisi3on and Analysis of Mobile Videos for Disaster Response Presented by Hien To Dr. Seon Ho Kim Integrated Media Systems Center University of
More informationOp#miza#on of an On- Board Imaging System for Rapid Radiotherapy. Erica Kemmerling, Meng Wu, He Yang, and Rebecca Fahrig
Op#miza#on of an On- Board Imaging System for Rapid Radiotherapy Erica Kemmerling, Meng Wu, He Yang, and Rebecca Fahrig Problem: mo#on during treatment Two overlapping thorax views during normal breathing
More informationhigh performance medical reconstruction using stream programming paradigms
high performance medical reconstruction using stream programming paradigms This Paper describes the implementation and results of CT reconstruction using Filtered Back Projection on various stream programming
More informationOp#miza#on of a Fast CT System for More Accurate Radiotherapy. Erica Cherry, Meng Wu, and Rebecca Fahrig
Op#miza#on of a Fast CT System for More Accurate Radiotherapy Erica Cherry, Meng Wu, and Rebecca Fahrig Problem: mo#on during treatment Two overlapping thorax views during normal breathing liver lung spine
More informationTerraSwarm. A Machine Learning and Op0miza0on Toolkit for the Swarm. Ilge Akkaya, Shuhei Emoto, Edward A. Lee. University of California, Berkeley
TerraSwarm A Machine Learning and Op0miza0on Toolkit for the Swarm Ilge Akkaya, Shuhei Emoto, Edward A. Lee University of California, Berkeley TerraSwarm Tools Telecon 17 November 2014 Sponsored by the
More informationAsaf Cidon, Assaf Eisenman, Mohammad Alizadeh and Sachin KaH
Cli$anger: Scaling Performance Cliffs in Memory Caches [NSDI 2016] Cache OS: Data Center Dynamic Cache Management Asaf Cidon, Assaf Eisenman, Mohammad Alizadeh and Sachin KaH 1 Key-Value Caches are Essen1al
More informationHow about imaging moving subjects at low signal? Fast snapshot imaging
4/15/13 How to image a moving subject? Mo)on- Robust Super- Resolu)on Magne)c Resonance Imaging By fast shuoer speed: snapshot imaging ISBI 2013 San Francisco, CA Part IV: Mo+on- Robust Super- Resolu+on
More informationAmol Deshpande, University of Maryland Lisa Hellerstein, Polytechnic University, Brooklyn
Amol Deshpande, University of Maryland Lisa Hellerstein, Polytechnic University, Brooklyn Mo>va>on: Parallel Query Processing Increasing parallelism in compu>ng Shared nothing clusters, mul> core technology,
More informationFast BVH Construction on GPUs
Fast BVH Construction on GPUs Published in EUROGRAGHICS, (2009) C. Lauterbach, M. Garland, S. Sengupta, D. Luebke, D. Manocha University of North Carolina at Chapel Hill NVIDIA University of California
More informationAbout the Course. Reading List. Assignments and Examina5on
Uppsala University Department of Linguis5cs and Philology About the Course Introduc5on to machine learning Focus on methods used in NLP Decision trees and nearest neighbor methods Linear models for classifica5on
More informationX shooter pipeline reduc1ons. In 7 (x3) easy steps
X shooter pipeline reduc1ons In 7 (x3) easy steps Cascade X shooter headers, tools Example Run chain(vis) Overview Differences VIS/UVB/NIR chains Examples Outputs Observing strategy : To nod or not to
More informationSeman&c Aware Anomaly Detec&on in Real World Parking Data
Seman&c Aware Anomaly Detec&on in Real World Parking Data Arnamoy Bha+acharyya 1, Weihan Wang 2, Chris&ne Tsang 2, Cris&ana Amza 1 1 University of Toronto, 2 Smarking Inc Mo&va&on Mo&va&on heps://www.engadget.com/2017/01/17/google-
More informationSearch Engines. Informa1on Retrieval in Prac1ce. Annota1ons by Michael L. Nelson
Search Engines Informa1on Retrieval in Prac1ce Annota1ons by Michael L. Nelson All slides Addison Wesley, 2008 Evalua1on Evalua1on is key to building effec$ve and efficient search engines measurement usually
More informationNetSlices: Scalable Mul/- Core Packet Processing in User- Space
NetSlices: Scalable Mul/- Core Packet Processing in - Space Tudor Marian, Ki Suh Lee, Hakim Weatherspoon Cornell University Presented by Ki Suh Lee Packet Processors Essen/al for evolving networks Sophis/cated
More informationGPU Cluster Computing. Advanced Computing Center for Research and Education
GPU Cluster Computing Advanced Computing Center for Research and Education 1 What is GPU Computing? Gaming industry and high- defini3on graphics drove the development of fast graphics processing Use of
More informationPhysis: An Implicitly Parallel Framework for Stencil Computa;ons
Physis: An Implicitly Parallel Framework for Stencil Computa;ons Naoya Maruyama RIKEN AICS (Formerly at Tokyo Tech) GTC12, May 2012 1 è Good performance with low programmer produc;vity Mul;- GPU Applica;on
More informationCtrl-C: Instruction-Aware Control Loop Based Adaptive Cache Bypassing for GPUs
The 34 th IEEE International Conference on Computer Design Ctrl-C: Instruction-Aware Control Loop Based Adaptive Cache Bypassing for GPUs Shin-Ying Lee and Carole-Jean Wu Arizona State University October
More informationSoft GPGPUs for Embedded FPGAS: An Architectural Evaluation
Soft GPGPUs for Embedded FPGAS: An Architectural Evaluation 2nd International Workshop on Overlay Architectures for FPGAs (OLAF) 2016 Kevin Andryc, Tedy Thomas and Russell Tessier University of Massachusetts
More informationEfficient Memory and Bandwidth Management for Industrial Strength Kirchhoff Migra<on
Efficient Memory and Bandwidth Management for Industrial Strength Kirchhoff Migra
More informationDetec%ng Wildlife in Uncontrolled Outdoor Video using Convolu%onal Neural Networks
Detec%ng Wildlife in Uncontrolled Outdoor Video using Convolu%onal Neural Networks Connor Bowley *, Alicia Andes +, Susan Ellis-Felege +, Travis Desell * Department of Computer Science * Department of
More informationThe Case for Heterogeneous HTAP
The Case for Heterogeneous HTAP Raja Appuswamy, Manos Karpathiotakis, Danica Porobic, and Anastasia Ailamaki Data-Intensive Applications and Systems Lab EPFL 1 HTAP the contract with the hardware Hybrid
More informationProgram Op*miza*on and Analysis. Chenyang Lu CSE 467S
Program Op*miza*on and Analysis Chenyang Lu CSE 467S 1 Program Transforma*on op#mize Analyze HLL compile assembly assemble Physical Address Rela5ve Address assembly object load executable link Absolute
More informationVirtualization. Introduction. Why we interested? 11/28/15. Virtualiza5on provide an abstract environment to run applica5ons.
Virtualization Yifu Rong Introduction Virtualiza5on provide an abstract environment to run applica5ons. Virtualiza5on technologies have a long trail in the history of computer science. Why we interested?
More informationLife and Medical Biology Data Accelerator (Lambda)
Life and Medical Biology Data Accelerator (Lambda) : Guangming Tan Ins>tute of Compu>ng Technology, Chinese Academy of Sciences 1 Biological Imaging Data Challenge Higher Resolu>on GAP: O(years)! High-Throughput
More informationCSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University
CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand
More informationAccelerating Data Warehousing Applications Using General Purpose GPUs
Accelerating Data Warehousing Applications Using General Purpose s Sponsors: Na%onal Science Founda%on, LogicBlox Inc., IBM, and NVIDIA The General Purpose is a many core co-processor 10s to 100s of cores
More informationPredic've Modeling in a Polyhedral Op'miza'on Space
Predic've Modeling in a Polyhedral Op'miza'on Space Eunjung EJ Park 1, Louis- Noël Pouchet 2, John Cavazos 1, Albert Cohen 3, and P. Sadayappan 2 1 University of Delaware 2 The Ohio State University 3
More informationExploiting GPU Caches in Sparse Matrix Vector Multiplication. Yusuke Nagasaka Tokyo Institute of Technology
Exploiting GPU Caches in Sparse Matrix Vector Multiplication Yusuke Nagasaka Tokyo Institute of Technology Sparse Matrix Generated by FEM, being as the graph data Often require solving sparse linear equation
More informationAccelerating Satellite Image Based Large- Scale Settlement Detection with GPU!
Accelerating Satellite Image Based Large- Scale Settlement Detection with GPU! Dilip%R.%Patlolla% Anil%M.%Cheriyadat% Eddie%A.%Bright% Jeane9e%E.%Weaver% % Oak%Ridge%Na?onal%Laboratory% Oak%Ridge,%TN%%
More informationObjec0ves. Gain understanding of what IDA Pro is and what it can do. Expose students to the tool GUI
Intro to IDA Pro 31/15 Objec0ves Gain understanding of what IDA Pro is and what it can do Expose students to the tool GUI Discuss some of the important func
More informationA Smartphone Based Real Time Ac5vity Monitoring System
A Smartphone Based Real Time Ac5vity Monitoring System By: Shumei Zhang, Paul McCullagh, Jing Zhang, Tiezhong Yu Presented by: Jane Henderson A Smartphone Based-Real Time Daily Ac5vity Monitoring System
More informationSecure and Ef icient Iris and Fingerprint Identi ication
Secure and Eficient Iris and Fingerprint Identiication Marina Blanton Department of Computer Science and Engineering, University of Notre Dame mblanton@nd.edu Paolo Gas Department of Computer Science,
More informationProfiling & Tuning Applica1ons. CUDA Course July István Reguly
Profiling & Tuning Applica1ons CUDA Course July 21-25 István Reguly Introduc1on Why is my applica1on running slow? Work it out on paper Instrument code Profile it NVIDIA Visual Profiler Works with CUDA,
More informationMul$-objec$ve Visual Odometry Hsiang-Jen (Johnny) Chien and Reinhard Kle=e
Mul$-objec$ve Visual Odometry Hsiang-Jen (Johnny) Chien and Reinhard Kle=e Centre for Robo+cs & Vision Dept. of Electronic and Electric Engineering School of Engineering, Computer, and Mathema+cal Sciences
More informationHigh performance 2D Discrete Fourier Transform on Heterogeneous Platforms. Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli
High performance 2D Discrete Fourier Transform on Heterogeneous Platforms Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli Motivation Fourier Transform widely used in Physics, Astronomy, Engineering
More informationThe Prac)cal Applica)on of Knowledge Discovery to Image Data: A Prac))oners View in The Context of Medical Image Mining
The Prac)cal Applica)on of Knowledge Discovery to Image Data: A Prac))oners View in The Context of Medical Image Mining Frans Coenen (http://cgi.csc.liv.ac.uk/~frans/) 10th Interna+onal Conference on Natural
More informationDown selecting suitable manycore technologies for the ELT AO RTC. David Barr, Alastair Basden, Nigel Dipper and Noah Schwartz
Down selecting suitable manycore technologies for the ELT AO RTC David Barr, Alastair Basden, Nigel Dipper and Noah Schwartz GFLOPS RTC for AO workshop 27/01/2016 AO RTC Complexity 1.E+05 1.E+04 E-ELT
More informationTechnical Deep-Dive in a Column-Oriented In-Memory Database
Technical Deep-Dive in a Column-Oriented In-Memory Database Carsten Meyer, Martin Lorenz carsten.meyer@hpi.de, martin.lorenz@hpi.de Research Group of Prof. Hasso Plattner Hasso Plattner Institute for Software
More informationScalable Multi Agent Simulation on the GPU. Avi Bleiweiss NVIDIA Corporation San Jose, 2009
Scalable Multi Agent Simulation on the GPU Avi Bleiweiss NVIDIA Corporation San Jose, 2009 Reasoning Explicit State machine, serial Implicit Compute intensive Fits SIMT well Collision avoidance Motivation
More informationUsing CUDA to Accelerate Radar Image Processing
Using CUDA to Accelerate Radar Image Processing Aaron Rogan Richard Carande 9/23/2010 Approved for Public Release by the Air Force on 14 Sep 2010, Document Number 88 ABW-10-5006 Company Overview Neva Ridge
More informationVirtual & Mixed Reality > Near-Eye Displays. Light Propagation through Waveguide with In- & Outcoupling Surface Gratings
Virtual & Mixed Reality > Near-Eye Displays Light Propagation through Waveguide with In- & Outcoupling Surface Gratings Task/System Illustration glass plate with in- & outcoupling surface gratings point
More informationA Script- Based Autotuning Compiler System to Generate High- Performance CUDA code
A Script- Based Autotuning Compiler System to Generate High- Performance CUDA code Malik Khan, Protonu Basu, Gabe Rudy, Mary Hall, Chun Chen, Jacqueline Chame Mo:va:on Challenges to programming the GPU
More informationGeoImaging Accelerator Pansharpen Test Results. Executive Summary
Executive Summary After demonstrating the exceptional performance improvement in the orthorectification module (approximately fourteen-fold see GXL Ortho Performance Whitepaper), the same approach has
More information7 Ways to Increase Your Produc2vity with Revolu2on R Enterprise 3.0. David Smith, REvolu2on Compu2ng
7 Ways to Increase Your Produc2vity with Revolu2on R Enterprise 3.0 David Smith, REvolu2on Compu2ng REvolu2on Compu2ng: The R Company REvolu2on R Free, high- performance binary distribu2on of R REvolu2on
More informationImproving Memory Space Efficiency of Kd-tree for Real-time Ray Tracing Byeongjun Choi, Byungjoon Chang, Insung Ihm
Improving Memory Space Efficiency of Kd-tree for Real-time Ray Tracing Byeongjun Choi, Byungjoon Chang, Insung Ihm Department of Computer Science and Engineering Sogang University, Korea Improving Memory
More informationUtilizing commercial graphics processors in the real-time geo-registration of streaming high-resolution imagery
Utilizing commercial graphics processors in the real-time geo-registration of streaming high-resolution imagery Laurence Flath,, Michael Kartz, Randall Frank Lawrence Livermore National Laboratory GP 2
More informationIntroduc)on to. CS60092: Informa0on Retrieval
Introduc)on to CS60092: Informa0on Retrieval Ch. 4 Index construc)on How do we construct an index? What strategies can we use with limited main memory? Sec. 4.1 Hardware basics Many design decisions in
More informationBotGraph: Large Scale Spamming Botnet Detec5on
BotGraph: Large Scale Spamming Botnet Detec5on Yao Zhao Yinglian Xie *, Fang Yu *, Qifa Ke *, Yuan Yu *, Yan Chen and Eliot Gillum EECS Department, Northwestern University MicrosoK Research Silicon Valley
More informationTelescope Wavefront Errors
Telescope Wavefront Errors Henri Bonnet 1 Tasks of WFC at E-ELT Help System Engineering develop and maintain the technical budgets Develop Control Strategy Define WFC I/F to instruments. 2 How we do it
More informationLecture 6 Proper,es of laser beams *
Lecture 6 Proper,es of laser beams * Min Yan Op,cs and Photonics, KTH 12/04/16 1 * Some figures and texts belong to: O. Svelto, Principles of Lasers, 5th Ed., Springer. Reading Principles of Lasers (5th
More informationECE 1749H: Interconnec1on Networks for Parallel Computer Architectures: Rou1ng. Prof. Natalie Enright Jerger
ECE 1749H: Interconnec1on Networks for Parallel Computer Architectures: Rou1ng Prof. Natalie Enright Jerger Announcements Feedback on your project proposals This week Scheduled extended 1 week Next week:
More informationLecture 13: Tracking mo3on features op3cal flow
Lecture 13: Tracking mo3on features op3cal flow Professor Fei- Fei Li Stanford Vision Lab Lecture 13-1! What we will learn today? Introduc3on Op3cal flow Feature tracking Applica3ons (Problem Set 3 (Q1))
More informationArchitectures for Scalable Media Object Search
Architectures for Scalable Media Object Search Dennis Sng Deputy Director & Principal Scientist NVIDIA GPU Technology Workshop 10 July 2014 ROSE LAB OVERVIEW 2 Large Database of Media Objects Next- Generation
More informationEvaluating the Potential of Graphics Processors for High Performance Embedded Computing
Evaluating the Potential of Graphics Processors for High Performance Embedded Computing Shuai Mu, Chenxi Wang, Ming Liu, Yangdong Deng Department of Micro-/Nano-electronics Tsinghua University Outline
More information