Performing Multi-Phased Radar Processing with a Very Deep FPGA Pipeline

Size: px
Start display at page:

Download "Performing Multi-Phased Radar Processing with a Very Deep FPGA Pipeline"

Transcription

1 Performing Multi-Phased Radar Processing with a Very Deep FPGA Pipeline Jeffrey T. Muehring and John K. Antonio School of Computer Science University of Oklahoma antonio@ou.edu 2000 MAPLD Conference The 3 rd Annual Military and Aerospace Applications of Programmable Devices and Technologies International Conference September 26-28, 2000

2 Outline Review of multiprocessor-based solution for Synthetic Aperture Radar (SAR) Proposed FPGA-based deep-pipeline solution for SAR An initial comparison of multiprocessor-based and FPGA-based solutions for SAR A framework for minimizing power consumption of deep pipelines based on signal activity transformations Summary

3 Typical Scenario for SAR Predator

4 Footprint of Aerial Side-Looking SAR Velocity Footprint Range Azimuth Targets

5 Offset Overlapping Beams Real Azimuth Resolution v R s

6 Synthetic Beams Azimuth v R Compressed Resolution R s

7 Overview of SAR Processing Two Main Phases of Computation for a SAR Processing Phase Range Processing Phase 2 Azimuth Processing Outputs Input Data Stream Typical Timing Diagram for Executing SAR on a Multiprocessor System Processor Sets S 2 S Time

8 Typical Communication Requirements for SAR on a Multiprocessor System Phase Range Processing (shown distributed across 3 processors) Phase 2 Azimuth Processing (shown distributed across 4 processors) azimuth processor 4 range processor 3 azimuth processor 3 range processor 2 distributed corner turn pulses range samples range processor azimuth processor 2 azimuth processor range samples pulses Reference:T. Einstein, Realtime Synthetic Aperture Radar Processing on the RACE Multicomputer, App. Note 203.0, Mercury Computing Sys, 996.

9 Typical Processing Flow for SAR on a Multiprocessor System Input Data (Pulse Returns) m n n Range Processing (r-tap FIR)* n m n Output Image Distributed Corner Turn Azimuth Processing (a-tap FIR)* *Typically performed using FFT-based fast convolution technique m

10 Outline Review of multiprocessor-based solution for Synthetic Aperture Radar (SAR) Proposed FPGA-based deep-pipeline solution for SAR An initial comparison of multiprocessor-based and FPGA-based solutions for SAR A framework for minimizing power consumption of deep pipelines based on signal activity transformations Summary

11 Tracing of Computational Dependencies for a Single Input Range Bin Input Data (Pulse Returns) (assume r = 3) m n n (m/2)-th pulse return Range Processing (r-tap FIR) n m (assume a = 5) n Output Image Distributed Corner Turn Azimuth Processing (a-tap FIR) m

12 Computation of Output Values Associated with Single Input Range Bin using Proposed Deep-Pipe Input Data (Pulse Returns) (assume r = 3, a = 5, n = 0) n Deep Pipeline (a r)-tap FIR interspersed with a (n - r) delay elements (m/2)-th pulse return Note: each output value is the sum of a r weighted input values Computation of Output Image Samples: after c cycles after c + cycles after c + n cycles after c + (a n) cycles n n n n m m m m

13 Structure of the Deep-Pipeline Example: no. range bins = n = 4 range kernel size = r = 2 azimuth kernel size = a = 3 R 0 > R > R 2 > R 3 > R 4 > R 5 > R 6 > R 7 > R 8 > R 9 > input stream a 2 r a 2 r 0 a r a r 0 a 0 r a 0 r no. registers = (a n) (n r) no. KCMs = (a r) + + output stream

14 Outline Review of multiprocessor-based solution for Synthetic Aperture Radar (SAR) Proposed FPGA-based deep-pipeline solution for SAR An initial comparison of multiprocessor-based and FPGA-based solutions for SAR A framework for minimizing power consumption of deep pipelines based on signal activity transformations Summary

15 Example SAR Scenarios The following sets of application parameters define three SAR scenarios Application Parameters Simple Medium Difficult δ 2.5 (resolution, m) v (velocity, m/s) The following radar parameters are assumed for all scenarios Radar Parameters R (range, m) R s (range swath, m) λ (wavelength, m) Parameter Values 00,000 20,000.03

16 Computational Parameters for Three Scenarios Computational Parameters Simple Medium Difficult n 0,000 20,000 40,000 (no. range bins) r,024 2,048 4,096 (range ker. size) a 375,500 6,000 (azim. ker. size) Q (M samples/sec)

17 Comparison of Multiprocessor-based and FPGA-based Approaches for Three Scenarios Computational Approach Simple Medium Difficult Multiprocessor DSPs: = 4 Mem: = 35 MB DSPs: 3 + = 24 Mem: 4 +,080 =. GB DSPs: = 5 Mem: ,640 = 8.7 GB FPGA Taps: M Taps: 3.M Taps: 24 M Regs: 3.75 M Regs: 30 M Regs: 240 M The multiprocessor approach requires complex interconnection network plus significant RAM The FPGA approach only needs simple systolic connections among FPGAs Based on SHARC 2060 DSPs and assumes fast (FFT-based) convolutions

18 Outline Review of multiprocessor-based solution for Synthetic Aperture Radar (SAR) Proposed FPGA-based deep-pipeline solution for SAR An initial comparison of multiprocessor-based and FPGA-based solutions for SAR A framework for minimizing power consumption of deep pipelines based on signal activity transformations Summary

19 Power Consumption Model input stream W Deep Pipeline W output stream a 0 c 0 input signal activities a a 2 c c 2 P avg = P leak + V 2 2 f W i= 0 a i c i M

20 Measured and Predicted Power Consumption for FIR Filter on Xilinx 4036 Power (W) 4.2 S S2 S3 4.0 S4 S5 S6 3.8 S7 S8 S9 3.6 S0 S S2 3.4 S3 S4 P 3.2 P2 P3 P4 3.0 P5 P6 P7 2.8 P8 P9 P0 2.6 P P2 P3 2.4 P4 all measured Data Sets T. Osmulski, et al, A Probabilistic Power Prediction Tool for the Xilinx 4000-Series FPGA, Proceedings of The 5 th International Workshop on Embedded/Distributed HPC Systems and Applications (EHPC 2000), in Lecture Notes in Computer Science, May 2000, pp

21 Using Activity Transformations to Minimize Power Consumption Some input lines are hot (high capacitance) other input lines are cold (low capacitance) May be possible to apply linear transformation to input data to appropriately match input line activity vector to capacitance vector input stream a a T Deep Pipeline Assume Power Model P(a ) T - output stream Assuming activity vector a is known or estimated, determine transformation T to minimize power consumption of deep pipeline: Min {P(T(a))} T

22 Summary An FPGA-based approach was proposed as an alternative to the traditional multiprocessor approach for SAR processing The proposed FPGA-based approach looks promising in terms of inherent hardware complexity, but is probably not practical for implementation with currently available FPGA parts The proposed FPGA-based approach is applicable to other multi-phased embedded radar applications (e.g., STAP Space Time Adaptive Processing) A framework was proposed for minimizing power consumption for a class of FPGA designs based on input signal activity transformations

Optimal Configuration of Compute Nodes for Synthetic Aperture Radar Processing

Optimal Configuration of Compute Nodes for Synthetic Aperture Radar Processing Optimal Configuration of Compute Nodes for Synthetic Aperture Radar Processing Jeffrey T. Muehring and John K. Antonio Deptartment of Computer Science, P.O. Box 43104, Texas Tech University, Lubbock, TX

More information

Final Report for: Configuring Embeddable Adaptive Computing Systems for Multiple Application Domains with Minimal Size, Weight, and Power

Final Report for: Configuring Embeddable Adaptive Computing Systems for Multiple Application Domains with Minimal Size, Weight, and Power Final Report for: Configuring Embeddable Adaptive Computing Systems for Multiple Application Domains with Minimal Size, Weight, and Power DARPA Contract F30602-97-2-0297 Technical Point of Contact: Dr.

More information

Reconfigurable Versus Fixed Versus Hybrid Architectures

Reconfigurable Versus Fixed Versus Hybrid Architectures Reconfigurable Versus Fixed Versus Hybrid Architectures John K. Antonio Oklahoma Supercomputing Symposium 2008 Norman, Oklahoma October 6, 2008 Computer Science, University of Oklahoma Overview The (past)

More information

Optimal Configuration of Combined GPP/DSP/FPGA Systems for Minimal SWAP

Optimal Configuration of Combined GPP/DSP/FPGA Systems for Minimal SWAP Optimal Configuration of Combined GPP/DSP/FPGA Systems for Minimal SWAP Presented by John K. Antonio University of Oklahoma Second Annual Review September 23, 1999 Outline Program Overview and Introduction

More information

Design, Implementation and Performance Evaluation of Synthetic Aperture Radar Signal Processor on FPGAs

Design, Implementation and Performance Evaluation of Synthetic Aperture Radar Signal Processor on FPGAs Design, Implementation and Performance Evaluation of Synthetic Aperture Radar Signal Processor on FPGAs Hemang Parekh Masters Thesis MS(Computer Engineering) University of Kansas 23rd June, 2000 Committee:

More information

Master s Thesis: Optimal Configuration of a Parallel Embedded System for. Synthetic Aperture Radar Processing. Jeffrey T. Muehring

Master s Thesis: Optimal Configuration of a Parallel Embedded System for. Synthetic Aperture Radar Processing. Jeffrey T. Muehring Master s Thesis: Optimal Configuration of a Parallel Embedded System for Synthetic Aperture Radar Processing Jeffrey T. Muehring Department of Computer Science Texas Tech University November 1997 Committee

More information

A real time SAR processor implementation with FPGA

A real time SAR processor implementation with FPGA Computational Methods and Experimental Measurements XV 435 A real time SAR processor implementation with FPGA C. Lesnik, A. Kawalec & P. Serafin Institute of Radioelectronics, Military University of Technology,

More information

A Framework for Real-Time High-Throughput Signal and Image Processing Systems on Workstations

A Framework for Real-Time High-Throughput Signal and Image Processing Systems on Workstations A Framework for Real-Time High-Throughput Signal and Image Processing Systems on Workstations Prof. Brian L. Evans in collaboration with Gregory E. Allen and K. Clint Slatton Department of Electrical and

More information

Evaluating the Potential of Graphics Processors for High Performance Embedded Computing

Evaluating the Potential of Graphics Processors for High Performance Embedded Computing Evaluating the Potential of Graphics Processors for High Performance Embedded Computing Shuai Mu, Chenxi Wang, Ming Liu, Yangdong Deng Department of Micro-/Nano-electronics Tsinghua University Outline

More information

Memorandum. Clint Slatton Prof. Brian Evans Term project idea for Multidimensional Signal Processing (EE381k)

Memorandum. Clint Slatton Prof. Brian Evans Term project idea for Multidimensional Signal Processing (EE381k) Memorandum From: To: Subject: Date : Clint Slatton Prof. Brian Evans Term project idea for Multidimensional Signal Processing (EE381k) 16-Sep-98 Project title: Minimizing segmentation discontinuities in

More information

Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System

Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System Chi Zhang, Viktor K Prasanna University of Southern California {zhan527, prasanna}@usc.edu fpga.usc.edu ACM

More information

High Performance DoD DSP Applications

High Performance DoD DSP Applications High Performance DoD DSP Applications Robert Bond Embedded Digital Systems Group 23 August 2003 Slide-1 Outline DoD High-Performance DSP Applications Middleware (with some streaming constructs) Future

More information

ERS-l SAR processing with CESAR.

ERS-l SAR processing with CESAR. ERS-l SAR processing with CESAR. by Einar-Arne Herland Division for Electronics Norwegian Defence Research Establishment P.O.Box 25, N-2007 Kjeller, Norway Abstract A vector processor called CESAR (Computer

More information

The HPEC Challenge Benchmark Suite

The HPEC Challenge Benchmark Suite The HPEC Challenge Benchmark Suite Ryan Haney, Theresa Meuse, Jeremy Kepner and James Lebak Massachusetts Institute of Technology Lincoln Laboratory HPEC 2005 This work is sponsored by the Defense Advanced

More information

6LPXODWLRQÃRIÃWKHÃ&RPPXQLFDWLRQÃ7LPHÃIRUÃDÃ6SDFH7LPH $GDSWLYHÃ3URFHVVLQJÃ$OJRULWKPÃRQÃDÃ3DUDOOHOÃ(PEHGGHG 6\VWHP

6LPXODWLRQÃRIÃWKHÃ&RPPXQLFDWLRQÃ7LPHÃIRUÃDÃ6SDFH7LPH $GDSWLYHÃ3URFHVVLQJÃ$OJRULWKPÃRQÃDÃ3DUDOOHOÃ(PEHGGHG 6\VWHP LPXODWLRQÃRIÃWKHÃ&RPPXQLFDWLRQÃLPHÃIRUÃDÃSDFHLPH $GDSWLYHÃURFHVVLQJÃ$OJRULWKPÃRQÃDÃDUDOOHOÃ(PEHGGHG \VWHP Jack M. West and John K. Antonio Department of Computer Science, P.O. Box, Texas Tech University,

More information

In-chip and Inter-chip Interconnections and data transportations for Future MPAR Digital Receiving System

In-chip and Inter-chip Interconnections and data transportations for Future MPAR Digital Receiving System In-chip and Inter-chip Interconnections and data transportations for Future MPAR Digital Receiving System A presentation for LMCO-MPAR project 2007 briefing Dr. Yan Zhang School of Electrical and Computer

More information

Gedae cwcembedded.com. The CHAMP-AV6 VPX-REDI. Digital Signal Processing Card. Maximizing Performance with Minimal Porting Effort

Gedae cwcembedded.com. The CHAMP-AV6 VPX-REDI. Digital Signal Processing Card. Maximizing Performance with Minimal Porting Effort Technology White Paper The CHAMP-AV6 VPX-REDI Digital Signal Processing Card Maximizing Performance with Minimal Porting Effort Introduction The Curtiss-Wright Controls Embedded Computing CHAMP-AV6 is

More information

Virtual Prototyping. Digital Signal Processing Systems

Virtual Prototyping. Digital Signal Processing Systems Virtual Prototyping Digital Signal Processing Systems September 23, 1998 1998 Lockheed Martin Radar Technology Symposium Moorestown, NJ Carl Hein chein@atl.lmco.com Lockheed Martin A&E Bldg.-2W, Camden

More information

Digital Processing of Synthetic Aperture Radar Data

Digital Processing of Synthetic Aperture Radar Data Digital Processing of Synthetic Aperture Radar Data Algorithms and Implementation Ian G. Cumming Frank H. Wong ARTECH HOUSE BOSTON LONDON artechhouse.com Contents Foreword Preface Acknowledgments xix xxiii

More information

Kernel Benchmarks and Metrics for Polymorphous Computer Architectures

Kernel Benchmarks and Metrics for Polymorphous Computer Architectures PCAKernels-1 Kernel Benchmarks and Metrics for Polymorphous Computer Architectures Hank Hoffmann James Lebak (Presenter) Janice McMahon Seventh Annual High-Performance Embedded Computing Workshop (HPEC)

More information

Parallel Processing Techniques for the Processing of Synthetic Aperture Radar Data on FPGAs

Parallel Processing Techniques for the Processing of Synthetic Aperture Radar Data on FPGAs Parallel Processing Techniques for the Processing of Synthetic Aperture Radar Data on FPGAs William Chapman, Sanjay Ranka, Sartaj Sahni, and Mark Schmalz University of Florida, Department of CISE, Gainesville

More information

Executable Requirements: Opportunities and Impediments

Executable Requirements: Opportunities and Impediments Executable Requirements: Oppotunities and Impediments Executable Requirements: Opportunities and Impediments G. A. Shaw and A. H. Anderson * Abstract: In a top-down, language-based design methodology,

More information

Performance Characterization, Prediction, and Optimization for Heterogeneous Systems with Multi-Level Memory Interference

Performance Characterization, Prediction, and Optimization for Heterogeneous Systems with Multi-Level Memory Interference The 2017 IEEE International Symposium on Workload Characterization Performance Characterization, Prediction, and Optimization for Heterogeneous Systems with Multi-Level Memory Interference Shin-Ying Lee

More information

Hardware Acceleration of Pulsar Search on FPGAs using OpenCL

Hardware Acceleration of Pulsar Search on FPGAs using OpenCL Hardware Acceleration of Pulsar Search on FPGAs using OpenCL Oliver Sinnen Haomiao Wang & Prabu Thiagaraj (Manchester Uni) Parallel and Reconfigurable Computing Department of Electrical and Computer Engineering

More information

A Scalable Multiprocessor for Real-time Signal Processing

A Scalable Multiprocessor for Real-time Signal Processing A Scalable Multiprocessor for Real-time Signal Processing Daniel Scherrer, Hans Eberle Institute for Computer Systems, Swiss Federal Institute of Technology CH-8092 Zurich, Switzerland {scherrer, eberle}@inf.ethz.ch

More information

Virtual Prototyping and Performance Analysis of RapidIO-based System Architectures for Space-Based Radar

Virtual Prototyping and Performance Analysis of RapidIO-based System Architectures for Space-Based Radar Virtual Prototyping and Performance Analysis of RapidIO-based System Architectures for Space-Based Radar David Bueno, Adam Leko, Chris Conger, Ian Troxel, and Alan D. George HCS Research Laboratory College

More information

Computational Process Networks

Computational Process Networks Computational Process Networks for Real-Time High-Throughput Signal and Image Processing Systems on Workstations Gregory E. Allen EE 382C - Embedded Software Systems 17 February 2000 http://www.ece.utexas.edu/~allen/

More information

Computer Systems Architecture Spring 2016

Computer Systems Architecture Spring 2016 Computer Systems Architecture Spring 2016 Lecture 01: Introduction Shuai Wang Department of Computer Science and Technology Nanjing University [Adapted from Computer Architecture: A Quantitative Approach,

More information

Scalable GMTI Tracker

Scalable GMTI Tracker Scalable GMTI Tracker Thomas Kurien Mercury Computer Systems 199 Riverneck Road Chelmsford, Massachusetts 01824 tkurien@mc.com Abstract - This paper describes the design and preliminary implementation

More information

Accelerating the Pulsar Search Pipeline with FPGAs, Programmed in OpenCL

Accelerating the Pulsar Search Pipeline with FPGAs, Programmed in OpenCL Accelerating the Pulsar Search Pipeline with FPGAs, Programmed in OpenCL Oliver Sinnen, Tyrone Sherwin, and Haomiao Wang & Prabu Thiagaraj (Manchester Uni/Raman Research Institute, Bangalore) Parallel

More information

Parallel Computer Architectures. Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam

Parallel Computer Architectures. Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam Parallel Computer Architectures Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam Outline Flynn s Taxonomy Classification of Parallel Computers Based on Architectures Flynn s Taxonomy Based on notions of

More information

Transparent Offloading and Mapping (TOM) Enabling Programmer-Transparent Near-Data Processing in GPU Systems Kevin Hsieh

Transparent Offloading and Mapping (TOM) Enabling Programmer-Transparent Near-Data Processing in GPU Systems Kevin Hsieh Transparent Offloading and Mapping () Enabling Programmer-Transparent Near-Data Processing in GPU Systems Kevin Hsieh Eiman Ebrahimi, Gwangsun Kim, Niladrish Chatterjee, Mike O Connor, Nandita Vijaykumar,

More information

SEA SURFACE SPEED FROM TERRASAR-X ATI DATA

SEA SURFACE SPEED FROM TERRASAR-X ATI DATA SEA SURFACE SPEED FROM TERRASAR-X ATI DATA Matteo Soccorsi (1) and Susanne Lehner (1) (1) German Aerospace Center, Remote Sensing Technology Institute, 82234 Weßling, Germany, Email: matteo.soccorsi@dlr.de

More information

PARAMETRIC STUDY WITH GEOFRAC: A THREE-DIMENSIONAL STOCHASTIC FRACTURE FLOW MODEL. Alessandra Vecchiarelli, Rita Sousa, Herbert H.

PARAMETRIC STUDY WITH GEOFRAC: A THREE-DIMENSIONAL STOCHASTIC FRACTURE FLOW MODEL. Alessandra Vecchiarelli, Rita Sousa, Herbert H. PROCEEDINGS, Thirty-Eighth Workshop on Geothermal Reservoir Engineering Stanford University, Stanford, California, February 3, 23 SGP-TR98 PARAMETRIC STUDY WITH GEOFRAC: A THREE-DIMENSIONAL STOCHASTIC

More information

Design and Implementation of Real-time High Squint Spotlight SAR Imaging Processor

Design and Implementation of Real-time High Squint Spotlight SAR Imaging Processor Chinese Journal of Electronics Vol.19, No.3, July 2010 Design and Implementation of Real-time High Squint Spotlight SAR Imaging Processor SUN Jinping 1, WANG Jun 1, HONG Wen 2 and MAO Shiyi 1 1.School

More information

by Using a Phase-Error Correction Algorithm

by Using a Phase-Error Correction Algorithm Detecting Moving Targets in SAR Imagery by Using a Phase-Error Correction Algorithm J.R. Fienup and A.M. Kowalczyk Environmental Research Institute of Michigan P.O. Box 134001, Ann Arbor, MI 48113-4001

More information

High dynamic range imaging, computing & I/O load

High dynamic range imaging, computing & I/O load High dynamic range imaging, computing & I/O load RMS ~15µJy/beam RMS ~1µJy/beam S. Bhatnagar NRAO, Socorro Parameterized Measurement Equation Generalized Measurement Equation Obs [ S M V ij = J ij, t W

More information

Configurable Multiprocessing: An FIR

Configurable Multiprocessing: An FIR Configurable Multiprocessing: An FIR Filter Example Cmpware, Inc. Introduction Multiple processors on a device common Thousands of 32-bit RISC CPUs possible Advantages in: Performance Power consumption

More information

Application level challenges and issues of processing different

Application level challenges and issues of processing different Application level challenges and issues of processing different sairam frequency, polarization and incidence angle Synthetic Aperture Radar data using distributed computing resources Dr. R Manavalan, Mangala

More information

Requirements for Scalable Application Specific Processing in Commercial HPEC

Requirements for Scalable Application Specific Processing in Commercial HPEC Requirements for Scalable Application Specific Processing in Commercial HPEC Steven Miller Silicon Graphics, Inc. Phone: 650-933-1899 Email Address: scm@sgi.com Abstract: More and more High Performance

More information

Challenges in Detecting & Tracking Moving Objects with Synthetic Aperture Radar (SAR)

Challenges in Detecting & Tracking Moving Objects with Synthetic Aperture Radar (SAR) Challenges in Detecting & Tracking Moving Objects with Synthetic Aperture Radar (SAR) Michael Minardi PhD Sensors Directorate Air Force Research Laboratory Outline Focusing Moving Targets Locating Moving

More information

Amortised Optimisation as a Means to Achieve Genetic Improvement

Amortised Optimisation as a Means to Achieve Genetic Improvement Amortised Optimisation as a Means to Achieve Genetic Improvement Hyeongjun Cho, Sungwon Cho, Seongmin Lee, Jeongju Sohn, and Shin Yoo Date 2017.01.30, The 50th CREST Open Workshop Offline Improvement Expensive

More information

Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks

Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks Naveen Suda, Vikas Chandra *, Ganesh Dasika *, Abinash Mohanty, Yufei Ma, Sarma Vrudhula, Jae-sun Seo, Yu

More information

Application of Parallel Processors to Real-Time Sensor Array Processing 1

Application of Parallel Processors to Real-Time Sensor Array Processing 1 Application of Parallel Processors to Real-Time Sensor Array Processing David R. Martinez MIT Lincoln Laboratory dmartinez@ll.mit.edu Abstract Historically, most radar sensor array processing has been

More information

Energy Efficient K-Means Clustering for an Intel Hybrid Multi-Chip Package

Energy Efficient K-Means Clustering for an Intel Hybrid Multi-Chip Package High Performance Machine Learning Workshop Energy Efficient K-Means Clustering for an Intel Hybrid Multi-Chip Package Matheus Souza, Lucas Maciel, Pedro Penna, Henrique Freitas 24/09/2018 Agenda Introduction

More information

Support for Programming Reconfigurable Supercomputers

Support for Programming Reconfigurable Supercomputers Support for Programming Reconfigurable Supercomputers Miriam Leeser Nicholas Moore, Albert Conti Dept. of Electrical and Computer Engineering Northeastern University Boston, MA Laurie Smith King Dept.

More information

SAR training processor

SAR training processor Rudi Gens This manual describes the SAR training processor (STP) that has been developed to introduce students to the complex field of processed synthetic aperture radar (SAR) data. After a brief introduction

More information

XPU A Programmable FPGA Accelerator for Diverse Workloads

XPU A Programmable FPGA Accelerator for Diverse Workloads XPU A Programmable FPGA Accelerator for Diverse Workloads Jian Ouyang, 1 (ouyangjian@baidu.com) Ephrem Wu, 2 Jing Wang, 1 Yupeng Li, 1 Hanlin Xie 1 1 Baidu, Inc. 2 Xilinx Outlines Background - FPGA for

More information

ARM Cortex core microcontrollers 3. Cortex-M0, M4, M7

ARM Cortex core microcontrollers 3. Cortex-M0, M4, M7 ARM Cortex core microcontrollers 3. Cortex-M0, M4, M7 Scherer Balázs Budapest University of Technology and Economics Department of Measurement and Information Systems BME-MIT 2018 Trends of 32-bit microcontrollers

More information

Halmstad University Post-Print

Halmstad University Post-Print Halmstad University Post-Print Reconfigurable Optical Interconnection System Supporting Concurrent Application- Specific Parallel Computing Sacki Agelis and Magnus Jonsson N.B.: When citing this work,

More information

On-board Payload Data Processing, for SAR and Multispectral data processing, on-board satellites (LEON2/FFTC)

On-board Payload Data Processing, for SAR and Multispectral data processing, on-board satellites (LEON2/FFTC) UNCLASSIFIED Nationaal Lucht- en Ruimtevaartlaboratorium National Aerospace Laboratory NLR Executive summary On-board Payload Data Processing, for SAR and Multispectral data processing, on-board satellites

More information

Parallel FIR Filters. Chapter 5

Parallel FIR Filters. Chapter 5 Chapter 5 Parallel FIR Filters This chapter describes the implementation of high-performance, parallel, full-precision FIR filters using the DSP48 slice in a Virtex-4 device. ecause the Virtex-4 architecture

More information

PERFORMANCE ANALYSIS OF ALTERNATIVE STRUCTURES FOR 16-BIT INTEGER FIR FILTER IMPLEMENTED ON ALTIVEC SIMD PROCESSING UNIT

PERFORMANCE ANALYSIS OF ALTERNATIVE STRUCTURES FOR 16-BIT INTEGER FIR FILTER IMPLEMENTED ON ALTIVEC SIMD PROCESSING UNIT PERFORMANCE ANALYSIS OF ALTERNATIVE STRUCTURES FOR -BIT INTEGER FIR FILTER IMPLEMENTED ON ALTIVEC SIMD PROCESSING UNIT Grzegorz Kraszewski Białystok Technical University, Department of Electric Engineering

More information

QuiXilica V5 Architecture

QuiXilica V5 Architecture QuiXilica V5 Architecture: The High Performance Sensor I/O Processing Solution for the Latest Generation and Beyond Andrew Reddig President, CTO TEK Microsystems, Inc. Military sensor data processing applications

More information

Heath Yardley University of Adelaide Radar Research Centre

Heath Yardley University of Adelaide Radar Research Centre Heath Yardley University of Adelaide Radar Research Centre Radar Parameters Imaging Geometry Imaging Algorithm Gamma Remote Sensing Modular SAR Processor (MSP) Motion Compensation (MoCom) Calibration Polarimetric

More information

02 - Numerical Representations

02 - Numerical Representations September 3, 2014 Todays lecture Finite length effects, continued from Lecture 1 Floating point (continued from Lecture 1) Rounding Overflow handling Example: Floating Point Audio Processing Example: MPEG-1

More information

HPEC-SI Demonstration: Common Imagery Processor

HPEC-SI Demonstration: Common Imagery Processor MTR 03B0000106 MITRE TECHNICAL REPORT HPEC-SI Demonstration: Common Imagery Processor December 2003 Brian Sroka Richard Games Sponsor: OSD Contract No.: FA8721-04-C-0001 Dept. No.: D730 Project No.: 14011234-1C

More information

A SWITCHED-ANTENNA NADIR-LOOKING INTERFEROMETRIC SAR ALTIMETER FOR TERRAIN-AIDED NAVIGATION

A SWITCHED-ANTENNA NADIR-LOOKING INTERFEROMETRIC SAR ALTIMETER FOR TERRAIN-AIDED NAVIGATION A SWITCHED-ANTENNA NADIR-LOOKING INTERFEROMETRIC SAR ALTIMETER FOR TERRAIN-AIDED NAVIGATION ABSTRACT Inchan Paek 1, Jonghun Jang 2, Joohwan Chun 3 and Jinbae Suh 3 1 PGM Image Sensor Centre, Hanwha Thales,

More information

Digital Signal Processor Core Technology

Digital Signal Processor Core Technology The World Leader in High Performance Signal Processing Solutions Digital Signal Processor Core Technology Abhijit Giri Satya Simha November 4th 2009 Outline Introduction to SHARC DSP ADSP21469 ADSP2146x

More information

Compressive Sensing Applications and Demonstrations: Synthetic Aperture Radar

Compressive Sensing Applications and Demonstrations: Synthetic Aperture Radar Compressive Sensing Applications and Demonstrations: Synthetic Aperture Radar Shaun I. Kelly The University of Edinburgh 1 Outline 1 SAR Basics 2 Compressed Sensing SAR 3 Other Applications of Sparsity

More information

An Advanced Graph Processor Prototype

An Advanced Graph Processor Prototype An Advanced Graph Processor Prototype Vitaliy Gleyzer GraphEx 2016 DISTRIBUTION STATEMENT A. Approved for public release: distribution unlimited. This material is based upon work supported by the Assistant

More information

The Staggered SAR Concept: Imaging a Wide Continuous Swath with High Resolution

The Staggered SAR Concept: Imaging a Wide Continuous Swath with High Resolution The Staggered SAR Concept: Imaging a Wide Continuous Swath with High Resolution Michelangelo Villano *, Gerhard Krieger *, Alberto Moreira * * German Aerospace Center (DLR), Microwaves and Radar Institute

More information

Adaptive Computing Systems (ACS) Domain for Implementing DSP Algorithms in Reconfigurable Hardware. Objective/Approach/Process

Adaptive Computing Systems (ACS) Domain for Implementing DSP Algorithms in Reconfigurable Hardware. Objective/Approach/Process Adaptive Computing Systems (ACS) Domain for Implementing DSP Algorithms in Reconfigurable Hardware John Zaino, Eric Pauer, Ken Smith, Paul Fiore, Jairam Ramanathan, Cory Myers {john.c.aino, ken.smith,

More information

ADDRESS TRANSLATION AND TLB

ADDRESS TRANSLATION AND TLB ADDRESS TRANSLATION AND TLB Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement Homework 3 submission deadline: Nov.

More information

New Software-Designed Instruments

New Software-Designed Instruments 1 New Software-Designed Instruments Nicholas Haripersad Field Applications Engineer National Instruments South Africa Agenda What Is a Software-Designed Instrument? Why Software-Designed Instrumentation?

More information

ADDRESS TRANSLATION AND TLB

ADDRESS TRANSLATION AND TLB ADDRESS TRANSLATION AND TLB Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement Homework 4 submission deadline: Mar.

More information

When speed matters.

When speed matters. When speed matters. www.zetec.com The Ideal Combination of Speed, Power and Flexibility QuartZ is the latest member of Zetec s Ultrasonic Instruments product family. It is the answer to the requirements

More information

C-Based Hardware Design Platform for Dynamically Reconfigurable Processor

C-Based Hardware Design Platform for Dynamically Reconfigurable Processor C-Based Hardware Design Platform for Dynamically Reconfigurable Processor September 22 nd, 2005 IPFlex Inc. Agenda Merits of C-Based hardware design Hardware enabling C-Based hardware design DAPDNA-FW

More information

Using GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation

Using GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation Using GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation GPU Technology Conference 2012 May 15, 2012 Thomas M. Benson, Daniel P. Campbell, Daniel A. Cook thomas.benson@gtri.gatech.edu

More information

Introduction to Digital Logic Missouri S&T University CPE 2210 Hardware Implementations

Introduction to Digital Logic Missouri S&T University CPE 2210 Hardware Implementations Introduction to Digital Logic Missouri S&T University CPE 2210 Hardware Implementations Egemen K. Çetinkaya Egemen K. Çetinkaya Department of Electrical & Computer Engineering Missouri University of Science

More information

Synthetic Aperture Radar Modeling using MATLAB and Simulink

Synthetic Aperture Radar Modeling using MATLAB and Simulink Synthetic Aperture Radar Modeling using MATLAB and Simulink Naivedya Mishra Team Lead Uurmi Systems Pvt. Ltd. Hyderabad Agenda What is Synthetic Aperture Radar? SAR Imaging Process Challenges in Design

More information

Computational Process Networks a model and framework for high-throughput signal processing

Computational Process Networks a model and framework for high-throughput signal processing Computational Process Networks a model and framework for high-throughput signal processing Gregory E. Allen Ph.D. Defense 25 April 2011 Committee Members: James C. Browne Craig M. Chase Brian L. Evans

More information

Verification and Validation of X-Sim: A Trace-Based Simulator

Verification and Validation of X-Sim: A Trace-Based Simulator http://www.cse.wustl.edu/~jain/cse567-06/ftp/xsim/index.html 1 of 11 Verification and Validation of X-Sim: A Trace-Based Simulator Saurabh Gayen, sg3@wustl.edu Abstract X-Sim is a trace-based simulator

More information

SDA: Software-Defined Accelerator for Large- Scale DNN Systems

SDA: Software-Defined Accelerator for Large- Scale DNN Systems SDA: Software-Defined Accelerator for Large- Scale DNN Systems Jian Ouyang, 1 Shiding Lin, 1 Wei Qi, Yong Wang, Bo Yu, Song Jiang, 2 1 Baidu, Inc. 2 Wayne State University Introduction of Baidu A dominant

More information

HIGH PERFORMANCE ELECTRONICS FOR THE NEW SPACE AGE

HIGH PERFORMANCE ELECTRONICS FOR THE NEW SPACE AGE APP18-01c APPLICATION NOTE NOVO SPACE HIGH PERFORMANCE ELECTRONICS FOR THE NEW SPACE AGE INTRODUCTION Most subsystem can be partially or fully implemented using one or more of Novo s components. This document

More information

John Bloomfield, Mercury Computer Systems, Inc. HPEC September Mercury Computer Systems, Inc.

John Bloomfield, Mercury Computer Systems, Inc. HPEC September Mercury Computer Systems, Inc. 3DUWLWLRQLQJ &RPSXWDWLRQD7DVNV LWKLQDQ)3*$5,6& +HWHURJHQHRXV 0XWLFRPSXWHU John Bloomfield, Mercury Computer Systems, Inc. HPEC September 2002 Agenda Why worry about partitioning? How we partitioned a real-world

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) EETHPC,

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) HPC

More information

IMPLEMENTATION OF CSAR IMAGING ALGORITHM USING WAVEFRONT RECONSTRUCTION THEORY

IMPLEMENTATION OF CSAR IMAGING ALGORITHM USING WAVEFRONT RECONSTRUCTION THEORY IMPLEMENTATION OF CSAR IMAGING ALGORITHM USING WAVEFRONT RECONSTRUCTION THEORY M.Poornima Pranayini, Naveen Namdeo, P.Dhanalakshmi ABSTRACT: Circular synthetic aperture radar is a SAR mode where the radar-carrying

More information

Intelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018

Intelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Intelligent Enterprise meets Science of Where Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Value The Esri & SAP journey Customer Impact Innovation Track Record Customer

More information

Anisotropy-preserving 5D interpolation by hybrid Fourier transform

Anisotropy-preserving 5D interpolation by hybrid Fourier transform Anisotropy-preserving 5D interpolation by hybrid Fourier transform Juefu Wang and Shaowu Wang, CGG Summary We present an anisotropy-preserving interpolation method based on a hybrid 5D Fourier transform,

More information

MCM Based FIR Filter Architecture for High Performance

MCM Based FIR Filter Architecture for High Performance ISSN No: 2454-9614 MCM Based FIR Filter Architecture for High Performance R.Gopalana, A.Parameswari * Department Of Electronics and Communication Engineering, Velalar College of Engineering and Technology,

More information

SDA: Software-Defined Accelerator for Large- Scale DNN Systems

SDA: Software-Defined Accelerator for Large- Scale DNN Systems SDA: Software-Defined Accelerator for Large- Scale DNN Systems Jian Ouyang, 1 Shiding Lin, 1 Wei Qi, 1 Yong Wang, 1 Bo Yu, 1 Song Jiang, 2 1 Baidu, Inc. 2 Wayne State University Introduction of Baidu A

More information

Breaking the memory barrier (for finite difference modeling)

Breaking the memory barrier (for finite difference modeling) Breaking the memory barrier (for finite difference modeling) Jon Marius Venstad Norwegian University of Science and Technology (NTNU) Department of Petroleum Engineering & Applied Geophysics E-mail: venstad@gmail.com

More information

A real-time UAV INSAR raw signal simulator for HWIL simulation system

A real-time UAV INSAR raw signal simulator for HWIL simulation system A real-time UAV INSAR raw simulator for HWIL simulation system Wei Li, Houxiang Zhang, Hans Petter Hildre Faculty of Maritime Technology and Operations Aalesund University College N-605, Aalesund, Norway

More information

High Performance Radar Signal Processsing

High Performance Radar Signal Processsing High Performance Radar Signal Processsing First Semester Report Fall 2012 by Justin Haze Prepared to partially fulfill the requirements for ECE401 Department of Electrical and Computer Engineering Colorado

More information

Planning Land 3-D Seismic Surveys

Planning Land 3-D Seismic Surveys Planning Land 3-D Seismic Surveys Andreas Cordsen, Mike Galbraith, and John Peirce Edited by Bob A. Hardage Series Editor: Stephen J. Hill Geophysical Developments Series No. 9 Society of Exploration Geophysicists

More information

Metropolitan Road Traffic Simulation on FPGAs

Metropolitan Road Traffic Simulation on FPGAs Metropolitan Road Traffic Simulation on FPGAs Justin L. Tripp, Henning S. Mortveit, Anders Å. Hansson, Maya Gokhale Los Alamos National Laboratory Los Alamos, NM 85745 Overview Background Goals Using the

More information

RapidIO.org Update. Mar RapidIO.org 1

RapidIO.org Update. Mar RapidIO.org 1 RapidIO.org Update rickoco@rapidio.org Mar 2015 2015 RapidIO.org 1 Outline RapidIO Overview & Markets Data Center & HPC Communications Infrastructure Industrial Automation Military & Aerospace RapidIO.org

More information

BlueDBM: An Appliance for Big Data Analytics*

BlueDBM: An Appliance for Big Data Analytics* BlueDBM: An Appliance for Big Data Analytics* Arvind *[ISCA, 2015] Sang-Woo Jun, Ming Liu, Sungjin Lee, Shuotao Xu, Arvind (MIT) and Jamey Hicks, John Ankcorn, Myron King(Quanta) BigData@CSAIL Annual Meeting

More information

A KASSPER Real-Time Signal Processor Testbed

A KASSPER Real-Time Signal Processor Testbed A KASSPER Real-Time Signal Processor Testbed Glenn Schrader 244 Wood St. exington MA, 02420 Phone: (781)981-2579 Fax: (781)981-5255 gschrad@ll.mit.edu The Knowledge Aided Sensor Signal Processing and Expert

More information

Digital Filters in Radiation Detection and Spectroscopy

Digital Filters in Radiation Detection and Spectroscopy Digital Filters in Radiation Detection and Spectroscopy Digital Radiation Measurement and Spectroscopy NE/RHP 537 1 Classical and Digital Spectrometers Classical Spectrometer Detector Preamplifier Analog

More information

Product Brochure PREDATOR RADAR

Product Brochure PREDATOR RADAR PREDATOR RADAR PREDATOR RADAR FULL 1080P HIGH DEFINITION VIDEO IP68 Introducing the Predator Radar Unique Integrated Radar & Predator 1080p HD Camera Cost Effective Perimeter/Wide Area Surveillance Upto

More information

Plattformübergreifende Softwareentwicklung für heterogene Multicore-Systeme

Plattformübergreifende Softwareentwicklung für heterogene Multicore-Systeme Plattformübergreifende Softwareentwicklung für heterogene Multicore-Systeme Dr.-Ing. Timo Stripf 1 Managing Director Technolgy Outline Multicore Motivation Automatic Parallelization Interactive Parallelization

More information

REAL-TIME PARALLEL SOFTWARE DESIGN CASE STUDY: IMPLEMENTATION OF THE RASSP SAR BENCHMARK ON THE INTEL PARAGON

REAL-TIME PARALLEL SOFTWARE DESIGN CASE STUDY: IMPLEMENTATION OF THE RASSP SAR BENCHMARK ON THE INTEL PARAGON RL-TR-95-288 Final Technical Report January 1996 REAL-TIME PARALLEL SOFTWARE DESIGN CASE STUDY: IMPLEMENTATION OF THE RASSP SAR BENCHMARK ON THE INTEL PARAGON The MITRE Corporation Curtis P. Brown, Richard

More information

A Challenge Problem for 2D/3D Imaging of Targets from a Volumetric Data Set in an Urban Environment

A Challenge Problem for 2D/3D Imaging of Targets from a Volumetric Data Set in an Urban Environment A Challenge Problem for 2D/3D Imaging of Targets from a Volumetric Data Set in an Urban Environment Curtis H. Casteel, Jr,*, LeRoy A. Gorham, Michael J. Minardi, Steven M. Scarborough, Kiranmai D. Naidu,

More information

Industry Collaboration and Innovation

Industry Collaboration and Innovation Industry Collaboration and Innovation OpenCAPI Topics Industry Background Technology Overview Design Enablement OpenCAPI Consortium Industry Landscape Key changes occurring in our industry Historical microprocessor

More information

Power Consumption in 65 nm FPGAs

Power Consumption in 65 nm FPGAs White Paper: Virtex-5 FPGAs R WP246 (v1.2) February 1, 2007 Power Consumption in 65 nm FPGAs By: Derek Curd With the introduction of the Virtex -5 family, Xilinx is once again leading the charge to deliver

More information

3 - SYNTHETIC APERTURE RADAR (SAR) SUMMARY David Sandwell, SIO 239, January, 2008

3 - SYNTHETIC APERTURE RADAR (SAR) SUMMARY David Sandwell, SIO 239, January, 2008 1 3 - SYNTHETIC APERTURE RADAR (SAR) SUMMARY David Sandwell, SIO 239, January, 2008 Fraunhoffer diffraction To understand why a synthetic aperture in needed for microwave remote sensing from orbital altitude

More information

Improving FPGA Design Robustness with Partial TMR

Improving FPGA Design Robustness with Partial TMR Improving FPGA Design Robustness with Partial TMR Brian Pratt, Michael Caffrey, Paul Graham, Keith Morgan, Michael Wirthlin Abstract This paper describes an efficient approach of applying mitigation to

More information