A KASSPER Real-Time Signal Processor Testbed

Similar documents
Dana Sinno MIT Lincoln Laboratory 244 Wood Street Lexington, MA phone:

An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs

CASE STUDY: Using Field Programmable Gate Arrays in a Beowulf Cluster

Requirements for Scalable Application Specific Processing in Commercial HPEC

High-Assurance Security/Safety on HPEC Systems: an Oxymoron?

Sparse Linear Solver for Power System Analyis using FPGA

Setting the Standard for Real-Time Digital Signal Processing Pentek Seminar Series. Digital IF Standardization

Distributed Real-Time Embedded Video Processing

Parallel Matlab: RTExpress on 64-bit SGI Altix with SCSL and MPT

Information, Decision, & Complex Networks AFOSR/RTC Overview

Service Level Agreements: An Approach to Software Lifecycle Management. CDR Leonard Gaines Naval Supply Systems Command 29 January 2003

An Update on CORBA Performance for HPEC Algorithms. Bill Beckwith Objective Interface Systems, Inc.

4. Lessons Learned in Introducing MBSE: 2009 to 2012

Dr. ing. Rune Aasgaard SINTEF Applied Mathematics PO Box 124 Blindern N-0314 Oslo NORWAY

Dr. Stuart Dickinson Dr. Donald H. Steinbrecher Naval Undersea Warfare Center, Newport, RI May 10, 2011

Multi-Modal Communication

DoD Common Access Card Information Brief. Smart Card Project Managers Group

High-Performance Linear Algebra Processor using FPGA

MODELING AND SIMULATION OF LIQUID MOLDING PROCESSES. Pavel Simacek Center for Composite Materials University of Delaware

ARINC653 AADL Annex Update

COMPUTATIONAL FLUID DYNAMICS (CFD) ANALYSIS AND DEVELOPMENT OF HALON- REPLACEMENT FIRE EXTINGUISHING SYSTEMS (PHASE II)

Empirically Based Analysis: The DDoS Case

Technological Advances In Emergency Management

Terrain categorization using LIDAR and multi-spectral data

ASPECTS OF USE OF CFD FOR UAV CONFIGURATION DESIGN

DEVELOPMENT OF A NOVEL MICROWAVE RADAR SYSTEM USING ANGULAR CORRELATION FOR THE DETECTION OF BURIED OBJECTS IN SANDY SOILS

Washington University

FUDSChem. Brian Jordan With the assistance of Deb Walker. Formerly Used Defense Site Chemistry Database. USACE-Albuquerque District.

A Distributed Parallel Processing System for Command and Control Imagery

Data Reorganization Interface

INTEGRATING LOCAL AND GLOBAL NAVIGATION IN UNMANNED GROUND VEHICLES

Accuracy of Computed Water Surface Profiles

Using Model-Theoretic Invariants for Semantic Integration. Michael Gruninger NIST / Institute for Systems Research University of Maryland

The extreme Adaptive DSP Solution to Sensor Data Processing

Architecting for Resiliency Army s Common Operating Environment (COE) SERC

Col Jaap de Die Chairman Steering Committee CEPA-11. NMSG, October

Corrosion Prevention and Control Database. Bob Barbin 07 February 2011 ASETSDefense 2011

FPGA Acceleration of Information Management Services

CENTER FOR ADVANCED ENERGY SYSTEM Rutgers University. Field Management for Industrial Assessment Centers Appointed By USDOE

Topology Control from Bottom to Top

Concept of Operations Discussion Summary

UMass at TREC 2006: Enterprise Track

Parallelization of a Electromagnetic Analysis Tool

A Methodology for End-to-End Evaluation of Arabic Document Image Processing Software

Kathleen Fisher Program Manager, Information Innovation Office

Title: An Integrated Design Environment to Evaluate Power/Performance Tradeoffs for Sensor Network Applications 1

Towards a Formal Pedigree Ontology for Level-One Sensor Fusion

SINOVIA An open approach for heterogeneous ISR systems inter-operability

Space and Missile Systems Center

Vision Protection Army Technology Objective (ATO) Overview for GVSET VIP Day. Sensors from Laser Weapons Date: 17 Jul 09 UNCLASSIFIED

73rd MORSS CD Cover Page UNCLASSIFIED DISCLOSURE FORM CD Presentation

Computer Aided Munitions Storage Planning

Advanced Numerical Methods for Numerical Weather Prediction

Exploring the Query Expansion Methods for Concept Based Representation

SETTING UP AN NTP SERVER AT THE ROYAL OBSERVATORY OF BELGIUM

Energy Security: A Global Challenge

Using Templates to Support Crisis Action Mission Planning

New Dimensions in Microarchitecture Harnessing 3D Integration Technologies

US Army Industry Day Conference Boeing SBIR/STTR Program Overview

A Review of the 2007 Air Force Inaugural Sustainability Report

Cyber Threat Prioritization

Radar Tomography of Moving Targets

C2-Simulation Interoperability in NATO

Office of Global Maritime Situational Awareness

Increased Underwater Optical Imaging Performance Via Multiple Autonomous Underwater Vehicles

Introducing I 3 CON. The Information Interpretation and Integration Conference

Running CyberCIEGE on Linux without Windows

DoD M&S Project: Standardized Documentation for Verification, Validation, and Accreditation

DATA COLLECTION AND TESTING TOOL: SAUDAS

ENVIRONMENTAL MANAGEMENT SYSTEM WEB SITE (EMSWeb)

Space and Missile Systems Center

AFRL-ML-WP-TM

The C2 Workstation and Data Replication over Disadvantaged Tactical Communication Links

Air Virtual At Sea (VAST) Platform Stimulation Analysis

73rd MORSS CD Cover Page UNCLASSIFIED DISCLOSURE FORM CD Presentation

Edwards Air Force Base Accelerates Flight Test Data Analysis Using MATLAB and Math Works. John Bourgeois EDWARDS AFB, CA. PRESENTED ON: 10 June 2010

Monte Carlo Techniques for Estimating Power in Aircraft T&E Tests. Todd Remund Dr. William Kitto EDWARDS AFB, CA. July 2011

Considerations for Algorithm Selection and C Programming Style for the SRC-6E Reconfigurable Computer

DAFS Storage for High Performance Computing using MPI-I/O: Design and Experience

Smart Data Selection (SDS) Brief

Nationwide Automatic Identification System (NAIS) Overview. CG 939 Mr. E. G. Lockhart TEXAS II Conference 3 Sep 2008

75th Air Base Wing. Effective Data Stewarding Measures in Support of EESOH-MIS

Current Threat Environment

Headquarters U.S. Air Force. EMS Play-by-Play: Using Air Force Playbooks to Standardize EMS

2013 US State of Cybercrime Survey

ASSESSMENT OF A BAYESIAN MODEL AND TEST VALIDATION METHOD

NATO-IST-124 Experimentation Instructions

Compliant Baffle for Large Telescope Daylight Imaging. Stacie Williams Air Force Research Laboratory ABSTRACT

Rockwell Science Center (RSC) Technologies for WDM Components

Wireless Connectivity of Swarms in Presence of Obstacles

A Multilevel Secure MapReduce Framework for Cross-Domain Information Sharing in the Cloud

WAITING ON MORE THAN 64 HANDLES

U.S. Army Research, Development and Engineering Command (IDAS) Briefer: Jason Morse ARMED Team Leader Ground System Survivability, TARDEC

Directed Energy Using High-Power Microwave Technology

SURVIVABILITY ENHANCED RUN-FLAT

David W. Hyde US Army Engineer Waterways Experiment Station Vicksburg, Mississippi ABSTRACT

NEW FINITE ELEMENT / MULTIBODY SYSTEM ALGORITHM FOR MODELING FLEXIBLE TRACKED VEHICLES

Wind Tunnel Validation of Computational Fluid Dynamics-Based Aero-Optics Model

75 TH MORSS CD Cover Page. If you would like your presentation included in the 75 th MORSS Final Report CD it must :

Enhanced Predictability Through Lagrangian Observations and Analysis

Transcription:

A KASSPER Real-Time Signal Processor Testbed Glenn Schrader 244 Wood St. exington MA, 02420 Phone: (781)981-2579 Fax: (781)981-5255 gschrad@ll.mit.edu The Knowledge Aided Sensor Signal Processing and Expert Reasoning (KASSPER) Program is a Defense Advanced Research Projects Agency (DARPA) program which has the goal of improving the performance of Ground Moving Target Indicator (GMTI) radar systems by incorporating external sources of knowledge into the signal processing chain. The KASSPER Real-Time Signal Processor Testbed and its associated Signal Processing Architecture is a prototype radar system scheduling and signal processing framework that has been developed at Massachusetts Institute of Technology incoln aboratory (MIT ). A typical scenario in which KASSPER processing could be useful is depicted in Fig. 1. Note that the GMTI clutter environment is heterogeneous (trees, open areas, an urban area, etc). By taking advantage of prior knowledge about this environment (i.e. locations of roads, terrain contours, types of ground cover, etc.), processing algorithms can have an opportunity to improve their performance by avoiding invalid assumptions about the environment. A top level diagram of the KASSPER architecture is shown in Fig. 2. The components of this architecture that are different from a conventional radar signal processor are the knowledge database, knowledge cache, and the look-ahead scheduler. Fig. 1. A representative KASSPER problem. This work is sponsored by the Defense Advanced Research Projects Agency, under Air Force Contract F19628-00-C-0002. Opinions, interpretations, conclusions, and recommendations are those of the author and are not necessarily endorsed by the United States Government.

Report Documentation Page Form Approved OMB No. 0704-0188 Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington VA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. 1. REPORT DATE 01 FEB 2005 2. REPORT TYPE N/A 3. DATES COVERED - 4. TITE AND SUBTITE A KASSPERReal-Time Embedded Signal Processor Testbed 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM EEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) Massachusetts Institute of Technologyincoln aboratory 8. PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSOR/MONITOR S ACRONYM(S) 12. DISTRIBUTION/AVAIABIITY STATEMENT Approved for public release, distribution unlimited 11. SPONSOR/MONITOR S REPORT NUMBER(S) 13. SUPPEMENTARY NOTES See also ADM001742, HPEC-7 Volume 1, Proceedings of the Eighth Annual High Performance Embedded Computing (HPEC) Workshops, 28-30 September 2004., The original document contains color images. 14. ABSTRACT 15. SUBJECT TERMS 16. SECURITY CASSIFICATION OF: 17. IMITATION OF ABSTRACT UU a. REPORT unclassified b. ABSTRACT unclassified c. THIS PAGE unclassified 18. NUMBER OF PAGES 31 19a. NAME OF RESPONSIBE PERSON Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std Z39-18

The knowledge database stores the 'knowledge' that both the the signal processing chain and the scheduler need to perform 'intelligent' processing. The knowledge database reformats the knowledge data into a form that is usable by the processing chain during a pre-mission data load. During system operation, the knowledge cache holds the portion of the knowledge store that is within the radar s field of view. The knowledge pre-processing step performs real-time operations such as transformations between geographic coordinates and radar system coordinates. Fig. 2 KASSPER Architecture Block Diagram An important factor in the performance of knowledge aided signal processing algorithms is that the application of knowledge must be performed locally. This tends to cause the signal processing to be done at a finer grain than is comfortable for achieving good computational efficiency. As an aid to making processor scaling choices for knowledge aided algorithms, signal processing and knowledge database benchmarks have been developed in order to quantify the efficiencies that will actually be achievable. This talk will discuss the design of the testbed, its architecture; the testbed s software to hardware mapping, knowledge aided processing algorithms, as well as computational performance data that have been collected for important knowledge-aided processing kernels. A number of topics (such as the look-ahead scheduler) are currently areas of ongoing research. An overview of the potential benefits as well as a vision of how the architecture will incorporate knowledge aided scheduling will be presented.

A KASSPER Real-Time Embedded Signal Processor Testbed Glenn Schrader Massachusetts Institute of Technology incoln aboratory 28 September 2004 This work is sponsored by the Defense Advanced Research Projects Agency, under Air Force Contract F19628-00-C-0002. Opinions, interpretations, conclusions, and recommendations are those of the author and are not necessarily endorsed by the United States Government. HPEC 2004 KASSPER Testbed-1

Outline KASSPER Program Overview KASSPER Processor Testbed Computational Kernel Benchmarks STAP Weight Application Knowledge Database Pre-processing Summary HPEC 2004 KASSPER Testbed-2

Knowledge-Aided Sensor Signal Processing and Expert Reasoning (KASSPER) MIT Program Objectives Develop systems-oriented approach to revolutionize ISR signal processing for operation in difficult environments Develop and evaluate highperformance integrated radar system/algorithm approaches Incorporate knowledge into radar resource scheduling, signal, and data processing functions Define, develop, and demonstrate appropriate real-time signal processing architecture Ground Moving Target Indication (GMTI) Search HPEC 2004 KASSPER Testbed-3 Synthetic Aperture Radar (SAR) Stripmap SAR Spot GMTI Track

Representative KASSPER Problem Radar System Challenges Heterogenous clutter environments Clutter discretes Dense target environments Results in decreased system performance (e.g. higher Minimum Detectable Velocity, Probability of Detection vs. False Alarm Rate) KASSPER System Characteristics Knowledge Processing Knowledge-enabled Algorithms Knowledge ISR Repository Platform Multi-function Radar with KASSPER Multiple Waveforms System + Algorithms Intelligent Scheduling Knowledge Types Mission Goals Static Knowledge Dynamic Knowledge Convoy B auncher Convoy A. HPEC 2004 KASSPER Testbed-4

Outline KASSPER Program Overview KASSPER Processor Testbed Computational Kernel Benchmarks STAP Weight Application Knowledge Database Pre-processing Summary HPEC 2004 KASSPER Testbed-5

High evel KASSPER Architecture Intertial Navigation System (INS), Global Positioning System (GPS), etc. Sensor Data ook-ahead Scheduler Signal Processing Target Detects Data Processor (Tracker, etc) Track Data Knowledge Database e.g. Mission Knowledge e.g. Terrain Knowledge e.g. Real-time Intelligence Operator HPEC 2004 KASSPER Testbed-6 Static (i.e. Off-line ) Knowledge Source External System

Baseline Signal Processing Chain GMTI Signal Processing Raw Sensor Data INS, GPS, etc Data Input Task Beamformer Task GMTI Functional Pipeline Filtering Filter Task Task Detect Task Data Output Task Target Detects Data Processor (Tracker, etc) ook-ahead Scheduler Knowledge Processing Track Data Knowledge Database Baseline data cube sizes: 3 channels x 131 pulses x 1676 range cells 12 channels x 35 pulses x 1666 range cells. HPEC 2004 KASSPER Testbed-7

KASSPER Real-Time Processor Testbed INS, GPS, etc Processing Architecture Sensor Data Software Architecture ook-ahead Scheduler Signal Processing Results Data Processor (Tracker, etc) Knowledge Database Application Application Application Parallel Vector ibrary (PV) Kernel Kernel Kernel Off-line Knowledge Source (DTED, VMAP, etc) KASSPER Signal Processor Sensor Data Storage Surrogate for actual radar system HPEC 2004 KASSPER Testbed-8 Knowledge Storage Rapid Prototyping High evel Abstractions Scalability Productivity Rapid Algorithm Insertion 120 PPC G4 CPUs @ 500 MHZ 4 GFlop/Sec/CPU = 480 GFlop/Sec

KASSPER Knowledge-Aided Processing Benefits HPEC 2004 KASSPER Testbed-9

Outline KASSPER Program Overview KASSPER Processor Testbed Computational Kernel Benchmarks STAP Weight Application Knowledge Database Pre-processing Summary HPEC 2004 KASSPER Testbed-10

Space Time Adaptive Processing (STAP) Range cell of interest? Training Data (A) Steering Vector (v) HPEC 2004 KASSPER Testbed-11 sample 1 sample 2 Solve for weights ( R=A H A, W=R -1 v) sample N-1 sample N STAP Weights (W) Training samples are chosen to form an estimate of the background Multiplying the STAP Weights by the data at a range cell suppresses features that are similar to the features that were in the training data X Range cell/w suppressed clutter

Space Time Adaptive Processing (STAP) Breaks Assumptions Range cell of interest? Training Data (A) Steering Vector (v) sample 1 sample 2 Solve for weights ( R=A H A, W=R -1 v) sample N-1 sample N STAP Weights (W) X Range cell/w suppressed clutter Simple training selection approaches may lead to degraded performance since the clutter in training set may not match the clutter in the range cell of interest HPEC 2004 KASSPER Testbed-12

Space Time Adaptive Processing (STAP) Range cell of interest? Training Data (A) Steering Vector (v) sample sample 1 2 Solve for weights ( R=A H A, W=R -1 v) sample N-1 STAP Weights (W) sample N X Range cell/w suppressed clutter Use of terrain knowledge to select training samples can mitigate the degradation HPEC 2004 KASSPER Testbed-13

STAP Weights and Knowledge STAP weights computed from training samples within a training region To improve processor efficiency, many designs apply weights across a swath of range (i.e. matrix multiply) With knowledge enhanced STAP techniques, MUST assume that adjacent range cells will not use the same weights (i.e. weight selection/computation is data dependant) STAP Input Training STAP Input select 1 of N Training * Weights * Weights External data STAP Output STAP Output HPEC 2004 KASSPER Testbed-14 Equivalent to Matrix Multiply since same weights are used for all columns Cannot use Matrix Multiply since the weights applied to each column are data dependant

Benchmark Algorithm Overview Benchmark Reference Algorithm (Single Weight Multiply) STAP Output = Weights X STAP Input Benchmark Data Dependant Algorithm (Multiple Weight Multiply) STAP Input Training Set STAP Weights N pre-computed Weight vectors * Weights applied to a column selected sequentially modulo N STAP Output Benchmark s goal is to determine the achievable performance compared to the well-known matrix multiply algorithm. HPEC 2004 KASSPER Testbed-15

Test Case Overview Data Set J x K x Single Weight Multiply K J X X K Multiple Weight Multiply K J N K N = # of weight matrices = 10 J and K dimension length Complex Data Single precision interleaved Two test architectures PowerPC G4 (500Mhz) Pentium 4 (2.8Ghz) HPEC 2004 KASSPER Testbed-16 dimension length Four test cases Single weight multiply/wo cache flush Single weight multiply/w cache flush Multiple weight multiply/wo cache flush Multiple weight multiply/w cache flush

Data Set #1 - xx Single Weight Multiply X MFOP/Sec MFOP/Sec Multiple Weight Multiply 10 X ength () ength () Single/wo flush Single/w flush Multiple/wo flush Multiple/w flush HPEC 2004 KASSPER Testbed-17

Data Set #2 32x32x Single Weight Multiply 32 32 X 32 MFOP/Sec MFOP/Sec Multiple Weight Multiply 32 32 X 32 10 ength () ength () Single/wo flush Single/w flush Multiple/wo flush Multiple/w flush HPEC 2004 KASSPER Testbed-18

Data Set #3 20x20x Single Weight Multiply 20 20 X 20 MFOP/Sec MFOP/Sec Multiple Weight Multiply 20 20 X 20 10 ength () ength () Single/wo flush Single/w flush Multiple/wo flush Multiple/w flush HPEC 2004 KASSPER Testbed-19

Data Set #4 12x12x Single Weight Multiply 12 12 X 12 MFOP/Sec MFOP/Sec Multiple Weight Multiply 12 12 X 12 10 ength () ength () Single/wo flush Single/w flush Multiple/wo flush Multiple/w flush HPEC 2004 KASSPER Testbed-20

Data Set #5 8x8x Single Weight Multiply 8 8 X 8 MFOP/Sec MFOP/Sec Multiple Weight Multiply 8 8 10 X 8 ength () ength () Single/wo flush Single/w flush Multiple/wo flush Multiple/w flush HPEC 2004 KASSPER Testbed-21

Performance Observations Data dependent processing cannot be optimized as well Branching prevents compilers from unrolling loops to improve pipelining Data dependent processing does not allow efficient reuse of already loaded data Algorithms cannot make simplifying assumptions about already having the data that is needed. Data dependent processing does not vectorize Using either Altivec or SSE is very difficult since data movement patterns become data dependant Higher memory bandwidth reduces cache impact Actual performance scales roughly with clock rate rather than theoretical peak performance Approx 3x to 10x performance degradation, Processing efficiency of 2-5% depending on CPU architecture. HPEC 2004 KASSPER Testbed-22

Outline KASSPER Program Overview KASSPER Processor Testbed Computational Kernel Benchmarks STAP Weight Application Knowledge Database Pre-processing Summary HPEC 2004 KASSPER Testbed-23

Knowledge Database Architecture Knowledge Manager New Knowledge (Time Critial Targets, etc) ookup GPS, INS, User Inputs, Update etc Command ook-ahead Scheduler Off-line Pre-Processing Knowledge Database Sensor Data oad/ Store/ Send New Knowledge Stored Data Signal Processing Results Stored Data Downstream Processing (Tracker, etc) Stored Data oad Knowledge Cache Knowledge Store Store Send Knowledge Processing Off-line Knowledge Reformatter Raw a priori Raster & Vector Knowledge (DTED,VMAP,etc) HPEC 2004 KASSPER Testbed-24

Static Knowledge Vector Data Each feature represented by a set of point locations: Points - longitude and latitude (i.e. towers, etc) ines - list of points, first and last points are not connected (i.e. roads, rail, etc) Areas - list of points, first and last point are connected (i.e. forest, urban, etc) Standard Vector Formats Vector Smart Map (VMAP) Matrix Data Rectangular arrays of evenly spaced data points Standard Raster Formats Digital Terrain Elevation Data (DTED) Trees (area) Roads (line) DTED HPEC 2004 KASSPER Testbed-25

Terrain Interpolation Sampled Terrain Height & Type data Data Geographic Position N Slant Range Terain Height Earth Radius Converting from radar to geographic coordinates requires iterative refinement of estimate HPEC 2004 KASSPER Testbed-26

Terrain Interpolation Performance Results CPU Type Time per interpolation Processing Rate P4 2.8 1.27uSec/interpolation 144MFlop/sec (1.2%) G4 500 3.97uSec/interpolation 46MFlop/sec (2.3%) Data dependent processing does not vectorize Using either Altivec or SSE is very difficult Data dependent processing cannot be optimized as well Compilers cannot unroll loops to improve pipelining Data dependent processing does not allow efficient reuse of already loaded data Algorithms cannot make simplifying assumptions about already having the data that is needed Processing efficiency of 1-3% depending on CPU architecture. HPEC 2004 KASSPER Testbed-27

Outline KASSPER Program Overview KASSPER Processor Testbed Computational Kernel Benchmarks STAP Weight Application Knowledge Database Pre-processing Summary HPEC 2004 KASSPER Testbed-28

Summary Observations The use of knowledge processing provides a significant system performance improvement Compared to traditional signal processing algorithms, the implementation of knowledge enabled algorithms can result in a factor of 4 or 5 lower CPU efficiency (as low as 1%-5%) Next-generation systems that employ cognitive algorithms are likely to have similar performance issues Important Research Directions Heterogeneous processors (i.e. a mix of COTS CPUs, FPGAs, GPUs, etc) can improve efficiency by better matching hardware to the individual problems being solved. For what class of systems is the current technology adequate? Are emerging architectures for cognitive information processing needed (e.g. Polymorphous Computing Architecture PCA, Application Specific Instruction set Processors - ASIPs)? HPEC 2004 KASSPER Testbed-29