MultiChipSat: an Innovative Spacecraft Bus Architecture. Alvar Saenz-Otero

Similar documents
ECE 471 Embedded Systems Lecture 2

SINGLE BOARD COMPUTER FOR SPACE

CNES requirements w.r.t. Next Generation General Purpose Microprocessor

Evolution of Computers & Microprocessors. Dr. Cahit Karakuş

Space Micro Satellite Computer Goals Space Computer Performance Goals: >1,000 MIPS throughput Less than 1 SEU in 1,000 days Less than 10 watts power R

High temperature / radiation hardened capable ARM Cortex -M0 microcontrollers

University Würzburg Am Hubland, Würzburg

FAULT TOLERANT SYSTEMS

Xilinx Virtex-5QV Update and Space Roadmap Kangsen Huey Space Marketing Aerospace & Defense 17th, March, 2016

Command & Data Handling. By: Justin Hadella Brandon Gilles

ELC4438: Embedded System Design Embedded Processor

On Design for Reliability

Lab Assignment Each team will independently implement the launch interceptor specification For this assignment, you re writing portable C code

SpaceWire Technologies deliver multi-gigabit data rates for on-board Spacecraft. SpaceTech Expo Gregor Cranston Business Development Manager

SCS750. Super Computer for Space. Overview of Specifications

PowerPC- based Processor Modules for Space Applications

ECE 259 / CPS 221 Advanced Computer Architecture II (Parallel Computer Architecture) Availability. Copyright 2010 Daniel J. Sorin Duke University

Computer Architecture. Fall Dongkun Shin, SKKU

ADVANCED ELECTRONIC SOLUTIONS AVIATION SERVICES COMMUNICATIONS AND CONNECTIVITY MISSION SYSTEMS

Growth outside Cell Phone Applications

Error Mitigation of Point-to-Point Communication for Fault-Tolerant Computing

4. Hardware Platform: Real-Time Requirements

Improving the Fault Tolerance of a Computer System with Space-Time Triple Modular Redundancy

Outline of Presentation Field Programmable Gate Arrays (FPGAs(

Configurable Fault Tolerant Processor (CFTP) for Space Based Applications

Embedded Systems Design (630414) Lecture 1 Introduction to Embedded Systems Prof. Kasim M. Al-Aubidy Computer Eng. Dept.

Chapter 1 Computer System Overview

Part 1 of 3 -Understand the hardware components of computer systems

Next-Generation Distributed Satellite Bus Information Systems

Mitigation of SCU and MCU effects in SRAM-based FPGAs: placement and routing solutions

Detector Control System board for FAIR. J. A. Lucio Martínez Infrastructure and Computer Systems in Data Processing (IRI) Goethe University Frankfurt

S950 3U cpci Radiation Tolerant PowerPC SBC

PROGRAMMABLE MODULE WHICH USES FIRMWARE TO REALISE AN ASTRIUM PATENTED COSMIC RANDOM NUMBER GENERATOR FOR GENERATING SECURE CRYPTOGRAPHIC KEYS

Space: The Final Frontier FPGAs for Space and Harsh Environments

AVR Microcontrollers Architecture

About using FPGAs in radiation environments

ELEG3923 Microprocessor Ch.0 & Ch.1 Introduction to Microcontroller

RAD6000 Space Computers

Plug and Play Satellite Evolution

Example of Technology Development Program

Advanced Computing, Memory and Networking Solutions for Space

Chapter 2: Computer-System Structures. Hmm this looks like a Computer System?

Uniprocessor Computer Architecture Example: Cray T3E

Cycles Per Instruction For This Microprocessor

Chapter 5. Introduction ARM Cortex series

Computer-System Organization (cont.)

Homeschool Enrichment. The System Unit: Processing & Memory

Memory Systems IRAM. Principle of IRAM

COTS Commercial is not always advertising Monica Alderighi

New STM32 F7 Series. World s 1 st to market, ARM Cortex -M7 based 32-bit MCU

Multimedia in Mobile Phones. Architectures and Trends Lund

Development Status for JAXA Critical Parts, 2008

Dependable Systems made by FIRST

Ultra Low-Cost Defect Protection for Microprocessor Pipelines

Multicore computer: Combines two or more processors (cores) on a single die. Also called a chip-multiprocessor.

Implementation of the Fault Tolerance Module in PHOENIX CubeSat

Hardware RAID, RAID 6, and Windows Storage Server

SAE AS5643 and IEEE1394 Deliver Flexible Deterministic Solution for Aerospace and Defense Applications

Development of Formation Flight and Docking Algorithms Using the SPHERES Testbed

Chapter 1: Introduction. Operating System Concepts 8th Edition,

Basic Components of Digital Computer

The special radiation-hardened processors for new highly informative experiments in space

PRIMEQUEST 400 Series & SQL Server 2005 Technical Whitepaper (November, 2005)

Multi-Core Microprocessor Chips: Motivation & Challenges

Introduction to Embedded Systems

Building Reliable High-Performance Hardware. Fraunhofer FOKUS. Fraunhofer FOKUS

Mastering The Behavior of Multi-Core Systems to Match Avionics Requirements

I/O CANNOT BE IGNORED

Operating Systems: Internals and Design Principles, 7/E William Stallings. Chapter 1 Computer System Overview

Embedded Systems: Hardware Components (part II) Todor Stefanov

Introduction to the Personal Computer

MicroCore Labs. MCL51 Application Note. Lockstep. Quad Modular Redundant System

HISTORY OF MICROPROCESSORS

Embedded Systems: Architecture

Embedded Systems. 7. System Components

Flight Computer: Managing the Complexity

THE SMART BACKPLANE LOWERING THE COST OF SPACECRAFT AVIONICS BY IMPROVING THE RADIATION TOLERANCE OF COTS ELECTRONIC SYSTEMS Daniel Gleeson, MSc.

Challenges for Future Interconnection Networks Hot Interconnects Panel August 24, Dennis Abts Sr. Principal Engineer

These slides contain projections or other forward-looking statements within the meaning of Section 27A of the Securities Act of 1933, as amended, and

Executive Summary. Functional and Performance Validation of the 80S32 µc. Deliverable D5.2 - Report Workpackage 5 Evaluation Kit

Intel iapx 432-VLSI building blocks for a fault-tolerant computer

Fault tolerance in consumer products. Ben Pronk

Dillon M. Collins, Brendan S. Surrusco, Sven G. Bilén, Charles C. Croskey The Pennsylvania State University

Computer Architecture!

1993 Paper 3 Question 6

1 COMMAND AND DATA HANDLING (C&DH)

LECTURE 5: MEMORY HIERARCHY DESIGN

The Use of Cloud Computing Resources in an HPC Environment

ESA round table. September L. Goulard PY. Bretécher

Computer Architecture A Quantitative Approach, Fifth Edition. Chapter 2. Memory Hierarchy Design. Copyright 2012, Elsevier Inc. All rights reserved.


Embedded Systems Introduction

Introduction to the Service Availability Forum

Functional Safety and Safety Standards: Challenges and Comparison of Solutions AA309

This Unit: Putting It All Together. CIS 371 Computer Organization and Design. What is Computer Architecture? Sources

Copyright 2012, Elsevier Inc. All rights reserved.

Instruction Set And Architectural Features Of A Modern Risc Processor

Computer Architecture. A Quantitative Approach, Fifth Edition. Chapter 2. Memory Hierarchy Design. Copyright 2012, Elsevier Inc. All rights reserved.

Your Company Logo Here. Flying High-Performance FPGAs on Satellites: Two Case Studies

SYSTEM BUS AND MOCROPROCESSORS HISTORY

Transcription:

MultiChipSat: an Innovative Spacecraft Bus Architecture Alvar Saenz-Otero 29-11-6

Motivation Objectives Architecture Overview Other architectures Hardware architecture Software architecture Challenges Outline 2

Introduction Can distinguish two trends in microprocessor requirements: Require high processing power for new complex missions, developed using COTS CPUs Need small (low cost/power/mass) processors for micro-satellites to perform simpler missions Existing use of radhard processors does not help with either of these trends The microprocessor (main controller) of a spacecraft remains a substantial investment, usually for technology at least one generation old Today: about $5k - $25k for a board based on a processor from the 199 s or FPGA from early 2 s Usually the processor does not have redundancy, rather being a high reliability radiation hardened element Especially true in small satellites, which have a small budget to start with New software is being more accepted for space use; it needs newer processors Autonomy and AI techniques depend on object oriented programming 3

Objectives Use multiple processors to achieve the same performance as modern radhard devices Demonstrate the same fault-tolerance performance as radhard processors Demonstrate that radiation tolerance is sufficient when redundancy can overcome any upsets Create redundancy in critical functions while fully utilizing the remainder of the processing capabilitis for other functions Enhance performance by allowing the use of newer COTS microprocessors, and/or Demonstrate sufficient performance from new generation MCU s to have the same capabilities of radhard microprocessors Create a cost-effective increase in sub-system redundancy Reduce total power requirements Utilize the same of smaller board area (m 3 and kg) as a radhard system Reduce satellite bus manufacturing without reducing performance Achieve the performance using COTS parts at regular industry prices 4

Architecture Overview Utilize multiple interconnected small footprint (cost, mass power) COTS micro processors and/or micro controllers to create a fault-tolerant system with similar performance as a custom radhard processor Enable future spacecraft to utilize current generation electronics, rather than depend on the small supply of radhard processors Utilize complete system processors Reduce component count and complexity E.g. PIC, AVR, ARM Shared robust bus MCU/DSP E.g. CAN bus Processors check each other using the bus Develop robust/redundant Towards no single-point failure (for critical tasks) Shared Bus MCU/DSP Redundant DIO Critical Devices MCU/DSP Bus Devices 5

Processor Architectures Review Four main architectures compared: MCU - single-chip solutions to very spcific and limited applications. Include processor core, memory, DIO, common communication buses. They do not have strong mathematical functions. DSP - core unit designed specifically for mathematical functions. Low power consumption. Needs external memory and interface IC s to create DIO and communication buses. Highly optimized for mathematical functions, including availability of single-instruction floating point cores. TI C671 radhard version has been used in multiple space missions. General Purpose MP - core unit designed for general applications, not usually used in embedded systems. Needs external memory and interface IC s to create DIO and communication buses. Usually have the highest performance, but require the most development work. The PowerPC has been adopted for space use, with existing radhard versions in use. FPGA - it is common to find FPGA s implementing virtual processors, especially PowerPC cores, as part of the system on a chip trend. Their implementation in an FPGA sometimes enables them to not need any external components. Radhard FPGA s are expected in 21. 6

Processor Architectures (2) Of the four technologies, MCUs do not need external components. FPGA s require external non-volatile memory. DSP s and general purpose processors require the most external elements. MCU/DSP MCU/DSP Cache MCU DSP FPGA FPGA 2 MCU/DSP Cache Chipset General Purpose Processor Drives Video 7

State of the Art Processors The table lists some of the newest processors in all four categories General purpose processors provide the most MIPS, but also consume the most power Radhard processors have capabilities like the processors of 1 years ago MCU s have the least capabilities, but only a few of them could be combined to achieve most capabilities of current radhard processors Processor MHz MIPS (min/max) Program Memory Data Memory Cache External Parts Power $ MicroChip MCU 8 8-124 512k 32k 16 None Many.25W $8 PIC32MX36F256L Atmel MCU 21 21 512k 32k 24k Optional Many.33W $22 AT91SAM9XE512 C671DSP 14 14-512k - 16MB- 512k Multiple None 2W >$4 (radhard) 1G 16MB 128MB PowerPC 132 26 4MB 36MB 1M Multiple Bus 25W >$2k RAD75 (radhard) Intel 25 up to 4GB+ up to 12M Many Bus 45W $12 2 Extreme Quad 1, 8GB FPGA: PowerPC NASA MSL 4 6 2G 256MB TBD Min 1 Many 2W $8 8

Multi Processor Architectures Multi-core products have increased the interest in multi-processor systems We want to use the same algorithms, whiel ensuring that its possible to create a fully redundant system (no single-point failures) General purpose multi-core systems have too many shared components MCU/DSP MCU/DSP FPGA designs could be used, as long as multiple FPGA s with multiple external IC s are used MSL has similar concepts to MultiChipSat, but currently is implementing everything on a single FPGA Robust to SEU s which can be overcome by a system reset Not robust to permanent HW failures (ie, silicon damage) MCU/DSP Cache Chipset General Purpose Processor Drives Video Shared Bus (Dual?) Redundant MultiChipSat MCU s FPGA Redundant DIO Critical Devices Bus Devices Bus Devices DIO Critical Devices FPGA (with added redundancy) 9

Software Architecture MultiChipSat involves the development of a Kernel which provides oversight of the multi-processor system Two main fields in computer science Parallel processing: achieving full redundancy of critical tasks and using parallel threads to improve performance of overall system Voting algorithms / graceful degradation: enable fault detection and isolation, followed by recovery algorithms that return the processor(s) to a valid state in case of recoverable failures or put the system in a valid reduced capability state where critical tasks still complete Evolutionary Recovery Electronics research at JPL detects functionality changes in FPTA s and reconfigures the array to recover the functionality, specifically to address radiation issues in sub.5micron process IC s. (Adrian Stoica, Tughrul Arslan, et al, Evolutionary Recovery of Electronic Circuits from Radiation Induced Faults ) 1

Failure Types Single-Event-Upsets data corruption which results in a processor creating a bad output or halting operations the event is recoverable by restarting the processor and returning to a known state Permanent Hardware Failures high energy particles physically damage the silicon in the processor, rendering it inoperable the software must then redistribute the load of critical tasks and stop performing non-critical tasks that are no longer possible 11

SW Example (1) Kernel: the OS developed for MultiChipSat Critical: tasks required to maintain the satellite operational High Priority: collecting the necessary data Low Priority: processing data locally (reducing comm req) Idle: margin Nominal operations Kernel has minimal impact 1 8 6 4 2 1 8 6 4 2 Processor #1 Processor #2 Failed Idle Low Priority High Priority Critical Kernel All operations complete, including low priority tasks 1 8 6 4 2 Processor #3 Event (Timeline) 12

SW Example (2) Nominal operations Failure occurs One processor stops working completely The other processors initially continue operating as they were, but the kernel is using the shared bus to check for failures... 1 8 6 4 2 1 8 6 4 2 Processor #1 Processor #2 Failed Idle Low Priority High Priority Critical Kernel 1 8 6 4 2 Processor #3 Event (Timeline) 13

SW Example (3) Nominal operations Failure occurs Error detection The kernel detects a processor has failed Kernel priority increases in operational processors to identify the failure No idle time Low priority tasks may not run 1 8 6 4 2 1 8 6 4 2 1 8 6 4 2 Processor #1 Processor #2 Processor #3 Failed Idle Low Priority High Priority Critical Kernel Event (Timeline) 14

SW Example (4) Nominal operations Failure occurs Error detection Recovery starts The critical & high priority tasks of the failed processor are taken over by the operational processors Low priority tasks are ignored One oversight processor maintains a high priority kernel to find the solution The other processor maximizes its ability to continue critical tasks from the failed processor 1 8 6 4 2 1 8 6 4 2 1 8 6 4 2 Processor #1 Processor #2 Processor #3 Event (Timeline) Failed Idle Low Priority High Priority Critical Kernel 15

SW Example (5) Nominal operations Failure occurs Error detection Recovery starts Recovery After the failed processor restarts, the oversight processor communicates to it necessary information to return to a valid state for the current operations The processors then return to normal operation Must be able to complete this cycle without negatively affecting satellite operations! 1 8 6 4 2 1 8 6 4 2 1 8 6 4 2 Processor #1 Processor #2 Processor #3 Event (Timeline) Failed Idle Low Priority High Priority Critical Kernel 16

Ensure no single-point failure Main Challenges Both in hardware and software - hard to identify single point failures in software Single-point failures in : how to combine s from multiple MCUs with no or minimal external components Large memory/processor requirements for Kernel A Kernel is pressumed to be small and have minimal impact on processing requirements The MultiChipSat Kernel might require more memory and than is normally available in MCU s Not considering payload requirements At this point the idea is to demonstrate the concept MCU memory limitations especially of concern for complex payloads Not considering rest of avionics system Power systems, signal conditioning, etc Demonstration of radiation effects It is not trivial to reproduce the space radiation environment Must initially perform tests with simulated failures; are they good enough? 17

Potential Benefits Considering the process to create a complete system board, MultiChipSat technology could: Improve MIPS if able to utilize new generation microprocessors or Provide equal MIPS to radhard processors with substantial less power/cost requirements Introduce redundancy in processor system Reduced cost by several orders of magnitude Reduce power consumption Reduce implementation time System # Processors MIPS Power [W] Cost [$] RAD75 1 26 25 ~$2k C671 Space 1 14-1G 2 ~$5 Intel 2 Quad 3 >7.5G 15 ~$6 Virtex II PowerPC 3 1.8G 6W ~$6 Atmel MCU 3 63 1W ~$1 PIC MCU 3 24.75W ~$1 18

Questions & Comments Thank you