Evaluation of Machine Learning Algorithms for Satellite Operations Support

Similar documents
Application of Deep Learning Techniques in Satellite Telemetry Analysis.

Python With Data Science

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core)

SQL Server Machine Learning Marek Chmel & Vladimir Muzny

Oracle Machine Learning Notebook

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Machine Learning in the Process Industry. Anders Hedlund Analytics Specialist

Analytics and Visualization

Certified Data Science with Python Professional VS-1442

SCIENCE. An Introduction to Python Brief History Why Python Where to use

scikit-learn (Machine Learning in Python)

Knowledge Discovery. URL - Spring 2018 CS - MIA 1/22

Tutorial on Machine Learning Tools

Distributed Anomaly Detection using Autoencoder Neural Networks in WSN for IoT

Motivation. Problem: With our linear methods, we can train the weights but not the basis functions: Activator Trainable weight. Fixed basis function

Build a system health check for Db2 using IBM Machine Learning for z/os

ODL based AI/ML for Networks Prem Sankar Gopannan, Ericsson YuLing Chen, Cisco

ML 프로그래밍 ( 보충 ) Scikit-Learn

Machine Learning in Python. Rohith Mohan GradQuant Spring 2018

Rapid growth of massive datasets

ESOC s Successes, Complications and Opportunities in using Cloud Computing and Big Data Technology

Machine Learning with Python

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

CS 229 Project Report:

SUPERVISED LEARNING METHODS. Stanley Liang, PhD Candidate, Lassonde School of Engineering, York University Helix Science Engagement Programs 2018

REGRESSION ANALYSIS : LINEAR BY MAUAJAMA FIRDAUS & TULIKA SAHA

Specialist ICT Learning

Analytics Platform for ATLAS Computing Services

HMC CS 158, Fall 2017 Problem Set 3 Programming: Regularized Polynomial Regression

Scalable Machine Learning in R. with H2O

Machine Learning With Python. Bin Chen Nov. 7, 2017 Research Computing Center

Keras: Handwritten Digit Recognition using MNIST Dataset

Introduction to Data Mining and Data Analytics

Machine Learning: Think Big and Parallel

Python Certification Training

Data Analytics Training Program

Ch.1 Introduction. Why Machine Learning (ML)?

Keras: Handwritten Digit Recognition using MNIST Dataset

Facial Expression Classification with Random Filters Feature Extraction

Building High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL

CPSC 340: Machine Learning and Data Mining. More Linear Classifiers Fall 2017

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

CIS192 Python Programming

Data Analytics Training Program using

ECE 285 Final Project

Intrusion Detection System using AI and Machine Learning Algorithm

Lecture #11: The Perceptron

Gradient Descent. Wed Sept 20th, James McInenrey Adapted from slides by Francisco J. R. Ruiz

Why data science is the new frontier in software development

Machine Learning Part 1

Fishing Activity Visualization with Free Software Bigdata Analytics Institute

Self Programming Networks

Interpretable Machine Learning with Applications to Banking

A Dendrogram. Bioinformatics (Lec 17)

Machine Learning in Action

DATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course:

COMPUTATIONAL INTELLIGENCE (INTRODUCTION TO MACHINE LEARNING) SS18. Lecture 2: Linear Regression Gradient Descent Non-linear basis functions

Machine Learning In A Snap. Thomas Parnell Research Staff Member IBM Research - Zurich

Creating Situational Awareness with Spacecraft Data Trending and Monitoring

Data Science Bootcamp Curriculum. NYC Data Science Academy

Lesson 5 Nimbits. Chapter-6 L05: "Internet of Things ", Raj Kamal, Publs.: McGraw-Hill Education

Python for. Data Science. by Luca Massaron. and John Paul Mueller

Leveraging Adaptive Software Standards to Enable the Rapid Standup of Small Satellite Ground Systems

Tackling Big Data Using MATLAB

M. Sc. (Artificial Intelligence and Machine Learning)

Intelligent Edge Computing and ML-based Traffic Classifier. Kwihoon Kim, Minsuk Kim (ETRI) April 25.

Optimization. 1. Optimization. by Prof. Seungchul Lee Industrial AI Lab POSTECH. Table of Contents

Scaled Machine Learning at Matroid

ML4Bio Lecture #1: Introduc3on. February 24 th, 2016 Quaid Morris

A Multilevel Secure MapReduce Framework for Cross-Domain Information Sharing in the Cloud

SVM multiclass classification in 10 steps 17/32

Intel Distribution for Python* и Intel Performance Libraries

Big Data and FrameWorks; Perspectives to Applied Machine Learning

Data Science with PostgreSQL

Machine Learning in Biology

Integrating Advanced Analytics with Big Data

Connecting ArcGIS with R and Conda. Shaun Walbridge

Ch.1 Introduction. Why Machine Learning (ML)? manual designing of rules requires knowing how humans do it.

What is machine learning?

Frameworks in Python for Numeric Computation / ML

Recurrent Neural Network (RNN) Industrial AI Lab.

Clustering algorithms and autoencoders for anomaly detection

COPYRIGHT DATASHEET

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

Accessing ODBC API Using Python for Detecting Fraudulent Financial Journal Entries

Based on Big Data: Hype or Hallelujah? by Elena Baralis

ABSTRACT I. INTRODUCTION. Dr. J P Patra 1, Ajay Singh Thakur 2, Amit Jain 2. Professor, Department of CSE SSIPMT, CSVTU, Raipur, Chhattisgarh, India

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver

CS535 Big Data Fall 2017 Colorado State University 10/10/2017 Sangmi Lee Pallickara Week 8- A.

Introduction to Data Analytics. David Walling

Opportunities and challenges in personalization of online hotel search

ndpi & Machine Learning A future concrete idea

Index. Umberto Michelucci 2018 U. Michelucci, Applied Deep Learning,

Using Machine Learning to Identify Security Issues in Open-Source Libraries. Asankhaya Sharma Yaqin Zhou SourceClear

Allstate Insurance Claims Severity: A Machine Learning Approach

A new approach for supervised power disaggregation by using a deep recurrent LSTM network

MIT Samberg Center Cambridge, MA, USA. May 30 th June 2 nd, by C. Rea, R.S. Granetz MIT Plasma Science and Fusion Center, Cambridge, MA, USA

Transcription:

Evaluation of Machine Learning Algorithms for Satellite Operations Support Julian Spencer-Jones, Spacecraft Engineer Telenor Satellite AS Greg Adamski, Member of Technical Staff L3 Technologies Telemetry and RF Products 2017 by L3 Technologies and Telenor Satellite AS. Published by The Aerospace Corporation with permission.

Agenda Goals and background Machine learning tools overview Archiving subsystem ease of data access Machine learning for telemetry data - concepts Analyzing data from Telenor fleet: techniques, algorithms and results Lessons learned and future development

Problem statement and background Goal: predict battery voltage levels through eclipse season to prevent a catastrophic power failure. We will be analyzing data from the past 7 years and using machine learning algorithms to attempt a realistic prediction. Secondary goal: gain experience in machine learning algorithms and their applicability to satellite telemetry data analysis. Ease of use of the algorithm was of paramount importance: Quick implementation Results that can be explained Good defaults and as little requirement on data preparation as possible Lack of data science background the work was done by a Satellite Engineer and a Software Engineer We expect this is true for many organizations

Tools Machine learning - scikit-learn: open source library for machine learning - Powerful and includes a wide range of algorithms and tools - Used for all the work in this project scikit-learn NumPy SciPy matplotlib Python Many alternatives available in a variety of languages (not tested) Especially in Python and R Also: TensorFlow (Google), MSCNTK, Including cloud based like Amazon AI

Data Storage Nebula Archiver From proprietary to open Legacy archiving based on closed, proprietary file formats Data access possible only through proprietary APIs or GUIs Nebula Archiver power to the user Based on Cassandra No-SQL database (other DBs supported as needed) Almost unlimited scalability Lightning fast data access Data laid out to match typical access patterns Ease of access just a few lines of python code needed to retrieve and plot data Easily accessible from any programming platform

Nebula Archiver high level architecture Real time feed from operational fleet InControl Real time data Cassandra Driver CQL Calls Historical reports Happy operator Historical data for analytics Legacy Data Store Source reader 3 rd party data source Nebula Importer Data Updates Cassandra Cluster Nebula Archiver Data access for ML Machine Learning/ Analytics Tools Other system Data Updates/Import

Nebula Archiver - Benefits For the operator Quick overview of large portions of data [view 1 year of data in as little as a few seconds] Reporting has no impact on the real time processing For the engineer Very fast and simple access to stored data using open tools (python/java/js) Use analytical tools without impacting the real time system Almost unlimited scalability For the manager Happy operator/engineer Reduced hardware cost and ability to expand storage subsystem as needs grow

Machine learning - concepts Machine learning: data analytics technique that uses historical data to predict future system behavior Supervised learning: using engineering knowledge in conjunction with data to train the algorithm Unsupervised learning: predicting future behavior without user input Classification: Automatic assignment of an event to a class based on historical data and other inputs Overfitting: learning the underlying data vs. trends in the data

Regression Algorithms Several different regression algorithms were tested: Linear regression stochastic gradient descent State vector machine linear, polynomial and radial basis function Random forest extra trees regressor Extra trees regressor produced the best results in most cases tested, especially for time series. SVM RBF is faster and sometimes a good alternative.

Regression Use Cases Monitoring battery cell voltage through eclipse seasons. Dynamic limits potentially used for any predictable telemetry point. Predicting the attitude disturbance of momentum wheel unloads. Based on the momentum stored on the wheels.

Regression Example Battery Cell Voltages Battery cell voltages are monitored during eclipse seasons. No model has previously been found to reliably predict them.

Regression Example Battery Cell Voltages Extra Trees Regressor creates a good prediction. Model is updated during the eclipse season.

Regression Example Battery Cell Voltages Algorithm has learnt the data from seven previous eclipse seasons. Uses temperature, power and load telemetry as well as cell voltages.

Regression Example Dynamic Limits Predicts the trends ahead and sets limits. Prediction accuracy deteriorates with time. More work needed to reduce false alarms.

Classification Classification was tested for use in monitoring battery charging. Supervised training of previous charges to identify nominal charge characteristics for the various modes and states. More work needed to develop a useful tool. Would be useful for further work in anomaly detection.

Lessons Learned and Future Directions Lessons Learned: Challenge is to define the task and preprocess data. Some useful tools developed. Too few anomalies. Hard to train algorithms without failures. Future Development: Expand and improve current tools - Plenty of other potential applications Anomaly detection - Dynamic limits, out of family behaviour, component degradation Investigate techniques for more general learning