Assessing Applications of Cloud Computing to NASA s Earth Observing System Data and Information System (EOSDIS)

Similar documents
Earthdata Cloud Analytics Project

Cumulus Services Working Group. Dan Pilone SE TIM / August 2017

Western U.S. TEMPO Early Adopters

ESOC s Successes, Complications and Opportunities in using Cloud Computing and Big Data Technology

Cloud Storage with AWS: EFS vs EBS vs S3 AHMAD KARAWASH

CLOUD WORKLOAD SECURITY

WHITEPAPER. MemSQL Enterprise Feature List

How to Cloud for Earth Scientists: An Introduction


escience in the Cloud Dan Fay Director Earth, Energy and Environment

The Three Data Challenges

Kaltura Platform: Ultimate Deployment Flexibility

Modern Data Warehouse The New Approach to Azure BI

VMware Hybrid Cloud Solution

Approaching the Petabyte Analytic Database: What I learned

Scaling MATLAB. for Your Organisation and Beyond. Rory Adams The MathWorks, Inc. 1

Networks

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

Modernizing Business Intelligence and Analytics

DEEPWAVE DATA MANAGEMENT

Supporting the Cloud Transformation of Agencies across the Public Sector

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

GSAW The Earth Observing System (EOS) Ground System: Leveraging an Existing Operational Ground System Infrastructure to Support New Missions

Government IT Modernization and the Adoption of Hybrid Cloud

Building a Data Strategy for a Digital World

Copyright 2012 EMC Corporation. All rights reserved.

Capture Business Opportunities from Systems of Record and Systems of Innovation

ArcGIS Enterprise: Architecting Your Deployment

BIG DATA CHALLENGES A NOAA PERSPECTIVE

Microsoft Applications on Nutanix

When (and how) to move applications from VMware to Cisco Metacloud

Data publication and discovery with Globus

THE FUTURE IS HYBRID. Patrick Harr. Global Vice President, Cloud Strategy and Solutions Hewlett-Packard Company

Lambda Architecture for Batch and Stream Processing. October 2018

Automated Netezza to Cloud Migration

Intermedia s Private Cloud Exchange

CANARIE: Providing Essential Digital Infrastructure for Canada

What s New at AWS? looking at just a few new things for Enterprise. Philipp Behre, Enterprise Solutions Architect, Amazon Web Services

5 reasons why choosing Apache Cassandra is planning for a multi-cloud future

Scaling up MATLAB Analytics Marta Wilczkowiak, PhD Senior Applications Engineer MathWorks

ArcGIS Enterprise: An Introduction. Philip Heede

Customer Case Studies on Accelerating Their Path to Hybrid Cloud

Cloud Analytics and Business Intelligence on AWS

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

Cloud Confidence: Simple Seamless Secure. Dell EMC Data Protection for VMware Cloud on AWS

Amazon Web Services. For Government, Education, and Nonprofit Organizations

How Insurers are Realising the Promise of Big Data

10 Considerations for a Cloud Procurement. March 2017

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

Developing, Deploying and Managing Applications on the Cloud

Overview of Data Services and Streaming Data Solution with Azure

Building Bespoke Threat Intelligence Enrichment Platforms

Database Engineering. Percona Live, Amsterdam, September, 2015

Practical Guide to Cloud Computing Version 2. Read whitepaper at

5 Steps to Government IT Modernization

STATE OF MODERN APPLICATIONS IN THE CLOUD

The Latest EMC s announcements

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Ten Innovative Financial Services Applications Powered by Data Virtualization

VMware Cloud on AWS Technical Deck VMware, Inc.

The age of Big Data Big Data for Oracle Database Professionals

Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap. Pedro Alves

IMAGERY FOR ARCGIS. Manage and Understand Your Imagery. Credit: Image courtesy of DigitalGlobe

Data Movement & Tiering with DMF 7

Deep Learning Frameworks with Spark and GPUs

Using Cohesity with Amazon Web Services (AWS)

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Moving to the Cloud: Making It Happen With MarkLogic

Computing as a Service

Hosted Azure for your business. Build virtual servers, deploy with flexibility, and reduce your hardware costs with a managed cloud solution.

Accelerating Cloud Adoption

Elevate the Conversation: Put IT Resilience into Practice for Cloud Service Providers

What s New at AWS? A selection of some new stuff. Constantin Gonzalez, Principal Solutions Architect, Amazon Web Services

Cisco Unified Data Center Strategy

ICN for Cloud Networking. Lotfi Benmohamed Advanced Network Technologies Division NIST Information Technology Laboratory

DATA SCIENCE USING SPARK: AN INTRODUCTION

Key Features. High-performance data replication. Optimized for Oracle Cloud. High Performance Parallel Delivery for all targets

AWS Web Application Firewall. Darren Weiner Cloud Architect/Engineer

ArcGIS Enterprise: An Introduction. David Thom Solution Engineer State Government

Running user-defined functions in R on Earth observation data in cloud back-ends

YOUR APPLICATION S JOURNEY TO THE CLOUD. What s the best way to get cloud native capabilities for your existing applications?

Cisco Tetration Analytics

Data Center Management and Automation Strategic Briefing

MATE-EC2: A Middleware for Processing Data with Amazon Web Services

AWS 101. Patrick Pierson, IonChannel

Challenges of Big Data Movement in support of the ESA Copernicus program and global research collaborations

Cloud-Driven Spatial Intelligence

Soccer Academy Alliance Canada delivers agility for over 20,000 players

Understanding the latent value in all content

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Storage Strategies for vsphere 5.5 users

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

New Approach to Unstructured Data

Training and Certification. Guide to Learning and Certification Paths

Engineered Resilient Systems Advanced Analytics and Modeling in Support of Acquisition

Presentation Title 11/13/2013

Massive Scalability With InterSystems IRIS Data Platform

Managing Imagery and Raster Data using Mosaic Datasets

Fusion Registry 9 SDMX Data and Metadata Management System

Transcription:

Assessing Applications of Cloud Computing to NASA s Earth Observing System Data and Information System (EOSDIS) Chris Lynnes, Katie Baynes, Mark McInerney NASA/GSFC ESDIS

Earth Observing System Data and Information System (EOSDIS) EOSDIS distribute Research Applications Education data downlink subset Users archive capture and clean process 2

Earth Observing System Data and Information System (EOSDIS) EOSDIS distribute Research Applications Education data downlink subset Users archive capture and clean process 3

Opportunities from Cloud Computing 1. Data Volume a. Challenge for Archives b. Challenge for Users 2. Performance a. Load balancing and scaling b. Resiliency 3. Cost Reduction(?) 4

Overall Approach Program of prototypes Demonstration prototypes: will this work? Operational prototypes: how does this change the way we work? Focus on public clouds for max cost savings Leverage existing software and efforts...but refactor software to be cloud-native not forklift migrations 5

EOSDIS Cloud Prototypes Key Archive Mgmt Analytics Support Application Hosting

Petabytes Data Volume Challenge for Archives Archive Volume Distribution Volume 7

Archive Cloud Prototypes Benefits from Archive in the Cloud Cost savings for storage of Big Data? Avoid data downloading and local data mgmt Key

Archive Cloud Prototypes Alaska Satellite Facility Web Object Storage prototype Distribute Sentinel radar data from Amazon storage ESA data hub EDGE Server in cloud NASA Sentinel Gateway Data cannon EDGE Server cluster at EDC* Alaska Satellite Facility *EDC = EROS (Earth Resources Observation and Science) Data Center Global Imagery Browse Service in the Cloud (AWS Key Lambda) Ingest and Archive management prototype (AWS Lambda) NISAR Mission Preparatory Prototype

Data Volume Challenge for the Users One Solution: Move more analysis closer to the data 10

Analysis More Complexity Subset Data Variable Spatial Area Reprojection Quality Filter Mosaicking Transform Simple Stats Analyze Complex Stats End User s Algorithm Seasonal Time Series 11

Cloud Analytics Prototypes Benefits from Cloud Analytics Analyze data at scale Analyze datasets together easily Avoid data downloading and local mgmt

Users work with the data in place : Avoid tedious cycles of download-preprocess-store-analyze Analysis Today Archive A Archive B Local Computer Archive C A New Paradigm Analysis Cloud

Distributing the data enables distributed, parallel computing. Results are computed for chunks of the problem simultaneously 14 and then recombined and reduced

Cloud Analytics Prototypes Analysis support toolbox to attract users to cloud analytics NEXUS prototype to accelerate processing Spark + Cassandra DB Community open source tools DAAC-developed tools Cloud analytics examples and recipes

Analytics in the Cloud Long Term Paradigm Shift: How Scientists Work on Data Scientists will work on data in place instead of downloading High-value data will (also) be in databases Pre-existing toolsets will be easy to find and use 16

Application-Hosting & Processing Prototypes Cloud Application Deployments Port existing applications to public cloud Leverage NASA-Compliant General Application Platform 17

NGAP: Compliance-as-a-Service Software-as-a-Service Compliance-as-a-Service security controls, Authorization to Operate governance procurement and accounting reliability and availability Platform-as-a-Service Infrastructure-as-a-Service 18

Application-Hosting & Processing in the Cloud Paradigm Shift: How We Implement Systems Off-the-shelf systems reduce effort for hardware procurement and deployment Automate everything: testing, deployment, scaling, failover,... Benefits Science users get: New capabilities sooner NASA gets: More reuse Lower hardware costs 19

EOSDIS Cloud Progress so far... Archive Prototypes Serving data to the public from Alaska Satellite Facility Web Object Storage prototype in AWS Simple Scalable Storage End-to-end lambda workflow has been demonstrated for the Ingest / Archive management prototype Global Imagery Browse Service is undergoing system testing 20

EOSDIS Cloud Progress so far... Analytics Prototypes NEXUS analytics algorithms benchmarked vs. Giovanni Roughly order-of-magnitude speedup achieved on a cluster Now porting to cloud-native architecture 21

EOSDIS Cloud Progress so far... Application Hosting Prototypes NASA-Compliant General Application Platform is now authorized to operate publicly Earthdata Search client is operational and accessible to the public in AWS Modified processes to account for new costing mechanisms 22

Terra Incognita 1. Vendor Lock-in 2. Future storage costs 3. Uncapped egress costs 4. Security Restrictions and Network trust