Prediction of workflow execution time using provenance traces: practical applications in medical data processing

Size: px
Start display at page:

Download "Prediction of workflow execution time using provenance traces: practical applications in medical data processing"

Transcription

1 Prediction of workflow execution time using provenance traces: practical applications in medical data processing Hugo Hiden Simon Woodman Paul Watson

2 How long will my program take to run?

3 Part of a bigger picture Can I repeat my results? What are the implications of errors How long will my program take to run? What version of the program ran? How was a result generated?

4 Provenance Research Used to answer these questions Important in scientific research Lots of work done to capture and represent provenance Active research area OPM PROV

5 e-science Central Source of all our provenance data Platform used for many projects Repository of code and data Users can add their own code Well instrumented and understood Used to collect OPM Now PROV Plenty of data sets Diverse projects Large applications Workflows for data processing

6 The workflow model Simple workflow implementation Acyclic directed graph Composed of connected Blocks Deploys at reasonable scale in clouds

7 Modelling performance Execution time for a single block Workflow is some combination of individual block models There should be some predictors: The input data sizes The configuration of the block The machine it is running on The issues are: What types of model are most appropriate How accurate are they

8 Execution time of a block time=f(input-size, block-code, block-settings, random-factors) More data increases execution time Each block has different characteristics, so a model is needed for each block The configuration of the block instance can change behavior Machine load, network traffic, hardware variations, A workflow is a connected pathway of blocks

9 Requirements for a real system Proactively build models In response to more data When more blocks are added Select the most appropriate model Pick based on best error Aim to always return some estimate Mechanisms to return estimate if no models are available

10 Complications Gathering data Collect data non-invasively Model types Different blocks display different characteristics Different algorithms and versions Dynamic environment New blocks being added Block behaviour only becomes apparent as data is collected

11 Data collected via provenance Provenance collection already captures: Data sizes Code versions Algorithm settings Extra instrumentation for Block start and end times Number of concurrent workflows CPU / Memory usage

12 e-sc Architecture Tooling - Maven Plugins - File uploader - Domain specific apps/websites Workflow Engines Workflow Queue External API REST User MGMT - Friends - Groups - Projects - Quotas HTTP e-sc DB Postgres MySQL SQL Server Service/Lib Cache Control Topic Security - ACL - Authentication Migration Queue e-sc Blob Store New e-sc Blob Store Filesystem S3 Postgres Azure Blob Store HDFS Private API RMI Storage - Versioning - Archiving Provenance Store OpenID Shibboleth External Auth REST Processing - Services - Workflows - Libs Provenance Queue Postgres Neo4j SWORD Provenance/ Audit - Capture - Query/Search - Presentation Archive Queues Archive Filesystem AWS Glacier

13 Data capture architecture Provenance and performance data capture Data / model storage Tooling - Maven Plugins - File uploader - Domain specific apps/websites Workflow Engines Workflow Queue External API REST User MGMT - Friends - Groups - Projects - Quotas HTTP Service/Lib Cache Control Topic Security - ACL - Authentication Data Models Private API RMI Storage - Versioning - Archiving OpenID Shibboleth REST Model building / updating External Auth Processing - Services - Workflows - Libs SWORD Provenance/ Audit - Capture - Query/Search - Presentation

14 Data collected Each execution of a block creates a single data point: ID, Version Setting_1, Setting_2, Memory Use, Input_size Duration, Output_size ID, Version Setting_1, Setting_2, Memory Use, Input_size Duration, Output_size ID, Version Setting_1, Setting_2, Memory Use, Input_size Duration, Output_size Identifying data Model X data Model Y data

15 Block models Execution Time No relationship Blocks may exhibit very different behaviors depending on their implementation details / configuration Execution Time Observed Execution Data Linear relationship Observed Execution Data Execution Time Non-linear relationship Observed Execution Data

16 Selecting the most appropriate model

17 Selecting the most appropriate model

18 Selecting the most appropriate model

19 Selecting the most appropriate model

20 Dynamic model updating Impossible (difficult) to know what the best model will be Gathering more data may change our view Need to implement model updating Models can be rebuilt and replaced on the fly Return best available estimate at a given time This may improve

21 Panel of experts pattern Maintain a suite of different models Rebuild them all when new data arrives Use the best one until the next update Drug modelling project: Quantitative Structure Activity Relationship Activity f( )

22 Model fallbacks What happens if there is no model? Still want to return something We used the following logic: Use version agnostic model Use average execution time of block Use average execution time of all blocks This will always return some prediction as long as a single block of any type has executed

23 Medical data processing Measure acceleration in 3-axes Typically 100Hz Worn for 2 weeks Analyse sleep patterns, general activity levels etc Data collected and analysed Clinicians view results and modify exercise regime Collections of 100k data sets (24TB) Wrist worn accelerometers

24 Results Physical Activity Classification (PAC1) Prediction Fitted Ideal Prediction Fitted Ideal Predicted (KB) Predicted (seconds) Actual (KB) Actual (seconds) Output size model Duration model

25 Results GGIR GENEActiv processing Prediction Fitted Ideal Prediction (RMSE=34.670, r 2 =0.987) Fitted Ideal Predicted (KB) Predicted (seconds) Actual (KB) Actual (seconds) Output size model Duration model

26 Not always successful Prediction Fitted Ideal 60 Predicted (seconds) Actual (seconds)

27 Predicting Workflow duration Modelling is complicated by connected nature of workflow All data for model readily available not the case here? how big are the intermediate data transfers??????

28 Data volume produced by a block size=f(input-size, block-code, block-settings, random-factors) More data increases execution time Each block has different characteristics, so a model is needed for each block The configuration of the block instance can change behavior Machine load, network traffic, hardware variations, phase of moon

29 Modelling total execution time Execution time = Sum(block predictions)

30 Models built for each individual block Prediction generated by propagating size predictions Results Chemical property modelling Predicted (seconds) Training Prediction (RMSE=5.008,r 2 =0.980) Testing Prediction (RMSE=4.698,r 2 =0.981) Fitted Training Ideal Actual (seconds)

31 Modelling workflows: caveats Much harder to model workflow duration Propagation of errors Works for simple workflows Rapidly fails for larger workflows Possible solutions More data collection Model groups of blocks Build models of whole workflows

32 Conclusions Extended provenance capture to build predictive models Asynchronous collection of data and model building Demonstrated it is possible to model block execution time Show it may be possible to combine predictions to estimate workflow execution time Large workflows / poor block models are issues

Microsoft Architecting Microsoft Azure Solutions.

Microsoft Architecting Microsoft Azure Solutions. Microsoft 70-535 Architecting Microsoft Azure Solutions https://killexams.com/pass4sure/exam-detail/70-535 QUESTION: 106 Your organization has developed and deployed several Azure App Service Web and API

More information

MS-20487: Developing Windows Azure and Web Services

MS-20487: Developing Windows Azure and Web Services MS-20487: Developing Windows Azure and Web Services Description In this course, students will learn how to design and develop services that access local and remote data from various data sources. Students

More information

Users Application Virtual Machine Users Application Virtual Machine Users Application Virtual Machine Private Cloud Users Application Virtual Machine On-Premise Service Providers Private Cloud Users Application

More information

Most real programs operate somewhere between task and data parallelism. Our solution also lies in this set.

Most real programs operate somewhere between task and data parallelism. Our solution also lies in this set. for Windows Azure and HPC Cluster 1. Introduction In parallel computing systems computations are executed simultaneously, wholly or in part. This approach is based on the partitioning of a big task into

More information

Developing Windows Azure and Web Services

Developing Windows Azure and Web Services Developing Windows Azure and Web Services Course 20487B; 5 days, Instructor-led Course Description In this course, students will learn how to design and develop services that access local and remote data

More information

Course Outline: Course 50466A: Windows Azure Solutions with Microsoft Visual Studio 2010

Course Outline: Course 50466A: Windows Azure Solutions with Microsoft Visual Studio 2010 Course Outline: Course 50466A: Windows Azure Solutions with Microsoft Visual Studio 2010 Learning Method: Instructor-led Classroom Learning Duration: 3.00 Day(s)/ 24 hrs Overview: This class is an introduction

More information

Backup & Recovery on AWS

Backup & Recovery on AWS EBOOK: Backup & Recovery on AWS 2017, Amazon Web Services, Inc. or its affiliates. All rights reserved. Contents Backup and Recovery on AWS AWS Object Storage Services RDX APN Storage Partner Benefits

More information

Exam Questions

Exam Questions Exam Questions 70-475 Designing and Implementing Big Data Analytics Solutions https://www.2passeasy.com/dumps/70-475/ 1. Drag and Drop You need to recommend data storage mechanisms for the solution. What

More information

Oracle Cloud IaaS: Compute and Storage Fundamentals

Oracle Cloud IaaS: Compute and Storage Fundamentals Oracle University Contact Us: 1.800.529.0165 Oracle Cloud IaaS: Compute and Storage Fundamentals Duration: 3 Days What you will learn This Oracle Cloud IaaS: Compute and Storage Fundamentals training gives

More information

Developing Microsoft Azure Solutions

Developing Microsoft Azure Solutions Course 20532C: Developing Microsoft Azure Solutions Course details Course Outline Module 1: OVERVIEW OF THE MICROSOFT AZURE PLATFORM This module reviews the services available in the Azure platform and

More information

Course Outline. Introduction to Azure for Developers Course 10978A: 5 days Instructor Led

Course Outline. Introduction to Azure for Developers Course 10978A: 5 days Instructor Led Introduction to Azure for Developers Course 10978A: 5 days Instructor Led About this course This course offers students the opportunity to take an existing ASP.NET MVC application and expand its functionality

More information

20532D - Version: 1. Developing Microsoft Azure Solutions

20532D - Version: 1. Developing Microsoft Azure Solutions 20532D - Version: 1 Developing Microsoft Azure Solutions Developing Microsoft Azure Solutions 20532D - Version: 1 5 days Course Description: This course offers students the opportunity to take an existing

More information

Developing Enterprise Cloud Solutions with Azure

Developing Enterprise Cloud Solutions with Azure Developing Enterprise Cloud Solutions with Azure Java Focused 5 Day Course AUDIENCE FORMAT Developers and Software Architects Instructor-led with hands-on labs LEVEL 300 COURSE DESCRIPTION This course

More information

Developing Solutions for Google Cloud Platform (CPD200) Course Agenda

Developing Solutions for Google Cloud Platform (CPD200) Course Agenda Developing Solutions for Google Cloud Platform (CPD200) Course Agenda Module 1: Developing Solutions for Google Cloud Platform Identify the advantages of Google Cloud Platform for solution development

More information

COURSE 20487B: DEVELOPING WINDOWS AZURE AND WEB SERVICES

COURSE 20487B: DEVELOPING WINDOWS AZURE AND WEB SERVICES ABOUT THIS COURSE In this course, students will learn how to design and develop services that access local and remote data from various data sources. Students will also learn how to develop and deploy

More information

Science-as-a-Service

Science-as-a-Service Science-as-a-Service The iplant Foundation Rion Dooley Edwin Skidmore Dan Stanzione Steve Terry Matthew Vaughn Outline Why, why, why! When duct tape isn t enough Building an API for the web Core services

More information

Azure Application Building Blocks

Azure Application Building Blocks Azure Application Building Blocks database storage cloud services identity media CDN caching messaging Commonly used components inside the building blocks 1. Cloud Services Azure WebAPI, Azure WebJob 2.

More information

Cloud Storage with AWS: EFS vs EBS vs S3 AHMAD KARAWASH

Cloud Storage with AWS: EFS vs EBS vs S3 AHMAD KARAWASH Cloud Storage with AWS: EFS vs EBS vs S3 AHMAD KARAWASH Cloud Storage with AWS Cloud storage is a critical component of cloud computing, holding the information used by applications. Big data analytics,

More information

Integrate MATLAB Analytics into Enterprise Applications

Integrate MATLAB Analytics into Enterprise Applications Integrate Analytics into Enterprise Applications Aurélie Urbain MathWorks Consulting Services 2015 The MathWorks, Inc. 1 Data Analytics Workflow Data Acquisition Data Analytics Analytics Integration Business

More information

About Intellipaat. About the Course. Why Take This Course?

About Intellipaat. About the Course. Why Take This Course? About Intellipaat Intellipaat is a fast growing professional training provider that is offering training in over 150 most sought-after tools and technologies. We have a learner base of 600,000 in over

More information

Sentinet for Microsoft Azure SENTINET

Sentinet for Microsoft Azure SENTINET Sentinet for Microsoft Azure SENTINET Sentinet for Microsoft Azure 1 Contents Introduction... 2 Customer Benefits... 2 Deployment Topologies... 3 Cloud Deployment Model... 3 Hybrid Deployment Model...

More information

Oracle Database Cloud for Oracle DBAs Ed 3

Oracle Database Cloud for Oracle DBAs Ed 3 Oracle University Contact Us: 800-260-690 Oracle Database Cloud for Oracle DBAs Ed 3 Duration: 3 Days What you will learn Note: No hands-on lab environment for the Training On Demand course format This

More information

Dataverse: Modular Storage and Migration to the Cloud

Dataverse: Modular Storage and Migration to the Cloud Dataverse: Modular Storage and Migration to the Cloud Gustavo Durand, Dataverse Technical Lead / Architect Leonid Andreev, Dataverse Senior Developer Dataverse Overview An open-source platform to publish,

More information

Developing Microsoft Azure Solutions

Developing Microsoft Azure Solutions 1 Developing Microsoft Azure Solutions Course Prerequisites A general understanding of ASP.NET and C# concepts Upon Completion of this Course, you will accomplish following: Compare the services available

More information

Developing Microsoft Azure and Web Services. Course Code: 20487C; Duration: 5 days; Instructor-led

Developing Microsoft Azure and Web Services. Course Code: 20487C; Duration: 5 days; Instructor-led Developing Microsoft Azure and Web Services Course Code: 20487C; Duration: 5 days; Instructor-led WHAT YOU WILL LEARN In this course, students will learn how to design and develop services that access

More information

IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage

IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage Silverton Consulting, Inc. StorInt Briefing 2017 SILVERTON CONSULTING, INC. ALL RIGHTS RESERVED Page 2 Introduction Unstructured data has

More information

Windows Azure Solutions with Microsoft Visual Studio 2010

Windows Azure Solutions with Microsoft Visual Studio 2010 Windows Azure Solutions with Microsoft Visual Studio 2010 Course No. 50466 3 Days Instructor-led, Hands-on Introduction This class is an introduction to cloud computing and specifically Microsoft's public

More information

Design patterns for data-driven research acceleration

Design patterns for data-driven research acceleration Design patterns for data-driven research acceleration Rachana Ananthakrishnan, Kyle Chard, and Ian Foster The University of Chicago and Argonne National Laboratory Contact: rachana@globus.org Introduction

More information

MCSE Productivity. A Success Guide to Prepare- Core Solutions of Microsoft SharePoint Server edusum.com

MCSE Productivity. A Success Guide to Prepare- Core Solutions of Microsoft SharePoint Server edusum.com 70-331 MCSE Productivity A Success Guide to Prepare- Core Solutions of Microsoft SharePoint Server 2013 edusum.com Table of Contents Introduction to 70-331 Exam on Core Solutions of Microsoft SharePoint

More information

Let s say that hosting a cloudbased application is like car ownership

Let s say that hosting a cloudbased application is like car ownership Let s say that hosting a cloudbased application is like car ownership Azure App Service App Service Features & Capabilities All features and capabilities are shared across all of App Service application

More information

Azure Certification BootCamp for Exam (Developer)

Azure Certification BootCamp for Exam (Developer) Azure Certification BootCamp for Exam 70-532 (Developer) Course Duration: 5 Days Course Authored by CloudThat Description Microsoft Azure is a cloud computing platform and infrastructure created for building,

More information

The New Economics of Cloud Storage

The New Economics of Cloud Storage The New Economics of Cloud Storage How Wasabi Hot Cloud Storage Compares With the Economics of Amazon Web Services, Google Cloud and Microsoft Azure Executive Overview Wasabi is fundamentally transforming

More information

Azure Development Course

Azure Development Course Azure Development Course About This Course This section provides a brief description of the course, audience, suggested prerequisites, and course objectives. COURSE DESCRIPTION This course is intended

More information

How to pimp high volume PHP websites. 27. September 2008, PHP conference Barcelona. By Jens Bierkandt

How to pimp high volume PHP websites. 27. September 2008, PHP conference Barcelona. By Jens Bierkandt How to pimp high volume PHP websites 27. September 2008, PHP conference Barcelona By Jens Bierkandt 1 About me Jens Bierkandt Working with PHP since 2000 From Germany, living in Spain, speaking English

More information

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti S3IT: Service and Support for Science IT Cloud middleware Part2: Let s pick one cloud IaaS middleware: OpenStack Sergio Maffioletti S3IT: Service and Support for Science IT, University of Zurich http://www.s3it.uzh.ch/

More information

Deccansoft Software Services

Deccansoft Software Services Azure Syllabus Cloud Computing What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages and Disadvantages of Cloud Computing Getting

More information

Databricks, an Introduction

Databricks, an Introduction Databricks, an Introduction Chuck Connell, Insight Digital Innovation Insight Presentation Speaker Bio Senior Data Architect at Insight Digital Innovation Focus on Azure big data services HDInsight/Hadoop,

More information

Developing Microsoft Azure Solutions

Developing Microsoft Azure Solutions Developing Microsoft Azure Solutions Duration: 5 Days Course Code: M20532 Overview: This course is intended for students who have experience building web applications. Students should also have experience

More information

MATLAB. Senior Application Engineer The MathWorks Korea The MathWorks, Inc. 2

MATLAB. Senior Application Engineer The MathWorks Korea The MathWorks, Inc. 2 1 Senior Application Engineer The MathWorks Korea 2017 The MathWorks, Inc. 2 Data Analytics Workflow Business Systems Smart Connected Systems Data Acquisition Engineering, Scientific, and Field Business

More information

Cloud Computing /AWS Course Content

Cloud Computing /AWS Course Content Cloud Computing /AWS Course Content 1. Amazon VPC What is Amazon VPC? How to Get Started with Amazon VPC Create New VPC Launch an instance (Server) to use this VPC Security in Your VPC Networking in Your

More information

WEBSCALE CONVERGED APPLICATION DELIVERY PLATFORM

WEBSCALE CONVERGED APPLICATION DELIVERY PLATFORM SECURITY ANALYTICS WEBSCALE CONVERGED APPLICATION DELIVERY PLATFORM BLAZING PERFORMANCE, HIGH AVAILABILITY AND ROBUST SECURITY FOR YOUR CRITICAL WEB APPLICATIONS OVERVIEW Webscale is a converged multi-cloud

More information

SoftNAS Cloud Performance Evaluation on Microsoft Azure

SoftNAS Cloud Performance Evaluation on Microsoft Azure SoftNAS Cloud Performance Evaluation on Microsoft Azure November 30, 2016 Contents SoftNAS Cloud Overview... 3 Introduction... 3 Executive Summary... 4 Key Findings for Azure:... 5 Test Methodology...

More information

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

How can you implement this through a script that a scheduling daemon runs daily on the application servers? You ve been tasked with implementing an automated data backup solution for your application servers that run on Amazon EC2 with Amazon EBS volumes. You want to use a distributed data store for your backups

More information

Sentinet for Windows Azure VERSION 2.2

Sentinet for Windows Azure VERSION 2.2 Sentinet for Windows Azure VERSION 2.2 Sentinet for Windows Azure 1 Contents Introduction... 2 Customer Benefits... 2 Deployment Topologies... 3 Isolated Deployment Model... 3 Collocated Deployment Model...

More information

WHITEPAPER. MemSQL Enterprise Feature List

WHITEPAPER. MemSQL Enterprise Feature List WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure

More information

Approaching the Petabyte Analytic Database: What I learned

Approaching the Petabyte Analytic Database: What I learned Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may

More information

Building High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL

Building High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL Building High Performance Apps using NoSQL Swami Sivasubramanian General Manager, AWS NoSQL Building high performance apps There is a lot to building high performance apps Scalability Performance at high

More information

Oracle Autonomous Database

Oracle Autonomous Database Oracle Autonomous Database Maria Colgan Master Product Manager Oracle Database Development August 2018 @SQLMaria #thinkautonomous Safe Harbor Statement The following is intended to outline our general

More information

CIMERA ARCHITECTURE. Release 4.2.x

CIMERA ARCHITECTURE. Release 4.2.x CIMERA ARCHITECTURE Release 4.2.x Version 1.0, 13-May 2015 Gwyn Carwardine, Jon Bentley gwyn.carwardine@propelsystems.com jon.bentley@propelsystems.com Propel Systems, 2015 Cimera Architecture R4.2.x Page

More information

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES May, 2017 Contents Introduction... 2 Overview... 2 Architecture... 2 SDFS File System Service... 3 Data Writes... 3 Data Reads... 3 De-duplication

More information

Scaling Massive Content Stores in the Cloud. CloudExpo New York June Alfresco Founder & CTO

Scaling Massive Content Stores in the Cloud. CloudExpo New York June Alfresco Founder & CTO Scaling Massive Content Stores in the Cloud CloudExpo New York June 2016 @johnnewton Alfresco Founder & CTO Alfresco Customers Government Financial Services Healthcare Manufacturing Corporate Somewhere

More information

20532D: Developing Microsoft Azure Solutions

20532D: Developing Microsoft Azure Solutions 20532D: Developing Microsoft Azure Solutions Course Details Course Code: Duration: Notes: 20532D 5 days Elements of this syllabus are subject to change. About this course This course is intended for students

More information

Examining Public Cloud Platforms

Examining Public Cloud Platforms Examining Public Cloud Platforms A Survey Copyright 2012 Chappell & Associates Agenda What is Cloud Computing? Cloud Platform Technologies: An Overview Public Cloud Platforms: Reviewing the Terrain What

More information

Prepare. Model. Operationalize

Prepare. Model. Operationalize Prepare Model Operationalize Model Re-Code Validate Deploy How do we operationalize R? Turn R analytics Web services in one line of code; Swagger-based REST APIs, easy to consume, with any programming

More information

ARCHITECTURE GUIDE. Campaign Manager 6.0

ARCHITECTURE GUIDE. Campaign Manager 6.0 ARCHITECTURE GUIDE Campaign Manager 6.0 VERSION CONTROL Version Date Author Changes 1.0 28 April 2017 D Cooper Release RELATED DOCUMENTS The related documents are located in the Alterian product help.

More information

Introduction to Windows Azure Cloud Computing Futures Group, Microsoft Research Roger Barga, Jared Jackson, Nelson Araujo, Dennis Gannon, Wei Lu, and

Introduction to Windows Azure Cloud Computing Futures Group, Microsoft Research Roger Barga, Jared Jackson, Nelson Araujo, Dennis Gannon, Wei Lu, and Introduction to Windows Azure Cloud Computing Futures Group, Microsoft Research Roger Barga, Jared Jackson, Nelson Araujo, Dennis Gannon, Wei Lu, and Jaliya Ekanayake Range in size from edge facilities

More information

EMC ISILON HARDWARE PLATFORM

EMC ISILON HARDWARE PLATFORM EMC ISILON HARDWARE PLATFORM Three flexible product lines that can be combined in a single file system tailored to specific business needs. S-SERIES Purpose-built for highly transactional & IOPSintensive

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

Exam Questions AWS-Certified- Developer-Associate

Exam Questions AWS-Certified- Developer-Associate Exam Questions AWS-Certified- Developer-Associate Amazon AWS Certified Developer Associate https://www.2passeasy.com/dumps/aws-certified- Developer-Associate/ 1. When using Amazon SQS how much data can

More information

New research on Key Technologies of unstructured data cloud storage

New research on Key Technologies of unstructured data cloud storage 2017 International Conference on Computing, Communications and Automation(I3CA 2017) New research on Key Technologies of unstructured data cloud storage Songqi Peng, Rengkui Liua, *, Futian Wang State

More information

Zumobi Brand Integration(Zbi) Platform Architecture Whitepaper Table of Contents

Zumobi Brand Integration(Zbi) Platform Architecture Whitepaper Table of Contents Zumobi Brand Integration(Zbi) Platform Architecture Whitepaper Table of Contents Introduction... 2 High-Level Platform Architecture Diagram... 3 Zbi Production Environment... 4 Zbi Publishing Engine...

More information

Hive Metadata Caching Proposal

Hive Metadata Caching Proposal Hive Metadata Caching Proposal Why Metastore Cache During Hive 2 benchmark, we find Hive metastore operation take a lot of time and thus slow down Hive compilation. In some extreme case, it takes much

More information

Steven Edouard SDET, US - DX Audience West Microsoft Bruno Terkaly Principal Software Engineer - Microsoft

Steven Edouard SDET, US - DX Audience West Microsoft Bruno Terkaly Principal Software Engineer - Microsoft Steven Edouard SDET, US - DX Audience West Microsoft Bruno Terkaly Principal Software Engineer - Microsoft Steven and Bruno Info Section 3 Software Architecture Section 7 The Protocol challenge Progress

More information

Welcome to IBM Security Guardium Analyzer!

Welcome to IBM Security Guardium Analyzer! Welcome to IBM Security Guardium Analyzer! To help you get started with IBM Security Guardium Analyzer, please refer to these frequently asked questions: What is IBM Security Guardium Analyzer? Guardium

More information

Oracle Database 11g: New Features for Administrators DBA Release 2

Oracle Database 11g: New Features for Administrators DBA Release 2 Oracle Database 11g: New Features for Administrators DBA Release 2 Duration: 5 Days What you will learn This Oracle Database 11g: New Features for Administrators DBA Release 2 training explores new change

More information

Overview SENTINET 3.1

Overview SENTINET 3.1 Overview SENTINET 3.1 Overview 1 Contents Introduction... 2 Customer Benefits... 3 Development and Test... 3 Production and Operations... 4 Architecture... 5 Technology Stack... 7 Features Summary... 7

More information

Azure Integration Services

Azure Integration Services Azure Integration Services 2018 Microsoft Corporation. All rights reserved. This document is provided "as-is." Information and views expressed in this document, including URL and other Internet Web site

More information

Integrate MATLAB Analytics into Enterprise Applications

Integrate MATLAB Analytics into Enterprise Applications Integrate Analytics into Enterprise Applications Lyamine Hedjazi 2015 The MathWorks, Inc. 1 Data Analytics Workflow Preprocessing Data Business Systems Build Algorithms Smart Connected Systems Take Decisions

More information

Citizen developer tools are not just for citizen developers!

Citizen developer tools are not just for citizen developers! Citizen developer tools are not just for citizen developers! a.k.a Using Azure Functions and Cognitive Services Text API to enrich a Flow that fills Metadata for new items in a Modern SharePoint Team Site

More information

Azure Logic Apps. The big picture. API Apps, Logic Apps & API Management

Azure Logic Apps. The big picture. API Apps, Logic Apps & API Management Azure Logic Apps The big picture API Apps, Logic Apps & API Management Hyper-scale Agility Insight RICH EXPERIENCES BREADTH OF DEVICES { } One integrated offering WEB APPS Web apps that scale with your

More information

Edge Foundational Training

Edge Foundational Training Edge Foundational Training Give your team the tools to get up and running with Edge Edge Foundational Training provides the tools and information needed to start using Edge whether in the cloud or on premises.

More information

Schema Validate With Oracle 10g Database Status

Schema Validate With Oracle 10g Database Status Schema Validate With Oracle 10g Database Status This exam has been validated against Oracle Database 10g, Oracle Database 11g, Oracle Database 11g Release 2, and Oracle Database 12c Release 1. Copy whole

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

Testing storage and metadata backends

Testing storage and metadata backends Testing storage and metadata backends Hugo González Labrador, Arno Formella LIA2, University of Vigo CS3: Cloud Storage Services for Novel Applications and Workflows Zürich, January 2016 Outline Origin

More information

Cisco Tetration Analytics

Cisco Tetration Analytics Cisco Tetration Analytics Enhanced security and operations with real time analytics John Joo Tetration Business Unit Cisco Systems Security Challenges in Modern Data Centers Securing applications has become

More information

Technical Comparison Sheet: ez Platform Cloud vs Other Hosting Approaches

Technical Comparison Sheet: ez Platform Cloud vs Other Hosting Approaches Technical Comparison Sheet: vs Other Approaches This is a technical comparison worksheet between the various approaches to deploy and host your ez Platform project. Each approach is unique and has its

More information

Is Your Project in Trouble on System Performance?

Is Your Project in Trouble on System Performance? Is Your Project in Trouble on System Performance? Charles Chow May 2017 Is SATURN Your Project 2017 in Trouble - Is Your on System Project Performance? in Trouble on System Performance? May 2017 1 4, [Copyright

More information

The Open Monolith. Keeping Your Codebase (and Your Headaches) CON3449. Matthew sbgrid.

The Open Monolith. Keeping Your Codebase (and Your Headaches) CON3449. Matthew sbgrid. CON3449 The Open Monolith Keeping Your Codebase (and Your Headaches) Small Michael Bar-Sinai @michbarsinai mbarsinai.com Matthew Dunlap @disbliss sbgrid.org/about/staff/ @dataverseorg Large, monolithic

More information

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4 Cloud & container monitoring 04.05.2018, Lars Michelsen Some cloud definitions Applications Data Runtime Middleware O/S Virtualization Servers Storage Networking Software-as-a-Service (SaaS) Applications

More information

Microsoft Developing Windows Azure and Web Services

Microsoft Developing Windows Azure and Web Services 1800 ULEARN (853 276) www.ddls.com.au Microsoft 20487 - Developing Windows Azure and Web Services Length 5 days Price $4510.00 (inc GST) Version B Overview In this course, students will learn how to design

More information

Course Outline. Module 1: Microsoft Azure for AWS Experts Course Overview

Course Outline. Module 1: Microsoft Azure for AWS Experts Course Overview Course Outline Module 1: Microsoft Azure for AWS Experts Course Overview In this module, you will get an overview of Azure services and features including deployment models, subscriptions, account types

More information

Outline. ASP 2012 Grid School

Outline. ASP 2012 Grid School Distributed Storage Rob Quick Indiana University Slides courtesy of Derek Weitzel University of Nebraska Lincoln Outline Storage Patterns in Grid Applications Storage

More information

Scaling DreamFactory

Scaling DreamFactory Scaling DreamFactory This white paper is designed to provide information to enterprise customers about how to scale a DreamFactory Instance. The sections below talk about horizontal, vertical, and cloud

More information

Software + Services for Data Storage, Management, Discovery, and Re-Use

Software + Services for Data Storage, Management, Discovery, and Re-Use Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External

More information

Loosely coupled: asynchronous processing, decoupling of tiers/components Fan-out the application tiers to support the workload Use cache for data and content Reduce number of requests if possible Batch

More information

Oralogic Education Systems

Oralogic Education Systems Oralogic Education Systems Next Generation IT Education Systems Introduction: In the Oracle Database 12c: Performance Management and Tuning course, learn about the performance analysis and tuning tasks

More information

0-1 Million in 46 Days Scaling a Facebook Application in Rails

0-1 Million in 46 Days Scaling a Facebook Application in Rails 0-1 Million in 46 Days Scaling a Facebook Application in Rails Ikai Lan Linkedin Ikai Lan From 0 to 1,000,000 in 46 Days: Scaling a Facebook Application in Rails Slide 1 Hi! I m Ikai Lan Ikai Lan From

More information

Cloud Computing. Amazon Web Services (AWS)

Cloud Computing. Amazon Web Services (AWS) Cloud Computing What is Cloud Computing? Benefit of cloud computing Overview of IAAS, PAAS, SAAS Types Of Cloud private, public & hybrid Amazon Web Services (AWS) Introduction to Cloud Computing. Introduction

More information

IBM Integration Bus v9.0 System Administration: Course Content By Yuvaraj C Panneerselvam

IBM Integration Bus v9.0 System Administration: Course Content By Yuvaraj C Panneerselvam IBM Integration Bus v9.0 System Administration: Course Content By Yuvaraj C Panneerselvam 1. COURSE OVERVIEW As part of this course, you will learn how to administer IBM Integration Bus on distributed

More information

What is Drupal? What is this Drew-Paul thing you do?

What is Drupal? What is this Drew-Paul thing you do? What is Drupal? Or What is this Drew-Paul thing you do? Drupal for the average person Drupal lets me build websites that help people build their own websites without needing to know anything about programming

More information

Techno Expert Solutions

Techno Expert Solutions Course Content of Microsoft Windows Azzure Developer: Course Outline Module 1: Overview of the Microsoft Azure Platform Microsoft Azure provides a collection of services that you can use as building blocks

More information

Questions and Answers No.:81-90(143q)

Questions and Answers No.:81-90(143q) Exam Code: 70-534 Exam Name: Architecting Microsoft Azure Solutions Certification Provider: Microsoft Corresponding Certification: Microsoft Specialist: Microsoft Azure Questions and Answers No.:81-90(143q)

More information

New Features and Enhancements in Big Data Management 10.2

New Features and Enhancements in Big Data Management 10.2 New Features and Enhancements in Big Data Management 10.2 Copyright Informatica LLC 2017. Informatica, the Informatica logo, Big Data Management, and PowerCenter are trademarks or registered trademarks

More information

CloudOpen Europe 2013 SYNNEFO: A COMPLETE CLOUD STACK OVER TECHNICAL LEAD, SYNNEFO

CloudOpen Europe 2013 SYNNEFO: A COMPLETE CLOUD STACK OVER TECHNICAL LEAD, SYNNEFO SYNNEFO: A COMPLETE CLOUD STACK OVER GOOGLE GANETI. VANGELIS KOUKIS TECHNICAL LEAD, SYNNEFO 1 Running a public cloud: ~okeanos History - Design started late 2010 - Production since July 2011 Numbers -

More information

Healthcare IT A Monitoring Primer

Healthcare IT A Monitoring Primer Healthcare IT A Monitoring Primer Published: February 2019 PAGE 1 OF 13 Contents Introduction... 3 The Healthcare IT Environment.... 4 Traditional IT... 4 Healthcare Systems.... 4 Healthcare Data Format

More information

New Features... 1 Upgrade Changes... 1 Installation and Upgrade... 1 Known Limitations... 2 Informatica Global Customer Support...

New Features... 1 Upgrade Changes... 1 Installation and Upgrade... 1 Known Limitations... 2 Informatica Global Customer Support... Informatica Corporation Data Integration Hub 10.2.1 Release Notes December 2017 Copyright Informatica LLC 2013, 2017 Contents New Features... 1 Upgrade Changes... 1 Installation and Upgrade... 1 Known

More information

Cloud object storage : the right way. Orit Wasserman Open Source Summit 2018

Cloud object storage : the right way. Orit Wasserman Open Source Summit 2018 Cloud object storage : the right way Orit Wasserman Open Source Summit 2018 1 About me 20+ years of development 10+ in open source: Nested virtualization for KVM Maintainer of live migration in Qemu/kvm

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

CISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL

CISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL CISC 7610 Lecture 5 Distributed multimedia databases Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL Motivation YouTube receives 400 hours of video per minute That is 200M hours

More information

[MS20487]: Developing Windows Azure and Web Services

[MS20487]: Developing Windows Azure and Web Services [MS20487]: Developing Windows Azure and Web Services Length : 5 Days Audience(s) : Developers Level : 300 Technology : Cross-Platform Development Delivery Method : Instructor-led (Classroom) Course Overview

More information