Streaming Data: The Opportunity & How to Work With It
|
|
- Hollie Barker
- 6 years ago
- Views:
Transcription
1 Streaming Data: The Opportunity & How to Work With It Roger Barga, GM Amazon Kinesis April , Amazon Web Services, Inc. or its Affiliates. All rights reserved.
2 Interest in and demand for stream data processing is rapidly increasing * * Understatement of the year
3 Why? Most data is produced continuously user-identifier frank [10/Oct/2000:13:55: ] "GET /apache_pb.gif HTTP/1.0" Common Log Entry SeattlePublicWater/Kinesis/123/Realtime MQTT Record { "payerid": "Joe", "productcode": "AmazonS3", "clientproductcode": "AmazonS3", "usagetype": "Bandwidth", "operation": "PUT", "value": "22490", "timestamp": " " } Metering Record <R,AMZN,T,G,R1> NASDAQ OMX Record <165> T22:14:15.003Z mymachine.example.com evntslog - ID47 [examplesdid@32473 iut="3" eventsource="application" eventid="1011"][examplepriority@32473 class="high"] Syslog Entry
4 Why? Time is money Perishable Insights (Forrester) Hourly server logs: how your systems were misbehaving an hour ago Weekly / Monthly Bill: What you spent this past billing cycle? Daily fraud reports: tells you if there was fraud yesterday CloudWatch metrics: what just went wrong now Real-time spending alerts/caps: guaranteeing you can t overspend Real-time detection: blocks fraudulent use now
5 Why? Time is money Perishable Insights (Forrester) A more efficient implementation Most Big Data deployments process continuously generated data (batched)
6 Amazon Kinesis: Streaming Data Done the AWS Way Makes it easy to capture, deliver, and process real-time data streams Easy to provision, deploy, and manage Elastically scalable Real-time latencies Pay as you go, no up-front costs Right services for your specific use cases
7 Amazon Kinesis: Streaming data made easy Services make it easy to capture, deliver and process streams on AWS Kinesis Streams For Technical Developers Kinesis Firehose For all developers, data scientists Kinesis Analytics For all developers, data enginers, data scientists Build your own custom applications that process or analyze streaming data Easily load massive volumes of streaming data into S3 and Redshift Easily analyze data streams using standard SQL queries
8 Amazon Kinesis Streams Build your own data streaming applications Easy administration: Simply create a new stream, and set the desired level of capacity with shards. Scale to match your data throughput rate and volume. Build real-time applications: Perform continual processing on streaming big data using Kinesis Client Library (KCL), Apache Spark/Storm, AWS Lambda, and more. Low cost: Cost-efficient for workloads of any scale.
9 Kinesis Streaming Data Ingestion Streams are made of Shards Each Shard ingests data up to 1MB/sec, and up to 1000 TPS Producers use PUT call to store data in a Stream: PutRecord {Data, PartitionKey, StreamName} Each shard emits 2MB/sec All data is stored for 24 hours Scale Kinesis streams by adding or removing Shards Replay data from stream (log)
10 Kinesis Producer Library Highly configurable library to write to Kinesis Collects records and uses PutRecords for high throughput writes Integrates seamlessly with the Amazon Kinesis Client Library (KCL) to de-aggregate batched records Submits Amazon CloudWatch metrics on your behalf to provide visibility into producer performance
11 Extended Retention in Kinesis Default 24 Hours but configurable to 7 days 2 New APIs IncreaseStreamRetentionPeriod(String StreamName, int RetentionPeriodHours) DecreaseStreamRetentionPeriod(String StreamName, int RetentionPeriodHours) Use it in one of two-modes As always-on extended retention Raise retention period in response to operational event or planned maintenance of record processing application to extend retention.
12 25 40ms ms Aggregate and archive to S3 Millions of sources producing 100s of terabytes per hour Front End Authentication Authorization AZ AZ AZ Durable, highly consistent storage replicates data across three data centers (availability zones) Amazon Web Services Inexpensive: Inexpensive: $0.014 $0.028 per 1,000,000 per million PUT putspayload Units Ordered stream of events supports multiple readers Real-time dashboards and alarms Machine learning algorithms or sliding window analytics Custom-built Streaming Applications (KCL) Real-Time Streaming Data Ingestion Aggregate analysis in Hadoop or a data warehouse
13 Building Applications: Kinesis Client Library For fault-tolerant, scalable stream processing applications Open Source Java client library, also available for Python, Ruby, Node.JS,.NET. Available on Github Connects to the stream and enumerates the shards Instantiates a record processor for every shard it manages Pulls data records from the stream Pushes the records to the corresponding record processor
14 Building Applications: Kinesis Client Library For fault-tolerant, scalable stream processing applications Open Source Java client library, also available for Python, Ruby, Node.JS,.NET. Available on Github Connects to the stream and enumerates the shards Instantiates a record processor for every shard it manages Pulls data records from the stream Pushes the records to the corresponding record processor Checkpoints processed records Balances shard-worker associations when the worker instance count changes Balances shard-worker associations when shards are split or merged
15 Real-Time Streaming Data with Kinesis Streams 1 TB+/day game data analyzed in real-time Gaming 5 billion events/wk from connected devices IoT 17 PB of game data per season Entertainment 100 billion ad impressions/day, 30 ms response time Ad Tech 100 GB/day click streams 250+ sites Enterprise 50 billion ad impressions/day sub-50 ms responses Ad Tech 17 million events/day Technology 1 billion transactions per day Bitcoin
16 Amazon Kinesis: Streaming data made easy Services make it easy to capture, deliver and process streams on AWS Kinesis Streams For Technical Developers Kinesis Firehose For all developers, data scientists Kinesis Analytics For all developers, data enginers, data scientists Build your own custom applications that process or analyze streaming data Easily load massive volumes of streaming data into S3 and Redshift Easily analyze data streams using standard SQL queries
17 Amazon Kinesis Firehose Capture and submit streaming data to Firehose Firehose loads streaming data continuously into S3 and Redshift Analyze streaming data using your favorite BI tools Zero Admin: Capture and deliver streaming data into S3, Redshift, and more without writing an application or managing infrastructure Direct-to-data store integration: Batch, compress, and encrypt streaming data for delivery into S3, and other destinations in as little as 60 secs, set up in minutes Seamless elasticity: Seamlessly scales to match data throughput
18 Amazon Kinesis Firehose 3 Simple Concepts 1. Delivery Stream: The underlying entity of Firehose. Use Firehose by creating a delivery stream to a specified destination and send data to it. You do not have to create a stream or provision shards. You do not have to specify partition keys. 2. Records: The data producer sends data blobs as large as 1,000 KB to a delivery stream. That data blob is called a record. 3. Data Producers: Producers send records to a Delivery Stream. For example, a web server sending log data to a delivery stream is a data producer.
19 Amazon Kinesis Firehose Console Experience Unified Console Experience for Firehose and Streams
20 Amazon Kinesis Firehose Console Experience (S3) Create fully managed resources for delivery without building an app
21 Amazon Kinesis Firehose Console Experience Configure data delivery options simply using the console
22 Amazon Kinesis Agent Software agent makes submitting data to Firehose easy Monitors files and sends new data records to a delivery stream Handles file rotation, checkpointing, and retry upon failures Delivers all data in a reliable, timely, and simple manner Emits AWS CloudWatch metrics to help you better monitor and troubleshoot the streaming process Supported on Amazon Linux AMI with version or later, or Red Hat Enterprise Linux version 7 or later. Install on Linux-based server environments such as web servers, front ends, log servers, and more Also enabled for Kinesis Streams
23 Amazon Kinesis Firehose Pricing Simple, pay-as-you-go and no up-front costs Dimension Value Per 1 GB of data ingested $0.035
24 Hearst s wonderful world of streaming data Digital Media Publication brands Mobile Apps Re-targeting platforms
25 Hearst Streaming Data Architecture Clickstream API Ready Data Users to Hearst Properties Node.JS App- Proxy Kinesis Streams ETL on EMR Amazon Redshift Data Science Application Buzzing API LATENCY THROUGHPUT Milliseconds 100GB/day Kinesis Firehose Agg Data Models S3 Storage 30 seconds 5GB/day 100 seconds 1G/day 5 seconds 1G/day
26 AWS Endpoint Amazon Kinesis Firehose Fully Managed Service for Delivering Data Streams into AWS Destinations Data Sources No Partition Keys No Provisioning End to End Elastic [Batch, Compress, Encrypt] S3 Redshift
27 Amazon Kinesis Streams is a service for workloads that requires custom processing, per incoming record, with sub-1 second processing latency, and a choice of stream processing frameworks. Amazon Kinesis Firehose is a service for workloads that require zero administration, ability to use existing analytics tools based on S3 or Redshift, and a data latency of 60 seconds or higher.
28 Real-Time Analytics Value of insights on some data degrades quickly with time, like stock markets, traffic, device telemetry, patient monitoring, etc. Our customers asked for a way to produce results fast Static Queries, but need very fast output (alerts, real-time control) Ability to transform streaming data and deliver enhanced data to other destinations (services & storage) Dynamic and interactive queries (data exploration)
29 Amazon Kinesis: Streaming data made easy Services make it easy to capture, deliver and process streams on AWS Kinesis Streams For Technical Developers Kinesis Firehose For all developers, data scientists Kinesis Analytics For all developers, data enginers, data scientists Build your own custom applications that process or analyze streaming data Easily load massive volumes of streaming data into S3 and Redshift Easily analyze data streams using standard SQL queries
30 Amazon Kinesis Analytics Analyze data streams continuously with standard SQL Announcement Only! Connect to Kinesis streams, Firehose delivery streams Run standard SQL queries against data streams Kinesis Analytics can send processed data to analytics tools so you can create alerts and respond in real-time Apply SQL on streams: Easily connect to data streams and apply existing SQL skills. Build real-time applications: Perform continual processing on streaming big data with sub-second processing latencies Scale elastically: Elastically scales to match data throughput without any operator intervention.
31 Scenarios for Streaming Data Analytics Windowed analytics such as running totals, weighted moving averages using sliding and tumbling windows Streaming ETL to parse, validate, and enrich streaming data, and then aggregate and filter before sending to persistent stores like S3, Redshift, ElasticSearch, and more Real-time alerts on data streams using simple thresholds or advanced machine learning algorithms like anomaly detect Real-time visualizations on calculated key performance indicators in near real-time and more sophisticated use cases such as temporal analysis, detecting event sequences, complex event processing, advanced analytics.
32 Amazon Kinesis Analytics Service Overview
33 Use ANSI SQL to build real-time streaming applications Connect to Streaming Input Data Select Kinesis Streams or Firehoses as input data for your SQL code Easily write SQL code to process streaming data Author applications that perform simple aggregations to sophisticated multi-step pipelines with advanced analytical functions Continuously deliver SQL results Configure one to many outputs for your processed data for real-time alerts, visualizations, and batch analytics
34 Connect to Streaming Input Data Source Single streaming input, Kinesis Stream or Firehose Single reference input up to 1 GB from S3 Input formats include JSON, CSV, variable column (TSV, pipe-delimited); complex, nested JSON structures Each input has a schema Schema is inferred but you can edit Deep nesting (2+ levels) supported by specifying row paths of desired fields Multiple record types are supported through canonical schema and/or by manipulating structures in SQL using string functions
35 Easily write SQL to process streaming data Build streaming application with SQL statements Extensions to SQL standard to work seamlessly with streaming data (STREAM, Windows, ROWTIME) Robust SQL support including large number of functions including: Simple mathematical operators (AVG, STDEV, etc.) String manipulations (SUBSTRING, POSITION) Advanced analytics (random sampling, anomaly detection) Support for at-least-once processing semantics
36 Real-Time Analytics Patterns Simple counting (e.g. failure count) Counting with Windows ( e.g. failure count every hour) Preprocessing: filter, transform (e.g. data cleanup) Alerts, thresholds (e.g. Alarm on high temperature) Data correlation, detect missing events, detect erroneous data (e.g. detecting failed sensors) Joining event streams (e.g. detect a hit on soccer ball) Merge with data in a databaseupdate data conditionally
37 Realtime Analytics Patterns (contd.) Detect Event Sequence Patterns (e.g. small transaction followed by large transaction) Tracking - follow some related entity s state in space, time etc. (e.g. location of airline baggage, vehicle tracking, ) Detect trends Rise, turn, fall, Outliers, Complex trends like triple bottom etc., (e.g. algorithmic trading, SLA, load balancing) And more.
38 Continuously deliver SQL results Up to five outputs, including S3, Redshift through Firehose, Kinesis Streams, and more... Firehose allows Kinesis Analytics to separate processing and delivery of enriched streaming data Delivery speed will be heavily dependent upon your SQL queries (i.e. simple ETL versus 10 minute aggregations) Other destination(s) will be included at GA. Output formats include JSON, CSV, variable column (TSV, pipe-delimited)
39 Questions
40
Real-time Streaming Applications on AWS Patterns and Use Cases
Real-time Streaming Applications on AWS Patterns and Use Cases Paul Armstrong - Solutions Architect (AWS) Tom Seddon - Data Engineering Tech Lead (Deliveroo) 28 th June 2017 2016, Amazon Web Services,
More informationA Deeper Look into Amazon Kinesis
A Deeper Look into Amazon Kinesis Shawn Gandhi, Solutions Architect @shawnagram AWS Meetup NYC 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed
More informationServerless Computing. Redefining the Cloud. Roger S. Barga, Ph.D. General Manager Amazon Web Services
Serverless Computing Redefining the Cloud Roger S. Barga, Ph.D. General Manager Amazon Web Services Technology Triggers Highly Recommended http://a16z.com/2016/12/16/the-end-of-cloud-computing/ Serverless
More informationDeep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services
Deep Dive Amazon Kinesis Ian Meyers, Principal Solution Architect - Amazon Web Services Analytics Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
More informationEnergy Management with AWS
Energy Management with AWS Kyle Hart and Nandakumar Sreenivasan Amazon Web Services August [XX], 2017 Tampa Convention Center Tampa, Florida What is Cloud? The NIST Definition Broad Network Access On-Demand
More informationBig Data on AWS. Peter-Mark Verwoerd Solutions Architect
Big Data on AWS Peter-Mark Verwoerd Solutions Architect What to get out of this talk Non-technical: Big Data processing stages: ingest, store, process, visualize Hot vs. Cold data Low latency processing
More informationSplunk & AWS. Gain real-time insights from your data at scale. Ray Zhu Product Manager, AWS Elias Haddad Product Manager, Splunk
Splunk & AWS Gain real-time insights from your data at scale Ray Zhu Product Manager, AWS Elias Haddad Product Manager, Splunk Forward-Looking Statements During the course of this presentation, we may
More informationBig Data on AWS. Big Data Agility and Performance Delivered in the Cloud. 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Big Data on AWS Big Data Agility and Performance Delivered in the Cloud 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Big Data Technologies and techniques for working productively
More informationLambda Architecture for Batch and Stream Processing. October 2018
Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.
More informationCloud Analytics and Business Intelligence on AWS
Cloud Analytics and Business Intelligence on AWS Enterprise Applications Virtual Desktops Sharing & Collaboration Platform Services Analytics Hadoop Real-time Streaming Data Machine Learning Data Warehouse
More informationManaging IoT and Time Series Data with Amazon ElastiCache for Redis
Managing IoT and Time Series Data with ElastiCache for Redis Darin Briskman, ElastiCache Developer Outreach Michael Labib, Specialist Solutions Architect 2016, Web Services, Inc. or its Affiliates. All
More informationLambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015
Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document
More informationAnalyzing Streaming Data in Real-Time with Amazon Kinesis Analytics
Analyzing Streaming Data in Real-Time with Amazon Kinesis Analytics Dr. Steffen Hausmann, Solutions Architect, AWS May 18, 2017 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon
More informationWerden Sie ein Teil von Internet der Dinge auf AWS. AWS Enterprise Summit 2015 Dr. Markus Schmidberger -
Werden Sie ein Teil von Internet der Dinge auf AWS AWS Enterprise Summit 2015 Dr. Markus Schmidberger - schmidbe@amazon.de Internet of Things is the network of physical objects or "things" embedded with
More informationIntro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect
Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect Igor Roiter Big Data Cloud Solution Architect Working as a Data Specialist for the last 11 years 9 of them as a Consultant specializing
More informationWHITEPAPER. MemSQL Enterprise Feature List
WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationTowards a Real- time Processing Pipeline: Running Apache Flink on AWS
Towards a Real- time Processing Pipeline: Running Apache Flink on AWS Dr. Steffen Hausmann, Solutions Architect Michael Hanisch, Manager Solutions Architecture November 18 th, 2016 Stream Processing Challenges
More informationMONITORING SERVERLESS ARCHITECTURES
MONITORING SERVERLESS ARCHITECTURES CAN YOU HELP WITH SOME PRODUCTION PROBLEMS? Your Manager (CC) Rachel Gardner Rafal Gancarz Lead Consultant @ OpenCredo WHAT IS SERVERLESS? (CC) theaucitron Cloud-native
More informationAgenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache
Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,
More informationIncrease Value from Big Data with Real-Time Data Integration and Streaming Analytics
Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationSplunk & Amazon Web Services
Splunk & Amazon Web Services June 2016 Tony Bolander tbolander@splunk.com Daniel Lew dalew@splunk.com 1 Industry Leading Platform For Machine Data Machine Data: Any Location, Type, Volume Answer Any Question
More informationExperiences with Serverless Big Data
Experiences with Serverless Big Data AWS Meetup Munich 2016 Markus Schmidberger, Head of Data Service Munich, 17.10.16 Key Components of our Data Service Real-Time Monitoring Enable our development teams
More informationAutomate best practices and operational health for your AWS resources with Trusted Advisor and AWS Health
Automate best practices and operational health for your AWS resources with Trusted Advisor and AWS Health Heitor Lessa, Solutions Architect @ AWS Stephen Gran, Senior Technical Architect @ Piksel June
More informationFAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide
FAQs 1. What is the browser compatibility for logging into the TCS Connected Intelligence Data Lake for Business Portal? Please check whether you are using Mozilla Firefox 18 or above and Google Chrome
More informationAWS 101. Patrick Pierson, IonChannel
AWS 101 Patrick Pierson, IonChannel What is AWS? Amazon Web Services (AWS) is a secure cloud services platform, offering compute power, database storage, content delivery and other functionality to help
More informationARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS
ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS Dr Adnene Guabtni, Senior Research Scientist, NICTA/Data61, CSIRO Adnene.Guabtni@csiro.au EC2 S3 ELB RDS AMI
More informationGetting Started with AWS IoT
Getting Started with AWS IoT Denis V. Batalov, PhD @dbatalov Sr. Solutions Architect, AWS EMEA 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Things are becoming connected Source:
More informationWhat to expect from the session Technical recap VMware Cloud on AWS {Sample} Integration use case Services introduction & solution designs Solution su
LHC3376BES AWS Native Services Integration with VMware Cloud on AWS Technical Deep Dive Ian Massingham, Worldwide Lead, AWS Technical Evangelism Paul Bockelman, AWS Principal Solutions Architect (WWPS)
More informationIntroduction to Database Services
Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational
More informationEnable IoT Solutions using Azure
Internet Of Things A WHITE PAPER SERIES Enable IoT Solutions using Azure 1 2 TABLE OF CONTENTS EXECUTIVE SUMMARY INTERNET OF THINGS GATEWAY EVENT INGESTION EVENT PERSISTENCE EVENT ACTIONS 3 SYNTEL S IoT
More informationIntegrating Splunk with AWS services:
Integrating Splunk with AWS services: Using Redshi+, Elas0c Map Reduce (EMR), Amazon Machine Learning & S3 to gain ac0onable insights via predic0ve analy0cs via Splunk Patrick Shumate Solutions Architect,
More informationData Lake Best Practices
Data Lake Best Practices Agenda Why Data Lake Key Components of a Data Lake Modern Data Architecture Some Best Practices Case Study Summary Takeaways What is a Data Lake? What, why etc. What is a data
More informationGabriel Villa. Architecting an Analytics Solution on AWS
Gabriel Villa Architecting an Analytics Solution on AWS Cloud and Data Architect Skilled leader, solution architect, and technical expert focusing primarily on Microsoft technologies and AWS. Passionate
More informationDeep Dive into Concepts and Tools for Analyzing Streaming Data
Deep Dive into Concepts and Tools for Analyzing Streaming Data Dr. Steffen Hausmann Sr. Solutions Architect, Amazon Web Services Data originates in real-time Photo by mountainamoeba https://www.flickr.com/photos/mountainamoeba/2527300028/
More informationQLIK INTEGRATION WITH AMAZON REDSHIFT
QLIK INTEGRATION WITH AMAZON REDSHIFT Qlik Partner Engineering Created August 2016, last updated March 2017 Contents Introduction... 2 About Amazon Web Services (AWS)... 2 About Amazon Redshift... 2 Qlik
More informationAWS IoT Overview. July 2016 Thomas Jones, Partner Solutions Architect
AWS IoT Overview July 2016 Thomas Jones, Partner Solutions Architect AWS customers are connecting physical things to the cloud in every industry imaginable. Healthcare and Life Sciences Municipal Infrastructure
More informationOracle NoSQL Database Enterprise Edition, Version 18.1
Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across
More informationHow can you implement this through a script that a scheduling daemon runs daily on the application servers?
You ve been tasked with implementing an automated data backup solution for your application servers that run on Amazon EC2 with Amazon EBS volumes. You want to use a distributed data store for your backups
More informationAWS Storage Optimization. AWS Whitepaper
AWS Storage Optimization AWS Whitepaper AWS Storage Optimization: AWS Whitepaper Copyright 2018 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress
More informationDURATION : 03 DAYS. same along with BI tools.
AWS REDSHIFT TRAINING MILDAIN DURATION : 03 DAYS To benefit from this Amazon Redshift Training course from mildain, you will need to have basic IT application development and deployment concepts, and good
More informationStore, Protect, Optimize Your Healthcare Data in AWS
Healthcare reform, increasing patient expectations, exponential data growth, and the threat of cyberattacks are forcing healthcare providers to re-evaluate their data management strategies. Healthcare
More informationExtend NonStop Applications with Cloud-based Services. Phil Ly, TIC Software John Russell, Canam Software
Extend NonStop Applications with Cloud-based Services Phil Ly, TIC Software John Russell, Canam Software Agenda Cloud Computing and Microservices Amazon Web Services (AWS) Integrate NonStop with AWS Managed
More informationAWS Agility + Splunk Visibility = Cloud Success. Splunk App for AWS Demo. Laura Ripans, AWS Alliance Manager
AWS Agility + Splunk Visibility = Cloud Success Splunk App for AWS Demo Laura Ripans, AWS Alliance Manager Disruptive innovation and business transformation starts with data I HAVE BEEN GIVEN AN AWS ACCOUNT!!!
More information4) An organization needs a data store to handle the following data types and access patterns:
1) A company needs to deploy a data lake solution for their data scientists in which all company data is accessible and stored in a central S3 bucket. The company segregates the data by business unit,
More informationUnderstanding the latent value in all content
Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence
More informationHow to Route Internet Traffic between A Mobile Application and IoT Device?
Whitepaper How to Route Internet Traffic between A Mobile Application and IoT Device? Website: www.mobodexter.com www.paasmer.co 1 Table of Contents 1. Introduction 3 2. Approach: 1 Uses AWS IoT Setup
More informationHigh-Performance Distributed DBMS for Analytics
1 High-Performance Distributed DBMS for Analytics 2 About me Developer, hardware engineering background Head of Analytic Products Department in Yandex jkee@yandex-team.ru 3 About Yandex One of the largest
More informationData Analytics at Logitech Snowflake + Tableau = #Winning
Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief
More informationFluentd + MongoDB + Spark = Awesome Sauce
Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision
More informationBest Practices and Performance Tuning on Amazon Elastic MapReduce
Best Practices and Performance Tuning on Amazon Elastic MapReduce Michael Hanisch Solutions Architect Amo Abeyaratne Big Data and Analytics Consultant ANZ 12.04.2016 2016, Amazon Web Services, Inc. or
More informationAmazon Search Services. Christoph Schmitter
Amazon Search Services Christoph Schmitter csc@amazon.de What we'll cover Overview of Amazon Search Services Understand the difference between Cloudsearch and Amazon ElasticSearch Service Q&A Amazon Search
More informationCloud Computing & Visualization
Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International
More informationOverview of Data Services and Streaming Data Solution with Azure
Overview of Data Services and Streaming Data Solution with Azure Tara Mason Senior Consultant tmason@impactmakers.com Platform as a Service Offerings SQL Server On Premises vs. Azure SQL Server SQL Server
More informationArchitectural challenges for building a low latency, scalable multi-tenant data warehouse
Architectural challenges for building a low latency, scalable multi-tenant data warehouse Mataprasad Agrawal Solutions Architect, Services CTO 2017 Persistent Systems Ltd. All rights reserved. Our analytics
More informationData 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.
Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020
More informationBuild, Deploy & Operate Intelligent Chatbots with Amazon Lex
Build, Deploy & Operate Intelligent Chatbots with Amazon Lex Ian Massingham AWS Technical Evangelist @IanMmmm aws.amazon.com/lex 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
More informationStanislav Harvan Internet of Things
Stanislav Harvan v-sharva@microsoft.com Internet of Things IoT v číslach Gartner: V roku 2020 bude na Internet pripojených viac ako 25mld zariadení: 1,5mld smart TV 2,5mld pc 5mld smart phone 16mld dedicated
More informationUn'introduzione a Kafka Streams e KSQL and why they matter! ITOUG Tech Day Roma 1 Febbraio 2018
Un'introduzione a Kafka Streams e KSQL and why they matter! ITOUG Tech Day Roma 1 Febbraio 2018 R E T H I N K I N G Stream Processing with Apache Kafka Kafka the Streaming Data Platform 1.0 Enterprise
More informationAnalytics and Visualization
GU I DE NO. 4 Analytics and Visualization AWS IoT Analytics Mini-User Guide Introduction As IoT applications scale, so does the data generated from these various IoT devices. This data is raw, unstructured,
More informationOverview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::
Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized
More informationMicrosoft Exam
Volume: 42 Questions Case Study: 1 Relecloud General Overview Relecloud is a social media company that processes hundreds of millions of social media posts per day and sells advertisements to several hundred
More informationCONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM
CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications
More informationNew Oracle NoSQL Database APIs that Speed Insertion and Retrieval
New Oracle NoSQL Database APIs that Speed Insertion and Retrieval O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 1 NEW ORACLE NoSQL DATABASE APIs that SPEED INSERTION AND RETRIEVAL Introduction
More informationData 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.
17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations
More informationAutomating Elasticity. March 2018
Automating Elasticity March 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only. It represents AWS s current product
More informationSearch Engines and Time Series Databases
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Search Engines and Time Series Databases Corso di Sistemi e Architetture per Big Data A.A. 2017/18
More informationCisco Tetration Analytics
Cisco Tetration Analytics Enhanced security and operations with real time analytics John Joo Tetration Business Unit Cisco Systems Security Challenges in Modern Data Centers Securing applications has become
More informationData Acquisition. The reference Big Data stack
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Data Acquisition Corso di Sistemi e Architetture per Big Data A.A. 2017/18 Valeria Cardellini The reference
More informationGain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.
Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources
More informationAugust 23, 2017 Revision 0.3. Building IoT Applications with GridDB
August 23, 2017 Revision 0.3 Building IoT Applications with GridDB Table of Contents Executive Summary... 2 Introduction... 2 Components of an IoT Application... 2 IoT Models... 3 Edge Computing... 4 Gateway
More informationAbout Intellipaat. About the Course. Why Take This Course?
About Intellipaat Intellipaat is a fast growing professional training provider that is offering training in over 150 most sought-after tools and technologies. We have a learner base of 600,000 in over
More informationWe are ready to serve Latest IT Trends, Are you ready to learn? New Batches Info
We are ready to serve Latest IT Trends, Are you ready to learn? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : Storage & Database Services : Introduction
More informationIntroduction to Amazon Web Services. Jeff Barr Senior AWS /
Introduction to Amazon Web Services Jeff Barr Senior AWS Evangelist @jeffbarr / jbarr@amazon.com What Does It Take to be a Global Online Retailer? The Obvious Part And the Not-So Obvious Part How Did
More informationFUJITSU Software ServerView Cloud Monitoring Manager V1.0. Overview
FUJITSU Software ServerView Cloud Monitoring Manager V1.0 Overview J2UL-2073-01ENZ0(00) November 2015 Trademarks Copyright FUJITSU LIMITED 2015 LINUX is a registered trademark of Linus Torvalds. The OpenStack
More informationTable 1 The Elastic Stack use cases Use case Industry or vertical market Operational log analytics: Gain real-time operational insight, reduce Mean Ti
Solution Overview Cisco UCS Integrated Infrastructure for Big Data with the Elastic Stack Cisco and Elastic deliver a powerful, scalable, and programmable IT operations and security analytics platform
More informationCisco Tetration Analytics
Cisco Tetration Analytics Enhanced security and operations with real time analytics Christopher Say (CCIE RS SP) Consulting System Engineer csaychoh@cisco.com Challenges in operating a hybrid data center
More informationOracle NoSQL Database Enterprise Edition, Version 18.1
Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across
More informationFlash Storage Complementing a Data Lake for Real-Time Insight
Flash Storage Complementing a Data Lake for Real-Time Insight Dr. Sanhita Sarkar Global Director, Analytics Software Development August 7, 2018 Agenda 1 2 3 4 5 Delivering insight along the entire spectrum
More informationEvent Sponsors. Expo Sponsors. Expo Light Sponsors
Event Sponsors Expo Sponsors Expo Light Sponsors IoT for the BI professional David L. Bojsen - Principal Architect What to expect Level 200 session Which basically means PowerPoint and talking Enthusiastic
More informationCIT 668: System Architecture. Amazon Web Services
CIT 668: System Architecture Amazon Web Services Topics 1. AWS Global Infrastructure 2. Foundation Services 1. Compute 2. Storage 3. Database 4. Network 3. AWS Economics Amazon Services Architecture Regions
More informationMicrosoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud
Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions
More informationStreaming Integration and Intelligence For Automating Time Sensitive Events
Streaming Integration and Intelligence For Automating Time Sensitive Events Ted Fish Director Sales, Midwest ted@striim.com 312-330-4929 Striim Executive Summary Delivering Data for Time Sensitive Processes
More informationStreaming ETL of High-Velocity Big Data Using SAS Event Stream Processing and SAS Viya
SAS 1679-2018 Streaming ETL of High-Velocity Big Data Using SAS Event Stream Processing and SAS Viya ABSTRACT Joydeep Bhattacharya and Manish Jhunjhunwala, SAS Institute Inc. A typical ETL happens once
More informationAWS & Intel: A Partnership Dedicated to fueling your Innovations. Thomas Kellerer BDM CSP, Intel Central Europe
AWS & Intel: A Partnership Dedicated to fueling your Innovations Thomas Kellerer BDM CSP, Intel Central Europe The Digital Service Economy Growth in connected devices enables new business opportunities
More informationMicrosoft Azure Stream Analytics
Microsoft Azure Stream Analytics Marcos Roriz and Markus Endler Laboratory for Advanced Collaboration (LAC) Departamento de Informática (DI) Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio)
More informationServerless Architecture Hochskalierbare Anwendungen ohne Server. Sascha Möllering, Solutions Architect
Serverless Architecture Hochskalierbare Anwendungen ohne Server Sascha Möllering, Solutions Architect Agenda Serverless Architecture AWS Lambda Amazon API Gateway Amazon DynamoDB Amazon S3 Serverless Framework
More informationAmazon AWS-Solution-Architect-Associate Exam
Volume: 858 Questions Question: 1 You are trying to launch an EC2 instance, however the instance seems to go into a terminated status immediately. What would probably not be a reason that this is happening?
More informationSolutions For Your Internet Of Things
Solutions For Your Internet Of Things 29 March 2017 Arun Shankar Our Journey Nov 14 stealth Sep 15 Jul 16 Sep 16 Nov 16 Feb 17 Founded in Japan 5K+ clients US launch Europe launch 2017 More exciting things
More informationAzure-persistence MARTIN MUDRA
Azure-persistence MARTIN MUDRA Storage service access Blobs Queues Tables Storage service Horizontally scalable Zone Redundancy Accounts Based on Uri Pricing Calculator Azure table storage Storage Account
More informationBuilding Big Data Storage Solutions (Data Lakes) for Maximum Flexibility. AWS Whitepaper
Building Big Data Storage Solutions (Data Lakes) for Maximum Flexibility AWS Whitepaper Building Big Data Storage Solutions (Data Lakes) for Maximum Flexibility: AWS Whitepaper Copyright 2018 Amazon Web
More informationzspotlight: Spark on z/os
zspotlight: Spark on z/os Avijit Chatterjee, Ph.D. achatter@us.ibm.com, @ChatterAvijit STSM, IBM Competitive Project Office 1 CEOs are increasingly focused on customers as individuals leveraging contextual
More informationBERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved
BERLIN 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Building Multi-Region Applications Jan Metzner, Solutions Architect Brian Wagner, Solutions Architect 2015, Amazon Web Services,
More informationCloudExpo November 2017 Tomer Levi
CloudExpo November 2017 Tomer Levi About me Full Stack Engineer @ Intel s Advanced Analytics group. Artificial Intelligence unit at Intel. Responsible for (1) Radical improvement of critical processes
More informationAWS Lambda: Event-driven Code in the Cloud
AWS Lambda: Event-driven Code in the Cloud Dean Bryen, Solutions Architect AWS Andrew Wheat, Senior Software Engineer - BBC April 15, 2015 London, UK 2015, Amazon Web Services, Inc. or its affiliates.
More informationLog Analytics with Amazon Elasticsearch Service. Christoph Schmitter
Log Analytics with Amazon Elasticsearch Service Christoph Schmitter (csc@amazon.de) What we'll cover Understanding Elasticsearch capabilities Elasticsearch, the technology Aggregations; ad-hoc analysis
More informationEvolution of an Apache Spark Architecture for Processing Game Data
Evolution of an Apache Spark Architecture for Processing Game Data Nick Afshartous WB Analytics Platform May 17 th 2017 May 17 th, 2017 About Me nafshartous@wbgames.com WB Analytics Core Platform Lead
More informationCloud Providers more AWS, Aneka
Basics of Cloud Computing Lecture 6 Cloud Providers more AWS, Aneka and GAE Satish Srirama Outline More AWS Some more PaaS Aneka Google App Engine Force.com 16.05.2012 Satish Srirama 2/51 Recap Last lecture
More informationAmazon Elastic File System
Amazon Elastic File System Choosing Between the Different Throughput & Performance Modes July 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided
More information