Get Data, Build Apps and Analyze Data Using IBM Bluemix Data and Analytics (Session 6748)
|
|
- Jonas Rose
- 5 years ago
- Views:
Transcription
1 Get Data, Build Apps and Analyze Data Using IBM Bluemix Data and Analytics (Session 6748) Eric Cattoir Yves Debeer Bert Waltniel 2015 IBM Corporation
2 IBM Bluemix The Digital Innovation Platform
3 Innovation is the new currency Two guys in a Starbucks can have access to the same computing power as a Fortune 500 company. Jim Deters Founder, Galvanize
4 4 Anatomy of a Disruptive Idea
5 To really disrupt, a business should focus on building differentiation and rent the rest Devs can quickly compose apps with new APIs and digital services to add features and increase engagement in areas like: Analytics, cognition Mobile, location Internet of Things Social engagement Identity Reviews Travel Messaging His/her company s private APIs and services
6 Bluemix works for disruptors.
7 Bluemix started as a public PaaS Bluemix started with a major focus on developer productivity in the public cloud. Customer Managed Service Provider Managed Infrastructure as a Service Platform a Service as Code Code Data Data Runtime Runtime Middleware Middleware OS OS Virtualization Virtualization Servers Servers Storage Storage Networking Networking IBM SoftLayer 7
8 We listened. Now we re evolving to become even more flexible. Capabilities in Bluemix now span PaaS and IaaS and can be delivered as a public, dedicated, or on-premises* implementation. Customer Managed Service Provider Managed Infrastructure as a Service Platform a Service as Code Data Runtime Middleware OS Virtualization Servers Storage Networking Code Data Runtime Middleware OS Virtualization Servers Storage Networking Built on technologies: IBM SoftLayer 8
9 How does Bluemix work? + Always focused on what s next Bluemix is underlined by three key open compute technologies: Cloud Foundry, Docker, and OpenStack. It extends each of these with a growing number of services, robust DevOps tooling, integration capabilities, and a seamless developer experience. DevOps Tooling Your Own Hosted Apps / Services Integration and API Mgmt Catalog of Services that Extend Apps Functionality Web Data Mobile Cognitive Analytics IoT Security Yours + Flexible Compute Options to Run Apps / Services Instant Runtimes Containers Virtual Machines Platform Deployment Options that Meet Your Workload Requirements + Bluemix Bluemix Bluemix Public + 9 Powered by IBM SoftLayer In Your Data Center *Bluemix Local coming Summer 2015
10 Bluemix is built on IBM SoftLayer The highest performing cloud infrastructure available. A different kind of data center Every location designed, built, and operated to the same standardized, pod based spec 24/7 on-site security and rigorous controls Expanding to 40 data centers worldwide Global network of networks Public, private, and management networks all separate More than 2,000Gbps between data centers and network points of presence (PoPs) Unmetered inbound public bandwidth and fully unmetered bandwidth between data centers Entirely automated London (now) SoftLayer API controls everything - more than 3000 documented methods and 180 distinct services Bare metal and virtualized servers in the same platform Dallas Bluemix Public Location SoftLayer Data Center 10
11 11 Sign up in minutes. Pay for what you use. Cloud based pricing models to serve developer needs. Friction free adoption 30 day trial (no credit card required) - Designed to allow testing of an entire application on the platform Free tier for every service - encourages experimentation of new services for applications already running on Bluemix Multiple Commitment Models Pay-as-you-go - optimized for flexibility, no term commitment Subscription - term based optimized for cost, discounted from pay as you go rates Self Service Zero to coding in less than 5 minutes Credit card over the web in many countries - or through your IBM rep
12 Let s see it! *click* 12
13 Business Agility through Data Traditional Requirements Based Top-Down Design Integration and Reuse Competence Centers Better Decisions Enterprise Focus Agile Data Analytics Opportunity-Oriented Experimentation Throwaway Hackathons Business Innovation Functional Focus 13
14 But How?
15 Agility and Elasticity through Cloud Elastic Provisioning Pay-as-You-Go Manage High Volume External Data Sources Self-Service Through a Browser SQL / NOSQL Unstructured Data Access Data Anywhere, Anytime Leverage Current Cloud Apps 15
16 IBM Data and Analytics Services Open for Data A comprehensive portfolio of open source data services
17 Work with Cloud Data Services in Bluemix Iterate Sensors Internet Social Media Customer Conversations GET DATA PUT DATA TO WORK ANALYZE DATA INTERACT GAIN INSIGHT VISUALIZE Internal & External Data Sources DataWorks Cloudant MongoDB Notebooks Predictive Analytics Back Office Applications Apache Kafka Streams DB2 Postgres ReThinkDB Redis Object Storage Graph DB DashDB Your Own Data & Analytics Applications 17
18 Example: Health Management Platform Clinical & Wearable Device Sensors Fitbit, Jawbone Device Data Lab Results Patient Conversations GET DATA PUT DATA TO WORK ANALYZE DATA INTERACT GAIN INSIGHT VISUALIZE Internal & External Data Sources Health Results from RDBMS DataWorks Streams Cloudant DashDB Notebooks 18
19 Get Data from Own or Public Data Import Data into data services e.g. dashdb, Cloudant, Mongo, through respective load tools Create Connections to diverse data sources on-premise or cloud for use in analytics e.g. Notebooks Load Data from diverse sources into cloud data services in-context, powered by DataWorks X
20 Get Data from Bluemix s Analytics Exchange Explore available data sets Find interesting data Access data from Bluemix apps Analyze Data in Apache Spark & Notebooks Dash DB Watson Analytics X
21 Build Applications using Bluemix Data Services Connect your applications to use Data in Bluemix Select the Database Service Instance you want to use and pick your plan (upper right) Get service credentials to use in your code (lower right) Use the APIs, passing the credentials you obtained from Node.js, Liberty, or other apps on Bluemix or from apps running on other platform or devices Manage from the context of Bluemix under Bluemix login X
22 Analyze Data with IBM Analytics for Spark Go to Work with Data -> Analytics and create a new service instance Interactive Notebooks Use Python, Scala with Spark Associate an Object Storage for accessing&uploading of Data Connect to Data Sources e.g. Files, Cloudant, DashDB, on-premise DBs, Spark Submit Download Apache Spark Submit Develop your own Spark Jobs Run and monitor your Jobs + X
23 Example: New York Accidents Analytics NY City Public Data Accident Data from NYPD Road Condition Data Weather History Data We created a set of Notebooks to Cleanse Data to get it in proper shape for visualization and analytics Visualize Data to better understand its content and structure Analyze Data to identify patterns and correlations in the data Predict future Incident Likelyhoods from data Visualize Insights from descriptive and predictive analytics X
24 X
25 X
26 X
27 X
28 X
29 X
30 X
31 X
32 X
33 X
34 Spark Technical Discussion X
35 Resilient Distributed Datasets (RDDs) A collection of elements that Spark works on in parallel. May be kept in memory or on disk. Applications can also explicitly tell Spark to cache an RDD, which is great for iterative algorithms. An RDD contains the raw data, plus the function to compute it. Fault-tolerance: if any partition of an RDD is lost, it will automatically be recomputed using the transformations that originally created it. RDD built from a Java collection RDD built from an external dataset (local FS, HDFS, Hbase, )
36 Working with RDDs: Transformations and Actions Transformations are lazy: they do not compute their results right away. They are added to the operations of the RDD optimize the required calculations recover from lost data partitions Examples: map(func), filter(func), union(), join(), groupbykey Actions are executed immediately, and trigger execution of all prior transformations on an RDD Examples: reduce(func), collect(), saveassequencefile() func are Java/Scala/Python functions that you write Call persist() on an RDD if you plan to reuse it later
37 Execution Model spark-submit Executor Driver Program Executor notebook Executor X
38 Spark in Action Word Count in Scala val conf = new SparkConf().setAppName( WordCount ) val sc = new SparkContext(conf) 1 RDD = 1 line of the document val file = sc.textfile( swift://filecontainer.spark/input.txt ) val words = file.flatmap(line => tokenize(line)) Transformations val wordmap = words.map(x => (x, 1)) Action val wordcounts = wordmap.reducebykey(_ + _) wordcounts.saveastextfile( swift://filecontainer.spark/output.txt ) Tokenize is def tokenize(text : String) : Array[String] = { text.tolowercase.replaceall("[^a-za-z0-9\\s]", "").split("\\s+") } // Adapted from Word Count example on
39 X
40 DataFrames X
41 Combine Services: Analytics of Twitter Data Node.js Reader receives Twitter data stream and writes to Topic Reader Node.js Message Hub provides elastic high velocity message queue Topic Kafka Algorithms in Scala detect Tweets of interest Notebook Spark Streaming Watson Tone Analyzer Watson enriches Tweets with tone & sentiment info Insight App lets users explore and interact with results Insights App Node.js REST API Node.js Results Cloudant DB Cloudant stores insight data with HADR at scale X
42 Streaming Analytics using multiple Data Sources Insight App Node.js Notebook REST API Node.js Reader Node.js Topic Kafka Results CloudantDB Reader Node.js Topic Kafka Spark Streaming Stock Quotes Reader Node.js Topic Kafka Topic Kafka Alert Gen Node.js Push Service Watson Tone Analyzer Predictive Analytics X
43 Conclusion You can achieve greater Business Agility and faster Insights through Cloud based Innovation without upfront investment IBM Cloud Data Services provide open, cloud based data and analytics services that enable fast cloud based innovation Bluemix - Data & Analytics at features and integrates cloud data services, enabling you to Get Data from your own or public data sources Build Applications using cloud data & analytics services Analyze Data with Spark&Notebooks at Hadoop, dash DB, Combine and Integrate cloud data & analytics services with each other, as well as with other Bluemix services, e.g. through the new Message Hub based on Apache Kafka X
44 Combine Services: Analytics of Twitter Data 19
45 clouddataservices/sentiment-analysis-oftwitter-hashtags/ 20
46 Thank You
Exam C IBM Cloud Platform Application Development v2 Sample Test
Exam C5050 384 IBM Cloud Platform Application Development v2 Sample Test 1. What is an advantage of using managed services in IBM Bluemix Platform as a Service (PaaS)? A. The Bluemix cloud determines the
More informationOverview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::
Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized
More informationWhich compute option is designed for the above scenario? A. OpenWhisk B. Containers C. Virtual Servers D. Cloud Foundry
1. A developer needs to create support for a workload that is stateless and short-living. The workload can be any one of the following: - API/microservice /web application implementation - Mobile backend
More informationIBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics
IBM Data Science Experience White paper R Transforming R into a tool for big data analytics 2 R Executive summary This white paper introduces R, a package for the R statistical programming language that
More informationConnect and Transform Your Digital Business with IBM
Connect and Transform Your Digital Business with IBM Optimize Your Hybrid Cloud Solution 1 Your journey to the Cloud can have several entry points Competitive Project Office Create and deploy new apps
More informationHDInsight > Hadoop. October 12, 2017
HDInsight > Hadoop October 12, 2017 2 Introduction Mark Hudson >20 years mixing technology with data >10 years with CapTech Microsoft Certified IT Professional Business Intelligence Member of the Richmond
More informationSpatial Analytics Built for Big Data Platforms
Spatial Analytics Built for Big Platforms Roberto Infante Software Development Manager, Spatial and Graph 1 Copyright 2011, Oracle and/or its affiliates. All rights Global Digital Growth The Internet of
More informationMapReduce, Hadoop and Spark. Bompotas Agorakis
MapReduce, Hadoop and Spark Bompotas Agorakis Big Data Processing Most of the computations are conceptually straightforward on a single machine but the volume of data is HUGE Need to use many (1.000s)
More informationBlended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)
Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance
More informationCloud I - Introduction
Cloud I - Introduction Chesapeake Node.js User Group (CNUG) https://www.meetup.com/chesapeake-region-nodejs-developers-group START BUILDING: CALLFORCODE.ORG 3 Agenda Cloud Offerings ( Cloud 1.0 ) Infrastructure
More informationThe Evolution of Big Data Platforms and Data Science
IBM Analytics The Evolution of Big Data Platforms and Data Science ECC Conference 2016 Brandon MacKenzie June 13, 2016 2016 IBM Corporation Hello, I m Brandon MacKenzie. I work at IBM. Data Science - Offering
More informationAnalyzing Flight Data
IBM Analytics Analyzing Flight Data Jeff Carlson Rich Tarro July 21, 2016 2016 IBM Corporation Agenda Spark Overview a quick review Introduction to Graph Processing and Spark GraphX GraphX Overview Demo
More informationIBM dashdb Local. Using a software-defined environment in a private cloud to enable hybrid data warehousing. Evolving the data warehouse
IBM dashdb Local Using a software-defined environment in a private cloud to enable hybrid data warehousing Evolving the data warehouse Managing a large-scale, on-premises data warehouse environments to
More informationUnderstanding the latent value in all content
Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence
More informationAsanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks
Asanka Padmakumara ETL 2.0: Data Engineering with Azure Databricks Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing A regular speaker in data
More informationRickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers
Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers Watson Data Platform Reference Architecture Business
More informationOverview of IBM Cloud Services and Academic Initiatives Driving Better Student Outcomes from a Better Cloud and Technology Partner
Overview of IBM Cloud Services and Academic Initiatives Driving Better Student Outcomes from a Better Cloud and Technology Partner Lysa Banks Lead Architect and Program Manager for Industry Cloud Solutions
More informationBlurring the Line Between Developer and Data Scientist
Blurring the Line Between Developer and Data Scientist Notebooks with PixieDust va barbosa va@us.ibm.com Developer Advocacy IBM Watson Data Platform WHY ARE YOU HERE? More companies making bet-the-business
More informationOPENSTACK BEIJING CONFERENCE. by: Steven Hallett Head of Cloud Infrastructure Engineering and Operations
OPENSTACK BEIJING CONFERENCE by: Steven Hallett Head of Cloud Infrastructure Engineering and Operations August 10, 2012 Agenda X.commerce, an Introduction Platform Vision Current Status The Opportunity
More informationDesigning MQ deployments for the cloud generation
Designing MQ deployments for the cloud generation WebSphere User Group, London Arthur Barr, Senior Software Engineer, IBM MQ 30 th March 2017 Top business drivers for cloud 2 Source: OpenStack user survey,
More informationWebinar Series TMIP VISION
Webinar Series TMIP VISION TMIP provides technical support and promotes knowledge and information exchange in the transportation planning and modeling community. Today s Goals To Consider: Parallel Processing
More informationFluentd + MongoDB + Spark = Awesome Sauce
Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision
More informationDATA SCIENCE USING SPARK: AN INTRODUCTION
DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data
More information#techsummitch
www.thomasmaurer.ch #techsummitch Justin Incarnato Justin Incarnato Microsoft Principal PM - Azure Stack Hyper-scale Hybrid Power of Azure in your datacenter Azure Stack Enterprise-proven On-premises
More informationCloud Computing & Visualization
Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International
More informationIBM Bluemix compute capabilities IBM Corporation
IBM Bluemix compute capabilities After you complete this section, you should understand: IBM Bluemix infrastructure compute options Bare metal servers Virtual servers IBM Bluemix Container Service IBM
More informationOpen platform for database-as-a-service. (DBaaS) on IBM Power Systems solution. A modern, optimized platform for the cognitive era.
Open platform for database-as-a-service (DBaaS) on IBM Power Systems solution A modern, optimized platform for the cognitive era Highlights Faster time to value and improved productivity Superior performance
More informationC exam. Number: C Passing Score: 800 Time Limit: 120 min IBM C IBM Cloud Platform Application Development
C5050-285.exam Number: C5050-285 Passing Score: 800 Time Limit: 120 min IBM C5050-285 IBM Cloud Platform Application Development Exam A QUESTION 1 What are the two key benefits of Cloudant Sync? (Select
More informationTaming your heterogeneous cloud with Red Hat OpenShift Container Platform.
Taming your heterogeneous cloud with Red Hat OpenShift Container Platform martin@redhat.com Business Problem: Building a Hybrid Cloud solution PartyCo Some Bare Metal machines Mostly Virtualised CosPlayUK
More informationAnalytic Cloud with. Shelly Garion. IBM Research -- Haifa IBM Corporation
Analytic Cloud with Shelly Garion IBM Research -- Haifa 2014 IBM Corporation Why Spark? Apache Spark is a fast and general open-source cluster computing engine for big data processing Speed: Spark is capable
More informationServerless Computing: Customer Adoption Insights & Patterns
Serverless Computing: Customer Adoption Insights & Patterns Michael Behrendt IBM Distinguished Engineer Chief Architect, Serverless/FaaS & @Michael_beh Evolution of serverless Increasing focus on business
More informationDistributed Systems. 22. Spark. Paul Krzyzanowski. Rutgers University. Fall 2016
Distributed Systems 22. Spark Paul Krzyzanowski Rutgers University Fall 2016 November 26, 2016 2015-2016 Paul Krzyzanowski 1 Apache Spark Goal: generalize MapReduce Similar shard-and-gather approach to
More informationAn Introduction to Apache Spark
An Introduction to Apache Spark 1 History Developed in 2009 at UC Berkeley AMPLab. Open sourced in 2010. Spark becomes one of the largest big-data projects with more 400 contributors in 50+ organizations
More informationExam C Foundations of IBM Cloud Reference Architecture V5
Exam C5050 287 Foundations of IBM Cloud Reference Architecture V5 1. Which cloud computing scenario would benefit from the inclusion of orchestration? A. A customer has a need to adopt lean principles
More informationCSE 444: Database Internals. Lecture 23 Spark
CSE 444: Database Internals Lecture 23 Spark References Spark is an open source system from Berkeley Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. Matei
More informationSpark, Shark and Spark Streaming Introduction
Spark, Shark and Spark Streaming Introduction Tushar Kale tusharkale@in.ibm.com June 2015 This Talk Introduction to Shark, Spark and Spark Streaming Architecture Deployment Methodology Performance References
More informationSpark Overview. Professor Sasu Tarkoma.
Spark Overview 2015 Professor Sasu Tarkoma www.cs.helsinki.fi Apache Spark Spark is a general-purpose computing framework for iterative tasks API is provided for Java, Scala and Python The model is based
More informationBig Data. Big Data Analyst. Big Data Engineer. Big Data Architect
Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION
More informationAPI s in a hybrid world. Date 28 September 2017
API s in a hybrid world Date 28 September 2017 So, What is this API Economy thing? A mutually beneficial ecosystem of suppliers & consumers of APIs. Suppliers use APIs to securely provide specific types
More informationMoving into the Cloud. Steven Canale, VP of Sales for SoftLayer
Moving into the Cloud Steven Canale, VP of Sales for SoftLayer What is driving IT demand? Cyber security Advanced predictive analytics Business Optimization and Big Data Explosion of mobile devices Real-time
More informationData-intensive computing systems
Data-intensive computing systems University of Verona Computer Science Department Damiano Carra Acknowledgements q Credits Part of the course material is based on slides provided by the following authors
More informationBig data systems 12/8/17
Big data systems 12/8/17 Today Basic architecture Two levels of scheduling Spark overview Basic architecture Cluster Manager Cluster Cluster Manager 64GB RAM 32 cores 64GB RAM 32 cores 64GB RAM 32 cores
More informationIBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems
IBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems 2014 IBM Corporation Powerful Forces are Changing the Way Business Gets Done Data growing exponentially
More informationSTATE OF MODERN APPLICATIONS IN THE CLOUD
STATE OF MODERN APPLICATIONS IN THE CLOUD 2017 Introduction The Rise of Modern Applications What is the Modern Application? Today s leading enterprises are striving to deliver high performance, highly
More informationDeveloping Enterprise Cloud Solutions with Azure
Developing Enterprise Cloud Solutions with Azure Java Focused 5 Day Course AUDIENCE FORMAT Developers and Software Architects Instructor-led with hands-on labs LEVEL 300 COURSE DESCRIPTION This course
More information2013 Cisco and/or its affiliates. All rights reserved. 1
2013 Cisco and/or its affiliates. All rights reserved. 1 Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems Brian McCarson Sr. Principal Engineer & Sr. System
More informationGo Faster: Containers, Platforms and the Path to Better Software Development (Including Live Demo)
RED HAT DAYS VANCOUVER Go Faster: Containers, Platforms and the Path to Better Software Development (Including Live Demo) Paul Armstrong Principal Solutions Architect Gerald Nunn Senior Middleware Solutions
More informationData 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.
17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations
More informationCisco Cloud Strategy. Uwe Müller. Leader PreSales Cloud & Datacenter Germany
Cisco Cloud Strategy Uwe Müller Leader PreSales Cloud & Datacenter Germany 277X Data created by IoE devices v. end-user 30M New devices connected every week 180B Mobile apps downloaded in 2015 78% Workloads
More informationGain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.
Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources
More informationWe are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info
We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423
More informationArchitecting Microsoft Azure Solutions (proposed exam 535)
Architecting Microsoft Azure Solutions (proposed exam 535) IMPORTANT: Significant changes are in progress for exam 534 and its content. As a result, we are retiring this exam on December 31, 2017, and
More informationDeploying Applications on DC/OS
Mesosphere Datacenter Operating System Deploying Applications on DC/OS Keith McClellan - Technical Lead, Federal Programs keith.mcclellan@mesosphere.com V6 THE FUTURE IS ALREADY HERE IT S JUST NOT EVENLY
More informationData 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.
Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020
More informationCloud Computing: Making the Right Choice for Your Organization
Cloud Computing: Making the Right Choice for Your Organization A decade ago, cloud computing was on the leading edge. Now, 95 percent of businesses use cloud technology, and Gartner says that by 2020,
More informationThe SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Dublin Apache Kafka Meetup, 30 August 2017.
Dublin Apache Kafka Meetup, 30 August 2017 The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Joseph @pleia2 * ASF projects 1 Elizabeth K. Joseph, Developer Advocate Developer Advocate
More informationCOSC 6339 Big Data Analytics. Introduction to Spark. Edgar Gabriel Fall What is SPARK?
COSC 6339 Big Data Analytics Introduction to Spark Edgar Gabriel Fall 2018 What is SPARK? In-Memory Cluster Computing for Big Data Applications Fixes the weaknesses of MapReduce Iterative applications
More informationIBM Bluemix platform as a service (PaaS)
Cloud Developer Certification Preparation IBM Bluemix platform as a service (PaaS) After you complete this unit, you should understand: Use cases for IBM Bluemix PaaS applications Key infrastructure components
More informationOracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data
Oracle Big Data SQL Release 3.2 The unprecedented explosion in data that can be made useful to enterprises from the Internet of Things, to the social streams of global customer bases has created a tremendous
More informationApache Bahir Writing Applications using Apache Bahir
Apache Big Data Seville 2016 Apache Bahir Writing Applications using Apache Bahir Luciano Resende About Me Luciano Resende (lresende@apache.org) Architect and community liaison at Have been contributing
More informationData Protection Modernization: Meeting the Challenges of a Changing IT Landscape
Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Tom Clark IBM Distinguished Engineer, Chief Architect Software 1 Data growth is continuing to explode Sensors & Devices
More informationCreating a Hybrid Gateway for API Traffic. Ed Julson API Platform Product Marketing TIBCO Software
Creating a Hybrid Gateway for API Traffic Ed Julson API Platform Product Marketing TIBCO Software Housekeeping Webinar Recording Today s webinar is being recorded and a link to the recording will be emailed
More information70-532: Developing Microsoft Azure Solutions
70-532: Developing Microsoft Azure Solutions Objective Domain Note: This document shows tracked changes that are effective as of January 18, 2018. Create and Manage Azure Resource Manager Virtual Machines
More informationBIG DATA COURSE CONTENT
BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data
More information2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE
Co-Host Host 2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE Oracle Cloud Computing Strategy Han Wammes Public Sector Market Development Manager 1 Copyright 2012, Oracle and/or its
More informationCisco Unified Data Center Strategy
Cisco Unified Data Center Strategy How can IT enable new business? Holger Müller Technical Solutions Architect, Cisco September 2014 My business is rapidly changing and I need the IT and new technologies
More informationBringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security
Bringing OpenStack to the Enterprise An enterprise-class solution ensures you get the required performance, reliability, and security INTRODUCTION Organizations today frequently need to quickly get systems
More informationInformation empowerment for your evolving data ecosystem
Information empowerment for your evolving data ecosystem Highlights Enables better results for critical projects and key analytics initiatives Ensures the information is trusted, consistent and governed
More informationDatabases and Big Data Today. CS634 Class 22
Databases and Big Data Today CS634 Class 22 Current types of Databases SQL using relational tables: still very important! NoSQL, i.e., not using relational tables: term NoSQL popular since about 2007.
More informationDe kracht van IBM cloud: hoe je bestaande workloads verhuist naar de cloud
De kracht van IBM cloud: hoe je bestaande workloads verhuist naar de cloud Martin van der Meer Cloud Services Leader IBM Benelux 14 June 2017 Copyright 2017 IBM All rights reserved. 1 Cloud is maturing
More informationBig data streaming: Choices for high availability and disaster recovery on Microsoft Azure. By Arnab Ganguly DataCAT
: Choices for high availability and disaster recovery on Microsoft Azure By Arnab Ganguly DataCAT March 2019 Contents Overview... 3 The challenge of a single-region architecture... 3 Configuration considerations...
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationCisco Unified Computing System
Cisco Unified Computing System Architected for Workload Diversity and Fast IT Todd Brannon, Director of Product Marketing, Unified Computing tobranno@cisco.com @tobranno Agenda Applications & Architecture
More informationIBM Cloud for VMware Solutions
Introduction 2 IBM Cloud IBM Cloud for VMware Solutions Zeb Ahmed Senior Offering Manager VMware on IBM Cloud Mehran Hadipour Director Business Development - Zerto Internal Use Only Do not distribute 3
More informationApache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source
Apache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source DMITRIY SETRAKYAN Founder, PPMC https://ignite.apache.org @apacheignite @dsetrakyan Agenda About In- Memory Computing Apache Ignite
More informationBeyond 1001 Dedicated Data Service Instances
Beyond 1001 Dedicated Data Service Instances Introduction The Challenge Given: Application platform based on Cloud Foundry to serve thousands of apps Application Runtime Many platform users - who don
More informationCloud without Compromise - Continuous Innovation for Growth
Cloud without Compromise - Continuous Innovation for Growth Michael Healy Vice President IBM Cloud Computing We are at an inflection point in the industry Technology is increasingly the most critical driver
More informationData Architectures in Azure for Analytics & Big Data
Data Architectures in for Analytics & Big Data October 20, 2018 Melissa Coates Solution Architect, BlueGranite Microsoft Data Platform MVP Blog: www.sqlchick.com Twitter: @sqlchick Data Architecture A
More informationJavaentwicklung in der Oracle Cloud
Javaentwicklung in der Oracle Cloud Sören Halter Principal Sales Consultant 2016-11-17 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information
More informationCloud + Big Data Putting it all Together
Cloud + Big Data Putting it all Together Even Solberg 2009 VMware Inc. All rights reserved 2 Big, Fast and Flexible Data Big Big Data Processing Fast OLTP workloads Flexible Document Object Big Data Analytics
More informationDell In-Memory Appliance for Cloudera Enterprise
Dell In-Memory Appliance for Cloudera Enterprise Spark Technology Overview and Streaming Workload Use Cases Author: Armando Acosta Hadoop Product Manager/Subject Matter Expert Armando_Acosta@Dell.com/
More informationBig Streaming Data Processing. How to Process Big Streaming Data 2016/10/11. Fraud detection in bank transactions. Anomalies in sensor data
Big Data Big Streaming Data Big Streaming Data Processing Fraud detection in bank transactions Anomalies in sensor data Cat videos in tweets How to Process Big Streaming Data Raw Data Streams Distributed
More informationLambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015
Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document
More informationContinuous delivery of Java applications. Marek Kratky Principal Sales Consultant Oracle Cloud Platform. May, 2016
Continuous delivery of Java applications using Oracle Cloud Platform Services Marek Kratky Principal Sales Consultant Oracle Cloud Platform May, 2016 Safe Harbor Statement The following is intended to
More informationIBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE
IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE Choosing IT infrastructure is a crucial decision, and the right choice will position your organization for success. IBM Power Systems provides an innovative platform
More informationAnalytics in Spark. Yanlei Diao Tim Hunter. Slides Courtesy of Ion Stoica, Matei Zaharia and Brooke Wenig
Analytics in Spark Yanlei Diao Tim Hunter Slides Courtesy of Ion Stoica, Matei Zaharia and Brooke Wenig Outline 1. A brief history of Big Data and Spark 2. Technical summary of Spark 3. Unified analytics
More informationNTT Com Press Conference March 1, 2016 #enterprisecloud
NTT Com Press Conference March 1, 2016 #enterprisecloud 1 Significant Enhancement of Enterprise Cloud - Realizing Digital Transformation - NTT Communications March 1, 2016 2 NTT Communications Initiatives
More informationIndustry-leading Application PaaS Platform
Industry-leading Application PaaS Platform Solutions Transactional Apps Digital Marketing LoB App Modernization Services Web Apps Web App for Containers API Apps Mobile Apps IDE Enterprise Integration
More informationKhadija Souissi. Auf z Systems November IBM z Systems Mainframe Event 2016
Khadija Souissi Auf z Systems 07. 08. November 2016 @ IBM z Systems Mainframe Event 2016 Acknowledgements Apache Spark, Spark, Apache, and the Spark logo are trademarks of The Apache Software Foundation.
More informationPUBLIC AND HYBRID CLOUD: BREAKING DOWN BARRIERS
PUBLIC AND HYBRID CLOUD: BREAKING DOWN BARRIERS Jane R. Circle Manager, Red Hat Global Cloud Provider Program and Cloud Access Program June 28, 2016 WHAT WE'LL DISCUSS TODAY Hybrid clouds and multi-cloud
More informationMicrosoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud
Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions
More informationAWS 101. Patrick Pierson, IonChannel
AWS 101 Patrick Pierson, IonChannel What is AWS? Amazon Web Services (AWS) is a secure cloud services platform, offering compute power, database storage, content delivery and other functionality to help
More informationCenturyLink for Microsoft
Strategic Partner Alliances CenturyLink for Microsoft EMPOWER REACH AGILITY 2017 CenturyLink. All Rights Reserved. The CenturyLink mark, pathways logo and certain CenturyLink product names are the property
More informationIBM API Connect: Introduction to APIs, Microservices and IBM API Connect
IBM API Connect: Introduction to APIs, Microservices and IBM API Connect Steve Lokam, Sr. Principal at OpenLogix @openlogix @stevelokam slokam@open-logix.com (248) 869-0083 What do these companies have
More informationPlease give me your feedback
#HPEDiscover Please give me your feedback Session ID: B4385 Speaker: Aaron Spurlock Use the mobile app to complete a session survey 1. Access My schedule 2. Click on the session detail page 3. Scroll down
More informationIntegrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers
Oracle zsig Conference IBM LinuxONE and z System Servers Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Sam Amsavelu Oracle on z Architect IBM Washington
More informationHow to Keep UP Through Digital Transformation with Next-Generation App Development
How to Keep UP Through Digital Transformation with Next-Generation App Development Peter Sjoberg Jon Olby A Look Back, A Look Forward Dedicated, data structure dependent, inefficient, virtualized Infrastructure
More informationApplied Spark. From Concepts to Bitcoin Analytics. Andrew F.
Applied Spark From Concepts to Bitcoin Analytics Andrew F. Hart ahart@apache.org @andrewfhart My Day Job CTO, Pogoseat Upgrade technology for live events 3/28/16 QCON-SP Andrew Hart 2 Additionally Member,
More informationCloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationData Acquisition. The reference Big Data stack
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Data Acquisition Corso di Sistemi e Architetture per Big Data A.A. 2016/17 Valeria Cardellini The reference
More information