zspotlight: Spark on z/os
|
|
- Pierce Sherman Newton
- 6 years ago
- Views:
Transcription
1 zspotlight: Spark on z/os Avijit Chatterjee, STSM, IBM Competitive Project Office 1
2 CEOs are increasingly focused on customers as individuals leveraging contextual mobile and cognitive technologies 2015 IBM Global C-Suite Study: The CEO Perspective 2
3 To focus on customers as individuals, real-time analytics is heavily leveraged across many industries Banking Card: Use scoring to determine transaction risk based on spending history Money laundering risk: Based on money wiring to multiple accounts keeping amount below threshold Retail Sales opportunity: Real-time scoring for target marketing 3 Government Compliance: Score to detect non-compliant behavior and tax evasion Social Services: Assess likelihood that individual will need multiple agency support to proactively engage various agencies to create best outcome and manage costs
4 Running Analytics on enterprise data off-platform doesn t pay for a mainframe-centric business Analytical Analytical Analytical A large European bank: 120 database images created from bulk data transfers 1,000 applications on 750 cores with 14,000 software titles ETL consuming 28% of total distributed cores and 16% of total MIPS A large Asian bank: Source: IBM Eagle Studies Operational Analytical Analytical One mainframe devoted exclusively to bulk data transfers ETL consuming 8% of total distributed core and 18% of total MIPS With this strategy, IT costs grow faster than business growth 4
5 Rather it leads to significant data transfer costs 5 Example: Source: IBM CPO internal study Operational x86 1 TB of data transferred per day ODS Analytical Analytical Analytical ETL Calculator: Estimated 4 yr. cost summary System costs = $9,864,412 Labor costs = $393,927 Total = $10,258,339 Assuming 4 cores on z13 running at 85% utilization and 12 cores on x86 servers run at 45% utilization, transfer will burn 519 MIPS and use 10 x86 cores per day
6 Apache Spark powers a federated Analytics architecture keeping data in place Unified analytics platform for structure and unstructured data Flexibility & agility with multi-language support Efficient structure 100x vs. map reduce Rich set of built-in functions with consistent APIs: Spark SQL, Spark MLib, Spark Streaming, GraphX, Spark Spark Spark DB2 Linux on z Systems CICS Spark VSAM IMS DB SMF Syslog IMS Spark Physical Sequential z/os Tape Spark WAS DB2 z/os Log Streams ADABAS Spark Spark Power Spark Spark x86 Spark Spark Cloud 6
7 Apache Spark empowers users to accelerate the insight economy What they want to do: Build applications that leverage advanced analytics in partnership with the data scientist and data engineer Follow agile design methodologies Optimize performance and meet SLAs Scientist the convincer What they want to do: Identify patterns, trends, risks, and opportunities in data Discover new actionable insights How Spark can help: Supports the entire data science workflow: from data access and integration, to machine learning, to visualization Provides a growing library of machine learning algorithms 7 How Spark can help: Supports the top analytics app languages such as Python and Scala Eliminates programming complexity with libraries such as MLlib and simplifies DevOps Makes it easy to embed advanced analytics into applications App Developer the builder Engineer the thinker What they want to do: Bridge between the Scientist and the App Developer Implement machine learning algorithms at scale Put the right data system to work for the job at hand How Spark can help: Abstract data access complexity (Spark doesn t care what your data store is) Enables solutions at web-scale
8 Apache Spark is an operating system for Analytics Spark Streaming: Enables scalable, high-throughput processing of live data streams Live stream chopped into batches based on time window Spark SQL: Provides capability to perform relational queries via SQL (subset of HiveQL) Mix SQL queries with Spark applications Spark MLIB Provides scalable machine learning library, has common machine learning functions Provides classification, regression, clustering, filtering, etc. from Spark GraphX Spark APIs for graph style processing and iterative graph computations Spark Core: Foundation providing task dispatching, scheduling, i/o Representation of Spark s basic unit of data: RDD 8
9 On-platform Operational Analytics using Spark on z/os z/os JDBC/AZK DB2 z/os NYSE data Spark on z/os joins multiple data types for fast, complete analytics, without moving the data 1 Scala query using sql syntax to access 3 data sources (over 1 billion rows of data) in 3 different formats JDBC/AZ K JDBC/AZ K Flat file Nasdaq data VSAM S&P data 1.1 billion rows of source data Filtered to 50M rows of Trades for brokers in 1 region Summarized to show activity for each broker across the big 3 exchanges Completed in < 2 minutes with 1 GP (60% utilized) and 11 ziips on a z13 LPAR with 512Gb memory Use Case: Filtered data pull from 1 Billion rows, using Spark filtering to access 50 Million rows, and then summarize using Spark aggregation 9 Resource specs for run: 6 executors, 6Gb driver mem, 80Gb per executor, 4 vcores per executor (ziips in SMT-2), 12 concurrent threads, 512Gb total memory in LPAR, 13 ziips, 2 GPs. AZK accessing all data, spark unionall to merge data. 55% of 1 GP utilized during peek resource consumption
10 On-platform Operational Analytics achieves 67% lower TCA CICS DB2 ETL Apache Spark Parquet $2,105,990 (3 yr. TCA) 10 Trade 166GB Brokerage aggregation query workload across Trades tables from 3 exchanges (over 5 Billion trades, 500GB) * 3-Year TCA includes 3-year US prices for Hardware, Software, Maintenance and Support as of 05/16/2016. Price and performance for x86 environment includes cost of ETL and elapsed time to transfer the data. This is based on an IBM internal study designed to replicate a typical IBM customer workload usage in the marketplace. z/os z Competitor x86 System Intel E v2 2.7GHz 12co CICS DB2 Apache Spark z/os z ziips Linux 67% $697,106 (3 yr. TCA) lower TCA* For systems compared
11 It doesn t pay to move data to x86 to run Analytics beyond 150GB Competitive Project Office $4,000,000 Comparing 3-year TCA factoring Cost of ETL $3,500,000 $3,000,000 $2,500,000 $2,000,000 $1,500,000 $1,000,000 $500,000 $0 100GB 300GB 500GB 700GB 900GB Spark on z/os Spark on x86 11
12 Spark on z/os offering is now generally available What is in the Offering? IBM z/os Platform for Apache Spark (IBM product): Apache Spark enabled for z/os Optimized Integration Layer No License Charge product Support & Service available from IBM Very aggressive pricing for ziips and memory for Spark z/os workload Lab Services install, config, tune Jumpstart service for data science use cases FactZ quick strike POCs for Spark analytic business applications Ecosystem GitHub z/os-spark repository Jupyter Notebook IDEs (Scala Workbench, Interactive Insights Workbench) Apache Job Server Sample data & code snippets Rocket: Industry vertical mappings, e.g. ISO for card data In progress: R support FactZ: Custom Solutions for banking & insurance Zementis: Fast, Scalable, In-Transaction Predictive Scoring integration Apache Spark 12
13 Common use cases of Spark on z/os Analytics across OLTP & Warehouse information Analytics combining business-owned data and external IoT and social data Analytics to improve system performance and operations in realtime using streaming as well as archived data 13
14 Why Spark on z/os is best fit Optimized and parallel access to almost all z/os data environments and distributed data sources analyzing data in place Spark memory structures with sensitive data are governed with z/os security capabilities Leverages z/os memory management, compression, and RDMA communications to provide a high-performance scale up and scale out architecture. Uses large pages, incorporating DRAM with large amounts of Flash as an attractive means to provide scalable elastic memory Best fit analytic capability for the investments made in SMF in-memory analytics Leverages zedc compression when compressing internal data for caching and shuffling SMT2 and SIMD on select operations for enhanced performance Very high ziip offload -- for affordability Intra-SQL and intra-partition parallelism for optimal data access 14
15 Seize the moment with Analytics on z Systems adopt or be disrupted Enhance customer experience by providing right product at the right place and right time Locate analytics exactly where your data resides Power real time analytics with in-memory analytics engine Spark on z/os Get started today: ibm.biz/zanalytics 15
16 IBM z Systems CPO Resources IBM Competitive Project Office Our mission is to perform hands-on competitive research that will enable sales and technical professionals to boost market awareness, increase pipeline and sales of IBM z Systems hardware and IBM software. Please visit our website for additional information on how we can help you generate leads and close business. Customer Briefing Bringing to life the results of IBM's lab-based, hands-on competitive research with your clients in face-to-face events. "IBM z Systems More Potential than Ever" "IBM LinuxONE - Enabling IT to Solve Business Challenges" For First in Enterprise (FIE) IBM CPO Competitive Sales Assists (CSA) A CPO CSA helps to make the advantages of IBM's solutions more real and compelling to a customer evaluating competitive alternatives. Visit our website to learn how CPO provides one-to-one clients engagements to assist sellers with closing competitive opportunities. IBM CPO Communication Share the external z Systems website with your clients. Click here to subscribe to IBM CPO Communications IBM Redbooks Point-of-View and Redpaper publications Authored by IBM CPO Read more about our research - CPO Technical Deliverables Join the CPO Community 16 For information on z Systems CPO Program or Events: Sally Touscany touscany@us.ibm.com
Khadija Souissi. Auf z Systems November IBM z Systems Mainframe Event 2016
Khadija Souissi Auf z Systems 07. 08. November 2016 @ IBM z Systems Mainframe Event 2016 Acknowledgements Apache Spark, Spark, Apache, and the Spark logo are trademarks of The Apache Software Foundation.
More informationIBM Data Virtualization Manager for z/os Leverage data virtualization synergy with API economy to evolve the information architecture on IBM Z
IBM for z/os Leverage data virtualization synergy with API economy to evolve the information architecture on IBM Z IBM z Analytics Agenda Big Data vs. Dark Data Traditional Data Integration Mainframe Data
More informationIBM DATA VIRTUALIZATION MANAGER FOR z/os
IBM DATA VIRTUALIZATION MANAGER FOR z/os Any Data to Any App John Casey Senior Solutions Advisor jcasey@rocketsoftware.com IBM z Analytics A New Era of Digital Business To Remain Competitive You must deliver
More informationANY Data for ANY Application Exploring IBM Data Virtualization Manager for z/os in the era of API Economy
ANY Data for ANY Application Exploring IBM for z/os in the era of API Economy Francesco Borrello francesco.borrello@it.ibm.com IBM z Analytics Traditional Data Integration Inadequate No longer Viable to
More informationFast Innovation requires Fast IT
Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:
More informationIBM DB2 Analytics Accelerator Trends and Directions
March, 2017 IBM DB2 Analytics Accelerator Trends and Directions DB2 Analytics Accelerator for z/os on Cloud Namik Hrle IBM Fellow Peter Bendel IBM STSM Disclaimer IBM s statements regarding its plans,
More informationOverview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::
Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized
More informationAbstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight
ESG Lab Review InterSystems Data Platform: A Unified, Efficient Data Platform for Fast Business Insight Date: April 218 Author: Kerry Dolan, Senior IT Validation Analyst Abstract Enterprise Strategy Group
More informationSaving ETL Costs Through Data Virtualization Across The Enterprise
Saving ETL Costs Through Virtualization Across The Enterprise IBM Virtualization Manager for z/os Marcos Caurim z Analytics Technical Sales Specialist 2017 IBM Corporation What is Wrong with Status Quo?
More informationOracle Exadata: Strategy and Roadmap
Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended
More informationIBM DB2 Analytics Accelerator use cases
IBM DB2 Analytics Accelerator use cases Ciro Puglisi Netezza Europe +41 79 770 5713 cpug@ch.ibm.com 1 Traditional systems landscape Applications OLTP Staging Area ODS EDW Data Marts ETL ETL ETL ETL Historical
More informationIBM Systems for Cognitive Solutions IBM Machine Learning for z/os
IBM Systems for Cognitive Solutions IBM Machine Learning for z/os Khadija Souissi IBM Client Center Boeblingen Machine Learning takes center stage Gartner identifies Machine Learning as the Top Trend in
More informationTaming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems
1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for
More informationData Analytics at Logitech Snowflake + Tableau = #Winning
Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief
More informationFluentd + MongoDB + Spark = Awesome Sauce
Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision
More informationIBM IMS Tools Keynote
IBM IMS TECHNICAL SYMPOSIUM 2016 IBM IMS Tools Keynote Janet LeBlanc IMS Tools Offering Manager 2016 IBM Corporation Agenda Our journey where we have been A couple of products you should see this week:
More informationProgress DataDirect For Business Intelligence And Analytics Vendors
Progress DataDirect For Business Intelligence And Analytics Vendors DATA SHEET FEATURES: Direction connection to a variety of SaaS and on-premises data sources via Progress DataDirect Hybrid Data Pipeline
More informationData Virtualization for the Enterprise
Data Virtualization for the Enterprise New England Db2 Users Group Meeting Old Sturbridge Village, 1 Old Sturbridge Village Road, Sturbridge, MA 01566, USA September 27, 2018 Milan Babiak Client Technical
More informationIBM Data Virtualization Manager in Detail + Demo Atlanta DB2 User Group Meeting December 7, 2018
IBM Data Virtualization Manager in Detail + Demo Atlanta DB2 User Group Meeting December 7, 2018 Milan Babiak Client Technical Professional, Analytics on Z Systems North America IBM Canada Milan.Babiak@ca.ibm.com
More informationBuilding a Data Strategy for a Digital World
Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service
More informationTECHED USER CONFERENCE MAY 3-4, 2016
TECHED USER CONFERENCE MAY 3-4, 2016 Bruce Beaman, Senior Director Adabas and Natural Product Marketing Software AG Software AG s Future Directions for Adabas and Natural WHAT CUSTOMERS ARE TELLING US
More informationApache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context
1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes
More informationIBM Db2 Event Store Simplifying and Accelerating Storage and Analysis of Fast Data. IBM Db2 Event Store
IBM Db2 Event Store Simplifying and Accelerating Storage and Analysis of Fast Data IBM Db2 Event Store Disclaimer The information contained in this presentation is provided for informational purposes only.
More informationFrom Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019
From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways
More informationBig Data. Big Data Analyst. Big Data Engineer. Big Data Architect
Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION
More informationAnalytics on z Systems, what s new?
Khadija Souissi Analytics on z Systems, what s new? IBM Architektentage 16.11.2016 Agenda Spark on z Systems What is Spark Spark Details The ecosystem for Spark on z/os Use Cases New Dimension for the
More informationMigrate from Netezza Workload Migration
Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with
More informationTHINK DIGITAL RETHINK LEGACY
THINK DIGITAL RETHINK LEGACY Adabas & 2050+ Platform Strategy & Roadmap Bruce Beddoe VP Adabas Systems 1 % BUSINESS & MISSION-CRITICAL 2 For internal use only Billions invested in DIFFERENTIATING business
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationADABAS & NATURAL 2050+
ADABAS & NATURAL 2050+ Guido Falkenberg SVP Global Customer Innovation DIGITAL TRANSFORMATION #WITHOUTCOMPROMISE 2017 Software AG. All rights reserved. ADABAS & NATURAL 2050+ GLOBAL INITIATIVE INNOVATION
More informationExploiting IT Log Analytics to Find and Fix Problems Before They Become Outages
Exploiting IT Log Analytics to Find and Fix Problems Before They Become Outages Clyde Richardson (richarcl@us.ibm.com) Technical Sales Specialist Sarah Knowles (seli@us.ibm.com) Strategy and Portfolio
More informationBringing Data to Life
Bringing Data to Life Data management and Visualization Techniques Benika Hall Rob Harrison Corporate Model Risk March 16, 2018 Introduction Benika Hall Analytic Consultant Wells Fargo - Corporate Model
More information1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda
Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:
More informationBUILD BETTER MICROSOFT SQL SERVER SOLUTIONS Sales Conversation Card
OVERVIEW SALES OPPORTUNITY Lenovo Database Solutions for Microsoft SQL Server bring together the right mix of hardware infrastructure, software, and services to optimize a wide range of data warehouse
More informationData 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.
Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020
More informationAccelerate your Azure Hybrid Cloud Business with HPE. Ken Won, HPE Director, Cloud Product Marketing
Accelerate your Azure Hybrid Cloud Business with HPE Ken Won, HPE Director, Cloud Product Marketing Mega trend: Customers are increasingly buying cloud services from external service providers Speed of
More informationUnderstanding the latent value in all content
Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence
More informationA Tutorial on Apache Spark
A Tutorial on Apache Spark A Practical Perspective By Harold Mitchell The Goal Learning Outcomes The Goal Learning Outcomes NOTE: The setup, installation, and examples assume Windows user Learn the following:
More informationWEB-APIs DRIVING DIGITAL INNOVATION
WEB-APIs DRIVING DIGITAL INNOVATION Importance of Web-APIs Simply put, Web-APIs are the medium to make a company s digital assets consumable to any channel, which has a current or latent need. It helps
More informationTransforming IT: From Silos To Services
Transforming IT: From Silos To Services Chuck Hollis Global Marketing CTO EMC Corporation http://chucksblog.emc.com @chuckhollis IT is being transformed. Our world is changing fast New Technologies New
More informationModernizing Business Intelligence and Analytics
Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from
More informationBig Data Infrastructures & Technologies
Big Data Infrastructures & Technologies Spark and MLLIB OVERVIEW OF SPARK What is Spark? Fast and expressive cluster computing system interoperable with Apache Hadoop Improves efficiency through: In-memory
More informationThe Never Ending Value of z Systems Focus on Analytics & Big Data
The Never Ending Value of z Systems Focus on Analytics & Big Data Hélène Lyon Distinguished Engineer & CTO, Analytics on z Systems for Europe Europe IMS SWAT Technical Executive IBM Systems, zsoftware
More informationWhen, Where & Why to Use NoSQL?
When, Where & Why to Use NoSQL? 1 Big data is becoming a big challenge for enterprises. Many organizations have built environments for transactional data with Relational Database Management Systems (RDBMS),
More informationModernize Without. Compromise. Modernize Without Compromise- All Flash. All-Flash Portfolio. Haider Aziz. System Engineering Manger- Primary Storage
Modernize Without Modernize Without Compromise- All Flash Compromise All-Flash Portfolio Haider Aziz Haider Aziz System Engineering Manger- Primary Storage System Engineering Manger- Primary Storage Modern
More informationEnd to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015
End to End Analysis on System z IBM Transaction Analysis Workbench for z/os James Martin IBM Tools Product SME August 10, 2015 Please note IBM s statements regarding its plans, directions, and intent are
More informationEnhancing Security With SQL Server How to balance the risks and rewards of using big data
Enhancing Security With SQL Server 2016 How to balance the risks and rewards of using big data Data s security demands and business opportunities With big data comes both great reward and risk. Every company
More informationUNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX
UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX 1 Successful companies know that analytics are key to winning customer loyalty, optimizing business processes and beating their
More informationThe Power to Stream z IT Operational Data to the Analytic Engine of Your Choice
The Power to Stream z IT Operational Data to the Analytic Engine of Your Choice Domenico D Alterio IBM November 2018 Session OK Agenda Business challenges IBM Common Data Provider for z Systems Overview
More informationDATACENTER SERVICES DATACENTER
SERVICES SOLUTION SUMMARY ALL CHANGE React, grow and innovate faster with Computacenter s agile infrastructure services Customers expect an always-on, superfast response. Businesses need to release new
More informationBlurring the Line Between Developer and Data Scientist
Blurring the Line Between Developer and Data Scientist Notebooks with PixieDust va barbosa va@us.ibm.com Developer Advocacy IBM Watson Data Platform WHY ARE YOU HERE? More companies making bet-the-business
More informationDATA SCIENCE USING SPARK: AN INTRODUCTION
DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data
More informationSoftware Defined Storage
Software Defined Storage IBM Spectrum Portfolio Ian Hancock ian.hancock@uk.ibm.com Business challenges are IT challenges Create new business models (CEO) Transform financial & management processes (CFO)
More informationData 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.
17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations
More informationThe Evolution of Big Data Platforms and Data Science
IBM Analytics The Evolution of Big Data Platforms and Data Science ECC Conference 2016 Brandon MacKenzie June 13, 2016 2016 IBM Corporation Hello, I m Brandon MacKenzie. I work at IBM. Data Science - Offering
More informationMapR Enterprise Hadoop
2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS
More informationBenchmarking Spark ML using BigBench. Sweta Singh TPCTC 2016
Benchmarking Spark ML using BigBench Sweta Singh singhswe@us.ibm.com TPCTC 2016 Motivation Study the performance of Machine Learning use cases on large data warehouses in context of assessing Alternate
More informationDemystifying the Cloud With a Look at Hybrid Hosting and OpenStack
Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack Robert Collazo Systems Engineer Rackspace Hosting The Rackspace Vision Agenda Truly a New Era of Computing 70 s 80 s Mainframe Era 90
More informationDeploying, Managing and Reusing R Models in an Enterprise Environment
Deploying, Managing and Reusing R Models in an Enterprise Environment Making Data Science Accessible to a Wider Audience Lou Bajuk-Yorgan, Sr. Director, Product Management Streaming and Advanced Analytics
More informationUnifying Big Data Workloads in Apache Spark
Unifying Big Data Workloads in Apache Spark Hossein Falaki @mhfalaki Outline What s Apache Spark Why Unification Evolution of Unification Apache Spark + Databricks Q & A What s Apache Spark What is Apache
More informationHadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved
Hadoop 2.x Core: YARN, Tez, and Spark YARN Hadoop Machine Types top-of-rack switches core switch client machines have client-side software used to access a cluster to process data master nodes run Hadoop
More informationDQpowersuite. Superior Architecture. A Complete Data Integration Package
DQpowersuite Superior Architecture Since its first release in 1995, DQpowersuite has made it easy to access and join distributed enterprise data. DQpowersuite provides an easy-toimplement architecture
More informationMicrosoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud
Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions
More informationLatest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager
Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager Please Note IBM s statements regarding its plans, directions, and intent are subject to change or
More informationLeading-edge Technology on System z
Leading-edge Technology on System z David Rhoderick IBM Corporation Tuesday 11 th March 2014 Session Number 15033 Test link: www.share.org Fifty years ago, IBM introduced the first mainframe computer It
More informationz/os Update Jeff Magdall z/os PDT Lead February 4, IBM Corporation
z/os Update Jeff Magdall z/os PDT Lead magdall@us.ibm.com February 4, 2013 Topics Update on 2012 z/os Version 2 Statement of Direction zec12 Announcement February 5 th Preview New Solutions Announcement
More informationOLAP Introduction and Overview
1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata
More informationmicrosoft
70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series
More informationIntelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018
Intelligent Enterprise meets Science of Where Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Value The Esri & SAP journey Customer Impact Innovation Track Record Customer
More information2016 IBM Corporation 1
1 Driving Systems Competitive Advantage through Collaborative Innovation Tom Rosamilia IBM Investor Briefing 2016 Senior Vice President, IBM Systems 2 Systems $9.5B $1.7B Revenue Growth 2010 2015 z Systems
More informationWas ist dran an einer spezialisierten Data Warehousing platform?
Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction
More informationREALIZE YOUR. DIGITAL VISION with Digital Private Cloud from Atos and VMware
REALIZE YOUR DIGITAL VISION with Digital Private Cloud from Atos and VMware Today s critical business challenges and their IT impact Business challenges Maximizing agility to accelerate time to market
More informationIBM Data Replication for Big Data
IBM Data Replication for Big Data Highlights Stream changes in realtime in Hadoop or Kafka data lakes or hubs Provide agility to data in data warehouses and data lakes Achieve minimum impact on source
More informationCONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM
CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationCloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)
CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) Microsoft Solution Latest Sl Area Refresh No. Course ID Run ID Course Name Mapping Date 1 AZURE202x 2 Microsoft
More informationIBM Storage Solutions & Software Defined Infrastructure
IBM Storage Solutions & Software Defined Infrastructure Strategy, Trends, Directions Calline Sanchez, Vice President, IBM Enterprise Systems Storage Twitter: @cksanche LinkedIn: www.linkedin.com/pub/calline-sanchez/9/599/b09/
More informationIBM dashdb Local. Using a software-defined environment in a private cloud to enable hybrid data warehousing. Evolving the data warehouse
IBM dashdb Local Using a software-defined environment in a private cloud to enable hybrid data warehousing Evolving the data warehouse Managing a large-scale, on-premises data warehouse environments to
More informationIBM DB2 Analytics Accelerator
June, 2017 IBM DB2 Analytics Accelerator DB2 Analytics Accelerator for z/os on Cloud for z/os Update Peter Bendel IBM STSM Disclaimer IBM s statements regarding its plans, directions, and intent are subject
More informationEvolving To The Big Data Warehouse
Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from
More informationOptimizing Data Transformation with Db2 for z/os and Db2 Analytics Accelerator
Optimizing Data Transformation with Db2 for z/os and Db2 Analytics Accelerator Maryela Weihrauch, IBM Distinguished Engineer, WW Analytics on System z March, 2017 Please note IBM s statements regarding
More informationIBM Db2 Analytics Accelerator Version 7.1
IBM Db2 Analytics Accelerator Version 7.1 Delivering new flexible, integrated deployment options Overview Ute Baumbach (bmb@de.ibm.com) 1 IBM Z Analytics Keep your data in place a different approach to
More informationOracle Big Data Connectors
Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process
More informationApproaching the Petabyte Analytic Database: What I learned
Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may
More informationexam. Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Version 1.0
70-775.exam Number: 70-775 Passing Score: 800 Time Limit: 120 min File Version: 1.0 Microsoft 70-775 Perform Data Engineering on Microsoft Azure HDInsight Version 1.0 Exam A QUESTION 1 You use YARN to
More information2/26/2017. Originally developed at the University of California - Berkeley's AMPLab
Apache is a fast and general engine for large-scale data processing aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes Low latency: sub-second
More informationBig Data Architect.
Big Data Architect www.austech.edu.au WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional
More informationBig data systems 12/8/17
Big data systems 12/8/17 Today Basic architecture Two levels of scheduling Spark overview Basic architecture Cluster Manager Cluster Cluster Manager 64GB RAM 32 cores 64GB RAM 32 cores 64GB RAM 32 cores
More informationQLogic/Lenovo 16Gb Gen 5 Fibre Channel for Database and Business Analytics
QLogic/ Gen 5 Fibre Channel for Database Assessment for Database and Business Analytics Using the information from databases and business analytics helps business-line managers to understand their customer
More informationTransformation in Technology Barbara Duck Chief Information Officer. Investor Day 2018
Transformation in Technology Barbara Duck Chief Information Officer Investor Day 2018 Key Takeaways 1Transformation in Technology driving out cost, supporting a more technologyenabled business Our new
More informationPerformance Innovations with Oracle Database In-Memory
Performance Innovations with Oracle Database In-Memory Eric Cohen Solution Architect Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information
More informationThat Set the Foundation for the Private Cloud
for Choosing Virtualization Solutions That Set the Foundation for the Private Cloud solutions from work together to harmoniously manage physical and virtual environments, enabling the use of multiple hypervisors
More informationSAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC
SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data
More informationTransformation Through Innovation
Transformation Through Innovation A service provider strategy to prosper from digitization People will have 11.6 billion mobile-ready devices and connections by 2020. For service providers to thrive today
More informationTECHED USER CONFERENCE MAY 3-4, 2016
TECHED USER CONFERENCE MAY 3-4, 2016 Bob Jeffcott Software AG Big Data Adabas In Memory Data Management with Terracotta 2016 Software AG. All rights reserved. For internal use only AGENDA 1. ADABAS/NATURAL
More informationExam Questions
Exam Questions 70-775 Perform Data Engineering on Microsoft Azure HDInsight (beta) https://www.2passeasy.com/dumps/70-775/ NEW QUESTION 1 You are implementing a batch processing solution by using Azure
More informationTen Innovative Financial Services Applications Powered by Data Virtualization
Ten Innovative Financial Services Applications Powered by Data Virtualization DATA IS THE NEW ALPHA In an industry driven to deliver alpha, where might financial services firms find opportunities when
More informationREGULATORY REPORTING FOR FINANCIAL SERVICES
REGULATORY REPORTING FOR FINANCIAL SERVICES Gordon Hughes, Global Sales Director, Intel Corporation Sinan Baskan, Solutions Director, Financial Services, MarkLogic Corporation Many regulators and regulations
More informationIBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE
IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE Choosing IT infrastructure is a crucial decision, and the right choice will position your organization for success. IBM Power Systems provides an innovative platform
More informationFIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION
FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION The process of planning and executing SQL Server migrations can be complex and risk-prone. This is a case where the right approach and
More information