Top Five Reasons for Data Warehouse Modernization Philip Russom
|
|
- Arline Tate
- 5 years ago
- Views:
Transcription
1 Top Five Reasons for Data Warehouse Modernization Philip Russom TDWI Research Director for Data Management May 28, 2014
2 Sponsor
3 Speakers Philip Russom TDWI Research Director, Data Management Steve Sarsfield Product Marketing Manager, HP Vertica 3
4 Agenda PLEASE #TDWI, #EDW, #DataWarehouse, #DataArchitecture, #Analytics, #RealTime Background Why many users DWs need modernization What is it? There are many reasons, but I ll boil it down to five Top Five Reasons Analytics Scale Speed Productivity Cost Control New DW Architectures Resulting from Modernization Recommendations
5 DW Modernization has many meanings Additions to existing data warehouse New data subjects, sources, tables, dimensions, etc. More standalone data platforms and tools Complement DW without replacing it More marts and ODSs New appliances, columnar databases, Hadoop, NoSQL, etc. Architectural Adjustments All the above Better design Upgrades Newer versions of current DBMS software More hardware Rip and Replace Decommission current DW platform and migrate to another
6 Contact Information If you have further questions or comments: Philip Russom, TDWI Randy Lea, Teradata 6
7 Top Five Goals for DW Modernization I ll mostly focus on improvements to: Analytics, Scale, Speed These regularly rank high in TDWI surveys, for example: SOURCE: 2014 TDWI Report: Evolving Data Warehouse Architectures, Figure 4 1. ANALYTICS 2. SCALE 3. SPEED I ll also mention improvements to: Productivity, Cost Control These regularly come up in TDWI interviews with users
8 DW Modernization Goals are Related Analytics needs better productivity The challenge is to gain improvements with the first four goals without incurring more of the fifth: cost. Speed contributes to scale and productivity SPEED Streaming Big Data Event Processing Real-Time Operation Operational BI Near-Time Analytics Dashboard Refresh Fast Queries SOURCE: 2012 TDWI Report: High Performance Data Warehousing, Figure 1. CONCURRENCY Competing Workloads Reporting, Real Time, OLAP, Adv. Analytics, etc. Intra-Day Data Loads Thousands of Users Ad hoc Queries HIGH PERFORMANCE DATA WAREHOUSING (HiPer DW) SCALE Big Data Volumes Detailed Source Data Thousands of Reports Scale Out Into: Clouds, clusters, grids, distributed architectures COMPLEXITY Big Data Variety Unstructured Data Machine/sensor Data Web & Social Media Many Sources/Targets Complex Models & SQL High Availability
9 BEYOND OLAP & REPORTING TO Advanced Analytics Organizations need more analytic insights To compete, serve customers, be profitable, control costs, improve quality, grow, etc. Analytics is becoming a larger portion of BI work Reporting and OLAP are still important Organizations need advanced forms of analytics Technologies: Extreme SQL, data mining, statistics, natural language processing, text mining, AI, graph, etc. Methods: Predictive, clustering, segmentation, risk, fraud detection, etc. Most users designed EDWs for reporting and OLAP Analytics requirements differ from reports and OLAP Users face multiple paths to enabling advanced analytics Retrofit analytics onto report-focused EDW Deploy an analytic data platform that complements the EDW Replace the EDW s platform with one that handles all workloads
10 Scale TO MORE DATA, USERS, REPORTS, ANALYSES Data s Growing Volumes are a Challenge Large Data Warehouses data for both reporting and analytics Big Data volume aside, also diversity of data type, source, latency Scale is also a Challenge to Basic BI Functions, like Reporting Thousands of Concurrent BI Users; Thousands of Reports Eventually, thousands of analytic users Scale to Increasing Complexity More processing for ETL, integration, quality, analytics, real time, etc. Distributed DW architectures have more moving parts Scale despite Growing numbers of Concurrent Workloads Reporting, Real Time, OLAP, Analytics, Data Loads, Ad hoc Queries Users have a number of choices for scaling Scale Up: More hardware for more data; efficient storage Scale Out: Clouds, clusters, grids, racks, distributed architectures Deploy or migrate to data platforms built for analytics with big data: columnar databases, data warehouse appliances, newer brands of databases, Hadoop, NoSQL, etc.
11 EVERTHING NEEDS MORE Speed Speed involves a temporal continuum From high performance to near time and true real time Speed is enabled by a functional continuum From hardware to perky queries to event processing Many options are available for modernizing EDWs and analytics High performance functionality In-memory databases, in-database analytics, columnar databases, DW appliances, solid-state drives, modern CPUs, big memory in servers, Near-time functionality Microbatches, federation, virtualization, replication, services, query optimization, etc. Real-time functionality Complex event processing (CEP), stream processing, operational intelligence, etc.
12 MORE SOLUTIONS IN LESS TIME Productivity Agile and lean development methods Early prototype, built out iteratively Instead of older big bang deliverables Biz folks review/guide each iteration To assure IT-to-biz alignment Requirements gathering (RG) now done online Data exploration, discovery, profiling replace RG Req s captured online, applied directly to solution Fast tools and platforms make analytics productive Speed of thought iterative analysis Fast queries & bulk loads build analytic datasets fast Less time per project means More projects Organization uses solution sooner Greater agility for the business
13 DATA VARIES IN VALUE; MANAGE IT ACCORDINGLY Economics As you modernize a DW environment, rethink its economics Cost continuum of data platforms: High $/Tb Traditional Platforms New Affordable Platforms, built for DW/Analytics Cheap Open Source: Hadoop, NoSQL Choose a platform that fits a given data workload but also fits the value of data High-value data on the core EDW Modeling, cleansing, aggregating, and documenting data (which is required for reports and OLAP) increases its value Analytic datasets in the mid tier This data is lightly prepared or prepped on the fly; temp sandboxes Source & archival data on the back tier This is more of a data lake that preserves data in its original form, so it can be repurposed repeatedly, as analytic projects arise
14 ONE WAY TO MODERNIZE A DW Multi-Platform Data Warehouse Environments Many enterprise data warehouses (EDWs) are evolving into multi-platform data warehouse environments (DWEs). Users continue to add additional standalone data platforms to their warehouse tool and platform portfolio. The new platforms don t replace the core warehouse, because it is still the best platform for the data that goes into standards reports, dashboards, performance management, and OLAP. Instead, the new platforms complement the warehouse, because they are optimized for workloads that manage, process, and analyze new forms of big data, non-structured data, and real-time data.
15 Modern DW System Architectures can be Complex The technology stack for DW, BI, analytics, and data integration has always been a multi-platform environment. What s new? The trend toward a portfolio of many data platforms has accelerated. Why? More platform types to serve more data and workload types. Over The Passage of Time Federated Data Federated Marts Data Federated Marts Data Marts Customer Mart Customer or ODS Mart or ODS Real Time ODS DW from a Merger Columnar DBMS Columnar DBMS Map Reduce Complex, Event Processing Data Warehouse Star or Multi- Snowflake dimensional Scheme Data Models Data Staging Data Areas Staging Data Areas Staging Areas Metrics for Performance Mgt OLAP Cubes OLAP DBMSs Detailed Source Detailed Data Source Detailed Data Source Data Analytic Sand Box Data Federation & Virtualization Hadoop Distributed Hadoop File Distributed Sys File Sys DW Appliance DW Appliances No-SQL Database No-SQL Database Streaming Data Tools
16 Good Reasons for Integrating Hadoop with Relational EDW A Relational DBMS is good at: Metadata management Complex query optimization Query federation Table joins, views, keys, etc. Security, including roles, directories Much more mature development tools HDFS & other Hadoop tools are good at: Massive scalability Lower cost than most DW platforms & analytic DBMSs Multi-structured data & no-schema data Some ETL functions; late binding; custom code for analytics Use HDFS like a very scalable operational data store or data staging area, to modernize your existing DW environment
17 Recommendations Revaluate your data warehouse and related systems There s always room for improvement Change is afoot, in both biz & tech Prioritize modernization by putting biz goals first Biz wants to manage big data and leverage it Biz wants to compete on analytics Biz needs real-time tech to operate faster Biz needs BI/DW solutions sooner, more agile Technology goals are also important, though secondary Greater productivity from tech personnel Assuring capacity for growth Diversifying data platform and tool portfolio to support more types of data, workloads, development methods, etc. Migration to new platforms that are faster, more scalable, tuned for analytics, cost less, etc.
18 Cost Optimized Storage Steve Sarsfield, Product Marketing Manager, HP Vertica Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
19 Feeling the Pain Recognizing that it s time to modernize TIME IS MONEY What would be the business impact of reducing time from days to hours (hours to minutes)? READY FOR BIG DATA What is your plan for managing the need for real time data analysis as your data volumes continue to scale? ANALYTIC INNOVATION Are you getting the business insights from your organization s data when you need it? 19 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
20 Big Data Warehouse Key Features Joins, Complex Data Types SQL-based Predictive Analytics Petabyte Scale Advanced Analytics Advanced Analytics Manage Huge Data Volumes Manage Huge Data Volumes Python and R Support Data Scientists Support Data Scientists Work with Legacy Tools Deliver Fast Analytics Deliver Fast Analytics What-if, A/B testing Work with Legacy Tools SQL-based Visualization ETL 20 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
21 Analytics Capabilities Reinforced Legacy Architectures Advanced Analytics Support Data Scientists Work with Legacy Tools Advanced Analytics Manage Huge Data Volumes Support Data Scientists Deliver Fast Analytics Purpose-built Big Data Analytics Platform Work with Legacy Tools Manage Huge Data Volumes Deliver Fast Advanced Analytics Analytics Support Data Scientists New NoSQL Architectures Work with Legacy Tools Manage Huge Data Volumes Deliver Fast Analytics 21 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
22 Cost-Optimized Storage - ILM Tier-off older data Interactive Data Frequently queried Vertica data cache Hot Batch Data Vertica data cache Cool Serve Convert data to Vertica storage format Value Discovery Archive Data Vertica data cache Explore Any format Cold Dark Data Location Format Store Any format 22 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
23 Core Capabilities Impact How Do We Achieve Huge Performance Increases? 23 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
24 Secret Sauce of HP Vertica Columnar Storage Compression MPP Scale- Out Distributed Query Projections Speeds Query Time by Reading Only Necessary Data Lowers costly I/O to boost overall performance Provides high scalability on clusters with no name node or other single point of failure Any node can initiate the queries and use other nodes for work. No single point of failure Combine high availability with special optimizations for query performance A B D C E A CPU CPU CPU Memory Memory Memory Disk Disk Disk 24 Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
25 To find out more Purpose built for Big Data from the first line of code Download and Try Community Edition supports up to 1 TB on 3 nodes Contact us for more information or 30 day trial Contact Steve.Sarsfield@HP.com Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
26 Questions? 26
27 Contact Information If you have further questions or comments: Philip Russom, TDWI Steve Sarsfield, HP 27
Drawing the Big Picture
Drawing the Big Picture Multi-Platform Data Architectures, Queries, and Analytics Philip Russom TDWI Research Director for Data Management August 26, 2015 Sponsor 2 Speakers Philip Russom TDWI Research
More informationModernize Data Warehousing
Modernize Data Warehousing with Hadoop, Data Virtualization, and In-Memory Techniques Philip Russom TDWI Research Director for Data Management July 24, 2014 Sponsor Speakers Philip Russom TDWI Research
More informationMaking Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST
Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0 WEBINAR MAY 15 th, 2018 1PM EST 10AM PST Welcome and Logistics If you have problems with the sound on your computer, switch
More informationFast Innovation requires Fast IT
Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationPřehled novinek v SQL Server 2016
Přehled novinek v SQL Server 2016 Martin Rys, BI Competency Leader martin.rys@adastragrp.com https://www.linkedin.com/in/martinrys 20.4.2016 1 BI Competency development 2 Trends, modern data warehousing
More informationWHERE HADOOP FITS IN YOUR DATA WAREHOUSE ARCHITECTURE
TDWI RESEARCH TDWI CHECKLIST REPORT WHERE HADOOP FITS IN YOUR DATA WAREHOUSE ARCHITECTURE By Philip Russom Sponsored by tdwi.org JUNE 2013 TDWI CHECKLIST REPORT WHERE HADOOP FITS IN YOUR DATA WAREHOUSE
More informationEvolving To The Big Data Warehouse
Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationNetezza The Analytics Appliance
Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for
More informationMicrosoft Analytics Platform System (APS)
Microsoft Analytics Platform System (APS) The turnkey modern data warehouse appliance Matt Usher, Senior Program Manager @ Microsoft About.me @two_under Senior Program Manager 9 years at Microsoft Visual
More informationCombine Native SQL Flexibility with SAP HANA Platform Performance and Tools
SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been
More informationData-Intensive Distributed Computing
Data-Intensive Distributed Computing CS 451/651 431/631 (Winter 2018) Part 5: Analyzing Relational Data (1/3) February 8, 2018 Jimmy Lin David R. Cheriton School of Computer Science University of Waterloo
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationHybrid Data Platform
UniConnect-Powered Data Aggregation Across Enterprise Data Warehouses and Big Data Storage Platforms A Percipient Technology White Paper Author: Ai Meun Lim Chief Product Officer Updated Aug 2017 2017,
More informationACCELERATE YOUR ANALYTICS GAME WITH ORACLE SOLUTIONS ON PURE STORAGE
ACCELERATE YOUR ANALYTICS GAME WITH ORACLE SOLUTIONS ON PURE STORAGE An innovative storage solution from Pure Storage can help you get the most business value from all of your data THE SINGLE MOST IMPORTANT
More informationData Management Glossary
Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative
More informationComposite Software Data Virtualization The Five Most Popular Uses of Data Virtualization
Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software, Inc. June 2011 TABLE OF CONTENTS INTRODUCTION... 3 DATA FEDERATION... 4 PROBLEM DATA CONSOLIDATION
More informationWhen, Where & Why to Use NoSQL?
When, Where & Why to Use NoSQL? 1 Big data is becoming a big challenge for enterprises. Many organizations have built environments for transactional data with Relational Database Management Systems (RDBMS),
More informationBI ENVIRONMENT PLANNING GUIDE
BI ENVIRONMENT PLANNING GUIDE Business Intelligence can involve a number of technologies and foster many opportunities for improving your business. This document serves as a guideline for planning strategies
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationTop Trends in DBMS & DW
Oracle Top Trends in DBMS & DW Noel Yuhanna Principal Analyst Forrester Research Trend #1: Proliferation of data Data doubles every 18-24 months for critical Apps, for some its every 6 months Terabyte
More informationSAP IQ Software16, Edge Edition. The Affordable High Performance Analytical Database Engine
SAP IQ Software16, Edge Edition The Affordable High Performance Analytical Database Engine Agenda Agenda Introduction to Dobler Consulting Today s Data Challenges Overview of SAP IQ 16, Edge Edition SAP
More informationCHAPTER 3 Implementation of Data warehouse in Data Mining
CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected
More information@Pentaho #BigDataWebSeries
Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of
More informationModernizing Business Intelligence and Analytics
Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from
More informationPart 1: Indexes for Big Data
JethroData Making Interactive BI for Big Data a Reality Technical White Paper This white paper explains how JethroData can help you achieve a truly interactive interactive response time for BI on big data,
More informationApril Copyright 2013 Cloudera Inc. All rights reserved.
Hadoop Beyond Batch: Real-time Workloads, SQL-on- Hadoop, and the Virtual EDW Headline Goes Here Marcel Kornacker marcel@cloudera.com Speaker Name or Subhead Goes Here April 2014 Analytic Workloads on
More informationDATABASE SCALE WITHOUT LIMITS ON AWS
The move to cloud computing is changing the face of the computer industry, and at the heart of this change is elastic computing. Modern applications now have diverse and demanding requirements that leverage
More informationOracle 1Z0-515 Exam Questions & Answers
Oracle 1Z0-515 Exam Questions & Answers Number: 1Z0-515 Passing Score: 800 Time Limit: 120 min File Version: 38.7 http://www.gratisexam.com/ Oracle 1Z0-515 Exam Questions & Answers Exam Name: Data Warehousing
More informationShine a Light on Dark Data with Vertica Flex Tables
White Paper Analytics and Big Data Shine a Light on Dark Data with Vertica Flex Tables Hidden within the dark recesses of your enterprise lurks dark data, information that exists but is forgotten, unused,
More informationAutomating Information Lifecycle Management with
Automating Information Lifecycle Management with Oracle Database 2c The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More informationTaming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems
1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for
More informationAgenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache
Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationLeveraging Customer Behavioral Data to Drive Revenue the GPU S7456
Leveraging Customer Behavioral Data to Drive Revenue the GPU way 1 Hi! Arnon Shimoni Senior Solutions Architect I like hardware & parallel / concurrent stuff In my 4 th year at SQream Technologies Send
More informationTDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems
Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your
More informationCONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM
CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications
More informationCloud Computing & Visualization
Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International
More informationEvolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo
Evolution of Big Data Architectures@ Facebook Architecture Summit, Shenzhen, August 2012 Ashish Thusoo About Me Currently Co-founder/CEO of Qubole Ran the Data Infrastructure Team at Facebook till 2011
More informationFull file at
Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits
More informationData Warehousing 11g Essentials
Oracle 1z0-515 Data Warehousing 11g Essentials Version: 6.0 QUESTION NO: 1 Indentify the true statement about REF partitions. A. REF partitions have no impact on partition-wise joins. B. Changes to partitioning
More informationFrom Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019
From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways
More informationIBM dashdb Local. Using a software-defined environment in a private cloud to enable hybrid data warehousing. Evolving the data warehouse
IBM dashdb Local Using a software-defined environment in a private cloud to enable hybrid data warehousing Evolving the data warehouse Managing a large-scale, on-premises data warehouse environments to
More informationData 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.
Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020
More informationBig Data Facebook
Big Data Architectures@ Facebook QCon London 2012 Ashish Thusoo Outline Big Data @ Facebook - Scope & Scale Evolution of Big Data Architectures @ FB Past, Present and Future Questions Big Data @ FB: Scale
More informationAcquiring Big Data to Realize Business Value
Acquiring Big Data to Realize Business Value Agenda What is Big Data? Common Big Data technologies Use Case Examples Oracle Products in the Big Data space In Summary: Big Data Takeaways
More informationWhy All Column Stores Are Not the Same Twelve Low-Level Features That Offer High Value to Analysts
White Paper Analytics & Big Data Why All Column Stores Are Not the Same Twelve Low-Level Features That Offer High Value to Analysts Table of Contents page Compression...1 Early and Late Materialization...1
More informationSyncsort DMX-h. Simplifying Big Data Integration. Goals of the Modern Data Architecture SOLUTION SHEET
SOLUTION SHEET Syncsort DMX-h Simplifying Big Data Integration Goals of the Modern Data Architecture Data warehouses and mainframes are mainstays of traditional data architectures and still play a vital
More informationHadoop Beyond Batch: Real-time Workloads, SQL-on- Hadoop, and thevirtual EDW Headline Goes Here
Hadoop Beyond Batch: Real-time Workloads, SQL-on- Hadoop, and thevirtual EDW Headline Goes Here Marcel Kornacker marcel@cloudera.com Speaker Name or Subhead Goes Here 2013-11-12 Copyright 2013 Cloudera
More informationNew Approach to Unstructured Data
Innovations in All-Flash Storage Deliver a New Approach to Unstructured Data Table of Contents Developing a new approach to unstructured data...2 Designing a new storage architecture...2 Understanding
More informationData Analytics at Logitech Snowflake + Tableau = #Winning
Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief
More informationMassive Scalability With InterSystems IRIS Data Platform
Massive Scalability With InterSystems IRIS Data Platform Introduction Faced with the enormous and ever-growing amounts of data being generated in the world today, software architects need to pay special
More informationAnAlytic DAtAbAses for big DAtA
TDWI research TDWI CheCklIsT RepoRT AnAlytic DAtAbAses for big DAtA By Philip Russom Sponsored by tdwi.org October 2012 TDWI Checklist Report Analytic Databases for Big Data By Philip Russom TABLE OF CONTENTS
More informationIBM DB2 BLU Acceleration vs. SAP HANA vs. Oracle Exadata
Research Report IBM DB2 BLU Acceleration vs. SAP HANA vs. Oracle Exadata Executive Summary The problem: how to analyze vast amounts of data (Big Data) most efficiently. The solution: the solution is threefold:
More informationIntroduction to Data Science
UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics
More informationIncrease Value from Big Data with Real-Time Data Integration and Streaming Analytics
Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time
More informationGabriel Villa. Architecting an Analytics Solution on AWS
Gabriel Villa Architecting an Analytics Solution on AWS Cloud and Data Architect Skilled leader, solution architect, and technical expert focusing primarily on Microsoft technologies and AWS. Passionate
More informationAnalytics in Action with Teradata In-Memory Optimizations
Analytics in Action with Teradata In-Memory Optimizations Performance Study by Large Manufacturer Richard Hackathorn, Bolder Technology 03.16 EB9292 Table of Contents 2 Context 4 Customer Experience 6
More informationOracle #1 RDBMS Vendor
Oracle #1 RDBMS Vendor IBM 20.7% Microsoft 18.1% Other 12.6% Oracle 48.6% Source: Gartner DataQuest July 2008, based on Total Software Revenue Oracle 2 Continuous Innovation Oracle 11g Exadata Storage
More informationMigrate from Netezza Workload Migration
Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with
More information<Insert Picture Here> Introduction to Big Data Technology
Introduction to Big Data Technology The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into
More informationBig Data with Hadoop Ecosystem
Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process
More informationCapture Business Opportunities from Systems of Record and Systems of Innovation
Capture Business Opportunities from Systems of Record and Systems of Innovation Amit Satoor, SAP March Hartz, SAP PUBLIC Big Data transformation powers digital innovation system Relevant nuggets of information
More informationMining for insight. Osma Ahvenlampi, CTO, Sulake Implementing business intelligence for Habbo
Mining for insight Osma Ahvenlampi, CTO, Sulake Implementing business intelligence for Habbo Virtual world 3 Social Play 4 Habbo Countries 5 Leading virtual world» 129 million registered Habbo-characters
More informationLambda Architecture for Batch and Stream Processing. October 2018
Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.
More informationBig Data The end of Data Warehousing?
Big Data The end of Data Warehousing? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Big data, data warehousing, advanced analytics, Hadoop, unstructured data Introduction If there was an Unwort
More informationMigrate from Netezza Workload Migration
Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with
More informationVirtuoso Infotech Pvt. Ltd.
Virtuoso Infotech Pvt. Ltd. About Virtuoso Infotech Fastest growing IT firm; Offers the flexibility of a small firm and robustness of over 30 years experience collectively within the leadership team Technology
More informationEmbedded Technosolutions
Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication
More informationOracle Database 11g for Data Warehousing and Business Intelligence
An Oracle White Paper September, 2009 Oracle Database 11g for Data Warehousing and Business Intelligence Introduction Oracle Database 11g is a comprehensive database platform for data warehousing and business
More informationSimplifying your upgrade and consolidation to BW/4HANA. Pravin Gupta (Teklink International Inc.) Bhanu Gupta (Molex LLC)
Simplifying your upgrade and consolidation to BW/4HANA Pravin Gupta (Teklink International Inc.) Bhanu Gupta (Molex LLC) AGENDA What is BW/4HANA? Stepping stones to SAP BW/4HANA How to get your system
More informationThis tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.
About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This
More informationApproaching the Petabyte Analytic Database: What I learned
Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may
More information#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.
Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending Presented by: Trishla Maru Agenda Overview MultiSource Data Federation Use Cases Design Considerations Data
More informationBEST PRACTICES IN SELECTING AND DEVELOPING AN ANALYTIC APPLIANCE
BEST PRACTICES IN SELECTING AND DEVELOPING AN ANALYTIC APPLIANCE Author: Dr. Robert McCord BEST PRACTICES IN SELECTING AND DEVELOPING AN ANALYTIC APPLIANCE Author: Dr. Robert McCord Dr. McCord boasts twenty
More informationPERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract
PERSPECTIVE Data Virtualization A Potential Antidote for Big Data Growing Pains Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and value. Now they
More informationDemystifying Cloud Data Warehousing
YOUR DATA, NO LIMITS Demystifying Cloud Data Warehousing Nicolas Baret Director of Pre-Sales EMEA @Snowflake TDWI Helsinki, October 2017 1 What is a Cloud Data Warehouse and what should we expect? 2 What
More informationWhat is the maximum file size you have dealt so far? Movies/Files/Streaming video that you have used? What have you observed?
Simple to start What is the maximum file size you have dealt so far? Movies/Files/Streaming video that you have used? What have you observed? What is the maximum download speed you get? Simple computation
More information1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda
Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:
More informationHow Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera,
How Apache Hadoop Complements Existing BI Systems Dr. Amr Awadallah Founder, CTO Cloudera, Inc. Twitter: @awadallah, @cloudera 2 The Problems with Current Data Systems BI Reports + Interactive Apps RDBMS
More informationManagement Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management
Management Information Systems Review Questions Chapter 6 Foundations of Business Intelligence: Databases and Information Management 1) The traditional file environment does not typically have a problem
More informationBuilding an Integrated Big Data & Analytics Infrastructure September 25, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle
Building an Integrated Big Data & Analytics Infrastructure September 25, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to
More informationBIG DATA ANALYTICS A PRACTICAL GUIDE
BIG DATA ANALYTICS A PRACTICAL GUIDE STEP 1: GETTING YOUR DATA PLATFORM IN ORDER Big Data Analytics A Practical Guide / Step 1: Getting your Data Platform in Order 1 INTRODUCTION Everybody keeps extolling
More informationAccelerating BI on Hadoop: Full-Scan, Cubes or Indexes?
White Paper Accelerating BI on Hadoop: Full-Scan, Cubes or Indexes? How to Accelerate BI on Hadoop: Cubes or Indexes? Why not both? 1 +1(844)384-3844 INFO@JETHRO.IO Overview Organizations are storing more
More informationEnterprise Data Warehousing
Enterprise Data Warehousing SQL Server 2005 Ron Dunn Data Platform Technology Specialist Integrated BI Platform Integrated BI Platform Agenda Can SQL Server cope? Do I need Enterprise Edition? Will I avoid
More informationAnswer: A Reference:http://www.vertica.com/wpcontent/uploads/2012/05/MicroStrategy_Vertica_12.p df(page 1, first para)
1 HP - HP2-N44 Selling HP Vertical Big Data Solutions QUESTION: 1 When is Vertica a better choice than SAP HANA? A. The customer wants a closed ecosystem for BI and analytics, and is unconcerned with support
More informationTutorial Outline. Map/Reduce vs. DBMS. MR vs. DBMS [DeWitt and Stonebraker 2008] Acknowledgements. MR is a step backwards in database access
Map/Reduce vs. DBMS Sharma Chakravarthy Information Technology Laboratory Computer Science and Engineering Department The University of Texas at Arlington, Arlington, TX 76009 Email: sharma@cse.uta.edu
More information<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure
MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for
More informationHow to integrate data into Tableau
1 How to integrate data into Tableau a comparison of 3 approaches: ETL, Tableau self-service and WHITE PAPER WHITE PAPER 2 data How to integrate data into Tableau a comparison of 3 es: ETL, Tableau self-service
More informationAppliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1
Appliances and DW Architecture John O Brien President and Executive Architect Zukeran Technologies 1 OBJECTIVES To define an appliance Understand critical components of a DW appliance Learn how DW appliances
More informationHOW TO ACHIEVE REAL-TIME ANALYTICS ON A DATA LAKE USING GPUS. Mark Brooks - Principal System Kinetica May 09, 2017
HOW TO ACHIEVE REAL-TIME ANALYTICS ON A DATA LAKE USING GPUS Mark Brooks - Principal System Engineer @ Kinetica May 09, 2017 The Challenge: How to maintain analytic performance while dealing with: Larger
More informationService-Level Agreement (SLA) based Reliability, Availability, and Scalability (RAS) for analytics The solution has no single point of failure. The Ve
Solution Overview Cisco Integrated Infrastructure for Big Data and Analytics with Vertica Advanced Analytics Platform Highlights Proven enterprise-ready converged data platform The solution uses a fabric-centric
More information2014 年 3 月 13 日星期四. From Big Data to Big Value Infrastructure Needs and Huawei Best Practice
2014 年 3 月 13 日星期四 From Big Data to Big Value Infrastructure Needs and Huawei Best Practice Data-driven insight Making better, more informed decisions, faster Raw Data Capture Store Process Insight 1 Data
More informationSeptember 2013 Alberto Abelló & Oscar Romero 1
duce-i duce-i September 2013 Alberto Abelló & Oscar Romero 1 Knowledge objectives 1. Enumerate several use cases of duce 2. Describe what the duce environment is 3. Explain 6 benefits of using duce 4.
More informationStrategic Briefing Paper Big Data
Strategic Briefing Paper Big Data The promise of Big Data is improved competitiveness, reduced cost and minimized risk by taking better decisions. This requires affordable solution architectures which
More informationWas ist dran an einer spezialisierten Data Warehousing platform?
Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction
More informationSQL Server 2014 Column Store Indexes. Vivek Sanil Microsoft Sr. Premier Field Engineer
SQL Server 2014 Column Store Indexes Vivek Sanil Microsoft Vivek.sanil@microsoft.com Sr. Premier Field Engineer Trends in the Data Warehousing Space Approximate data volume managed by DW Less than 1TB
More informationData Warehousing in the Age of In-Memory Computing and Real-Time Analytics. Erich Schneider, Daniel Rutschmann June 2014
Data Warehousing in the Age of In-Memory Computing and Real-Time Analytics Erich Schneider, Daniel Rutschmann June 2014 Disclaimer This presentation outlines our general product direction and should not
More informationIntroduction to Big-Data
Introduction to Big-Data Ms.N.D.Sonwane 1, Mr.S.P.Taley 2 1 Assistant Professor, Computer Science & Engineering, DBACER, Maharashtra, India 2 Assistant Professor, Information Technology, DBACER, Maharashtra,
More information