基于 Hadoop 和 RDBMS 的 Oracle 大数据分析 Corey Wei 技术顾问甲骨文公司. 版权所有 2014,Oracle 和 / 或其关联公司 保留所有权利
|
|
- Eleanore Bates
- 5 years ago
- Views:
Transcription
1 基于 Hadoop 和 RDBMS 的 Oracle 大数据分析 Corey Wei 技术顾问甲骨文公司
2 Agenda Big Data Solution Overview Big Data Appliance Oracle NoSQL Database Big Data SQL Big Data Connectors Oracle Advanced Analytics Case Study 2
3 Big Data Solution Overview 3
4 Big Data Definition Big Data: Techniques and Technologies that Make Handling Data at Extreme Scale Economical. Brian Hopkins and Boris Evelson, Forrester Research, Expand Your Digital Horizons with Big Data (September 2011) 4
5 Applications In-Database Analytics Oracle Big Data Approach - Functional View Decide Real-Time Dashboards, Reporting & Query Information Discovery Event / Stream Data Capture Hadoop NoSQL Bridge Unstructured/ Structured Predictive Analytics Data Warehouse Log / File Data Capture Predictive Analytics ETL Data Marts / ODS Stream Acquire Organize Analyze 5
6 Applications In-Database Analytics Oracle Big Data Approach - Product View Decide Oracle Real-Time Decisions Oracle BI Enterprise Edition Endeca Information Discovery Oracle Event Processing Cloudera Hadoop Oracle NoSQL Database Oracle Big Data Connectors Oracle Advanced Analytics Oracle Industry Data Model(s) Apache Flume Oracle R Distribution Oracle Data Integrator Data Warehouse Oracle Database Stream Acquire Organize Analyze 6
7 Applications In-Database Analytics Oracle Big Data Approach Engineered Systems Decide Oracle Real-Time Decisions Oracle BI Enterprise Edition Endeca Information Discovery Oracle Event Processing Cloudera Hadoop Complete Oracle Big Data Connectors Oracle Advanced Analytics Apache Flume Oracle NoSQL Database Oracle R Distribution Integrated Oracle Data Integrator Scalable Data Warehouse Oracle Database Stream Acquire Organize Analyze 7
8 Big Data Appliance 8
9 Big Data Appliance X4-2 Sun Oracle X4-2L Servers with per server: 2 * 8 Core Intel Xeon E5 Processors 64 GB Memory 48TB Disk space Integrated Software: Oracle Linux Oracle Java VM Cloudera Distribution of Apache Hadoop (CDH) Cloudera Manager All Cloudera Options Oracle R Distribution Oracle NoSQL Database All integrated software (except NoSQL DB CE) is supported as part of Premier Support for Systems and Premier Support for Operating Systems 9
10 Big Data Appliance Product Family Starter Rack is a fully cabled and configured for growth with 6 servers In-Rack Expansion delivers 6 server modular expansion block Full Rack delivers optimal blend of capacity and expansion options Grow by adding rack up to 18 racks without additional switches 10
11 Big Data Appliance Engineered Systems Benefits Lower TCO than DIY Hadoop Clusters Faster Time to Value Higher Performance out-of-box Lower Management Overhead Integrated and Comprehensive Security Tight Integration with your Infrastructure 11
12 Cumulative Cost and Savings Engineered Systems Benefits TCO Data Points: 18 servers (DL380 vs. X4-2L) $1,400,000 $1,200,000 List Price Comparisons 864TB Raw Storage 288 Cores 1152GB Total Memory Cloudera Enterprise Subscription with all options Subscription vs. Perpetual Equivalent Installation Cost Not calculated: $1,000,000 $800,000 $600,000 $400,000 $200,000 Oracle BDA HP + Cloudera Savings Soft Cost (people and time to value) Data integration licenses $0 Year 1 Year 2 Year 3 Year 4 Year 5 12
13 Engineered Systems Benefits Management Console Single Command Patching and Upgrade Full Stack Patching and Upgrading Automatic Cluster Re- Configuration Security (AAA) out-of-box Encryption out-of-box (network and at-rest) InfiniBand + Optimizations Stack Tuning (OS, Java, Hadoop) BDA 3.0 DIY CDH
14 BDA Security Overview Authentication through Kerberos Authorization through Apache Sentry Auditing through Oracle Audit Vault Encryption for Data-at-Rest Network Encryption 14
15 Integrated Management Framework Management Infrastructure combines EM and CM Quick view of Hardware and Software status in Oracle Enterprise Manager 15
16 Oracle NoSQL Database 16
17 Oracle NoSQL Database Scalable, Highly Available, Key-Value Database Features Key-value, JSON & RDF data Large Object API BASE & ACID Transactions Data Center Support Online Rolling Upgrade Online Cluster Management Table data model Secondary Indices Secondary Zones (Data Centers) Security Application Application NoSQL DB Driver Storage Nodes Datacenter A Application Application NoSQL DB Driver Storage Nodes Datacenter B 17
18 Features - Failover Automatic Failover Automatic election of new Master Replication factor = 5 Rejoining nodes automatically synchronize with the Master Isolated nodes can still service reads All nodes are symmetric Rep Node Master Rep Node Replica Rep Node Replica Rep Node Replica Rep Node Replica New Master 18
19 Features Flexible Data Model Key-Value pairs Simple data model key-value pair (major+minor-key paradigm) Simple operations read/insert/update/delete, RMW support Major key: hashed to a Shard (partition), Minor key Btree within a Shard Raw Key/Value and JSON schema APIs supported Strings Major key: Minor key: subscriptions userid address picture Byte Array Value: expiration date phone # id.jpg Value Options: Key-Value JSON RDF Triples Tables/Rows 19
20 Features Flexible Data Model NoSQL DB Table Model Benefits Lower barrier to adoption, shorter time to market Simplified application modeling Uses familiar table concepts Features Layered on top of distributed key-value model Compatible with Release 2.0 JSON schemas Supports table evolution, retains flexible client access Sets foundation for future capabilities 20
21 Features Configurable Transactions Greater Flexibility Configurable Durability per operation Configurable Consistency per operation ACID by default Transaction scope is single API call Records share same major key Multiple operations supported 21
22 Features Elasticity On-Demand Cluster Expansion On Demand Application NoSQL DB Driver Increase Data Capacity Add more storage nodes New shards automatically created Increase Data Throughput More shards = better write throughput More replicas/shard = better read throughput Master Replica Replica Shard-1 Master Replica Replica Shard-2 StorageNode StorageNode StorageNode 22
23 Features Automatic Rebalancing Improve Performance Storage Node 1 Storage Node 2 Storage Node 3 Supports heterogeneous storage topology Replicas move from over-utilized to under-utilized storage nodes Number of shards and replication factor remain unchanged Represents a partition 23
24 Time to Upgrade (min) Features Online Rolling Upgrades Ever tried to upgrade a 200 node system while it s active? What s the Big Deal 17.5 Online Rolling Upgrade We did do it! Admin commands available to describe safe upgrade order Scripted available hands-free upgrade experience Read/Write availability throughout the upgrade process (24x3) 144 (48x3) 216 (72x3) Total Nodes (Shards x Rep. Factor) 24
25 Features Integration Oracle NoSQL Database: Integrated out of the box Query NoSQL data from Oracle Database Access NoSQL data from Hadoop for DW and analytics Share data with Coherence for extensible in-memory cache grid Persist history & event streams for processing with OEP Store & query RDF data using Oracle RDF for NoSQL 25
26 Throughput (ops/sec) Average Latency (ms) Benchmark Results - YCSB (Yahoo Cloud Scalability Benchmark) Mixed Throughput 1.25M ops/sec 2 billion records 2 TB of data 95% read, 5% update Low latency High Scalability 1,400,000 1,200,000 1,000, , , , , (2x3) 12 (4x3) 24 (8x3) 30 (10x3) Cluster Size Throughput (ops/sec) Read Latency (ms) Write Latency (ms) 26
27 Big Data SQL 27
28 Strengths of Both Systems Hadoop is good at some things Databases are good at others SQL is very important 28
29 Embrace Innovation and Integrate Big Data Appliance + Hadoop & NoSQL Unify Development languages Security Administration Support Workload management Lifecycle management Availability Exadata + Oracle Database 29
30 Publish Hadoop Metadata to Oracle Catalog Big Data Appliance + Hadoop Hive metadata HDFS Name Node HDFS Data Node HDFS Data Node create table customer_address ( ca_customer_id number(10,0), ca_street_number char(10), ca_state char(2), ca_zip char(10) ) organization external ( TYPE ORACLE_HIVE DEFAULT DIRECTORY DEFAULT_DIR ACCESS PARAMETERS (com.oracle.bigdata.cluster hadoop_cl_1) LOCATION ('hive://customer_address') ) Oracle Catalog Hive metadata External Table External Table Exadata + Oracle Database 30
31 Executing Queries on Hadoop Hive metadata HDFS Name Node HDFS Data Node HDFS Data Node HDFS Data Node HDFS Data Node Determine: Data locations Data structure Parallelism Send to specific data nodes: Data request Context Select c_customer_id, c_customer_last_name, ca_county From customers, customer_address where c_customer_id = ca_customer_id and ca_state = CA Oracle Catalog Hive metadata External Table External Table 31
32 Executing Queries on Hadoop Hive metadata Do I/O and Smart Scan: Filter rows Project columns Select c_customer_id, c_customer_last_name, ca_county From customers, customer_address where c_customer_id = ca_customer_id and ca_state = CA Oracle Catalog HDFS Name Node HDFS Data Node HDFS Data Node HDFS Data Node HDFS Data Node Move only relevant data Relevant rows Relevant columns Hive metadata External Table External Table Apply join with database data Tables 32
33 Optimizing Scans on Hadoop Hive metadata HDFS Name Node HDFS Data Node HDFS Data Node HDFS Data Node HDFS Data Node Storage Indexes Min Max Min Max Min Max Automatically collect and store the minimum and maximum value within a storage unit Before scanning a storage unit, verify whether the data requires falls within the Min- Max If not, skip scanning the block and reduce scan time Blocks 33
34 Oracle Big Data SQL One Query Spanning Oracle Database, Hadoop & NoSQL Query Data in RDBMS, Hadoop & NoSQL Fast Oracle SQL Massive Parallelism Storage Indexes Filtered Locally Oracle NoSQL DB HDFS Data Node Oracle Database Storage Server Minimized Data Movement Oracle NoSQL DB HDFS Data Node Oracle Database Storage Server 34
35 Big Data Connectors 35
36 Oracle Loader for Hadoop Partition, sort, and convert into Oracle data types on Hadoop ORACLE LOADER FOR HADOOP MAP MAP MAP SHUFFLE /SORT Connect to the database from reducer nodes, load into database partitions in parallel (JDBC or direct path) REDUCE REDUCE Offloads data preprocessing from the database server to Hadoop Works with a range of input data formats Automatic balancing in case of skew in input data MAP REDUCE Online and offline modes MAP REDUCE MAP SHUFFLE /SORT REDUCE Kerberos authentication 36
37 Oracle SQL Connector for HDFS Use Oracle SQL to Access Data on HDFS Generate external table in database pointing to HDFS data Load into database or query data in place on HDFS Fine-grained control over data type mapping Parallel load with automatic load balancing Kerberos authentication Hadoop Access or load into the database in parallel using external table mechanism Oracle Database OSCH OSCH OSCH OSCH HDFS Client External Table SQL Query 37
38 Oracle R Advanced Analytics for Hadoop R Analytics leveraging Hadoop and HDFS Oracle R Client Linearly Scale a Robust Set of R Algorithms MAP MAP REDUCE MAP REDUCE MAP HDFS Hadoop Leverage MapReduce for R Calculations Compute Intensive Parallelism for Simulations 38
39 Oracle Data Integrator Application Adapters for Hadoop Transforms Via MapReduce(HIVE) Oracle Data Integrator Oracle Loader for Hadoop Activates Loads Benefits Consistent tooling across BI/DW, SOA, Integration and Big Data Reduce complexities of processing Hadoop through graphical tooling Improves productivity when processing Big Data (Structured + Unstructured) Oracle Database Improving Productivity and Efficiency for Big Data 39
40 Oracle XQuery for Hadoop OXH is a transformation engine for Big Data XQuery language executed on the Hadoop XQuery for $ln in t ext:collect ion() let $f := t okenize($ln) where $f[1] = 'x' return text:put($f[2]) Map/Reduce Execut ion Plan M/R M/R M/R Oracle Big Data Connectors Oracle Data Integrator M/R Map/Reduce Worker Nodes OXH Engine HDFS Oracle Loader for Hadoop Acquire Organize Analyze 40
41 Oracle Data Integrator Simplify Map Reduce Automatically generates MapReduce code High performance loads into Data Warehouse leveraging both OLH and OSCH Manages the process across platforms Oracle Data Integrator OLH & OSCH 41
42 Oracle Advanced Analytics 42
43 Oracle Advanced Analytics Classification Function Algorithms Applicability Logistic Regression (GLM) Decision Trees Naïve Bayes Support Vector Machines (SVM) In-Database Data Mining Algorithms Classical statistical technique Popular / Rules / transparency Embedded app Wide / narrow data / text Regression Linear Regression (GLM) Support Vector Machine (SVM) Classical statistical technique Wide / narrow data / text Anomaly Detection One Class SVM Unknown fraud cases or anomalies Attribute Importance Association Rules A1 A2 A3 A4 A5 A6 A7 Minimum Description Length (MDL) Principal Components Analysis (PCA) Apriori Attribute reduction, Reduce data noise Market basket analysis / Next Best Offer Clustering Hierarchical k-means Hierarchical O-Cluster Expectation-Maximization Clustering (EM) Product grouping / Text mining Gene and protein analysis Feature Extraction F1 F2 F3 F4 Nonnegative Matrix Factorization (NMF) Singular Value Decomposition (SVD) Text analysis / Feature reduction 43
44 Oracle Advanced Analytics Oracle R Enterprise Compute Engines R Engine Other R packages Oracle R Enterprise packages User R Engine on desktop SQL Results Oracle Database R Open Source User tables?x Database Compute Engine R Results R Engine Other R packages Oracle R Enterprise packages R Engine(s) spawned by Oracle DB R-SQL Transparency Framework intercepts R functions for scalable in-database execution Function intercept for data transforms, statistical functions and advanced analytics Interactive display of graphical results and flow control as in standard R Submit entire R scripts for execution by database Scale to large datasets Access tables, views, and external tables, as well as data through DB LINKS Leverage database SQL parallelism Leverage new and existing in-database statistical and data mining capabilities Database can spawn multiple R engines for database-managed parallelism Efficient data transfer to spawned R engines Emulate map-reduce style algorithms and applications Enables lights-out execution of R scripts 44
45 Oracle Enabling Technologies Unified access model supporting all analysys capabilities: SQL, R & MR Unified Analytics API SQL R MR Hadoop RDBMS IB Management Framework and Tools Unified Analytics Processing Platform 45
46 Case Study 46
47 Current ( as-is ) Architecture Siloed Operational Systems with complex, heavy and slow data transformation and flows to Data Marts Current ( as-is ) architecture is based on a years 90s design: siloed datamarts with complex and expensive provision systems, unable to respond to the new business requirements with agility 1. Provision systems are complex, expensive and inefficient 2. Lack of business agility and very long time-to-market and time-to-value (6-12 months) 3. Business users are by-passing IT corporate systems 4. Datamarts are strongly siloed with no interoperability 5. Complex Operations with very limited backup/recovery and no HA capabilities 6. Unstructured information not managed 7. Lack of Advanced Analytic capabilities 47
48 MQ Data Factory Engine ODI MQFTE Data Factory Engine Data Factory Engine ODI BI Abstraction & Query Federation BI Server ODI MQFTE Data Pool Logical Architecture Source Data Layer Transformed data Data Marts SAP RRHH Information Management Staging & Raw Data Layer Level Performance Management Mainframe Interfases Otros Social/Text GG High Density Data Quality Low Density Foundation Layer Level High Density Low Density Access & Level 4 Performance Layer Embedded Data Marts Alerts, Dashboards, Reporting Services Sensors Diario Electronico Streaming Knowledge Discovery Area Analytical Discovery Sandbox Rapid Development Sandbox Advanced Analysis & Data Science (Discovery) Security and Metadata Data Integration : Data Factory Engine & ODI + Metadata Information Access 48
49 Data Pool Hardware Architecture UAT PRD DC1 BDR DC2 PRD UAT Oracle DataGuard IB IB IB IB SAN Replication SAN IB IB Data Pool Data Pool Backup ZS-3 Backup Oracle RMAN 10GbE TSM FC VTL VTL FC TSM 10GbE ZS-3 Backup Oracle RMAN Backup Snapshot ZFS Replication Snapshot 49
50
<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store
Oracle NoSQL Database A Distributed Key-Value Store Charles Lamb The following is intended to outline our general product direction. It is intended for information purposes only,
More informationOracle Big Data Connectors
Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process
More informationOracle NoSQL Database Enterprise Edition, Version 18.1
Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across
More informationCopyright 2013, Oracle and/or its affiliates. All rights reserved.
1 Oracle NoSQL Database: Release 3.0 What s new and why you care Dave Segleau NoSQL Product Manager The following is intended to outline our general product direction. It is intended for information purposes
More informationEvolving To The Big Data Warehouse
Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from
More informationOracle NoSQL Database Overview Marie-Anne Neimat, VP Development
Oracle NoSQL Database Overview Marie-Anne Neimat, VP Development June14, 2012 1 Copyright 2012, Oracle and/or its affiliates. All rights Agenda Big Data Overview Oracle NoSQL Database Architecture Technical
More informationOracle Big Data Fundamentals Ed 1
Oracle University Contact Us: +0097143909050 Oracle Big Data Fundamentals Ed 1 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, learn to use Oracle's Integrated Big Data
More informationOracle NoSQL Database Enterprise Edition, Version 18.1
Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across
More informationIntroduction to Oracle NoSQL Database
Introduction to Oracle NoSQL Database Anand Chandak Ashutosh Naik Agenda NoSQL Background Oracle NoSQL Database Overview Technical Features & Performance Use Cases 2 Why NoSQL? 1. The four V s of Big Data
More informationOracle Big Data. A NA LYT ICS A ND MA NAG E MENT.
Oracle Big Data. A NALYTICS A ND MANAG E MENT. Oracle Big Data: Redundância. Compatível com ecossistema Hadoop, HIVE, HBASE, SPARK. Integração com Cloudera Manager. Possibilidade de Utilização da Linguagem
More informationCopyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle
More informationOracle Big Data Fundamentals Ed 2
Oracle University Contact Us: 1.800.529.0165 Oracle Big Data Fundamentals Ed 2 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, you learn about big data, the technologies
More informationBuilding an Integrated Big Data & Analytics Infrastructure September 25, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle
Building an Integrated Big Data & Analytics Infrastructure September 25, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to
More informationOracle NoSQL Database and Cisco- Collaboration that produces results. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.
Oracle NoSQL Database and Cisco- Collaboration that produces results 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. What is Big Data? SOCIAL BLOG SMART METER VOLUME VELOCITY VARIETY
More informationAgenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache
Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,
More informationQuestion: 1 You need to place the results of a PigLatin script into an HDFS output directory. What is the correct syntax in Apache Pig?
Volume: 72 Questions Question: 1 You need to place the results of a PigLatin script into an HDFS output directory. What is the correct syntax in Apache Pig? A. update hdfs set D as./output ; B. store D
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationSQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024
Current support level End Mainstream End Extended SQL Server 2005 SQL Server 2008 and 2008 R2 SQL Server 2012 SQL Server 2005 SP4 is in extended support, which ends on April 12, 2016 SQL Server 2008 and
More informationPrivate Cloud Database Consolidation Name, Title
Private Cloud Database Consolidation Name, Title Agenda Cloud Introduction Business Drivers Cloud Architectures Enabling Technologies Service Level Expectations Customer Case Studies Conclusions
More informationOracle 1Z Oracle Big Data 2017 Implementation Essentials.
Oracle 1Z0-449 Oracle Big Data 2017 Implementation Essentials https://killexams.com/pass4sure/exam-detail/1z0-449 QUESTION: 63 Which three pieces of hardware are present on each node of the Big Data Appliance?
More information1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationMapR Enterprise Hadoop
2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS
More informationNext-Generation Cloud Platform
Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology
More information<Insert Picture Here> Introduction to Big Data Technology
Introduction to Big Data Technology The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into
More informationOracle Machine Learning Notebook
Oracle Machine Learning Notebook Included in Autonomous Data Warehouse Cloud Charlie Berger, MS Engineering, MBA Sr. Director Product Management, Machine Learning, AI and Cognitive Analytics charlie.berger@oracle.com
More informationAchieving Horizontal Scalability. Alain Houf Sales Engineer
Achieving Horizontal Scalability Alain Houf Sales Engineer Scale Matters InterSystems IRIS Database Platform lets you: Scale up and scale out Scale users and scale data Mix and match a variety of approaches
More information1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions
1Z0-449 Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions Table of Contents Introduction to 1Z0-449 Exam on Oracle Big Data 2017 Implementation Essentials... 2 Oracle 1Z0-449
More informationCopyright 2011, Oracle and/or its affiliates. All rights reserved.
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationOracle #1 RDBMS Vendor
Oracle #1 RDBMS Vendor IBM 20.7% Microsoft 18.1% Other 12.6% Oracle 48.6% Source: Gartner DataQuest July 2008, based on Total Software Revenue Oracle 2 Continuous Innovation Oracle 11g Exadata Storage
More informationNew Oracle NoSQL Database APIs that Speed Insertion and Retrieval
New Oracle NoSQL Database APIs that Speed Insertion and Retrieval O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 1 NEW ORACLE NoSQL DATABASE APIs that SPEED INSERTION AND RETRIEVAL Introduction
More informationIntroduction to Database Services
Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More information<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure
MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for
More informationHow Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera,
How Apache Hadoop Complements Existing BI Systems Dr. Amr Awadallah Founder, CTO Cloudera, Inc. Twitter: @awadallah, @cloudera 2 The Problems with Current Data Systems BI Reports + Interactive Apps RDBMS
More informationOracle Big Data SQL High Performance Data Virtualization Explained
Keywords: Oracle Big Data SQL High Performance Data Virtualization Explained Jean-Pierre Dijcks Oracle Redwood City, CA, USA Big Data SQL, SQL, Big Data, Hadoop, NoSQL Databases, Relational Databases,
More informationSecurity and Performance advances with Oracle Big Data SQL
Security and Performance advances with Oracle Big Data SQL Jean-Pierre Dijcks Oracle Redwood Shores, CA, USA Key Words SQL, Oracle, Database, Analytics, Object Store, Files, Big Data, Big Data SQL, Hadoop,
More informationOracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data
Oracle Big Data SQL Release 3.2 The unprecedented explosion in data that can be made useful to enterprises from the Internet of Things, to the social streams of global customer bases has created a tremendous
More informationCopyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Oracle NoSQL Database and Oracle Relational Database - A Perfect Fit Dave Rubin Director NoSQL Database Development 2 The following is intended to outline our general product direction. It is intended
More informationNetezza The Analytics Appliance
Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for
More informationSafe Harbor Statement
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment
More informationTop Trends in DBMS & DW
Oracle Top Trends in DBMS & DW Noel Yuhanna Principal Analyst Forrester Research Trend #1: Proliferation of data Data doubles every 18-24 months for critical Apps, for some its every 6 months Terabyte
More informationBig Data Infrastructures & Technologies
Big Data Infrastructures & Technologies Spark and MLLIB OVERVIEW OF SPARK What is Spark? Fast and expressive cluster computing system interoperable with Apache Hadoop Improves efficiency through: In-memory
More informationSpatial Analytics Built for Big Data Platforms
Spatial Analytics Built for Big Platforms Roberto Infante Software Development Manager, Spatial and Graph 1 Copyright 2011, Oracle and/or its affiliates. All rights Global Digital Growth The Internet of
More informationFast Innovation requires Fast IT
Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationOracle Big Data Appliance X7-2
Oracle Big Data Appliance X7-2 Oracle Big Data Appliance is a flexible, high-performance, secure platform for running diverse workloads on Hadoop, Kafka and NoSQL. With Oracle Big Data SQL, Oracle Big
More informationJargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems
Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons
More informationArchitecture of a Real-Time Operational DBMS
Architecture of a Real-Time Operational DBMS Srini V. Srinivasan Founder, Chief Development Officer Aerospike CMG India Keynote Thane December 3, 2016 [ CMGI Keynote, Thane, India. 2016 Aerospike Inc.
More informationApache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context
1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes
More informationManaging IoT and Time Series Data with Amazon ElastiCache for Redis
Managing IoT and Time Series Data with ElastiCache for Redis Darin Briskman, ElastiCache Developer Outreach Michael Labib, Specialist Solutions Architect 2016, Web Services, Inc. or its Affiliates. All
More informationOracle Exadata: Strategy and Roadmap
Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended
More informationFlash Storage Complementing a Data Lake for Real-Time Insight
Flash Storage Complementing a Data Lake for Real-Time Insight Dr. Sanhita Sarkar Global Director, Analytics Software Development August 7, 2018 Agenda 1 2 3 4 5 Delivering insight along the entire spectrum
More informationModernizing Business Intelligence and Analytics
Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from
More informationDatabase Level 100. Rohit Rahi November Copyright 2018, Oracle and/or its affiliates. All rights reserved.
Database Level 100 Rohit Rahi November 2018 1 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More information<Insert Picture Here> Enterprise Data Management using Grid Technology
Enterprise Data using Grid Technology Kriangsak Tiawsirisup Sales Consulting Manager Oracle Corporation (Thailand) 3 Related Data Centre Trends. Service Oriented Architecture Flexibility
More informationIan Choy. Technology Solutions Professional
Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration
More informationOracle Big Data Science
Oracle Big Data Science Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com @VlamisSoftware Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri
More informationFLORIDA DEPARTMENT OF TRANSPORTATION PRODUCTION BIG DATA PLATFORM
FLORIDA DEPARTMENT OF TRANSPORTATION PRODUCTION BIG DATA PLATFORM RECOMMENDATION AND JUSTIFACTION Executive Summary: VHB has been tasked by the Florida Department of Transportation District Five to design
More informationOracle Database 18c and Autonomous Database
Oracle Database 18c and Autonomous Database Maria Colgan Oracle Database Product Management March 2018 @SQLMaria Safe Harbor Statement The following is intended to outline our general product direction.
More informationOracle Big Data Science IOUG Collaborate 16
Oracle Big Data Science IOUG Collaborate 16 Session 4762 Tim and Dan Vlamis Tuesday, April 12, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed 200+ Oracle
More information2014 年 3 月 13 日星期四. From Big Data to Big Value Infrastructure Needs and Huawei Best Practice
2014 年 3 月 13 日星期四 From Big Data to Big Value Infrastructure Needs and Huawei Best Practice Data-driven insight Making better, more informed decisions, faster Raw Data Capture Store Process Insight 1 Data
More informationOracle Secure Backup 12.1 Technical Overview
Oracle Secure Backup 12.1 Technical Overview February 12, 2015 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and
More informationAzure Webinar. Resilient Solutions March Sander van den Hoven Principal Technical Evangelist Microsoft
Azure Webinar Resilient Solutions March 2017 Sander van den Hoven Principal Technical Evangelist Microsoft DX @svandenhoven 1 What is resilience? Client Client API FrontEnd Client Client Client Loadbalancer
More informationEnergy Management with AWS
Energy Management with AWS Kyle Hart and Nandakumar Sreenivasan Amazon Web Services August [XX], 2017 Tampa Convention Center Tampa, Florida What is Cloud? The NIST Definition Broad Network Access On-Demand
More informationDo-It-Yourself 1. Oracle Big Data Appliance 2X Faster than
Oracle Big Data Appliance 2X Faster than Do-It-Yourself 1 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such
More informationAurora, RDS, or On-Prem, Which is right for you
Aurora, RDS, or On-Prem, Which is right for you Kathy Gibbs Database Specialist TAM Katgibbs@amazon.com Santa Clara, California April 23th 25th, 2018 Agenda RDS Aurora EC2 On-Premise Wrap-up/Recommendation
More informationIntroduction to the Oracle Big Data Appliance - 1
Hello and welcome to this online, self-paced course titled Administering and Managing the Oracle Big Data Appliance (BDA). This course contains several lessons. This lesson is titled Introduction to the
More informationdocs.hortonworks.com
docs.hortonworks.com : Getting Started Guide Copyright 2012, 2014 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform for storing,
More informationSolaris Engineered Systems
Solaris Engineered Systems SPARC SuperCluster Introduction Andy Harrison andy.harrison@oracle.com Engineered Systems, Revenue Product Engineering The following is intended to outline
More informationEsgynDB Enterprise 2.0 Platform Reference Architecture
EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed
More informationMellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions
Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions Providing Superior Server and Storage Performance, Efficiency and Return on Investment As Announced and Demonstrated at
More information@Pentaho #BigDataWebSeries
Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of
More informationOracle BDA: Working With Mammoth - 1
Hello and welcome to this online, self-paced course titled Administering and Managing the Oracle Big Data Appliance (BDA). This course contains several lessons. This lesson is titled Working With Mammoth.
More informationNoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu
NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related
More informationWas ist dran an einer spezialisierten Data Warehousing platform?
Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction
More informationCapture Business Opportunities from Systems of Record and Systems of Innovation
Capture Business Opportunities from Systems of Record and Systems of Innovation Amit Satoor, SAP March Hartz, SAP PUBLIC Big Data transformation powers digital innovation system Relevant nuggets of information
More informationIntegrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers
Oracle zsig Conference IBM LinuxONE and z System Servers Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Sam Amsavelu Oracle on z Architect IBM Washington
More informationTaming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems
1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for
More informationWhat is Gluent? The Gluent Data Platform
What is Gluent? The Gluent Data Platform The Gluent Data Platform provides a transparent data virtualization layer between traditional databases and modern data storage platforms, such as Hadoop, in the
More informationActual4Test. Actual4test - actual test exam dumps-pass for IT exams
Actual4Test http://www.actual4test.com Actual4test - actual test exam dumps-pass for IT exams Exam : 1z1-449 Title : Oracle Big Data 2017 Implementation Essentials Vendor : Oracle Version : DEMO Get Latest
More informationHighly Scalable, Non-RDMA NVMe Fabric. Bob Hansen,, VP System Architecture
A Cost Effective,, High g Performance,, Highly Scalable, Non-RDMA NVMe Fabric Bob Hansen,, VP System Architecture bob@apeirondata.com Storage Developers Conference, September 2015 Agenda 3 rd Platform
More informationOracle NoSQL Database at OOW 2017
Oracle NoSQL Database at OOW 2017 CON6544 Oracle NoSQL Database Cloud Service Monday 3:15 PM, Moscone West 3008 CON6543 Oracle NoSQL Database Introduction Tuesday, 3:45 PM, Moscone West 3008 CON6545 Oracle
More informationCloud Computing & Visualization
Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International
More informationOracle GoldenGate for Big Data
Oracle GoldenGate for Big Data The Oracle GoldenGate for Big Data 12c product streams transactional data into big data systems in real time, without impacting the performance of source systems. It streamlines
More informationHadoop & Big Data Analytics Complete Practical & Real-time Training
An ISO Certified Training Institute A Unit of Sequelgate Innovative Technologies Pvt. Ltd. www.sqlschool.com Hadoop & Big Data Analytics Complete Practical & Real-time Training Mode : Instructor Led LIVE
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationMySQL Cluster Web Scalability, % Availability. Andrew
MySQL Cluster Web Scalability, 99.999% Availability Andrew Morgan @andrewmorgan www.clusterdb.com Safe Harbour Statement The following is intended to outline our general product direction. It is intended
More informationA BigData Tour HDFS, Ceph and MapReduce
A BigData Tour HDFS, Ceph and MapReduce These slides are possible thanks to these sources Jonathan Drusi - SCInet Toronto Hadoop Tutorial, Amir Payberah - Course in Data Intensive Computing SICS; Yahoo!
More informationVMware Virtual SAN Technology
VMware Virtual SAN Technology Today s Agenda 1 Hyper-Converged Infrastructure Architecture & Vmware Virtual SAN Overview 2 Why VMware Hyper-Converged Software? 3 VMware Virtual SAN Advantage Today s Agenda
More informationGain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.
Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources
More informationAcquiring Big Data to Realize Business Value
Acquiring Big Data to Realize Business Value Agenda What is Big Data? Common Big Data technologies Use Case Examples Oracle Products in the Big Data space In Summary: Big Data Takeaways
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationBig Data with Hadoop Ecosystem
Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process
More informationMODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS
MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS SUJEE MANIYAM FOUNDER / PRINCIPAL @ ELEPHANT SCALE www.elephantscale.com sujee@elephantscale.com HI, I M SUJEE MANIYAM Founder / Principal @ ElephantScale
More informationAppliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1
Appliances and DW Architecture John O Brien President and Executive Architect Zukeran Technologies 1 OBJECTIVES To define an appliance Understand critical components of a DW appliance Learn how DW appliances
More informationAbstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight
ESG Lab Review InterSystems Data Platform: A Unified, Efficient Data Platform for Fast Business Insight Date: April 218 Author: Kerry Dolan, Senior IT Validation Analyst Abstract Enterprise Strategy Group
More informationTITLE. the IT Landscape
The Impact of Hyperconverged Infrastructure on the IT Landscape 1 TITLE Drivers for adoption Lower TCO Speed and Agility Scale Easily Operational Simplicity Hyper-converged Integrated storage & compute
More informationOracle Secure Backup: Achieve 75 % Cost Savings with Your Tape Backup
1 Oracle Secure Backup: Achieve 75 % Cost Savings with Your Tape Backup Donna Cooksey Oracle Principal Product Manager John Swallow Waters Corporation Sr. Infrastructure Architect Enterprise Software Solutions
More informationFlexPod. The Journey to the Cloud. Technical Presentation. Presented Jointly by NetApp and Cisco
FlexPod The Journey to the Cloud Technical Presentation Presented Jointly by NetApp and Cisco Agenda Alliance Highlights Introducing FlexPod One Shared Vision and Journey FlexPod for the Oracle base base
More information