MULTICORE IN DATA APPLIANCES. Gustavo Alonso Systems Group Dept. of Computer Science ETH Zürich, Switzerland

Size: px
Start display at page:

Download "MULTICORE IN DATA APPLIANCES. Gustavo Alonso Systems Group Dept. of Computer Science ETH Zürich, Switzerland"

Transcription

1 MULTICORE IN DATA APPLIANCES Gustavo Alonso Systems Group Dept. of Computer Science ETH Zürich, Switzerland SwissBox CREST Workshop March 2012

2 Systems Group = Enterprise Computing Center = Gustavo Alonso - Systems Group - ETH Zürich 2

3 The SwissBox project Build an open source data appliance Hardware Software Gustavo Alonso - Systems Group - ETH Zürich 3

4 Goals Robust by design Scalable by design Fully predictable Behavior immune to peak loads (read or write) Efficient use of modern hardware Cost efficiency Power/space/management complexity Gustavo Alonso - Systems Group - ETH Zürich 4

5 DATA APPLIANCE Database in a box Funny database Funny box Examples: Exascale (Oracle) Twin-Fin (Netezza IBM) NewDB (SAP) Teradata Gustavo Alonso - Systems Group - ETH Zürich 5

6 ORACLE EXADATA Intelligent storage manager Massive caching RAC based architecture Fast network interconnect Gustavo Alonso - Systems Group - ETH Zürich 6

7 The Multicore Challenge Gustavo Alonso - Systems Group - ETH Zürich 7

8 Data parallelism Relational model is highly parallel Independent tables Orthogonal operators Intra- and Inter-query parallelism Most successful commercial parallel systems And yet Gustavo Alonso - Systems Group - ETH Zürich 8

9 Throughput (TPS) Database engines and multicore Postgres TPC-WB 20GB DB cores cores cores TPCW-S Clients PG 48 PG-24 PG 8 Salomie, Subasu, Giceva, Alonso, EuroSys 2011 Gustavo Alonso - Systems Group - ETH Zürich 9

10 Thorughput(TPS) Database engines and multicore MySQL TPC-WB 20 GB DB 8 cores cores cores Clients MYSQL-48 MYSQL-24 MYSQL-12 Salomie, Subasu, Giceva, Alonso, EuroSys 2011 Gustavo Alonso - Systems Group - ETH Zürich 10

11 CLAIM #1 Adding resources to a troubled application does not necessarily lead to improvements Gustavo Alonso - Systems Group - ETH Zürich 11

12 Size matters The challenge of appliances is the unprecedented power available 64 cores AMD,256 GB memory, 10 Gb network + 3 TB NAS: ~14k CHF Imagine a rack full of those Is your job large enough? Gustavo Alonso - Systems Group - ETH Zürich 12

13 CLAIM #2 Large scale parallelism sensible only when looking at aggregated loads Gustavo Alonso - Systems Group - ETH Zürich 13

14 Load interaction In a highly parallel system, multiple jobs will get on each other s way: Synchronization Data movement Resource arbitrage Resource capping Heavy vs. light jobs Management and coordination Gustavo Alonso - Systems Group - ETH Zürich 14

15 Giannikis, Alono, Kossmann, PVLDB 2012 Load interaction in practice System X Gustavo Alonso - Systems Group - ETH Zürich 15

16 CLAIM #3 Robustness and performance can only be obtained by minimizing interaction Gustavo Alonso - Systems Group - ETH Zürich 16

17 Locality in the XXI st century Each package has: 12 cores 4HT ports 4 memory channels P0 P4 P2 P6 Each die has: 6 cores 4HT ports 2 memory channels P1 P3 P5 P7 Gustavo Alonso - Systems Group - ETH Zürich 17

18 CLAIM #4 Robustness and definitely performance can only be achieved on fixed data paths Gustavo Alonso - Systems Group - ETH Zürich 18

19 Architecture other data centers RACK RACK RACK RACK RACK multicore Parallel hardware accelerator Gustavo Alonso - Systems Group - ETH Zürich 19

20 CLAIM #5 Traditional layered architectures (HW/OS/VM/App) do not work in these environments Gustavo Alonso - Systems Group - ETH Zürich 20

21 CLAIM #6 Strong notions of consistency (serializability) and atomicity of complex programs no longer feasible Gustavo Alonso - Systems Group - ETH Zürich 21

22 SWISSBOX Alonso, Kossmann, Roscoe, CIDR 2011 Gustavo Alonso - Systems Group - ETH Zürich 22

23 SwissBox: the project Great opportunity for research Rethink the entire system software stack Redesign the operating system, database, and storage system architecture Software hardware co-design Gustavo Alonso - Systems Group - ETH Zürich 23

24 SwissBox: the product Direct collaboration and input from industry Great demand for tailored systems Big data Highly demanding applications Low power / high efficiency Gustavo Alonso - Systems Group - ETH Zürich 24

25 Claim # 1 => Deterministic behavior Adding resources to an application does not necessarily lead to a performance improvement System performance completely determined at design time through simple parameters Gustavo Alonso - Systems Group - ETH Zürich 25

26 Clock Scan QUERIES UPDATES Unterbrunner, Giannikis, Alonso, Kossmann, PVLDB 2009 BUILD QUERY INDEX FOR NEXT SCAN READ CURSOR WRITE CURSOR DATA IN CIRCULAR BUFFER (WIDE TABLE) Gustavo Alonso - Systems Group - ETH Zürich 26

27 Claim # 2 => Batch processing Large scale parallelism makes sense only when considering aggregated loads Execution proceeds in batches (1000 s of queries per batch) Gustavo Alonso - Systems Group - ETH Zürich 27

28 Giannikis, Alono, Kossmann, PVLDB 2012 Shared join Crescando runs selection and projections in one set of cores SharedDB runs joins on the streams from Crescando, thousands of queries at a time Gustavo Alonso - Systems Group - ETH Zürich 28

29 Claim # 3 => No load interaction Robustness and performance can only be obtained by minimizing interaction Operators are orthogonal and work on clearly delimited resources Gustavo Alonso - Systems Group - ETH Zürich 29

30 Giannikis, Alono, Kossmann, PVLDB 2012 Predictability, robustness Gustavo Alonso - Systems Group - ETH Zürich 30

31 Claim # 4 => No dynamic scheduling Robustness (and definitely performance) can only be achieved on fixed data paths No dynamic scheduling, operators are always on and at fixed locations Gustavo Alonso - Systems Group - ETH Zürich 31

32 Single plan: operator per core Gustavo Alonso - Systems Group - ETH Zürich 32

33 Giannikis, Alono, Kossmann, PVLDB 2012 Raw performance Gustavo Alonso - Systems Group - ETH Zürich 33

34 Claim # 5 => Open stack Traditional layered architectures (HW/OS/VM/App) do not work in these environments Operating System / Database co-design Gustavo Alonso - Systems Group - ETH Zürich 34

35 Claim # 6 => Consistency Strong notions of consistency (serializability) and atomicity of complex programs no longer feasible Snapshot isolation, multiversions, eventual consistency Gustavo Alonso - Systems Group - ETH Zürich 35

36 SWISSBOX Gustavo Alonso - Systems Group - ETH Zürich 36

37 Where are we? Fully predictable performance Accurate analytical model Easily tunable / scalable Tolerates high peaks of reads and updates without compromising SLA Intelligent storage engine Gustavo Alonso - Systems Group - ETH Zürich 37

38 Next steps Hardware acceleration Query optimizer Parallel operators OS / DB interfaces Gustavo Alonso - Systems Group - ETH Zürich 38

39 In the future Virtualized operators Flexible, elastic deployment through OS interaction Scalability through operator/plan replication across cores and machines Hardware acceleration by operator offloading and in-network data processing Operator parallelism Gustavo Alonso - Systems Group - ETH Zürich 39

40 SwissBox in a nutshell A new way to process data Parallel, predictable by design Not optimal but good enough Co-design at all levels Great opportunity for research Redo everything from scratch Gustavo Alonso - Systems Group - ETH Zürich 40

Rack-scale Data Processing System

Rack-scale Data Processing System Rack-scale Data Processing System Jana Giceva, Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich Rack-scale Data Processing

More information

DATABASES AND THE CLOUD. Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland

DATABASES AND THE CLOUD. Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland DATABASES AND THE CLOUD Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland AVALOQ Conference Zürich June 2011 Systems Group www.systems.ethz.ch Enterprise Computing Center

More information

Performance in the Multicore Era

Performance in the Multicore Era Performance in the Multicore Era Gustavo Alonso Systems Group -- ETH Zurich, Switzerland Systems Group Enterprise Computing Center Performance in the multicore era 2 BACKGROUND - SWISSBOX SwissBox: An

More information

Crescando: Predictable Performance for Unpredictable Workloads

Crescando: Predictable Performance for Unpredictable Workloads Crescando: Predictable Performance for Unpredictable Workloads G. Alonso, D. Fauser, G. Giannikis, D. Kossmann, J. Meyer, P. Unterbrunner Amadeus S.A. ETH Zurich, Systems Group (Funded by Enterprise Computing

More information

Crazy little thing called hardware GUSTAVO ALONSO SYSTEMS GROUP DEPT. OF COMPUTER SCIENCE ETH ZURICH

Crazy little thing called hardware GUSTAVO ALONSO SYSTEMS GROUP DEPT. OF COMPUTER SCIENCE ETH ZURICH Crazy little thing called hardware GUSTAVO ALONSO SYSTEMS GROUP DEPT. OF COMPUTER SCIENCE ETH ZURICH HTDC 2014 Systems Group = www.systems.ethz.ch Enterprise Computing Center = www.ecc.ethz.ch Hardware

More information

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions Providing Superior Server and Storage Performance, Efficiency and Return on Investment As Announced and Demonstrated at

More information

Data Processing on Emerging Hardware

Data Processing on Emerging Hardware Data Processing on Emerging Hardware Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland 3 rd International Summer School on Big Data, Munich, Germany, 2017 www.systems.ethz.ch

More information

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,

More information

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer Oracle Exadata X7 Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer 05.12.2017 Oracle Engineered Systems ZFS Backup Appliance Zero Data Loss Recovery Appliance Exadata Database

More information

Caribou: Intelligent Distributed Storage

Caribou: Intelligent Distributed Storage : Intelligent Distributed Storage Zsolt István, David Sidler, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich 1 Rack-scale thinking In the Cloud ToR Switch Compute Compute + Provisioning

More information

Oracle Exadata: Strategy and Roadmap

Oracle Exadata: Strategy and Roadmap Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended

More information

Introduction to Database Services

Introduction to Database Services Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational

More information

IBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein?

IBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein? FlashSystem Family 2015 IBM FlashSystem IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein? PiRT - Power i Round Table 17 Sep. 2015 Daniel Gysin IBM

More information

Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration

Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration Sina Meraji sinamera@ca.ibm.com Berni Schiefer schiefer@ca.ibm.com Tuesday March 17th at 12:00

More information

WHITEPAPER. MemSQL Enterprise Feature List

WHITEPAPER. MemSQL Enterprise Feature List WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure

More information

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2 Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,

More information

Oracle Exadata. Smart Database Platforms - Dramatic Performance and Cost Advantages. Juan Loaiza Senior Vice President Oracle Database Systems

Oracle Exadata. Smart Database Platforms - Dramatic Performance and Cost Advantages. Juan Loaiza Senior Vice President Oracle Database Systems Oracle Exadata Smart Database Platforms - Dramatic Performance and Cost Advantages Juan Loaiza Senior Vice President Oracle Database Systems Exadata X5-2 Exadata X5-8 SuperCluster M7-8 Exadata Vision Dramatically

More information

Massive Scalability With InterSystems IRIS Data Platform

Massive Scalability With InterSystems IRIS Data Platform Massive Scalability With InterSystems IRIS Data Platform Introduction Faced with the enormous and ever-growing amounts of data being generated in the world today, software architects need to pay special

More information

CISC 7610 Lecture 2b The beginnings of NoSQL

CISC 7610 Lecture 2b The beginnings of NoSQL CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone

More information

In-Memory Data Management

In-Memory Data Management In-Memory Data Management Martin Faust Research Assistant Research Group of Prof. Hasso Plattner Hasso Plattner Institute for Software Engineering University of Potsdam Agenda 2 1. Changed Hardware 2.

More information

Oracle Autonomous Database

Oracle Autonomous Database Oracle Autonomous Database Maria Colgan Master Product Manager Oracle Database Development August 2018 @SQLMaria #thinkautonomous Safe Harbor Statement The following is intended to outline our general

More information

In-Memory Data Management Jens Krueger

In-Memory Data Management Jens Krueger In-Memory Data Management Jens Krueger Enterprise Platform and Integration Concepts Hasso Plattner Intitute OLTP vs. OLAP 2 Online Transaction Processing (OLTP) Organized in rows Online Analytical Processing

More information

Oracle Exadata: The World s Fastest Database Machine

Oracle Exadata: The World s Fastest Database Machine 10 th of November Sheraton Hotel, Sofia Oracle Exadata: The World s Fastest Database Machine Daniela Milanova Oracle Sales Consultant Oracle Exadata Database Machine One architecture for Data Warehousing

More information

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of

More information

IBM Db2 Analytics Accelerator Version 7.1

IBM Db2 Analytics Accelerator Version 7.1 IBM Db2 Analytics Accelerator Version 7.1 Delivering new flexible, integrated deployment options Overview Ute Baumbach (bmb@de.ibm.com) 1 IBM Z Analytics Keep your data in place a different approach to

More information

Technology Insight Series

Technology Insight Series IBM ProtecTIER Deduplication for z/os John Webster March 04, 2010 Technology Insight Series Evaluator Group Copyright 2010 Evaluator Group, Inc. All rights reserved. Announcement Summary The many data

More information

Consistency and Scalability

Consistency and Scalability COMP 150-IDS: Internet Scale Distributed Systems (Spring 2015) Consistency and Scalability Noah Mendelsohn Tufts University Email: noah@cs.tufts.edu Web: http://www.cs.tufts.edu/~noah Copyright 2015 Noah

More information

Accelerating Data Science. Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland

Accelerating Data Science. Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland Accelerating Data Science Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland Data processing today: Appliances (large machines) Data Centers (many machines) Databases are

More information

Fit for Purpose Platform Positioning and Performance Architecture

Fit for Purpose Platform Positioning and Performance Architecture Fit for Purpose Platform Positioning and Performance Architecture Joe Temple IBM Monday, February 4, 11AM-12PM Session Number 12927 Insert Custom Session QR if Desired. Fit for Purpose Categorized Workload

More information

VOLTDB + HP VERTICA. page

VOLTDB + HP VERTICA. page VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics

More information

Oracle: From Client Server to the Grid and beyond

Oracle: From Client Server to the Grid and beyond Oracle: From Client Server to the Grid and beyond Graham Wood Architect, RDBMS Development Oracle Corporation Continuous Innovation Oracle 6 Oracle 5 Oracle 2 Oracle 7 Data Warehousing Optimizations Parallel

More information

Was ist dran an einer spezialisierten Data Warehousing platform?

Was ist dran an einer spezialisierten Data Warehousing platform? Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction

More information

Reconfigurable hardware for big data. Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland

Reconfigurable hardware for big data. Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland Reconfigurable hardware for big data Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland www.systems.ethz.ch Systems Group 7 faculty ~40 PhD ~8 postdocs Researching all

More information

BatchDB: Efficient Isolated Execution of Hybrid OLTP+OLAP Workloads for Interactive Applications

BatchDB: Efficient Isolated Execution of Hybrid OLTP+OLAP Workloads for Interactive Applications BatchDB: Efficient Isolated Execution of Hybrid OLTP+OLAP Workloads for Interactive Applications Darko Makreshanski, Jana Giceva, Claude Barthels, Gustavo Alonso Systems Group, Department of Computer Science,

More information

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE Digital transformation is taking place in businesses of all sizes Big Data and Analytics Mobility Internet of Things

More information

Oracle #1 for Data Warehousing. Data Warehouses Growing Rapidly Tripling In Size Every Two Years

Oracle #1 for Data Warehousing. Data Warehouses Growing Rapidly Tripling In Size Every Two Years Extreme Performance HP Oracle Machine & Exadata Storage Server October 16, 2008 Robert Stackowiak Vice President, EPM & Data Warehousing Solutions, Oracle ESG Oracle #1 for Data Warehousing Microsoft 14.8%

More information

SAP HANA. Jake Klein/ SVP SAP HANA June, 2013

SAP HANA. Jake Klein/ SVP SAP HANA June, 2013 SAP HANA Jake Klein/ SVP SAP HANA June, 2013 SAP 3 YEARS AGO Middleware BI / Analytics Core ERP + Suite 2013 WHERE ARE WE NOW? Cloud Mobile Applications SAP HANA Analytics D&T Changed Reality Disruptive

More information

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant Exadata X3 in action: Measuring Smart Scan efficiency with AWR Franck Pachot Senior Consultant 16 March 2013 1 Exadata X3 in action: Measuring Smart Scan efficiency with AWR Exadata comes with new statistics

More information

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

Architectural challenges for building a low latency, scalable multi-tenant data warehouse Architectural challenges for building a low latency, scalable multi-tenant data warehouse Mataprasad Agrawal Solutions Architect, Services CTO 2017 Persistent Systems Ltd. All rights reserved. Our analytics

More information

What is new in the cloud? Donald Kossmann ETH Zurich

What is new in the cloud? Donald Kossmann ETH Zurich What is new in the cloud? Donald Kossmann ETH Zurich http://systems.ethz.ch Acknowledgments Questions? Agenda Why? How? What? Simple Truths Power of data the more data the merrier (GB > TB > PB) data comes

More information

Future of Database. - Journey to the Cloud. Juan Loaiza Senior Vice President Oracle Database Systems

Future of Database. - Journey to the Cloud. Juan Loaiza Senior Vice President Oracle Database Systems Future of Database - Journey to the Cloud Juan Loaiza Senior Vice President Oracle Database Systems Copyright 2016, Oracle and/or its affiliates. All rights reserved. Safe Harbor Statement The following

More information

Next-Generation Cloud Platform

Next-Generation Cloud Platform Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology

More information

Netezza The Analytics Appliance

Netezza The Analytics Appliance Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for

More information

Introduction to Data Management CSE 344

Introduction to Data Management CSE 344 Introduction to Data Management CSE 344 Lectures 23 and 24 Parallel Databases 1 Why compute in parallel? Most processors have multiple cores Can run multiple jobs simultaneously Natural extension of txn

More information

Evolving To The Big Data Warehouse

Evolving To The Big Data Warehouse Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from

More information

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved. 1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle

More information

Key Features. High-performance data replication. Optimized for Oracle Cloud. High Performance Parallel Delivery for all targets

Key Features. High-performance data replication. Optimized for Oracle Cloud. High Performance Parallel Delivery for all targets To succeed in today s competitive environment, you need real-time information. This requires a platform that can unite information from disparate systems across your enterprise without compromising availability

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

Infrastructure Matters: POWER8 vs. Xeon x86

Infrastructure Matters: POWER8 vs. Xeon x86 Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report

More information

Got Isilon? Need IOPS? Get Avere.

Got Isilon? Need IOPS? Get Avere. Got Isilon? Need IOPS? Get Avere. Scalable I/O Performance to Complement Any EMC Isilon Environment By: Jeff Tabor, Director of Product Marketing Achieving Performance Scaling Overcoming Random I/O and

More information

Oracle CoreTech Update OASC Opening 17. November 2014

Oracle CoreTech Update OASC Opening 17. November 2014 Oracle CoreTech Update OASC Opening 17. November 2014 Roger Wullschleger Senior Manager Sales Consulting CoreTech Oracle Software (Schweiz) GmbH Copyright 2014, Oracle and/or its affiliates. All rights

More information

Improve Web Application Performance with Zend Platform

Improve Web Application Performance with Zend Platform Improve Web Application Performance with Zend Platform Shahar Evron Zend Sr. PHP Specialist Copyright 2007, Zend Technologies Inc. Agenda Benchmark Setup Comprehensive Performance Multilayered Caching

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

The Oracle Database Appliance I/O and Performance Architecture

The Oracle Database Appliance I/O and Performance Architecture Simple Reliable Affordable The Oracle Database Appliance I/O and Performance Architecture Tammy Bednar, Sr. Principal Product Manager, ODA 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved.

More information

Four Steps to Unleashing The Full Potential of Your Database

Four Steps to Unleashing The Full Potential of Your Database Four Steps to Unleashing The Full Potential of Your Database This insightful technical guide offers recommendations on selecting a platform that helps unleash the performance of your database. What s the

More information

Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication

Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication CDS and Sky Tech Brief Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication Actifio recommends using Dedup-Async Replication (DAR) for RPO of 4 hours or more and using StreamSnap for

More information

Data Management in the Cloud: Limitations and Opportunities. Daniel Abadi Yale University January 30 th, 2009

Data Management in the Cloud: Limitations and Opportunities. Daniel Abadi Yale University January 30 th, 2009 Data Management in the Cloud: Limitations and Opportunities Daniel Abadi Yale University January 30 th, 2009 Want milk with your breakfast? Buy a cow Big upfront cost Produces more (or less) milk than

More information

Oracle IaaS, a modern felhő infrastruktúra

Oracle IaaS, a modern felhő infrastruktúra Sárecz Lajos Cloud Platform Sales Consultant Oracle IaaS, a modern felhő infrastruktúra Copyright 2017, Oracle and/or its affiliates. All rights reserved. Azure Window collapsed Oracle Infrastructure as

More information

A Scalable Lock Manager for Multicores

A Scalable Lock Manager for Multicores A Scalable Lock Manager for Multicores Hyungsoo Jung Hyuck Han Alan Fekete NICTA Samsung Electronics University of Sydney Gernot Heiser NICTA Heon Y. Yeom Seoul National University @University of Sydney

More information

Exadata Implementation Strategy

Exadata Implementation Strategy Exadata Implementation Strategy BY UMAIR MANSOOB 1 Who Am I Work as Senior Principle Engineer for an Oracle Partner Oracle Certified Administrator from Oracle 7 12c Exadata Certified Implementation Specialist

More information

Migrating Oracle Databases To Cassandra

Migrating Oracle Databases To Cassandra BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra

More information

FOEDUS: OLTP Engine for a Thousand Cores and NVRAM

FOEDUS: OLTP Engine for a Thousand Cores and NVRAM FOEDUS: OLTP Engine for a Thousand Cores and NVRAM Hideaki Kimura HP Labs, Palo Alto, CA Slides By : Hideaki Kimura Presented By : Aman Preet Singh Next-Generation Server Hardware? HP The Machine UC Berkeley

More information

Intel Enterprise Processors Technology

Intel Enterprise Processors Technology Enterprise Processors Technology Kosuke Hirano Enterprise Platforms Group March 20, 2002 1 Agenda Architecture in Enterprise Xeon Processor MP Next Generation Itanium Processor Interconnect Technology

More information

Lustre* - Fast Forward to Exascale High Performance Data Division. Eric Barton 18th April, 2013

Lustre* - Fast Forward to Exascale High Performance Data Division. Eric Barton 18th April, 2013 Lustre* - Fast Forward to Exascale High Performance Data Division Eric Barton 18th April, 2013 DOE Fast Forward IO and Storage Exascale R&D sponsored by 7 leading US national labs Solutions to currently

More information

Research Collection. Daedalus a distributed crescando system. Master Thesis. ETH Library. Author(s): Giannikis, Georgios. Publication Date: 2009

Research Collection. Daedalus a distributed crescando system. Master Thesis. ETH Library. Author(s): Giannikis, Georgios. Publication Date: 2009 Research Collection Master Thesis Daedalus a distributed crescando system Author(s): Giannikis, Georgios Publication Date: 2009 Permanent Link: https://doi.org/10.3929/ethz-a-005816890 Rights / License:

More information

Adapted from: TRENDS AND ATTRIBUTES OF HORIZONTAL AND VERTICAL COMPUTING ARCHITECTURES

Adapted from: TRENDS AND ATTRIBUTES OF HORIZONTAL AND VERTICAL COMPUTING ARCHITECTURES Adapted from: TRENDS AND ATTRIBUTES OF HORIZONTAL AND VERTICAL COMPUTING ARCHITECTURES Tom Atwood Business Development Manager Sun Microsystems, Inc. Takeaways Understand the technical differences between

More information

Shared Workload Optimization

Shared Workload Optimization Shared Workload Optimization Georgios Giannikis Darko Makreshanski Gustavo Alonso Donald Kossmann Systems Group, Department of Computer Science, ETH Zurich, Switzerland {giannikg,darkoma,alonso,kossmann}@inf.ethz.ch

More information

Scaling up analytical queries with column-stores

Scaling up analytical queries with column-stores Scaling up analytical queries with column-stores Ioannis Alagiannis, Manos Athanassoulis, Anastasia Ailamaki Ecole Polytechnique Fédérale de Lausanne Lausanne, VD, Switzerland {ioannis.alagiannis, manos.athanassoulis,

More information

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending Presented by: Trishla Maru Agenda Overview MultiSource Data Federation Use Cases Design Considerations Data

More information

朱义普. Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration. Director, North Asia, HPC

朱义普. Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration. Director, North Asia, HPC October 28, 2013 Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration 朱义普 Director, North Asia, HPC DDN Storage Vendor for HPC & Big Data

More information

Amazon Aurora Relational databases reimagined.

Amazon Aurora Relational databases reimagined. Amazon Aurora Relational databases reimagined. Ronan Guilfoyle, Solutions Architect, AWS Brian Scanlan, Engineer, Intercom 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Current

More information

Chapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems!

Chapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and

More information

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE Oracle Database Exadata Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE Oracle Database Exadata combines the best database with the best cloud platform. Exadata is the culmination of more

More information

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. 17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations

More information

Under the Hood of Oracle Database Appliance. Alex Gorbachev

Under the Hood of Oracle Database Appliance. Alex Gorbachev Under the Hood of Oracle Database Appliance Alex Gorbachev Mountain View, CA 9-Nov-2011 http://bit.ly/pythianasmwebinar 2 Alex Gorbachev CTO, The Pythian Group Blogger OakTable Network member Oracle ACE

More information

IBM Db2 Warehouse on Cloud

IBM Db2 Warehouse on Cloud IBM Db2 Warehouse on Cloud February 01, 2018 Ben Hudson, Offering Manager Noah Kuttler, Product Marketing CALL LOGISTICS Data Warehouse Community Share. Solve. Do More. There are 2 options to listen to

More information

Finding a needle in Haystack: Facebook's photo storage

Finding a needle in Haystack: Facebook's photo storage Finding a needle in Haystack: Facebook's photo storage The paper is written at facebook and describes a object storage system called Haystack. Since facebook processes a lot of photos (20 petabytes total,

More information

A Single Source of Truth

A Single Source of Truth A Single Source of Truth is it the mythical creature of data management? In the world of data management, a single source of truth is a fully trusted data source the ultimate authority for the particular

More information

Traditional RDBMS Wisdom is All Wrong -- In Three Acts. Michael Stonebraker

Traditional RDBMS Wisdom is All Wrong -- In Three Acts. Michael Stonebraker Traditional RDBMS Wisdom is All Wrong -- In Three Acts Michael Stonebraker The Stonebraker Says Webinar Series The first three acts: 1. Why main memory is the answer for OLTP Recording available at VoltDB.com

More information

The InfoLibrarian Metadata Appliance Automated Cataloging System for your IT infrastructure.

The InfoLibrarian Metadata Appliance Automated Cataloging System for your IT infrastructure. Metadata Integration Appliance Times have changed and here is modern solution that delivers instant return on your investment. The InfoLibrarian Metadata Appliance Automated Cataloging System for your

More information

Towards Database / Operating System Co-Design

Towards Database / Operating System Co-Design Towards Database / Operating System Co-Design Jana Giceva, Adrian Schüpbach, Gustavo Alonso, Timothy Roscoe Systems Group, ETH Zurich ABSTRACT Trends in multicore processors pose serious structural challenges

More information

Introduction to Data Management CSE 344

Introduction to Data Management CSE 344 Introduction to Data Management CSE 344 Lecture 25: Parallel Databases CSE 344 - Winter 2013 1 Announcements Webquiz due tonight last WQ! J HW7 due on Wednesday HW8 will be posted soon Will take more hours

More information

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu Database Architecture 2 & Storage Instructor: Matei Zaharia cs245.stanford.edu Summary from Last Time System R mostly matched the architecture of a modern RDBMS» SQL» Many storage & access methods» Cost-based

More information

What We Have Already Learned. DBMS Deployment: Local. Where We Are Headed Next. DBMS Deployment: 3 Tiers. DBMS Deployment: Client/Server

What We Have Already Learned. DBMS Deployment: Local. Where We Are Headed Next. DBMS Deployment: 3 Tiers. DBMS Deployment: Client/Server What We Have Already Learned CSE 444: Database Internals Lectures 19-20 Parallel DBMSs Overall architecture of a DBMS Internals of query execution: Data storage and indexing Buffer management Query evaluation

More information

Introduction to Database Systems CSE 414

Introduction to Database Systems CSE 414 Introduction to Database Systems CSE 414 Lecture 24: Parallel Databases CSE 414 - Spring 2015 1 Announcements HW7 due Wednesday night, 11 pm Quiz 7 due next Friday(!), 11 pm HW8 will be posted middle of

More information

Oracle Database Exadata Cloud Service

Oracle Database Exadata Cloud Service Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity Oracle Database Exadata Cloud Service (Exadata Service) delivers the world s best Cloud Database Platform by combining the world

More information

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22

More information

Data Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich

Data Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Data Modeling and Databases Ch 10: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application

More information

Exadata. Presented by: Kerry Osborne. February 23, 2012

Exadata. Presented by: Kerry Osborne. February 23, 2012 Exadata Presented by: Kerry Osborne February 23, 2012 whoami Worked with Oracle Since 1982 (V2) Working with Exadata since early 2010 Work for Enkitec (www.enkitec.com) (Enkitec owns a Half Rack V2/X2)

More information

IBM DB2 Analytics Accelerator use cases

IBM DB2 Analytics Accelerator use cases IBM DB2 Analytics Accelerator use cases Ciro Puglisi Netezza Europe +41 79 770 5713 cpug@ch.ibm.com 1 Traditional systems landscape Applications OLTP Staging Area ODS EDW Data Marts ETL ETL ETL ETL Historical

More information

CIB Session 12th NoSQL Databases Structures

CIB Session 12th NoSQL Databases Structures CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is

More information

Data Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich

Data Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Data Modeling and Databases Ch 9: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application

More information

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp. Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020

More information

Distributed and Fault-Tolerant Execution Framework for Transaction Processing

Distributed and Fault-Tolerant Execution Framework for Transaction Processing Distributed and Fault-Tolerant Execution Framework for Transaction Processing May 30, 2011 Toshio Suganuma, Akira Koseki, Kazuaki Ishizaki, Yohei Ueda, Ken Mizuno, Daniel Silva *, Hideaki Komatsu, Toshio

More information

Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis

Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis 1 NoSQL So-called NoSQL systems offer reduced functionalities compared to traditional Relational DBMSs, with the aim of achieving

More information

AMD Opteron Processors In the Cloud

AMD Opteron Processors In the Cloud AMD Opteron Processors In the Cloud Pat Patla Vice President Product Marketing AMD DID YOU KNOW? By 2020, every byte of data will pass through the cloud *Source IDC 2 AMD Opteron In The Cloud October,

More information

Oracle Enterprise Architecture. Software. Hardware. Complete. Oracle Exalogic.

Oracle Enterprise Architecture. Software. Hardware. Complete. Oracle Exalogic. Oracle Enterprise Architecture Software. Hardware. Complete Oracle Exalogic edward.zhang@oracle.com Exalogic Exalogic Exalogic -- Exalogic Design Center Exalogic - Sun Oracle - - - CPU/Memory/Networking/Storage

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

Workload Optimized Systems: The Wheel of Reincarnation. Michael Sporer, Netezza Appliance Hardware Architect 21 April 2013

Workload Optimized Systems: The Wheel of Reincarnation. Michael Sporer, Netezza Appliance Hardware Architect 21 April 2013 Workload Optimized Systems: The Wheel of Reincarnation Michael Sporer, Netezza Appliance Hardware Architect 21 April 2013 Outline Definition Technology Minicomputers Prime Workstations Apollo Graphics

More information

Oracle Server Benchmark with In-Memory SQL Processing Exadata X2-2 half-rack high-capacity

Oracle Server Benchmark with In-Memory SQL Processing Exadata X2-2 half-rack high-capacity Oracle Server Benchmark with In-Memory SQL Processing Exadata X2-2 half-rack high-capacity Benchware Performance Suite Release 8.4 (Build 130630) June 2013 Contents 1 Introduction to Tests 2 CPU and Server

More information