Crazy little thing called hardware GUSTAVO ALONSO SYSTEMS GROUP DEPT. OF COMPUTER SCIENCE ETH ZURICH
|
|
- Valentine Waters
- 5 years ago
- Views:
Transcription
1 Crazy little thing called hardware GUSTAVO ALONSO SYSTEMS GROUP DEPT. OF COMPUTER SCIENCE ETH ZURICH HTDC 2014
2 Systems Group = Enterprise Computing Center =
3 Hardware rules Multicore, Many core Transactional Memory SIMD, AVX, vectorization SSDs, persistent memory Infiniband, RDMA GPUs, FPGAs (hardware acceleration) Intelligent storage engines, main memory Database appliances Reacting to changes we do not control
4 Do we care? Software producer of performance instead of consumer of performance Mark D. Hill (U. of Wisconsin)
5 What does it mean? Many good ideas from decades ago are no longer good ideas (as conceived): General purpose solutions Threading, thread based parallelism Locking, shared state Centralized synchronization Concurrency control STM (sorry )
6 The take away message Ignoring hardware trends is irresponsible Software evolves more slowly than hardware Focus on the end (solve a problem), not on the means (the technique used) How to succeed in research: Make hardware your friend Influence hardware evolution
7 Outline 1. Hardware as a problem for system design a) A jungle of improvements b) An example (database joins) 2. The case for custom systems a) Economies of scale b) An example (Crescando) 3. Ideas and directions
8 1. Hardware as a problem for system design a) A jungle of improvements
9 Why is this happening? Single thread performance The Future of Computing Performance: Game Over or Next Level? Fuller and Millett (Eds) Presentation idea from Bob Colwell
10 Why is this happening? Clock frequency
11 Why is this happening? Power dissipation
12 Why is this happening? MULTICORE
13 Multicore is great: avoid distribution Nobody ever got fired for using Hadoop on a Cluster A. Rowstron, D. Narayanan, A. Donnely, G. O Shea, A. Douglas HotCDP 2012, Bern, Switzerland Analysis of MapReduce workloads: Microsoft: median job size < 14 GB Yahoo: median job size < 12.5 GB Facebook: 90% of jobs less than 100 GB Fit in main memory One server more efficient than a cluster Adding memory to a big server better than using a cluster
14 but multicore is dead MULTICORE DARK SILICON
15 Dark Silicon = heterogeneity Symmetric Asymmetric Dynamic (power) Specialized/dynamic Probabilistic
16 Heterogeneity is a mess Example: deployment on multicores Experiment setup 8GB datastore size SLA latency requirement 8s 4 different machines Min Cores Partition Size [GB] RT [s] Intel Nehalem AMD Barcelona AMD Shanghai AMD MagnyCours Jana Giceva, Tudor-Ioan Salomie, Adrian Schüpbach, Gustavo Alonso, Timothy Roscoe: COD: Database / Operating System Co-Design. CIDR 2013
17 Beyond multicore: manycore MULTICORE DARK SILICON MANY CORE
18 PCIe Manycore (example) normal CPU
19 Extreme manycore: Intelligent storage Louis Woods, Zsolt István, Gustavo Alonso: Hybrid FPGA-accelerated SQL query processing. FPL 2013
20 Don t need to trust the software stack Offload all sensitive operations to secure coprocessor Testing industry-standard benchmarks with modified SQL.exe, 0-60% perf overhead Keys, Plaintext Memory CPU Keys, Plaintext Secure Co-processor SLIDE COURTESY OF KEN EGURO, MICROSOFT RESEARCH 20
21 The take away message Ignoring hardware trends at your own peril You will be solving the wrong problem Your solution will irrelevant very quickly Your solution will be making the wrong assumptions Hardware might solve the problem that you are trying to solve (may have solved it already)
22 1. Hardware as a problem for system design b) An example (database joins)
23 The joy of joins "Main-Memory Hash Joins on Multi-Core CPUs: Tuning to the Underlying Hardware" by Cagri Balkesen, Jens Teubner, Gustavo Alonso, and Tamer Ozsu, ICDE 2013 Cagri Balkesen, Gustavo Alonso, Jens Teubner, M. Tamer Özsu: Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited. PVLDB 7(1): (2013)
24 Hardware conscious Joins are a complex and demanding operation Lots of work on implementing all kinds of joins In 2009 Kim, Sedlar et al. paper (PVLDB 2009) Radix join on multicore Sort Merge join on multicore Claim fastest implementation to date Key message: when SIMD wide enough, sort merge will be faster than radix join Performance in the multicore era 24
25 Hardware oblivious In 2011 Blanas, Li et al. (SIGMOD 2011) No partitioning join (vs. radix join version of Kim paper) Claim: Hardware is good enough, no need for careful tailoring to the underlying hardware Performance in the multicore era 25
26 Hardware confusion In 2012 Albutiu, Kemper et al. (PVLDB 2012) Sort merge joins (vs join version of Blanas) Claim: Sort merge already better and without using SIMD Performance in the multicore era 26
27 The basic hash join
28 Canonical Hash Join k R 1. Build phase hash table bucket 0 bucket 1 2. Probe phase k S hash(key) match hash(key) bucket n-1 Complexity: O( R + S ), i.e., O(N) Easy to parallelize Performance in the multicore era 28
29 Need for Speed Hardware-Conscious Hash Joins
30 Partitioned Hash Join (Shatdal et al. 1994) Idea: Partition input into disjoint chunks of cache size k R 1 h 2 (k) 1 k S h 1 (key)... p p h 1 (key) 1 Partition 2 Build 3 Probe No more cache misses during the join 1 Partition p > #TLB-entries TLB misses Problem: p can be too large! p > #Cache-entries Cache thrashing "Cache conscious algorithms for relational query processing", Shatdal et al, VLDB 94 Performance in the multicore era 30
31 Multi-Pass Radix Partitioning Problem: Hardware limits fan-out, i.e. T = #TLB-entries (typically ) Solution: Do the partitioning in multiple passes! input relation 1 st pass h 1 (key) nd Pass h 2 (key) 1 T partition i th pass... 1 st log 2 T bits of hash(key) T 2 nd Pass h 2 (key) 2 nd log 2 T bits of hash(key) 1 T i = log T p partition - T i TLB & Cache efficiency compensates multiple read/write passes «Database Architecture Optimized for the new Bottleneck: Memory Access», Manegold et al, VLDB 99 Performance in the multicore era 31
32 Partitioned Relation Parallel Radix Join Parallelizing the Partitioning: Pass - 1 Local Histograms Thread-1 Thread-2 Thread-3 Thread-N 1 st Scan Global Histogram & Prefix Sum T0 T1 T N T0 T1 T N T0 T1 T N T0 T1 T N Each thread scatters out its tuples based on the prefix sum 2 nd Scan Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs, VLDB 09 Performance in the multicore era 32
33 Relation Partitioned Parallel Radix Join Parallelizing the Partitioning: Pass - (2.. i) P 1 P 2 P 3 P 4 Histogram Thread-2 Thread-4 Thread-N... 1 st Scan Prefix Sum Each thread individually partitions sub-relations from pass-1 2 nd Scan P 11 P 12 P 13 P 14 P 21 P 22 P 23 P 24 P 31 P 32 P 33 P 41 P 42 P 43 Performance in the multicore era 33
34 Trust the force Hardware-Oblivious Hash Joins
35 Parallel Hash Join ( no partitioning join of Blanas et al.) R 1. Build Phase Thread-1 Thread-2 Thread-3 Thread-N latches L0 L1 shared hashtable bucket 0 bucket 1 2. Probe Phase 1. Acquire latch L(n-1) 2. Store in bucket bucket n-1 Compare & match Thread-1 Thread-2 Thread-3 Thread-N S Performance in the multicore era 35
36 And the winner is
37 Effect of Workloads Case 1 Workload A: 16M 256M, 16-byte tuples, i.e., 256MiB 4096MiB 16.5 cy/tpl 12.9 cy/tpl 1. Effective on-chip threading 2. Efficient sync. primitives (ldstub) 3. Larger page size Hardware oblivious vs. conscious with our optimized code -30% may seem in favor of hardware-oblivious, but Performance in the multicore era 37
38 Effect of Workloads Case 2 Workload B: Equal-sized tables, 977MiB 977MiB, 8-byte tuples 50 cy/tpl 3.5X 14 cy/tpl Picture radically changes: Hardware-conscious is better by 3.5X on Intel and 2.5X on others With larger build table, overhead of not being hardware-conscious is clearly visible Performance in the multicore era 38
39 Sort or Hash? 375M/sec 6.41 cy/tpl 299M/sec 8.03 cy/tpl 619M/sec 3.87 cy/tpl 305M/sec 7.99 cy/tpl Workload: 12.8GB 12.8GB Input size is an important parameter! Workload: 1GB 1GB Optimizations on radix: NUMA-aware data placement, partitioning with software-managed buffers Machine: Intel Sandy Bridge E4640, 2.4GHz, 32-cores, 64-threads Performance in the multicore era 39
40 Game on for academia The next several hundred papers 1. Pick an algorithm 2. Pick an architecture 3. Optimize 4. Publish 5. Go to 1 Algorithm can be used in parallel for increased throughput
41 2. The case for custom systems a) Economies of scale
42 Game changers Hardware evolution: One size does not fit all Cloud computing IT industry as service industry Big X (X=data volume, loads, system costs, requirements, scale) Demand for customized solutions
43 ORACLE EXADATA Intelligent storage manager Massive caching RAC based architecture Fast network interconnect
44 NETEZZA (IBM) TWINFIN No storage manager Distributed disks (per node) FPGA processing No indexing
45 Has been tried before The idea that database workloads are relevant enough to justify customization is not new Decades ago it was difficult to fight the progress of general purpose systems. Not any more Gamma Database Machine (Dewitt)
46 2. The case for custom systems b) An example (Crescando)
47 Amadeus Workload Passenger-Booking Database ~ 600 GB of raw data (two years of bookings) single table, denormalized ~ 50 attributes: flight-no, name, date,..., many flags Query Workload up to 4000 queries / second latency guarantees: 2 seconds today: only pre-canned queries allowed Update Workload avg. 600 updates per second (1 update per GB per sec) peak of updates per second data freshness guarantee: 2 seconds Problems with State-of-the Art Simple queries work only because of mat. views multi-month project to implement new query / process Complex queries do not work at all
48 Crescando: the Amadeus use case Remove load interaction Remove unpredictability Simplify design for scalability and modeling Treat a multicore machine as a collection of individual nodes (not as a parallel machine) Run only on main memory One thread per core Highly tune the code at each core
49 Scan on a core QUERIES UPDATES BUILD QUERY INDEX FOR NEXT SCAN READ CURSOR WRITE CURSOR DATA IN CIRCULAR BUFFER (WIDE TABLE)
50 Crescando on 1 Machine (N Cores) Scan Thread Scan Thread Input Queue (Operations) Split Scan Thread Scan Thread Merge Output Queue (Result Tuples)... Input Queue (Operations) Scan Thread Output Queue (Result Tuples)
51 Crescando in a Data Center (N Machines) External Clients... Crescando Aggregation Layers Replication Groups...
52 Why is this interesting? Fully predictable performance Response time determined by design regardless of load Only two parameters: Size of the scan Number of queries per scan Scalable to arbitrary numbers of nodes Philipp Unterbrunner, Georgios Giannikis, Gustavo Alonso, Dietmar Fauser, Donald Kossmann: Predictable Performance for Unpredictable Workloads. PVLDB 2(1): (2009)
53 Even more interesting On such a system (or a key value store, No- SQL database, etc.): Why using a general purpose CPU? Many other options available Smaller CPUs FPGAs ASIC Zsolt István, Gustavo Alonso, Michaela Blott, Kees A. Vissers: A flexible hash table design for 10GBPS key-value stores on FPGAS. FPL 2013
54 3. Ideas and directions
55 Not everything is parallel P. Roy, J. Teubner, G. Alonso Efficient Frequent Item Counting in Multi-Core Hardware, KDD 2012 Performance in the multicore era 55
56 Not everything is a CPU Louis Woods, Zsolt István, Gustavo Alonso: Hybrid FPGA-accelerated SQL query processing. FPL 2013
57 Hardware might solve your problem Louis Woods, Gustavo Alonso, Jens Teubner: Parallel Computation of Skyline Queries. FCCM 2013
58 Pipeline parallelism Georgios Giannikis, Gustavo Alonso, Donald Kossmann: SharedDB: Killing One Thousand Queries With One Stone. PVLDB 5(6): (2012) SharedDB does not run queries individually (each one in one thread). Instead, it runs operators that process queries in batches thousands of queries at a time
59 Shared DB can run TPC-W!
60 For the non-db people TPC-W has updates!!! Full consistency without conventional transaction manager Transactions are no longer what you read in textbooks Sequential execution Memory CoW (Hyder, TU Munich) Snapshot isolation
61 Raw performance
62 Predictability, robustness
63 Parallelism or distribution?
64 COD : Overview What is the knowledge we have? Who knows what? DBMS Application requirements and characteristics Insert interface here Hardware & architecture + System state and utilization of resources OS CIDR 13 COD: Database/Operating System co-design 64
65 Cod s Interface supports DB storage engine OS Policy Engine Push application-specific System-level facts: #Requests (in System-level a batch) facts Datastore size (#Tuples, and TupleSize) SLA response time DB requirement system-level Needed for: properties cost / utility functions: Application-specific DB-specific Facts & properties Cost functions CIDR 13 COD: Database/Operating System co-design 65
66 COD s key features Declarative interface Resource allocation for imperative requests Resource allocation based on cost functions Proactive interface Inform of system state Request releasing of resources Recommend reallocation of resources CIDR 13 COD: Database/Operating System co-design 66
67 Throughput [req/second] Experimental results Deployment in a noisy system Experiment setup 1200 System Performance -Throughput AMD MagnyCours 4 x 2.2GHz AMD Opteron 6174 processors total Datastore size 53GB Noise: another CPU-intensive task running on core what we get using COD, knowing the system state 48 cores - isolated cores - noisy Increasing load [#requests] CIDR 13 COD: Database/Operating System co-design 67
68 Throughput [req/second] Experimental results Deployment in a noisy system Experiment setup 1200 System Performance -Throughput AMD MagnyCours 4 x 2.2GHz AMD Opteron 6174 processors total Datastore size 53GB Noise: another CPU-intensive task running on core what we get using COD, knowing the system state 48 cores - isolated cores COD noisy cores - noisy Increasing load [#requests] CIDR 13 COD: Database/Operating System co-design 68
69 Latency [sec] Experimental results Adaptability to dynamic system state 9 Adaptability Latency Experiment setup AMD MagnyCours 4 x 2.2GHz AMD Opteron 6174 processors total Datastore size 53GB Noise: other CPU-intensive threads spawned every 4-5min on core SLA Elapsed time [min] CIDR 13 COD: Database/Operating System co-design 69
70 Latency [sec] Experimental results Adaptability to dynamic system state 9 Adaptability Latency Experiment setup AMD MagnyCours 4 x 2.2GHz AMD Opteron 6174 processors total Datastore size 53GB Noise: other CPU-intensive threads spawned every 4-5min on core Naïve datastore engine SLA Elapsed time [min] CIDR 13 COD: Database/Operating System co-design 70
71 Latency [sec] Experimental results Adaptability to dynamic system state 9 Adaptability Latency Experiment setup AMD MagnyCours 4 x 2.2GHz AMD Opteron 6174 processors total Datastore size 53GB Noise: other CPU-intensive threads spawned every 4-5min on core Naïve datastore engine SLA COD Elapsed time [min] CIDR 13 COD: Database/Operating System co-design 71
72 Conclusions
73 The opportunity is now Consensus on major crisis in hardware (from the sw perspective) Hardware not really improving, responsibility passed on to software Business models and IT systems moving towards specialization
Performance in the Multicore Era
Performance in the Multicore Era Gustavo Alonso Systems Group -- ETH Zurich, Switzerland Systems Group Enterprise Computing Center Performance in the multicore era 2 BACKGROUND - SWISSBOX SwissBox: An
More informationRack-scale Data Processing System
Rack-scale Data Processing System Jana Giceva, Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich Rack-scale Data Processing
More informationMULTICORE IN DATA APPLIANCES. Gustavo Alonso Systems Group Dept. of Computer Science ETH Zürich, Switzerland
MULTICORE IN DATA APPLIANCES Gustavo Alonso Systems Group Dept. of Computer Science ETH Zürich, Switzerland SwissBox CREST Workshop March 2012 Systems Group = www.systems.ethz.ch Enterprise Computing Center
More informationDATABASES AND THE CLOUD. Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland
DATABASES AND THE CLOUD Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland AVALOQ Conference Zürich June 2011 Systems Group www.systems.ethz.ch Enterprise Computing Center
More informationData Processing on Emerging Hardware
Data Processing on Emerging Hardware Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland 3 rd International Summer School on Big Data, Munich, Germany, 2017 www.systems.ethz.ch
More informationCrescando: Predictable Performance for Unpredictable Workloads
Crescando: Predictable Performance for Unpredictable Workloads G. Alonso, D. Fauser, G. Giannikis, D. Kossmann, J. Meyer, P. Unterbrunner Amadeus S.A. ETH Zurich, Systems Group (Funded by Enterprise Computing
More informationData Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 10: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationData Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 9: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationHash Joins for Multi-core CPUs. Benjamin Wagner
Hash Joins for Multi-core CPUs Benjamin Wagner Joins fundamental operator in query processing variety of different algorithms many papers publishing different results main question: is tuning to modern
More informationJignesh M. Patel. Blog:
Jignesh M. Patel Blog: http://bigfastdata.blogspot.com Go back to the design Query Cache from Processing for Conscious 98s Modern (at Algorithms Hardware least for Hash Joins) 995 24 2 Processor Processor
More informationArchitecture-Conscious Database Systems
Architecture-Conscious Database Systems 2009 VLDB Summer School Shanghai Peter Boncz (CWI) Sources Thank You! l l l l Database Architectures for New Hardware VLDB 2004 tutorial, Anastassia Ailamaki Query
More informationReconfigurable hardware for big data. Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland
Reconfigurable hardware for big data Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland www.systems.ethz.ch Systems Group 7 faculty ~40 PhD ~8 postdocs Researching all
More informationDesigning Hybrid Data Processing Systems for Heterogeneous Servers
Designing Hybrid Data Processing Systems for Heterogeneous Servers Peter Pietzuch Large-Scale Distributed Systems (LSDS) Group Imperial College London http://lsds.doc.ic.ac.uk University
More informationCSCI-GA Multicore Processors: Architecture & Programming Lecture 10: Heterogeneous Multicore
CSCI-GA.3033-012 Multicore Processors: Architecture & Programming Lecture 10: Heterogeneous Multicore Mohamed Zahran (aka Z) mzahran@cs.nyu.edu http://www.mzahran.com Status Quo Previously, CPU vendors
More informationAccelerating Data Science. Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland
Accelerating Data Science Gustavo Alonso Systems Group Department of Computer Science ETH Zurich, Switzerland Data processing today: Appliances (large machines) Data Centers (many machines) Databases are
More informationV Conclusions. V.1 Related work
V Conclusions V.1 Related work Even though MapReduce appears to be constructed specifically for performing group-by aggregations, there are also many interesting research work being done on studying critical
More informationJoin Processing for Flash SSDs: Remembering Past Lessons
Join Processing for Flash SSDs: Remembering Past Lessons Jaeyoung Do, Jignesh M. Patel Department of Computer Sciences University of Wisconsin-Madison $/MB GB Flash Solid State Drives (SSDs) Benefits of
More informationCaribou: Intelligent Distributed Storage
: Intelligent Distributed Storage Zsolt István, David Sidler, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich 1 Rack-scale thinking In the Cloud ToR Switch Compute Compute + Provisioning
More informationResearch Collection. Daedalus a distributed crescando system. Master Thesis. ETH Library. Author(s): Giannikis, Georgios. Publication Date: 2009
Research Collection Master Thesis Daedalus a distributed crescando system Author(s): Giannikis, Georgios Publication Date: 2009 Permanent Link: https://doi.org/10.3929/ethz-a-005816890 Rights / License:
More informationAdvanced Database Systems
Lecture IV Query Processing Kyumars Sheykh Esmaili Basic Steps in Query Processing 2 Query Optimization Many equivalent execution plans Choosing the best one Based on Heuristics, Cost Will be discussed
More informationExadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant
Exadata X3 in action: Measuring Smart Scan efficiency with AWR Franck Pachot Senior Consultant 16 March 2013 1 Exadata X3 in action: Measuring Smart Scan efficiency with AWR Exadata comes with new statistics
More informationB.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2
Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,
More informationMain-Memory Databases 1 / 25
1 / 25 Motivation Hardware trends Huge main memory capacity with complex access characteristics (Caches, NUMA) Many-core CPUs SIMD support in CPUs New CPU features (HTM) Also: Graphic cards, FPGAs, low
More informationDB2 is a complex system, with a major impact upon your processing environment. There are substantial performance and instrumentation changes in
DB2 is a complex system, with a major impact upon your processing environment. There are substantial performance and instrumentation changes in versions 8 and 9. that must be used to measure, evaluate,
More informationLecture 26: Multiprocessing continued Computer Architecture and Systems Programming ( )
Systems Group Department of Computer Science ETH Zürich Lecture 26: Multiprocessing continued Computer Architecture and Systems Programming (252-0061-00) Timothy Roscoe Herbstsemester 2012 Today Non-Uniform
More informationAdvanced Databases: Parallel Databases A.Poulovassilis
1 Advanced Databases: Parallel Databases A.Poulovassilis 1 Parallel Database Architectures Parallel database systems use parallel processing techniques to achieve faster DBMS performance and handle larger
More informationToward timely, predictable and cost-effective data analytics. Renata Borovica-Gajić DIAS, EPFL
Toward timely, predictable and cost-effective data analytics Renata Borovica-Gajić DIAS, EPFL Big data proliferation Big data is when the current technology does not enable users to obtain timely, cost-effective,
More informationHYRISE In-Memory Storage Engine
HYRISE In-Memory Storage Engine Martin Grund 1, Jens Krueger 1, Philippe Cudre-Mauroux 3, Samuel Madden 2 Alexander Zeier 1, Hasso Plattner 1 1 Hasso-Plattner-Institute, Germany 2 MIT CSAIL, USA 3 University
More informationNext-Generation Cloud Platform
Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology
More informationCSE 544: Principles of Database Systems
CSE 544: Principles of Database Systems Anatomy of a DBMS, Parallel Databases 1 Announcements Lecture on Thursday, May 2nd: Moved to 9am-10:30am, CSE 403 Paper reviews: Anatomy paper was due yesterday;
More informationGPUfs: Integrating a file system with GPUs
GPUfs: Integrating a file system with GPUs Mark Silberstein (UT Austin/Technion) Bryan Ford (Yale), Idit Keidar (Technion) Emmett Witchel (UT Austin) 1 Traditional System Architecture Applications OS CPU
More informationAdaptive Query Processing on Prefix Trees Wolfgang Lehner
Adaptive Query Processing on Prefix Trees Wolfgang Lehner Fachgruppentreffen, 22.11.2012 TU München Prof. Dr.-Ing. Wolfgang Lehner > Challenges for Database Systems Three things are important in the database
More informationBig and Fast. Anti-Caching in OLTP Systems. Justin DeBrabant
Big and Fast Anti-Caching in OLTP Systems Justin DeBrabant Online Transaction Processing transaction-oriented small footprint write-intensive 2 A bit of history 3 OLTP Through the Years relational model
More informationDesign Principles for End-to-End Multicore Schedulers
c Systems Group Department of Computer Science ETH Zürich HotPar 10 Design Principles for End-to-End Multicore Schedulers Simon Peter Adrian Schüpbach Paul Barham Andrew Baumann Rebecca Isaacs Tim Harris
More informationDatenbanksysteme II: Modern Hardware. Stefan Sprenger November 23, 2016
Datenbanksysteme II: Modern Hardware Stefan Sprenger November 23, 2016 Content of this Lecture Introduction to Modern Hardware CPUs, Cache Hierarchy Branch Prediction SIMD NUMA Cache-Sensitive Skip List
More informationWhy Parallel Architecture
Why Parallel Architecture and Programming? Todd C. Mowry 15-418 January 11, 2011 What is Parallel Programming? Software with multiple threads? Multiple threads for: convenience: concurrent programming
More informationInfrastructure Matters: POWER8 vs. Xeon x86
Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report
More informationToday. SMP architecture. SMP architecture. Lecture 26: Multiprocessing continued Computer Architecture and Systems Programming ( )
Lecture 26: Multiprocessing continued Computer Architecture and Systems Programming (252-0061-00) Timothy Roscoe Herbstsemester 2012 Systems Group Department of Computer Science ETH Zürich SMP architecture
More informationBig Data Systems on Future Hardware. Bingsheng He NUS Computing
Big Data Systems on Future Hardware Bingsheng He NUS Computing http://www.comp.nus.edu.sg/~hebs/ 1 Outline Challenges for Big Data Systems Why Hardware Matters? Open Challenges Summary 2 3 ANYs in Big
More informationCSE544 Database Architecture
CSE544 Database Architecture Tuesday, February 1 st, 2011 Slides courtesy of Magda Balazinska 1 Where We Are What we have already seen Overview of the relational model Motivation and where model came from
More informationA Fast and High Throughput SQL Query System for Big Data
A Fast and High Throughput SQL Query System for Big Data Feng Zhu, Jie Liu, and Lijie Xu Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, China 100190
More informationA Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture
A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture By Gaurav Sheoran 9-Dec-08 Abstract Most of the current enterprise data-warehouses
More informationAccelerating Foreign-Key Joins using Asymmetric Memory Channels
Accelerating Foreign-Key Joins using Asymmetric Memory Channels Holger Pirk Stefan Manegold Martin Kersten holger@cwi.nl manegold@cwi.nl mk@cwi.nl Why? Trivia: Joins are important But: Many Joins are (Indexed)
More informationSomething to think about. Problems. Purpose. Vocabulary. Query Evaluation Techniques for large DB. Part 1. Fact:
Query Evaluation Techniques for large DB Part 1 Fact: While data base management systems are standard tools in business data processing they are slowly being introduced to all the other emerging data base
More informationIn-Memory Data Management
In-Memory Data Management Martin Faust Research Assistant Research Group of Prof. Hasso Plattner Hasso Plattner Institute for Software Engineering University of Potsdam Agenda 2 1. Changed Hardware 2.
More informationRed Fox: An Execution Environment for Relational Query Processing on GPUs
Red Fox: An Execution Environment for Relational Query Processing on GPUs Haicheng Wu 1, Gregory Diamos 2, Tim Sheard 3, Molham Aref 4, Sean Baxter 2, Michael Garland 2, Sudhakar Yalamanchili 1 1. Georgia
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Today Finishing up from last time Brief discussion of graphics workload metrics
More informationHuge market -- essentially all high performance databases work this way
11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch
More informationParallel Algorithm Engineering
Parallel Algorithm Engineering Kenneth S. Bøgh PhD Fellow Based on slides by Darius Sidlauskas Outline Background Current multicore architectures UMA vs NUMA The openmp framework and numa control Examples
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationParallel DBMS. Prof. Yanlei Diao. University of Massachusetts Amherst. Slides Courtesy of R. Ramakrishnan and J. Gehrke
Parallel DBMS Prof. Yanlei Diao University of Massachusetts Amherst Slides Courtesy of R. Ramakrishnan and J. Gehrke I. Parallel Databases 101 Rise of parallel databases: late 80 s Architecture: shared-nothing
More informationInformation Systems (Informationssysteme)
Information Systems (Informationssysteme) Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Summer 2018 c Jens Teubner Information Systems Summer 2018 1 Part IX B-Trees c Jens Teubner Information
More informationWhen MPPDB Meets GPU:
When MPPDB Meets GPU: An Extendible Framework for Acceleration Laura Chen, Le Cai, Yongyan Wang Background: Heterogeneous Computing Hardware Trend stops growing with Moore s Law Fast development of GPU
More informationModern Processor Architectures. L25: Modern Compiler Design
Modern Processor Architectures L25: Modern Compiler Design The 1960s - 1970s Instructions took multiple cycles Only one instruction in flight at once Optimisation meant minimising the number of instructions
More informationMaximum Performance. How to get it and how to avoid pitfalls. Christoph Lameter, PhD
Maximum Performance How to get it and how to avoid pitfalls Christoph Lameter, PhD cl@linux.com Performance Just push a button? Systems are optimized by default for good general performance in all areas.
More informationWhat We Have Already Learned. DBMS Deployment: Local. Where We Are Headed Next. DBMS Deployment: 3 Tiers. DBMS Deployment: Client/Server
What We Have Already Learned CSE 444: Database Internals Lectures 19-20 Parallel DBMSs Overall architecture of a DBMS Internals of query execution: Data storage and indexing Buffer management Query evaluation
More informationAnastasia Ailamaki. Performance and energy analysis using transactional workloads
Performance and energy analysis using transactional workloads Anastasia Ailamaki EPFL and RAW Labs SA students: Danica Porobic, Utku Sirin, and Pinar Tozun Online Transaction Processing $2B+ industry Characteristics:
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system
More informationCMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS
Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22
More informationRevisiting the Past 25 Years: Lessons for the Future. Guri Sohi University of Wisconsin-Madison
Revisiting the Past 25 Years: Lessons for the Future Guri Sohi University of Wisconsin-Madison Outline VLIW OOO Superscalar Enhancing Superscalar And the future 2 Beyond pipelining to ILP Late 1980s to
More informationOverview of Data Exploration Techniques. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri
Overview of Data Exploration Techniques Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri data exploration not always sure what we are looking for (until we find it) data has always been big volume
More informationChapter 12: Query Processing. Chapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Query Processing Overview Measures of Query Cost Selection Operation Sorting Join
More informationOutline. Database Management and Tuning. Outline. Join Strategies Running Example. Index Tuning. Johann Gamper. Unit 6 April 12, 2012
Outline Database Management and Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 6 April 12, 2012 1 Acknowledgements: The slides are provided by Nikolaus Augsten
More informationParallelism. CS6787 Lecture 8 Fall 2017
Parallelism CS6787 Lecture 8 Fall 2017 So far We ve been talking about algorithms We ve been talking about ways to optimize their parameters But we haven t talked about the underlying hardware How does
More informationAdvanced Database Systems
Lecture II Storage Layer Kyumars Sheykh Esmaili Course s Syllabus Core Topics Storage Layer Query Processing and Optimization Transaction Management and Recovery Advanced Topics Cloud Computing and Web
More informationTowards Database / Operating System Co-Design
Towards Database / Operating System Co-Design Jana Giceva, Adrian Schüpbach, Gustavo Alonso, Timothy Roscoe Systems Group, ETH Zurich ABSTRACT Trends in multicore processors pose serious structural challenges
More informationGPU > CPU. FOR HIGH PERFORMANCE COMPUTING PRESENTATION BY - SADIQ PASHA CHETHANA DILIP
GPU > CPU. FOR HIGH PERFORMANCE COMPUTING PRESENTATION BY - SADIQ PASHA CHETHANA DILIP INTRODUCTION or With the exponential increase in computational power of todays hardware, the complexity of the problem
More information<Insert Picture Here> Controlling resources in an Exadata environment
Controlling resources in an Exadata environment Agenda Exadata Security Flash Cache and Log Storage Indexes Parallel Execution Agenda Exadata Security Flash Cache and Log Storage
More informationMQJoin: Efficient Shared Execution of Main-Memory Joins. Author(s): Makreshanski, Darko; Giannikis, Georgios; Alonso, Gustavo; Kossmann, Donald
Research Collection Report MQJoin: Efficient Shared Execution of Main-Memory Joins Author(s): Makreshanski, Darko; Giannikis, Georgios; Alonso, Gustavo; Kossmann, Donald Publication Date: 26 Permanent
More informationInexpensive Coordination in Hardware
Consensus in a Box Inexpensive Coordination in Hardware Zsolt István, David Sidler, Gustavo Alonso, Marko Vukolic * Systems Group, Department of Computer Science, ETH Zurich * Consensus IBM in Research,
More informationIntroduction to Data Management CSE 344
Introduction to Data Management CSE 344 Lecture 25: Parallel Databases CSE 344 - Winter 2013 1 Announcements Webquiz due tonight last WQ! J HW7 due on Wednesday HW8 will be posted soon Will take more hours
More informationUser Perspective. Module III: System Perspective. Module III: Topics Covered. Module III Overview of Storage Structures, QP, and TM
Module III Overview of Storage Structures, QP, and TM Sharma Chakravarthy UT Arlington sharma@cse.uta.edu http://www2.uta.edu/sharma base Management Systems: Sharma Chakravarthy Module I Requirements analysis
More informationIntroduction to Database Systems CSE 414
Introduction to Database Systems CSE 414 Lecture 24: Parallel Databases CSE 414 - Spring 2015 1 Announcements HW7 due Wednesday night, 11 pm Quiz 7 due next Friday(!), 11 pm HW8 will be posted middle of
More informationNVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory
NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory Dhananjoy Das, Sr. Systems Architect SanDisk Corp. 1 Agenda: Applications are KING! Storage landscape (Flash / NVM)
More informationSystems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2014/15 Lecture X: Parallel Databases Topics Motivation and Goals Architectures Data placement Query processing Load balancing
More informationOutline Marquette University
COEN-4710 Computer Hardware Lecture 1 Computer Abstractions and Technology (Ch.1) Cristinel Ababei Department of Electrical and Computer Engineering Credits: Slides adapted primarily from presentations
More informationArchitecture of a Real-Time Operational DBMS
Architecture of a Real-Time Operational DBMS Srini V. Srinivasan Founder, Chief Development Officer Aerospike CMG India Keynote Thane December 3, 2016 [ CMGI Keynote, Thane, India. 2016 Aerospike Inc.
More informationLecture on Multicores. Darius Sidlauskas Post-doc. Darius Sidlauskas, 25/ /21
Lecture on Multicores Darius Sidlauskas Post-doc 1/21 Outline Part 1 Background Current multicore CPUs Part 2 To share or not to share Part 3 Demo War story 2/21 Outline Part 1 Background Current multicore
More informationConcurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration
Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration Sina Meraji sinamera@ca.ibm.com Berni Schiefer schiefer@ca.ibm.com Tuesday March 17th at 12:00
More informationThe Multikernel A new OS architecture for scalable multicore systems
Systems Group Department of Computer Science ETH Zurich SOSP, 12th October 2009 The Multikernel A new OS architecture for scalable multicore systems Andrew Baumann 1 Paul Barham 2 Pierre-Evariste Dagand
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More informationData Processing on Modern Hardware
Data Processing on Modern Hardware Jens Teubner, TU Dortmund, DBIS Group jens.teubner@cs.tu-dortmund.de Summer 2014 c Jens Teubner Data Processing on Modern Hardware Summer 2014 1 Part V Execution on Multiple
More information<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store
Oracle NoSQL Database A Distributed Key-Value Store Charles Lamb The following is intended to outline our general product direction. It is intended for information purposes only,
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationModern Processor Architectures (A compiler writer s perspective) L25: Modern Compiler Design
Modern Processor Architectures (A compiler writer s perspective) L25: Modern Compiler Design The 1960s - 1970s Instructions took multiple cycles Only one instruction in flight at once Optimisation meant
More informationSAP HANA. Jake Klein/ SVP SAP HANA June, 2013
SAP HANA Jake Klein/ SVP SAP HANA June, 2013 SAP 3 YEARS AGO Middleware BI / Analytics Core ERP + Suite 2013 WHERE ARE WE NOW? Cloud Mobile Applications SAP HANA Analytics D&T Changed Reality Disruptive
More informationRed Fox: An Execution Environment for Relational Query Processing on GPUs
Red Fox: An Execution Environment for Relational Query Processing on GPUs Georgia Institute of Technology: Haicheng Wu, Ifrah Saeed, Sudhakar Yalamanchili LogicBlox Inc.: Daniel Zinn, Martin Bravenboer,
More informationInstant Recovery for Main-Memory Databases
Instant Recovery for Main-Memory Databases Ismail Oukid*, Wolfgang Lehner*, Thomas Kissinger*, Peter Bumbulis, and Thomas Willhalm + *TU Dresden SAP SE + Intel GmbH CIDR 2015, Asilomar, California, USA,
More informationOracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer
Oracle Exadata X7 Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer 05.12.2017 Oracle Engineered Systems ZFS Backup Appliance Zero Data Loss Recovery Appliance Exadata Database
More informationIntroduction to parallel computers and parallel programming. Introduction to parallel computersand parallel programming p. 1
Introduction to parallel computers and parallel programming Introduction to parallel computersand parallel programming p. 1 Content A quick overview of morden parallel hardware Parallelism within a chip
More informationExadata Implementation Strategy
Exadata Implementation Strategy BY UMAIR MANSOOB 1 Who Am I Work as Senior Principle Engineer for an Oracle Partner Oracle Certified Administrator from Oracle 7 12c Exadata Certified Implementation Specialist
More informationPatterns of Parallel Programming with.net 4. Ade Miller Microsoft patterns & practices
Patterns of Parallel Programming with.net 4 Ade Miller (adem@microsoft.com) Microsoft patterns & practices Introduction Why you should care? Where to start? Patterns walkthrough Conclusions (and a quiz)
More informationOracle Exadata: Strategy and Roadmap
Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended
More informationGPU ACCELERATED DATABASE MANAGEMENT SYSTEMS
CIS 601 - Graduate Seminar Presentation 1 GPU ACCELERATED DATABASE MANAGEMENT SYSTEMS PRESENTED BY HARINATH AMASA CSU ID: 2697292 What we will talk about.. Current problems GPU What are GPU Databases GPU
More informationBridging the Processor/Memory Performance Gap in Database Applications
Bridging the Processor/Memory Performance Gap in Database Applications Anastassia Ailamaki Carnegie Mellon http://www.cs.cmu.edu/~natassa Memory Hierarchies PROCESSOR EXECUTION PIPELINE L1 I-CACHE L1 D-CACHE
More informationdata parallelism Chris Olston Yahoo! Research
data parallelism Chris Olston Yahoo! Research set-oriented computation data management operations tend to be set-oriented, e.g.: apply f() to each member of a set compute intersection of two sets easy
More informationReducing Network Contention with Mixed Workloads on Modern Multicore Clusters
Reducing Network Contention with Mixed Workloads on Modern Multicore Clusters Matthew Koop 1 Miao Luo D. K. Panda matthew.koop@nasa.gov {luom, panda}@cse.ohio-state.edu 1 NASA Center for Computational
More informationHammer Slide: Work- and CPU-efficient Streaming Window Aggregation
Large-Scale Data & Systems Group Hammer Slide: Work- and CPU-efficient Streaming Window Aggregation Georgios Theodorakis, Alexandros Koliousis, Peter Pietzuch, Holger Pirk Large-Scale Data & Systems (LSDS)
More informationEvolving To The Big Data Warehouse
Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from
More informationThe Case for Heterogeneous HTAP
The Case for Heterogeneous HTAP Raja Appuswamy, Manos Karpathiotakis, Danica Porobic, and Anastasia Ailamaki Data-Intensive Applications and Systems Lab EPFL 1 HTAP the contract with the hardware Hybrid
More information