Models of Computation for Massive Data
|
|
- Asher Stafford
- 6 years ago
- Views:
Transcription
1 Models of Computation for Massive Data Jeff M. Phillips August 28, 2013
2 Outline Sequential: External Memory / (I/O)-Efficient Streaming Parallel: P and BSP MapReduce GP-GPU Distributed Computing runtime data size
3 Model model (Von Neumann Architecture): and Memory Operations (+,,,...) constant time Data stored as words, not bits. Read, Write take constant time.
4 Today s Reality Disk What your computer actually looks like: 164 GB 3+ layers of memory hierarchy. Small number of s. Many variations! bigger 2 GB 4 MB L2 faster L1 L1
5 Model model (Von Neumann Architecture): and Memory Operations (+,,,...) constant time Data stored as words, not bits. Read, Write take constant time.
6 External Memory Model D M block I/O P N = size of problem instance B = size of disk block M = number of items that fits in Memory T = number of items in output I/O = block move between Memory and Disk
7 External Memory Model D M block I/O P N = size of problem instance B = size of disk block M = number of items that fits in Memory T = number of items in output I/O = block move between Memory and Disk Advanced Data Structures: Sorting, Searching
8 Streaming Model memory length m word 2 [n] makes one pass on data Ordered set A = a 1, a 2,..., a m Each a i [n], size log n Compute f (A) or maintain f (A i ) for A i = a 1, a 2,..., a i. Space restricted to S = O(poly(log m, log n)). Updates O(poly(S)) for each a i.
9 Streaming Model memory length m word 2 [n] makes one pass on data Ordered set A = a 1, a 2,..., a m Each a i [n], size log n Compute f (A) or maintain f (A i ) for A i = a 1, a 2,..., a i. Space restricted to S = O(poly(log m, log n)). Updates O(poly(S)) for each a i. Advanced Algorithms: Approximate, Randomized
10 P Many (p) processors. Access shared memory: EREW : Exclusive Read Exclusive Write CREW : Concurrent Read Exclusive Write CRCW : Concurrent Read Concurrent Write Simple model, but has shortcomings......such as Synchronization. 1 2 p
11 P Many (p) processors. Access shared memory: EREW : Exclusive Read Exclusive Write CREW : Concurrent Read Exclusive Write CRCW : Concurrent Read Concurrent Write Simple model, but has shortcomings......such as Synchronization. 1 2 p Advanced Algorithms
12 Bulk Synchronous Parallel Each Processor has its own Memory Parallelism Procedes in Rounds: 1. Compute: Each processor computes on its own Data: w i. 2. Synchronize: Each processor sends messages to others: s i = MessSize CommCost. 3. Barrier: All processors wait until others done. Runtime: max w i + max s i Pro: Captures Parallelism and Synchronization Con: Ignores Locality.
13 Bulk Synchronous Parallel Each Processor has its own Memory Parallelism Procedes in Rounds: 1. Compute: Each processor computes on its own Data: w i. 2. Synchronize: Each processor sends messages to others: s i = MessSize CommCost. 3. Barrier: All processors wait until others done. Runtime: max w i + max s i Pro: Captures Parallelism and Synchronization Con: Ignores Locality.
14 Bulk Synchronous Parallel Each Processor has its own Memory Parallelism Procedes in Rounds: 1. Compute: Each processor computes on its own Data: w i. 2. Synchronize: Each processor sends messages to others: s i = MessSize CommCost. 3. Barrier: All processors wait until others done. Runtime: max w i + max s i Pro: Captures Parallelism and Synchronization Con: Ignores Locality.
15 MapReduce Each Processor has full hard drive, data items < key, value >. Parallelism Procedes in Rounds: Map: assigns items to processor by key. Reduce: processes all items using value. Usually combines many items with same key. Repeat M+R a constant number of times, often only one round. Optional post-processing step. MAP REDUCE Pro: Robust (duplication) and simple. Can harness Locality Con: Somewhat restrictive model
16 MapReduce Each Processor has full hard drive, data items < key, value >. Parallelism Procedes in Rounds: Map: assigns items to processor by key. Reduce: processes all items using value. Usually combines many items with same key. Repeat M+R a constant number of times, often only one round. Optional post-processing step. MAP REDUCE Pro: Robust (duplication) and simple. Can harness Locality Con: Somewhat restrictive model Advanced Algorithms
17 General Purpose GPU Massive parallelism on your desktop. Uses Graphics Processing Unit. Designed for efficient video rasterizing. Each processor corresponds to pixel p depth buffer: D(p) = min i x w i w i color buffer: C(p) = i α iχ i p... Pro: Fine grain, massive parallelism. Cheap. Harnesses Locality. Con: Somewhat restrictive model, hierarchy. Small memory. X
18 Distributed Computing Many small slow processors with data. Communication very expensive. Report to base station Merge tree base station Unorganized (peer-to-peer) Data collection or Distribution
19 Distributed Computing Many small slow processors with data. Communication very expensive. Report to base station Merge tree Unorganized (peer-to-peer) base station Data collection or Distribution
20 Distributed Computing Many small slow processors with data. Communication very expensive. Report to base station Merge tree Unorganized (peer-to-peer) Data collection or Distribution
21 Distributed Computing Many small slow processors with data. Communication very expensive. Report to base station Merge tree Unorganized (peer-to-peer) Data collection or Distribution Advanced Algorithms: Approximate, Randomized
22 Themes What are course goals? How to analyze algorithms in each model Taste of how to use each model When to use each model
23 Themes What are course goals? How to analyze algorithms in each model Taste of how to use each model When to use each model Work Plan: 1-3 weeks each model. Background and Model. Example algorithms analysis in each model. I/O Stream Parallel MapReduce GPU Distributed
24 Class Work 1 Credit Students: Attend Class. (some Fridays less important) Ask Questions. If above lacking, may have quizzes. Scribing Notes, Video-taping Lectures, or Giving Lectures. 3 Credit Students: Must also do a project! Project Proposal (Aug 30). Approved or Rejected by Sept 4. Intermediate Report (Oct 23). Presentations (Dec 11 or 13).
25 Sequential Review
26 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory)
27 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time
28 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time How fast are the following? Scanning (max):
29 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time How fast are the following? Scanning (max): TM: O(n) VNA: O(n)
30 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time How fast are the following? Scanning (max): TM: O(n) VNA: O(n) Sorting:
31 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time How fast are the following? Scanning (max): TM: O(n) VNA: O(n) Sorting: TM: O(n 2 ) VNA: O(n log n)
32 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time How fast are the following? Scanning (max): TM: O(n) VNA: O(n) Sorting: TM: O(n 2 ) VNA: O(n log n) Searching:
33 Sequential Review Turing Machines (Alan Turing 1936) Single Tape: MoveL, MoveR, read, write each constant time content pointer memory, infinite tape (memory) Von Neumann Architecture (Von Neumann + Eckert + Mauchly 1945) based on ENIAC + Memory (): read, write, op = constant time How fast are the following? Scanning (max): TM: O(n) VNA: O(n) Sorting: TM: O(n 2 ) VNA: O(n log n) Searching: TM: O(n) VNA: O(log n)
34 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search
35 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15
36 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15 Merge
37 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15 Merge Bubble hour 9 days -
38 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15 Merge Bubble hour 9 days - Complexity Theory: log: poly log(n) = log c n (... need to load data)
39 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15 Merge Bubble hour 9 days - Complexity Theory: log: poly log(n) = log c n (... need to load data) P : poly(n) = n c (many cool algorithms)
40 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15 Merge Bubble hour 9 days - Complexity Theory: log: poly log(n) = log c n (... need to load data) P : poly(n) = n c (many cool algorithms) EXP: exp(n) = c n (usually hopeless... but n not bad)
41 Asymptotics How large (in seconds) is: Searching (log n) Max (n) Merge-Sort (n log n) Bubble-Sort (n 2 )... or Dynamic Programming n = Search Max >15 Merge Bubble hour 9 days - Complexity Theory: log: poly log(n) = log c n (... need to load data) P : poly(n) = n c (many cool algorithms) EXP: exp(n) = c n (usually hopeless... but n not bad) NP: verify solution in P, find solution conjectured EXP (If EXP number parallel machines, then in P time)
42 Data Group Data Group Meeting 12:15-1:30pm in LCR (to be confirmed)
Introduction to Parallel Algorithm Analysis
Introduction to Parallel Algorithm Analysis Jeff M. Phillips October 4, 2013 Petri Nets C. A. Petri [1962] introduced analysis model for concurrent systems. Flow chart Described data flow and dependencies.
More informationIntroduction to I/O Efficient Algorithms (External Memory Model)
Introduction to I/O Efficient Algorithms (External Memory Model) Jeff M. Phillips August 30, 2013 Von Neumann Architecture Model: CPU and Memory Read, Write, Operations (+,,,...) constant time polynomially
More informationIntroduction to MapReduce Algorithms and Analysis
Introduction to MapReduce Algorithms and Analysis Jeff M. Phillips October 25, 2013 Trade-Offs Massive parallelism that is very easy to program. Cheaper than HPC style (uses top of the line everything)
More informationChapter 2 Abstract Machine Models. Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam
Chapter 2 Abstract Machine Models Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam Parallel Computer Models (1) A parallel machine model (also known as programming model, type architecture, conceptual
More informationComplexity and Advanced Algorithms Monsoon Parallel Algorithms Lecture 2
Complexity and Advanced Algorithms Monsoon 2011 Parallel Algorithms Lecture 2 Trivia ISRO has a new supercomputer rated at 220 Tflops Can be extended to Pflops. Consumes only 150 KW of power. LINPACK is
More informationChapter 2 Parallel Computer Models & Classification Thoai Nam
Chapter 2 Parallel Computer Models & Classification Thoai Nam Faculty of Computer Science and Engineering HCMC University of Technology Chapter 2: Parallel Computer Models & Classification Abstract Machine
More informationCarnegie Mellon Univ. Dept. of Computer Science Database Applications. Why Sort? Why Sort? select... order by
Carnegie Mellon Univ. Dept. of Computer Science 15-415 - Database Applications Lecture 12: external sorting (R&G ch. 13) Faloutsos 15-415 1 Why Sort? Faloutsos 15-415 2 Why Sort? select... order by e.g.,
More informationPRAM (Parallel Random Access Machine)
PRAM (Parallel Random Access Machine) Lecture Overview Why do we need a model PRAM Some PRAM algorithms Analysis A Parallel Machine Model What is a machine model? Describes a machine Puts a value to the
More informationCOMP Parallel Computing. PRAM (4) PRAM models and complexity
COMP 633 - Parallel Computing Lecture 5 September 4, 2018 PRAM models and complexity Reading for Thursday Memory hierarchy and cache-based systems Topics Comparison of PRAM models relative performance
More informationCS256 Applied Theory of Computation
CS256 Applied Theory of Computation Parallel Computation IV John E Savage Overview PRAM Work-time framework for parallel algorithms Prefix computations Finding roots of trees in a forest Parallel merging
More informationFundamentals of. Parallel Computing. Sanjay Razdan. Alpha Science International Ltd. Oxford, U.K.
Fundamentals of Parallel Computing Sanjay Razdan Alpha Science International Ltd. Oxford, U.K. CONTENTS Preface Acknowledgements vii ix 1. Introduction to Parallel Computing 1.1-1.37 1.1 Parallel Computing
More informationParallel Algorithms for Geometric Graph Problems Grigory Yaroslavtsev
Parallel Algorithms for Geometric Graph Problems Grigory Yaroslavtsev http://grigory.us Appeared in STOC 2014, joint work with Alexandr Andoni, Krzysztof Onak and Aleksandar Nikolov. The Big Data Theory
More informationCS 598: Communication Cost Analysis of Algorithms Lecture 15: Communication-optimal sorting and tree-based algorithms
CS 598: Communication Cost Analysis of Algorithms Lecture 15: Communication-optimal sorting and tree-based algorithms Edgar Solomonik University of Illinois at Urbana-Champaign October 12, 2016 Defining
More informationAlgorithm Performance Factors. Memory Performance of Algorithms. Processor-Memory Performance Gap. Moore s Law. Program Model of Memory I
Memory Performance of Algorithms CSE 32 Data Structures Lecture Algorithm Performance Factors Algorithm choices (asymptotic running time) O(n 2 ) or O(n log n) Data structure choices List or Arrays Language
More informationParallel Algorithms for (PRAM) Computers & Some Parallel Algorithms. Reference : Horowitz, Sahni and Rajasekaran, Computer Algorithms
Parallel Algorithms for (PRAM) Computers & Some Parallel Algorithms Reference : Horowitz, Sahni and Rajasekaran, Computer Algorithms Part 2 1 3 Maximum Selection Problem : Given n numbers, x 1, x 2,, x
More informationCS542. Algorithms on Secondary Storage Sorting Chapter 13. Professor E. Rundensteiner. Worcester Polytechnic Institute
CS542 Algorithms on Secondary Storage Sorting Chapter 13. Professor E. Rundensteiner Lesson: Using secondary storage effectively Data too large to live in memory Regular algorithms on small scale only
More informationExternal Sorting. Chapter 13. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
External Sorting Chapter 13 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Why Sort? A classic problem in computer science! Data requested in sorted order e.g., find students in increasing
More informationIE 495 Lecture 3. Septermber 5, 2000
IE 495 Lecture 3 Septermber 5, 2000 Reading for this lecture Primary Miller and Boxer, Chapter 1 Aho, Hopcroft, and Ullman, Chapter 1 Secondary Parberry, Chapters 3 and 4 Cosnard and Trystram, Chapter
More informationDatabase Applications (15-415)
Database Applications (15-415) DBMS Internals- Part V Lecture 15, March 15, 2015 Mohammad Hammoud Today Last Session: DBMS Internals- Part IV Tree-based (i.e., B+ Tree) and Hash-based (i.e., Extendible
More informationCS 310: Memory Hierarchy and B-Trees
CS 310: Memory Hierarchy and B-Trees Chris Kauffman Week 14-1 Matrix Sum Given an M by N matrix X, sum its elements M rows, N columns Sum R given X, M, N sum = 0 for i=0 to M-1{ for j=0 to N-1 { sum +=
More informationLast Class Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications
Last Class Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications C. Faloutsos A. Pavlo Lecture#12: External Sorting (R&G, Ch13) Static Hashing Extendible Hashing Linear Hashing Hashing
More informationLecture Summary CSC 263H. August 5, 2016
Lecture Summary CSC 263H August 5, 2016 This document is a very brief overview of what we did in each lecture, it is by no means a replacement for attending lecture or doing the readings. 1. Week 1 2.
More informationIndexing. Jan Chomicki University at Buffalo. Jan Chomicki () Indexing 1 / 25
Indexing Jan Chomicki University at Buffalo Jan Chomicki () Indexing 1 / 25 Storage hierarchy Cache Main memory Disk Tape Very fast Fast Slower Slow (nanosec) (10 nanosec) (millisec) (sec) Very small Small
More informationChapter 2. OS Overview
Operating System Chapter 2. OS Overview Lynn Choi School of Electrical Engineering Class Information Lecturer Prof. Lynn Choi, School of Electrical Eng. Phone: 3290-3249, Kong-Hak-Kwan 411, lchoi@korea.ac.kr,
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Today Finishing up from last time Brief discussion of graphics workload metrics
More informationDPHPC: Performance Recitation session
SALVATORE DI GIROLAMO DPHPC: Performance Recitation session spcl.inf.ethz.ch Administrativia Reminder: Project presentations next Monday 9min/team (7min talk + 2min questions) Presentations
More informationParallel Models RAM. Parallel RAM aka PRAM. Variants of CRCW PRAM. Advanced Algorithms
Parallel Models Advanced Algorithms Piyush Kumar (Lecture 10: Parallel Algorithms) An abstract description of a real world parallel machine. Attempts to capture essential features (and suppress details?)
More informationData Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 10: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationData Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 9: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationCOMP 308 Parallel Efficient Algorithms. Course Description and Objectives: Teaching method. Recommended Course Textbooks. What is Parallel Computing?
COMP 308 Parallel Efficient Algorithms Course Description and Objectives: Lecturer: Dr. Igor Potapov Chadwick Building, room 2.09 E-mail: igor@csc.liv.ac.uk COMP 308 web-page: http://www.csc.liv.ac.uk/~igor/comp308
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Analyzing a 3D Graphics Workload Where is most of the work done? Memory Vertex
More information1. (a) O(log n) algorithm for finding the logical AND of n bits with n processors
1. (a) O(log n) algorithm for finding the logical AND of n bits with n processors on an EREW PRAM: See solution for the next problem. Omit the step where each processor sequentially computes the AND of
More informationDatabase Applications (15-415)
Database Applications (15-415) DBMS Internals- Part V Lecture 13, March 10, 2014 Mohammad Hammoud Today Welcome Back from Spring Break! Today Last Session: DBMS Internals- Part IV Tree-based (i.e., B+
More informationCS 318 Principles of Operating Systems
CS 318 Principles of Operating Systems Fall 2018 Lecture 16: Advanced File Systems Ryan Huang Slides adapted from Andrea Arpaci-Dusseau s lecture 11/6/18 CS 318 Lecture 16 Advanced File Systems 2 11/6/18
More informationWhat is Parallel Computing?
What is Parallel Computing? Parallel Computing is several processing elements working simultaneously to solve a problem faster. 1/33 What is Parallel Computing? Parallel Computing is several processing
More informationHuge market -- essentially all high performance databases work this way
11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch
More informationBuilding an Inverted Index
Building an Inverted Index Algorithms Memory-based Disk-based (Sort-Inversion) Sorting Merging (2-way; multi-way) 2 Memory-based Inverted Index Phase I (parse and read) For each document Identify distinct
More informationL9: Hierarchical Clustering
L9: Hierarchical Clustering This marks the beginning of the clustering section. The basic idea is to take a set X of items and somehow partition X into subsets, so each subset has similar items. Obviously,
More informationParallelism with Haskell
Parallelism with Haskell T-106.6200 Special course High-Performance Embedded Computing (HPEC) Autumn 2013 (I-II), Lecture 5 Vesa Hirvisalo & Kevin Hammond (Univ. of St. Andrews) 2013-10-08 V. Hirvisalo
More informationChapter 12: Query Processing. Chapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Query Processing Overview Measures of Query Cost Selection Operation Sorting Join
More informationMulti-core Computing Lecture 2
Multi-core Computing Lecture 2 MADALGO Summer School 2012 Algorithms for Modern Parallel and Distributed Models Phillip B. Gibbons Intel Labs Pittsburgh August 21, 2012 Multi-core Computing Lectures: Progress-to-date
More information: Parallel Algorithms Exercises, Batch 1. Exercise Day, Tuesday 18.11, 10:00. Hand-in before or at Exercise Day
184.727: Parallel Algorithms Exercises, Batch 1. Exercise Day, Tuesday 18.11, 10:00. Hand-in before or at Exercise Day Jesper Larsson Träff, Francesco Versaci Parallel Computing Group TU Wien October 16,
More informationCSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 10 Parallel Programming Models: Map Reduce and Spark
CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 10 Parallel Programming Models: Map Reduce and Spark Announcements HW2 due this Thursday AWS accounts Any success? Feel
More informationCOURSE OVERVIEW. Introduction to Computer Engineering 2015 Spring by Euiseong Seo
COURSE OVERVIEW Introduction to Computer Engineering 2015 Spring by Euiseong Seo Course Objectives Introduction to computer engineering For computer engineer-wannabe For students studying other fields
More informationConcurrent/Parallel Processing
Concurrent/Parallel Processing David May: April 9, 2014 Introduction The idea of using a collection of interconnected processing devices is not new. Before the emergence of the modern stored program computer,
More informationHigh Performance Comparison-Based Sorting Algorithm on Many-Core GPUs
High Performance Comparison-Based Sorting Algorithm on Many-Core GPUs Xiaochun Ye, Dongrui Fan, Wei Lin, Nan Yuan, and Paolo Ienne Key Laboratory of Computer System and Architecture ICT, CAS, China Outline
More informationOutline. Parallel Database Systems. Information explosion. Parallelism in DBMSs. Relational DBMS parallelism. Relational DBMSs.
Parallel Database Systems STAVROS HARIZOPOULOS stavros@cs.cmu.edu Outline Background Hardware architectures and performance metrics Parallel database techniques Gamma Bonus: NCR / Teradata Conclusions
More informationOnline Course Evaluation. What we will do in the last week?
Online Course Evaluation Please fill in the online form The link will expire on April 30 (next Monday) So far 10 students have filled in the online form Thank you if you completed it. 1 What we will do
More informationTuring Machines, continued
Previously: Turing Machines, continued CMPU 240 Language Theory and Computation Fall 2018 Introduce Turing machines Today: Assignment 5 back TM variants, relation to algorithms, history Later Exam 2 due
More informationReview: Memory, Disks, & Files. File Organizations and Indexing. Today: File Storage. Alternative File Organizations. Cost Model for Analysis
File Organizations and Indexing Review: Memory, Disks, & Files Lecture 4 R&G Chapter 8 "If you don't find it in the index, look very carefully through the entire catalogue." -- Sears, Roebuck, and Co.,
More informationCS302 Topic: Algorithm Analysis. Thursday, Sept. 22, 2005
CS302 Topic: Algorithm Analysis Thursday, Sept. 22, 2005 Announcements Lab 3 (Stock Charts with graphical objects) is due this Friday, Sept. 23!! Lab 4 now available (Stock Reports); due Friday, Oct. 7
More informationEXTERNAL SORTING. Sorting
EXTERNAL SORTING 1 Sorting A classic problem in computer science! Data requested in sorted order (sorted output) e.g., find students in increasing grade point average (gpa) order SELECT A, B, C FROM R
More informationDistributed Video Systems Chapter 3 Storage Technologies
Distributed Video Systems Chapter 3 Storage Technologies Jack Yiu-bun Lee Department of Information Engineering The Chinese University of Hong Kong Contents 3.1 Introduction 3.2 Magnetic Disks 3.3 Video
More informationAdministrivia. Tree-Structured Indexes. Review. Today: B-Tree Indexes. A Note of Caution. Introduction
Administrivia Tree-Structured Indexes Lecture R & G Chapter 9 Homeworks re-arranged Midterm Exam Graded Scores on-line Key available on-line If I had eight hours to chop down a tree, I'd spend six sharpening
More informationProcessors. Nils Jansen and Kasper Brink. (based on slides from Jeroen Keiren, Marc Seutter and David N. Jansen)
Processors Nils Jansen and Kasper Brink (based on slides from Jeroen Keiren, Marc Seutter and David N. Jansen) https://ocw.cs.ru.nl/nwi-ipc006 Student Assistants Jordi Riemens Jasper Haasdijk Niek Janssen
More informationPARALLEL PROCESSING UNIT 3. Dr. Ahmed Sallam
PARALLEL PROCESSING 1 UNIT 3 Dr. Ahmed Sallam FUNDAMENTAL GPU ALGORITHMS More Patterns Reduce Scan 2 OUTLINES Efficiency Measure Reduce primitive Reduce model Reduce Implementation and complexity analysis
More informationFast BVH Construction on GPUs
Fast BVH Construction on GPUs Published in EUROGRAGHICS, (2009) C. Lauterbach, M. Garland, S. Sengupta, D. Luebke, D. Manocha University of North Carolina at Chapel Hill NVIDIA University of California
More informationEE/CSCI 451 Spring 2018 Homework 8 Total Points: [10 points] Explain the following terms: EREW PRAM CRCW PRAM. Brent s Theorem.
EE/CSCI 451 Spring 2018 Homework 8 Total Points: 100 1 [10 points] Explain the following terms: EREW PRAM CRCW PRAM Brent s Theorem BSP model 1 2 [15 points] Assume two sorted sequences of size n can be
More informationCOSEQUENTIAL PROCESSING (SORTING LARGE FILES)
10 COSEQUENTIAL PROCESSING (SORTING LARGE FILES) Copyright 2004, Binnur Kurt Content Cosequential Processing and Multiway Merge Sorting Large Files (External Sorting) 255 Cosequential Processing & Multiway
More informationInformation Retrieval
Introduction to Information Retrieval Lecture 4: Index Construction 1 Plan Last lecture: Dictionary data structures Tolerant retrieval Wildcards Spell correction Soundex a-hu hy-m n-z $m mace madden mo
More informationCompSci 516: Database Systems
CompSci 516 Database Systems Lecture 9 Index Selection and External Sorting Instructor: Sudeepa Roy Duke CS, Fall 2017 CompSci 516: Database Systems 1 Announcements Private project threads created on piazza
More informationChapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Overview Chapter 12: Query Processing Measures of Query Cost Selection Operation Sorting Join
More informationDtb Database Systems. Announcement
Dtb Database Systems ( 資料庫系統 ) December 10, 2008 Lecture #11 1 Announcement Assignment #5 will be out on the course webpage today. 2 1 External Sorting Chapter 13 3 Why learn sorting again? O (n*n): bubble,
More informationSyllabus CSCI 405 Operating Systems Fall 2018
Syllabus CSCI 405 Operating Systems Fall 2018 1.0 General Information Class Time: Monday/Wednesday/Friday 11:00 AM - 11:50 AM Class Location: 317 Thompson Instructor: Dr. Deepti Joshi; Office: 224 Thompson;
More informationReview implementation of Stable Matching Survey of common running times. Turn in completed problem sets. Jan 18, 2019 Sprenkle - CSCI211
Objectives Review implementation of Stable Matching Survey of common running times Turn in completed problem sets Jan 18, 2019 Sprenkle - CSCI211 1 Review: Asymptotic Analysis of Gale-Shapley Alg Not explicitly
More informationQuery Processing. Debapriyo Majumdar Indian Sta4s4cal Ins4tute Kolkata DBMS PGDBA 2016
Query Processing Debapriyo Majumdar Indian Sta4s4cal Ins4tute Kolkata DBMS PGDBA 2016 Slides re-used with some modification from www.db-book.com Reference: Database System Concepts, 6 th Ed. By Silberschatz,
More informationAdministriva. CS 133: Databases. General Themes. Goals for Today. Fall 2018 Lec 11 10/11 Query Evaluation Prof. Beth Trushkowsky
Administriva Lab 2 Final version due next Wednesday CS 133: Databases Fall 2018 Lec 11 10/11 Query Evaluation Prof. Beth Trushkowsky Problem sets PSet 5 due today No PSet out this week optional practice
More informationLecture 8 Parallel Algorithms II
Lecture 8 Parallel Algorithms II Dr. Wilson Rivera ICOM 6025: High Performance Computing Electrical and Computer Engineering Department University of Puerto Rico Original slides from Introduction to Parallel
More informationExternal Sorting. Chapter 13. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
External Sorting Chapter 13 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Why Sort? v A classic problem in computer science! v Data requested in sorted order e.g., find students in increasing
More informationDatabase Applications (15-415)
Database Applications (15-415) DBMS Internals- Part VI Lecture 17, March 24, 2015 Mohammad Hammoud Today Last Two Sessions: DBMS Internals- Part V External Sorting How to Start a Company in Five (maybe
More informationChapter 1 Computer System Overview
Operating Systems: Internals and Design Principles Chapter 1 Computer System Overview Seventh Edition By William Stallings Course Outline & Marks Distribution Hardware Before mid Memory After mid Linux
More informationFlash memory: Models and Algorithms. Guest Lecturer: Deepak Ajwani
Flash memory: Models and Algorithms Guest Lecturer: Deepak Ajwani Flash Memory Flash Memory Flash Memory Flash Memory Flash Memory Flash Memory Flash Memory Solid State Disks A data-storage device that
More informationThe Stored Program Computer
The Stored Program Computer 1 1945: John von Neumann Wrote a report on the stored program concept, known as the First Draft of a Report on EDVAC also Alan Turing Konrad Zuse Eckert & Mauchly The basic
More informationOut-Of-Core Sort-First Parallel Rendering for Cluster-Based Tiled Displays
Out-Of-Core Sort-First Parallel Rendering for Cluster-Based Tiled Displays Wagner T. Corrêa James T. Klosowski Cláudio T. Silva Princeton/AT&T IBM OHSU/AT&T EG PGV, Germany September 10, 2002 Goals Render
More informationPARALLEL & DISTRIBUTED DATABASES CS561-SPRING 2012 WPI, MOHAMED ELTABAKH
PARALLEL & DISTRIBUTED DATABASES CS561-SPRING 2012 WPI, MOHAMED ELTABAKH 1 INTRODUCTION In centralized database: Data is located in one place (one server) All DBMS functionalities are done by that server
More informationW4231: Analysis of Algorithms
W4231: Analysis of Algorithms 10/5/1999 (revised 10/6/1999) More hashing Binomial heaps Lectures Next Week No Lecture Tues. October 12 and Thurs. October 14. Extra lecture (2 1 2 hours) on Mon. October
More informationCompiling Parallel Algorithms to Memory Systems
Compiling Parallel Algorithms to Memory Systems Stephen A. Edwards Columbia University Presented at Jane Street, April 16, 2012 (λx.?)f = FPGA Parallelism is the Big Question Massive On-Chip Parallelism
More information2. Computer Evolution and Performance
2. Computer Evolution and Performance Spring 2016 Spring 2016 CS430 - Computer Architecture 1 Chapter 2: Computer Evolution and Performance Reading: pp. 16-49 Good Problems to Work: 2.1, 2.3, 2.4, 2.8,
More informationUmans Complexity Theory Lectures
Introduction Umans Complexity Theory Lectures Lecture 5: Boolean Circuits & NP: - Uniformity and Advice, - NC hierarchy Power from an unexpected source? we know P EXP, which implies no polytime algorithm
More informationThe Limits of Sorting Divide-and-Conquer Comparison Sorts II
The Limits of Sorting Divide-and-Conquer Comparison Sorts II CS 311 Data Structures and Algorithms Lecture Slides Monday, October 12, 2009 Glenn G. Chappell Department of Computer Science University of
More informationCS 179: GPU Programming LECTURE 5: GPU COMPUTE ARCHITECTURE FOR THE LAST TIME
CS 179: GPU Programming LECTURE 5: GPU COMPUTE ARCHITECTURE FOR THE LAST TIME 1 Last time... GPU Memory System Different kinds of memory pools, caches, etc Different optimization techniques 2 Warp Schedulers
More informationAn Incomplete History of Computation
An Incomplete History of Computation Charles Babbage 1791-1871 Lucasian Professor of Mathematics, Cambridge University, 1827-1839 First computer designer Ada Lovelace 1815-1852 First computer programmer
More informationGeneral introduction: GPUs and the realm of parallel architectures
General introduction: GPUs and the realm of parallel architectures GPU Computing Training August 17-19 th 2015 Jan Lemeire (jan.lemeire@vub.ac.be) Graduated as Engineer in 1994 at VUB Worked for 4 years
More informationComplexity and Advanced Algorithms. Introduction to Parallel Algorithms
Complexity and Advanced Algorithms Introduction to Parallel Algorithms Why Parallel Computing? Save time, resources, memory,... Who is using it? Academia Industry Government Individuals? Two practical
More informationIntroduction to Data Management. Lecture 15 (More About Indexing)
Introduction to Data Management Lecture 15 (More About Indexing) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Announcements v HW s and quizzes:
More informationConcurrency for data-intensive applications
Concurrency for data-intensive applications Dennis Kafura CS5204 Operating Systems 1 Jeff Dean Sanjay Ghemawat Dennis Kafura CS5204 Operating Systems 2 Motivation Application characteristics Large/massive
More informationCS4961 Parallel Programming. Lecture 4: Memory Systems and Interconnects 9/1/11. Administrative. Mary Hall September 1, Homework 2, cont.
CS4961 Parallel Programming Lecture 4: Memory Systems and Interconnects Administrative Nikhil office hours: - Monday, 2-3PM - Lab hours on Tuesday afternoons during programming assignments First homework
More informationAutomatic Scaling Iterative Computations. Aug. 7 th, 2012
Automatic Scaling Iterative Computations Guozhang Wang Cornell University Aug. 7 th, 2012 1 What are Non-Iterative Computations? Non-iterative computation flow Directed Acyclic Examples Batch style analytics
More informationStorage hierarchy. Textbook: chapters 11, 12, and 13
Storage hierarchy Cache Main memory Disk Tape Very fast Fast Slower Slow Very small Small Bigger Very big (KB) (MB) (GB) (TB) Built-in Expensive Cheap Dirt cheap Disks: data is stored on concentric circular
More informationLecture 15: Algorithms. AP Computer Science Principles
Lecture 15: Algorithms AP Computer Science Principles Algorithm algorithm: precise sequence of instructions to solve a computational problem. Search for a name in a phone s contact list. Sort emails by
More informationUser Perspective. Module III: System Perspective. Module III: Topics Covered. Module III Overview of Storage Structures, QP, and TM
Module III Overview of Storage Structures, QP, and TM Sharma Chakravarthy UT Arlington sharma@cse.uta.edu http://www2.uta.edu/sharma base Management Systems: Sharma Chakravarthy Module I Requirements analysis
More informationAlgorithm Performance Factors. Memory Performance of Algorithms. Processor-Memory Performance Gap. Moore s Law. Program Model of Memory II
Memory Performance of Algorithms CSE 32 Data Structures Lecture Algorithm Performance Factors Algorithm choices (asymptotic running time) O(n 2 ) or O(n log n) Data structure choices List or Arrays Language
More informationDatabase Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu
Database Architecture 2 & Storage Instructor: Matei Zaharia cs245.stanford.edu Summary from Last Time System R mostly matched the architecture of a modern RDBMS» SQL» Many storage & access methods» Cost-based
More informationCS 318 Principles of Operating Systems
CS 318 Principles of Operating Systems Fall 2017 Lecture 16: File Systems Examples Ryan Huang File Systems Examples BSD Fast File System (FFS) - What were the problems with the original Unix FS? - How
More informationAdvanced Database Systems
Lecture IV Query Processing Kyumars Sheykh Esmaili Basic Steps in Query Processing 2 Query Optimization Many equivalent execution plans Choosing the best one Based on Heuristics, Cost Will be discussed
More informationMapReduce Spark. Some slides are adapted from those of Jeff Dean and Matei Zaharia
MapReduce Spark Some slides are adapted from those of Jeff Dean and Matei Zaharia What have we learnt so far? Distributed storage systems consistency semantics protocols for fault tolerance Paxos, Raft,
More informationCSE 326: Data Structures B-Trees and B+ Trees
Announcements (2/4/09) CSE 26: Data Structures B-Trees and B+ Trees Midterm on Friday Special office hour: 4:00-5:00 Thursday in Jaech Gallery (6 th floor of CSE building) This is in addition to my usual
More informationEE/CSCI 451: Parallel and Distributed Computation
EE/CSCI 451: Parallel and Distributed Computation Lecture #7 2/5/2017 Xuehai Qian Xuehai.qian@usc.edu http://alchem.usc.edu/portal/xuehaiq.html University of Southern California 1 Outline From last class
More information1 Overview, Models of Computation, Brent s Theorem
CME 323: Distributed Algorithms and Optimization, Spring 2017 http://stanford.edu/~rezab/dao. Instructor: Reza Zadeh, Matroid and Stanford. Lecture 1, 4/3/2017. Scribed by Andreas Santucci. 1 Overview,
More informationTree-Structured Indexes
Tree-Structured Indexes CS 186, Fall 2002, Lecture 17 R & G Chapter 9 If I had eight hours to chop down a tree, I'd spend six sharpening my ax. Abraham Lincoln Introduction Recall: 3 alternatives for data
More information