COMP9318: Data Warehousing and Data Mining. L2: Data Warehousing and OLAP

Size: px
Start display at page:

Download "COMP9318: Data Warehousing and Data Mining. L2: Data Warehousing and OLAP"

Transcription

1 COMP9318: Data Warehousig ad Data Miig L2: Data Warehousig ad OLAP 1

2 Why ad What are Data Warehouses? 2

3 Data Aalysis Problems The same data foud i may differet systems Example: customer data across differet departmets The same cocept is defied differetly Heterogeeous sources Relatioal DBMS, OLie Trasactio Processig (OLTP) Ustructured data i files (e.g., MS Excel) ad documets (e.g., MS Word) 3

4 Data Aalysis Problems (Cot d) Data is suited for operatioal systems Accoutig,billig,etc. Do ot support aalysis across busiess fuctios Data quality is bad Missig data, imprecise data, differet use of systems Data are volatile Data deleted i operatioal systems (6moths) Data chage over time o historical iformatio 4

5 Solutio: Data Warehouse Defied i may differet ways, but ot rigorously. A decisio support database that is maitaied separately from the orgaizatio s operatioal database Support iformatio processig by providig a solid platform of cosolidated, historical data for aalysis. A data warehouse is a subject-orieted, itegrated, time-variat, ad ovolatile collectio of data i support of maagemet s decisio-makig process. W. H. Imo Data warehousig: The process of costructig ad usig data warehouses 5

6 Data Warehouse Subject-Orieted Orgaized aroud major subjects, such as customer, product, sales. Focusig o the modelig ad aalysis of data for decisio makers, ot o daily operatios or trasactio processig. Provide a simple ad cocise view aroud particular subject issues by excludig data that are ot useful i the decisio support process. 6

7 Data Warehouse Itegrated Costructed by itegratig multiple, heterogeeous data sources relatioal databases, flat files, o-lie trasactio records Data cleaig ad data itegratio techiques are applied. Esure cosistecy i amig covetios, ecodig structures, attribute measures, etc. amog differet data sources E.g., Hotel price: currecy, tax, breakfast covered, etc. Whe data is moved to the warehouse, it is coverted. 7

8 Data Warehouse Time Variat The time horizo for the data warehouse is sigificatly loger tha that of operatioal systems. Operatioal database: curret value data. Data warehouse data: provide iformatio from a historical perspective (e.g., past 5-10 years) Every key structure i the data warehouse Cotais a elemet of time, explicitly or implicitly But the key of operatioal data may or may ot cotai time elemet. 8

9 Data Warehouse No-Volatile 1. A physically separate store of data trasformed from the operatioal eviromet. 2. Operatioal update of data does ot occur i the data warehouse eviromet. Does ot require trasactio processig, recovery, ad cocurrecy cotrol mechaisms Requires oly two operatios i data accessig: iitial loadig of data ad access of data. 9

10 Data Warehouse Architecture Extract data from operatioal data sources clea, trasform Bulk load/refresh warehouse is offlie OLAP-server provides multidimesioal view Multidimesioal-olap (Essbase, oracle express) Relatioal-olap (Redbrick, Iformix, Sybase, SQL server) 10

11 Data Warehouse Architecture All subjects, itegrated Advaced aalysis Fuctio-orieted systems Subject-orieted systems 11

12 Why Separate Data Warehouse? High performace for both systems DBMS tued for OLTP: access methods, idexig, cocurrecy cotrol, recovery Warehouse tued for OLAP: complex OLAP queries, multidimesioal view, cosolidatio. Differet fuctios ad differet data: missig data: Decisio support requires historical data which operatioal DBs do ot typically maitai data cosolidatio: DS requires cosolidatio (aggregatio, summarizatio) of data from heterogeeous sources data quality: differet sources typically use icosistet data represetatios, codes ad formats which have to be recociled 12

13 Why OLAP Servers? Differet workload: OLTP (o-lie trasactio processig) Major task of traditioal relatioal DBMS Day-to-day operatios: purchasig, ivetory, bakig, maufacturig, payroll, registratio, accoutig, etc. OLAP (o-lie aalytical processig) Major task of data warehouse system Data aalysis ad decisio makig Queries hard/ifeasible for OLTP, e.g., Which week we have the largest sales? Does the sales of dairy products icrease over time? Geerate a spread sheet of total sales by state ad by year. Difficult to represet these queries by usig SQL ç Why? 13

14 OLTP vs. OLAP OLTP OLAP users clerk, IT professioal kowledge worker fuctio day to day operatios decisio support DB desig applicatio-orieted subject-orieted data curret, up-to-date detailed, flat relatioal isolated usage repetitive ad-hoc access read/write lots of scas idex/hash o prim. key uit of work short, simple trasactio complex query # records accessed tes millios #users thousads hudreds DB size 100MB-GB 100GB-TB historical, summarized, multidimesioal itegrated, cosolidated metric trasactio throughput query throughput, respose 14

15 Comparisos Databases Data Warehouses Purpose May purposes; Flexible ad geeral Oe purpose: Data aalysis Coceptual Model ER Multidimesioal Logical Model (Normalized) Relatioal Model (Deormalized) Star schema / Data cube/cuboids Physical Model Relatioal Tables ROLAP: Relatioal tables MOLAP: Multidimesioal arrays Query Laguage Query Processig SQL (hard for aalytical queries) B+-tree/hash idexes, Multiple joi optimizatio, Materialized MDX (easier for aalytical queries) Bitmap/Joi idexes, Star joi, Materialized data cube 15

16 The Multidimesioal Model 16

17 The Multidimesioal Model A data warehouse is based o a multidimesioal data model which views data i the form of a data cube, which is a multidimesioal geeralizatio of 2D spread sheet. Key cocepts: Facts: the subject it models Typically trasactios i this course; other types icludes sapshots, etc. Measures: umbers that ca be aggregated Dimesios: cotext of the measure Hierarchies: Provide cotexts of differet graularities (aka. grais) Goals for dimesioal modelig: Surroud facts with as much relevat cotext (dimesios) as possible ç Why? 17

18 Supermarket Example Subject: aalyze total sales ad profits Fact: Each Sales Trasactio Measure: Dollars_Sold, Amout_Sold, Cost Calculated Measure: Profit Dimesios: Store Product Time 18

19 Visualizig the Cubes A valid istace of the model is a data cube total Sales city product p1 p2 p3 p4 NY $ $925 LA $468 $ SD $296 - $240 - SF $652 - $540 $745 city product Cocepts: cell, fact (=o-empty cell), measure, dimesios Q: How to geeralize it to 3D? 19

20 3D Cube ad Hierarchies Cocepts: hierarchy (a tree of dimesio values), level Sales of book176 i NY i Ja ca be foud i this cell city DIMENSIONS PRODUCT LOCATION TIME NY Book176 ALL ALL ALL category regio year product product coutry quarter state moth week Ja Feb Mar moth city store day 20

21 Hierarchies Cocepts: hierarchy (a tree of dimesio values), level Which desig is better? Why? ALL Food Beverage Book Book176 ALL category product ALL category subcategory brad product 21

22 The (city, moth) Cuboid Sales of ALL_PROD i NY i Ja city NY Book176 DIMENSIONS PRODUCT LOCATION TIME ALL ALL ALL category regio year product product coutry quarter state moth week Ja Feb Mar moth city store day 22

23 All the Cuboids Assume: o other o-all levels o all dimesios. TV PC VCR Product Date 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Caada Mexico Coutry 23

24 All the Cuboids /2 Assume: o other o-all levels o all dimesios. Product all TV PC VCR Date 1Qtr 2Qtr 3Qtr 4Qtr all Total aual sales of TV i U.S.A. U.S.A Caada Mexico Coutry all 24

25 Lattice of the cuboids Base cuboid product, quarter, coutry 3-dim cuboid product,quarter quarter, coutry product, store 2-dim cuboid quarter product coutry 1-dim cuboid 0-dim cuboid -dim cube ca be represeted as (D 1, D 2,, D d ), where D i is the set of allowed values o the i-th dimesio. if D i = L i (a particular level), the Di = all descedat dimesio values of L i. ALL ca be omitted ad hece reduces the effective dimesioality dy ( i + 1) A complete cube of d-dimesios cosists of i=1 cuboids, where i is the umber of levels (excludig ALL) o i-th dimesio. They collectively form a lattice. 25

26 Properties of Operatios All operatios are closed uder the multidimesioal model i.e., both iput ad output of a operatio is a cube So that they ca be composed Q: What s the aalogy i the Relatioal Model? 26

27 Commo OLAP Operatios Roll-up: move up the hierarchy Sales of book176 i NY i Q1 here city product NY Book176 Q: what should be its value? DIMENSIONS PRODUCT LOCATION TIME ALL ALL ALL category regio year product coutry quarter state moth week Q1 Q2 moth Q3 COMP9318: Data Warehousig ad Data Miig city store day 27

28 city Book176 productny quarter DIMENSIONS PRODUCT LOCATION TIME ALL ALL ALL product city NY Book176 category regio year product coutry quarter state moth week city day Ja Feb Mar moth store 28

29 Commo OLAP Operatios Drill-dow: move dow the hierarchy more fie-graied aggregatio 29

30 city Slice ad Dice Queries Slice ad Dice: select ad project o oe or more dimesio values product moth Product = Book176 The output cube has smaller dimesioality tha the iput cube 30

31 Pivotig city NY Book176 Pivotig: aggregate o selected dimesios usually 2 dims (crosstabulatio) product Sales (of all products) i NY i Q1 =sum(??? ) Q1 Q2 moth Q3 pivot o (city, moth) City NY Q1 Q2 Q3 moth 31

32 A Reflective Pause Let s review the defiitio of data cubes agai. Key message: Disetagle the object from its represetatio or implemetatio 32

33 Modelig Exercise 1: Mothly Phoe Service Billig Theme: aalyze the icome/reveue of Telstra 33

34 Solutio FACT MEASURE DIMENSIONS 34

35 35

36 The Logical Model 36

37 Logical Models Two mai approaches: Usig relatioal DB techology: Star schema, Sowflake schema, Fact costellatio Usig multidimesioal techology: Just as multidimesioal data cube 37

38 Uiversal Schema è Star Schema May data warehouses adopt a star schema to represet the multidimesioal model Each dimesio is represeted by a dimesio-table LOCATION (locatio_key, store, street_address, city, state, coutry, regio) dimesio tables are ot ormalized Trasactios are described through a fact-table each tuple cosists of a poiter to each of the dimesio-tables (foreig-key) ad a list of measures (e.g. sales $$$) The uiversal schema for supermarket Store City State Prod Brad Category $Sold #Sold Cost S136 Syd NSW 76Ha Nestle Biscuit S173 Melb Vic 76Ha Nestle Biscuit

39 The Star Schema TIME time_key day day_of_the_week moth quarter year SALES time_key product_key PRODUCT product_key product_ame category brad color supplier_ame locatio_key LOCATION uits_sold locatio_key store amout street_address city state coutry Thik why: regio (1) Deormalized oce from the uiversal schema (2) Cotrolled redudacy 39

40 Typical Models for Data Warehouses Modelig data warehouses: dimesios & measures Star schema: A fact table i the middle coected to a set of dimesio tables Sowflake schema: A refiemet of star schema where some dimesioal hierarchy is ormalized ito a set of smaller dimesio tables, formig a shape similar to sowflake Fact costellatios: Multiple fact tables share dimesio tables, viewed as a collectio of stars, therefore called galaxy schema or fact costellatio 40

41 Example of Star Schema time time_key day day_of_the_week moth quarter year brach brach_key brach_ame brach_type Measures Sales Fact Table time_key time_key item_key brach_key locatio_key uits_sold dollars_sold avg_sales item item_key item_ame brad type supplier_type locatio locatio_key street city state_or_provice coutry 41

42 Example of Sowflake Schema time time_key day day_of_the_week moth quarter year Sales Fact Table time_key item_key item item_key item_ame brad type supplier_key supplier supplier_key supplier_type brach brach_key brach_ame brach_type Measures brach_key locatio_key uits_sold dollars_sold avg_sales locatio locatio_key street city_key city city_key city state_or_provice coutry 42

43 Example of Fact Costellatio time time_key day day_of_the_week moth quarter year Sales Fact Table time_key item_key item_key brach_key item item_key item_ame brad type supplier_type Shippig Fact Table time_key item_key Shipper_key from_locatio brach brach_key brach_ame brach_type Measures locatio_key uits_sold dollars_sold avg_sales locatio locatio_key street city provice_or_state coutry to_locatio dollars_cost uits_shipped shipper shipper_key shipper_ame locatio_key shipper_type 43

44 Advatages of Star Schema Facts ad dimesios are clearly depicted dimesio tables are relatively static, data is loaded (apped mostly) ito fact table(s) easy to comprehed (ad write queries) Fid total sales per product-category i our stores i Europe SELECT PRODUCT.category, SUM(SALES.amout) FROM SALES, PRODUCT,LOCATION WHERE SALES.product_key = PRODUCT.product_key AND SALES.locatio_key = LOCATION.locatio_key AND LOCATION.regio= Europe GROUP BY PRODUCT.category Operatios: Slice (Loc.Regio.Europe) + Pivot (Prod.category) 44

45 Query Laguage 45

46 Query Laguage Two approaches: Usig relatioal DB techology: SQL (with extesios such as CUBE/PIVOT/UNPIVOT) Usig multidimesioal techology: MDX SELECT PRODUCT.category, SUM(SALES.amout) FROM SALES, PRODUCT,LOCATION WHERE SALES.product_key = PRODUCT.product_key AND SALES.locatio_key = LOCATION.locatio_key AND LOCATION.regio= Europe GROUP BY PRODUCT.category SELECT {[PRODUCT].[category]} o ROWS, {[MEASURES].[amout]} o COLUMNS FROM [SALES] WHERE ([LOCATION].[regio].[Europe]) Operatios: Slice (Loc.Regio.Europe) + Pivot (Prod.category, Measures.amt) 46

47 Physical Model + Query Processig Techiques 47

48 Physical Model + Query Processig Techiques Two mai approaches: Usig relatioal DB techology: ROLAP Usig multidimesioal techology: MOLAP Hybrid: HOLAP Base cuboid: ROLAP Other cuboids: MOLAP 48

49 Q1: Selectio o low-cardiality attributes TIME time_key day day_of_the_week moth quarter year measures { SALES time_key customer_key locatio_key uits_sold amout Igorig the fial GROUP BY for ow Omittig the Product dimesio S regio= Europe JOIN JOIN CUSTOMER customer_key customer_ame regio type LOCATION locatio_key store street_address city state coutry regio 49

50 Idexig OLAP Data: Bitmap Idex (1) BI o dimesio tables Idex o a attribute (colum) with low distict values Each distict values, v, is associated with a -bit vector ( = #rows) Chap 4.2 i [JPT10] The i-th bit is set if the i-th row of the table has the value v for the idexed colum Multiple BIs ca be efficietly combied to eable optimized sca of the table Custom Cust Regio Type C1 Asia Retail C2 Europe Dealer C3 Asia Dealer C4 America Retail C5 Europe Dealer BI o Customer.Regio v bitmap Asia Europe America

51 Idexig OLAP Data: Bitmap Idex /2 (1) Bitmap joi idex (BI o Fact Table Joied with Dimesio tables) Coceptually, perform a joi, map each dimesio value to the bitmap of correspodig fact table rows. -- ORACLE SYNTAX CREATE BITMAP INDEX sales_cust_regio_bjix ON sales(customer.cust_regio) FROM sales, customer WHERE sales.cust_id = customers.cust_id; 51

52 Idexig OLAP Data: Bitmap Idex /3 Sales time customer loc Sale 101 C C C C C C C C Customer Cust Regio Type C1 Asia Retail C2 Europe Dealer C3 Asia Dealer C4 America Retail C5 Europe Dealer BI o Sales(Customer.Regio) v bitmap Asia Europe America

53 Q2: Selectio o high-cardiality attributes TIME time_key day day_of_the_week moth quarter year SALES time_key customer_key JOIN CUSTOMER customer_key customer_ame regio type measures { locatio_key uits_sold amout JOIN S city= Kigsford LOCATION locatio_key store street_address city state coutry regio 53

54 Chap 4.3 i [JPT10] Idexig OLAP Data: Joi Idices Joi idex relates the values of the dimesios of a star schema to rows i the fact table. a joi idex o city maitais for each distict city a list of ROW-IDs of the tuples recordig the sales i the city Locatio city = Sydey city = Radwick city = Kigsford city = Coogee R102 R117 R118 Sales Joi idices ca spa multiple dimesios OR R124 1 ca be implemeted as bitmapidexes (per dimesio) use bit-op for multiple-jois Kigsford R102,R117,R118,R124 54

55 Q3: Arbitrary selectios o Dimesios TIME time_key day day_of_the_week moth quarter year SALES time_key product_key S regio = Asia JOIN Customer customer_key customer_ame regio type S isschoolholidy(day) measures { locatio_key uits_sold amout JOIN S city =~ /\S+ford/ LOCATION locatio_key store street_address city state coutry regio 55

56 Chap 4.4 i [JPT10] Star Query ad Star Joi (Cot.) Time Customer Loc Usually oly part of the dim tables because of the selectio predicates time 1 2 thousads of tuples X Cust X loc time cust loc millios of tuples Sales time cust loc sold Sales!" σ 1 (Time)!" σ 2 (Cust)!" σ 3 (Loc) è Sales!" (σ 1 (Time) X σ 2 (Cust) X σ 3 (Loc)) 56

57 Q4: Coarse-grai Aggregatios Fid total sales per customer type i our stores i Europe Joi-idex will prue ¾ of the data (uiform sales), but the remaiig ¼ is still large (several millios trasactios) Idex is uclustered High-level aggregatios are expesive!!!!! regio LOCATON ÞLog Query Respose Times ÞPre-computatio is ecessary ÞPre-computatio is most beeficial coutry state city store 57

58 Cuboids = GROUP BYs Multidimesioal aggregatio = selectio o correspodig cuboid GB (type, city) ( Sales!" σ 1 (Time)!" σ 2 (Cust)!" σ 3 (Loc)) ) σ 1 selects some Years, σ 2 selects some Brads, σ 3 selects some Cities, GB (type, city) ( σ (Cuboid(Year, Type, City)) ) Materialize some/all of the cuboids A complex decisio ivolvig cuboid sizes, query workload, ad physical orgaizatio 58

59 Two Issues How to store the materialized cuboids? How to compute the cuboids efficietly? 59

60 CUBE BY i ROLAP Store Sales Product ALL ALL Group-bys here: (store,product) (store) (product) () Need to write 4 queries!!! Compute them idepedetly Store Product_key sum(amout) ALL ALL ALL ALL 1937 ALL ALL ALL ALL ALL ALL 5120 SELECT LOCATION.store, SALES.product_key, SUM (amout) FROM SALES, LOCATION WHERE SALES.locatio_key=LOCATION.locatio_key CUBE BY SALES.product_key, LOCATION.store 60

61 Top-dow Approach Model depedecies amog the aggregates: product,quarter quarter Cuboid AB product,store,quarter store,quarter product oe A B M product, store store most detailed view ca be computed from view (product,store,quarter) by summig-up all quarterly sales Cuboid A A B M Cuboid B 1 (all)??? 2 (all)??? A B M (all) 1??? (all) 2??? (all) 3??? (all) 4??? 61

62 Bottom-Up Approach (BUC) BUC (Beyer & Ramakrisha, SIGMOD 99) Ideas Compute the cube from bottom up Divide-ad-coquer A simpler recursive versio: BUC-SR A B

63 Uderstadig Recursio /1 Powerful computig/problem-solvig techiques Examples Factorial: f() = 1, if = 1 f() = f(-1) *, if 1 Quick sort: Sort([x]) = [x] Sort([x1,, pivot, x]) = sort[ys] ++ sort[zs]), where ys = [ x x i xi, x pivot ] zs = [ x x <- xi, x > pivot ] f(0) = 0! =??? List comprehesio i Haskell or pytho 63

64 Uderstadig Recursio /2 Let C(, m) be the umber of ways to select m balls from umbered balls Show that C(, m) = C(-1, m-1) + C(-1, m) etire solutio space Example: m = 3, = 5 Cosider ay ball i the 5 balls, e.g., d?1?2?3 d?1?2?1 (o d)?2 (o d)?3 (o d)?i i { a b c d e } 64

65 Key Poits Sub-problems eed to be smaller, so that a simple/trivial boudary case ca be reached Divide-ad-coquer There may be multiple ways the etire solutio space ca be divided ito disjoit sub-spaces, each of which ca be coquered recursively. 65

66 Geometric Ituitio /1 Reduce Cube(i 2D) to Cube(i 1D) a1 a2 [a1] [a2] [*] b1 b2 b3 M11 M12 M13 [Step 1] M21 M22 M23 [Step 1] [Step 2] [Step 2] [Step 2] [Step 3] b1 b2 b3 M11 M12 M13 [Step 1] M21 M22 M23 [Step 1] [Step 2] [Step 2] [Step 2] [Step 3] 66

67 Geometric Ituitio /2 Reduce Cube(i 3D) to Cube(i 2D) A C B 67

68 Geometric Ituitio /3 A C B Reduce Cube(i 3D) to Cube(i 2D) A B C 68

69 You eed to fix the icomplete output part of the algorithm! BUC-SR (Simple Recursio)* BUC-SR(data, dims) If (dims is empty) Else Output (sum(data)) Boudary case: data is essetially a list of measure values Dims = [dim1, rest_of_dims] For each distict value v of dim1 slice_v = slice of data o dim1 = v BUC-SR(slice_v, rest_of_dims) data = Project(data, rest_of_dims) BUC-SR(data, rest_of_dims) Geeral case: 1)Slice o dim1. Call BUC-SR recursively for each slice 2)Project out dim1, ad call BUC-SR o it recursively 69

70 Iput Iteral Output Output Example [{r1-r4}, B] B M A B M B M * * * * [{r5}, B] B M A B M B M [{r1-r5}, AB] 1 50 * 50 2 * 50 r1 r2 r3 r4 r5 A B M [{r1 -r5 }, B] B M B M * 150 A B M * 1 80 * 2 30 * 3 40 * *

71 Try a 3D-Cube by Yourself [{r1-r5}, ABC] r1 r2 r3 r4 r5 6/3/18 A B C M

72 MOLAP (Sparse) array-based multidimesioal storage egie Pros: small size (esp. for dese cubes) fast i idexig ad query processig Cos: scalability coversio from relatioal data 72

73 Multidimesioal Array Step 2: A ijective map from cell to offset f(time, item) = 4*time + item time item dollars_sold Q1 Q2 Q3 Q4 home etertaimet 605 home etertaimet 680 home etertaimet 812 home etertaimet 927 Q1 computer 825 Q2 computer 952 Q3 computer 1023 Step 1 Mappigs time value Q1 0 Q2 1 Q3 2 Q4 3 time item dollars_s old offset Q4 computer Q1 phoe 14 Q2 phoe 31 Q3 phoe 30 Q4 phoe 38 Q1 security 400 Q2 security 512 Q3 security 501 Q4 security 580 item value home etertai met 0 computer 1 phoe 2 security

74 Step 3 : If sparse Multidimesioal Array Step 3: If dese, oly eed to store sorted slots offset dollars_sold Dese MD array Thik: how to decode a slot? Multidimesioal array is typically sparse Use sparse array (i.e., offset + value) Could use chuk to further reduce the space Space usage: (d+1)**4 vs 2**4 HOLAP: Store all o-base cuboid i MD array Assig a value for ALL

What are Information Systems?

What are Information Systems? Iformatio Systems Cocepts What are Iformatio Systems? Roma Kotchakov Birkbeck, Uiversity of Lodo Based o Chapter 1 of Beett, McRobb ad Farmer: Object Orieted Systems Aalysis ad Desig Usig UML, (4th Editio),

More information

Data Warehousing. Paper

Data Warehousing. Paper Data Warehousig Paper 28-25 Implemetig a fiacial balace scorecard o top of SAP R/3, usig CFO Visio as iterface. Ida Carapelle & Sophie De Baets, SOLID Parters, Brussels, Belgium (EUROPE) ABSTRACT Fiacial

More information

Introduction to Data Warehousing

Introduction to Data Warehousing ICS 321 Spring 2012 Introduction to Data Warehousing Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 4/23/2012 Lipyeow Lim -- University of Hawaii at Manoa

More information

Data Warehousing & OLAP

Data Warehousing & OLAP Data Warehousing & OLAP Data Mining: Concepts and Techniques Chapter 3 Jiawei Han and An Introduction to Database Systems C.J.Date, Eighth Eddition, Addidon Wesley, 4 1 What is Data Warehousing? What is

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 19 Query Optimizatio Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Query optimizatio Coducted by a query optimizer i a DBMS Goal:

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 14 : 18/11/2014 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

Chapter 4, Data Warehouse and OLAP Operations

Chapter 4, Data Warehouse and OLAP Operations CSI 4352, Introduction to Data Mining Chapter 4, Data Warehouse and OLAP Operations Young-Rae Cho Associate Professor Department of Computer Science Baylor University CSI 4352, Introduction to Data Mining

More information

What is a Data Warehouse?

What is a Data Warehouse? What is a Data Warehouse? COMP 465 Data Mining Data Warehousing Slides Adapted From : Jiawei Han, Micheline Kamber & Jian Pei Data Mining: Concepts and Techniques, 3 rd ed. Defined in many different ways,

More information

Task scenarios Outline. Scenarios in Knowledge Extraction. Proposed Framework for Scenario to Design Diagram Transformation

Task scenarios Outline. Scenarios in Knowledge Extraction. Proposed Framework for Scenario to Design Diagram Transformation 6-0-0 Kowledge Trasformatio from Task Scearios to View-based Desig Diagrams Nima Dezhkam Kamra Sartipi {dezhka, sartipi}@mcmaster.ca Departmet of Computig ad Software McMaster Uiversity CANADA SEKE 08

More information

Data Warehousing & On-Line Analytical Processing

Data Warehousing & On-Line Analytical Processing Data Warehousing & On-Line Analytical Processing Erwin M. Bakker & Stefan Manegold https://homepages.cwi.nl/~manegold/dbdm/ http://liacs.leidenuniv.nl/~bakkerem2/dbdm/ s.manegold@liacs.leidenuniv.nl e.m.bakker@liacs.leidenuniv.nl

More information

1 Enterprise Modeler

1 Enterprise Modeler 1 Eterprise Modeler Itroductio I BaaERP, a Busiess Cotrol Model ad a Eterprise Structure Model for multi-site cofiguratios are itroduced. Eterprise Structure Model Busiess Cotrol Models Busiess Fuctio

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 26 Ehaced Data Models: Itroductio to Active, Temporal, Spatial, Multimedia, ad Deductive Databases Copyright 2016 Ramez Elmasri ad Shamkat B.

More information

A Multi-Dimensional Data Model

A Multi-Dimensional Data Model A Multi-Dimensional Data Model A Data Warehouse is based on a Multidimensional data model which views data in the form of a data cube A data cube, such as sales, allows data to be modeled and viewed in

More information

Interactive PMCube Explorer

Interactive PMCube Explorer Iteractive PMCube Explorer Documetatio ad User Maual Thomas Vogelgesag Carl vo Ossietzky Uiversität Oldeburg December 9, 206 Cotets Itroductio 3 2 Applicatio Overview 4 3 Data Preparatio 6 3. Data Warehouse

More information

ECE4050 Data Structures and Algorithms. Lecture 6: Searching

ECE4050 Data Structures and Algorithms. Lecture 6: Searching ECE4050 Data Structures ad Algorithms Lecture 6: Searchig 1 Search Give: Distict keys k 1, k 2,, k ad collectio L of records of the form (k 1, I 1 ), (k 2, I 2 ),, (k, I ) where I j is the iformatio associated

More information

By Mahesh R. Sanghavi Associate professor, SNJB s KBJ CoE, Chandwad

By Mahesh R. Sanghavi Associate professor, SNJB s KBJ CoE, Chandwad By Mahesh R. Sanghavi Associate professor, SNJB s KBJ CoE, Chandwad All the content of these PPTs were taken from PPTS of renown author via internet. These PPTs are only mean to share the knowledge among

More information

Data Warehousing 2. ICS 421 Spring Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa

Data Warehousing 2. ICS 421 Spring Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa ICS 421 Spring 2010 Data Warehousing 2 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/30/2010 Lipyeow Lim -- University of Hawaii at Manoa 1 Data Warehousing

More information

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence _9.qxd // : AM Page Chapter 9 Sequeces, Series, ad Probability 9. Sequeces ad Series What you should lear Use sequece otatio to write the terms of sequeces. Use factorial otatio. Use summatio otatio to

More information

Dta Mining and Data Warehousing

Dta Mining and Data Warehousing CSCI6405 Fall 2003 Dta Mining and Data Warehousing Instructor: Qigang Gao, Office: CS219, Tel:494-3356, Email: q.gao@dal.ca Teaching Assistant: Christopher Jordan, Email: cjordan@cs.dal.ca Office Hours:

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

CS 412 Intro. to Data Mining

CS 412 Intro. to Data Mining CS 412 Intro. to Data Mining Chapter 4. Data Warehousing and On-line Analytical Processing Jiawei Han, Computer Science, Univ. Illinois at Urbana -Champaign, 2017 1 2 3 Chapter 4: Data Warehousing and

More information

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:

More information

The Magma Database file formats

The Magma Database file formats The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,

More information

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396 Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1396 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1396 1 / 31 Table of contents 1 Introduction 2 Data warehousing

More information

CMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago

CMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago CMSC 22200 Computer Architecture Lecture 12: Virtual Memory Prof. Yajig Li Uiversity of Chicago A System with Physical Memory Oly Examples: most Cray machies early PCs Memory early all embedded systems

More information

Jarek Szlichta Acknowledgments: Jiawei Han, Micheline Kamber and Jian Pei, Data Mining - Concepts and Techniques

Jarek Szlichta  Acknowledgments: Jiawei Han, Micheline Kamber and Jian Pei, Data Mining - Concepts and Techniques Jarek Szlichta http://data.science.uoit.ca/ Acknowledgments: Jiawei Han, Micheline Kamber and Jian Pei, Data Mining - Concepts and Techniques Frequent Itemset Mining Methods Apriori Which Patterns Are

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

Data Warehousing (1)

Data Warehousing (1) ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/2010 Lipyeow Lim -- University of Hawaii at Manoa 1 Motivation

More information

Data Warehousing & On-line Analytical Processing

Data Warehousing & On-line Analytical Processing Data Warehousing & On-line Analytical Processing Erwin M. Bakker & Stefan Manegold https://homepages.cwi.nl/~manegold/dbdm/ http://liacs.leidenuniv.nl/~bakkerem2/dbdm/ Chapter 4: Data Warehousing and On-line

More information

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394 Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 31 Table of contents 1 Introduction 2 Data warehousing

More information

Reminds on Data Warehousing

Reminds on Data Warehousing BUSINESS INTELLIGENCE Reminds on Data Warehousing (details at the Decision Support Database course) Business Informatics Degree BI Architecture 2 Business Intelligence Lab Star-schema datawarehouse 3 time

More information

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394 Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 22 Table of contents 1 Introduction 2 Data warehousing

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 07 : 06/11/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

Syllabus. Syllabus. Motivation Decision Support. Syllabus

Syllabus. Syllabus. Motivation Decision Support. Syllabus Presentation: Sophia Discussion: Tianyu Metadata Requirements and Conclusion 3 4 Decision Support Decision Making: Everyday, Everywhere Decision Support System: a class of computerized information systems

More information

Appendix D. Controller Implementation

Appendix D. Controller Implementation COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Appedix D Cotroller Implemetatio Cotroller Implemetatios Combiatioal logic (sigle-cycle); Fiite state machie (multi-cycle, pipelied);

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 20 Itroductio to Trasactio Processig Cocepts ad Theory Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Trasactio Describes local

More information

CS 111: Program Design I Lecture 15: Modules, Pandas again. Robert H. Sloan & Richard Warner University of Illinois at Chicago March 8, 2018

CS 111: Program Design I Lecture 15: Modules, Pandas again. Robert H. Sloan & Richard Warner University of Illinois at Chicago March 8, 2018 CS 111: Program Desig I Lecture 15: Modules, Padas agai Robert H. Sloa & Richard Warer Uiversity of Illiois at Chicago March 8, 2018 PYTHON STANDARD LIBRARY & BEYOND: MODULES Extedig Pytho Every moder

More information

Extending The Sleuth Kit and its Underlying Model for Pooled Storage File System Forensic Analysis

Extending The Sleuth Kit and its Underlying Model for Pooled Storage File System Forensic Analysis Extedig The Sleuth Kit ad its Uderlyig Model for Pooled File System Foresic Aalysis Frauhofer Istitute for Commuicatio, Iformatio Processig ad Ergoomics Ja-Niclas Hilgert* Marti Lambertz Daiel Plohma ja-iclas.hilgert@fkie.frauhofer.de

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 22 Database Recovery Techiques Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Recovery algorithms Recovery cocepts Write-ahead

More information

CS490D: Introduction to Data Mining Chris Clifton

CS490D: Introduction to Data Mining Chris Clifton CS490D: Introduction to Data Mining Chris Clifton January 16, 2004 Data Warehousing Data Warehousing and OLAP Technology for Data Mining What is a data warehouse? A multi-dimensional data model Data warehouse

More information

CS 111: Program Design I Lecture 15: Objects, Pandas, Modules. Robert H. Sloan & Richard Warner University of Illinois at Chicago October 13, 2016

CS 111: Program Design I Lecture 15: Objects, Pandas, Modules. Robert H. Sloan & Richard Warner University of Illinois at Chicago October 13, 2016 CS 111: Program Desig I Lecture 15: Objects, Padas, Modules Robert H. Sloa & Richard Warer Uiversity of Illiois at Chicago October 13, 2016 OBJECTS AND DOT NOTATION Objects (Implicit i Chapter 2, Variables,

More information

Lecture 1: Introduction and Strassen s Algorithm

Lecture 1: Introduction and Strassen s Algorithm 5-750: Graduate Algorithms Jauary 7, 08 Lecture : Itroductio ad Strasse s Algorithm Lecturer: Gary Miller Scribe: Robert Parker Itroductio Machie models I this class, we will primarily use the Radom Access

More information

n Explore virtualization concepts n Become familiar with cloud concepts

n Explore virtualization concepts n Become familiar with cloud concepts Chapter Objectives Explore virtualizatio cocepts Become familiar with cloud cocepts Chapter #15: Architecture ad Desig 2 Hypervisor Virtualizatio ad cloud services are becomig commo eterprise tools to

More information

Review: The ACID properties

Review: The ACID properties Recovery Review: The ACID properties A tomicity: All actios i the Xactio happe, or oe happe. C osistecy: If each Xactio is cosistet, ad the DB starts cosistet, it eds up cosistet. I solatio: Executio of

More information

Analysis of Documents Clustering Using Sampled Agglomerative Technique

Analysis of Documents Clustering Using Sampled Agglomerative Technique Aalysis of Documets Clusterig Usig Sampled Agglomerative Techique Omar H. Karam, Ahmed M. Hamad, ad Sheri M. Moussa Abstract I this paper a clusterig algorithm for documets is proposed that adapts a samplig-based

More information

Avid Interplay Bundle

Avid Interplay Bundle Avid Iterplay Budle Versio 2.5 Cofigurator ReadMe Overview This documet provides a overview of Iterplay Budle v2.5 ad describes how to ru the Iterplay Budle cofiguratio tool. Iterplay Budle v2.5 refers

More information

Algorithm Design Techniques. Divide and conquer Problem

Algorithm Design Techniques. Divide and conquer Problem Algorithm Desig Techiques Divide ad coquer Problem Divide ad Coquer Algorithms Divide ad Coquer algorithm desig works o the priciple of dividig the give problem ito smaller sub problems which are similar

More information

MapReduce and Hadoop. Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata. November 10, 2014

MapReduce and Hadoop. Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata. November 10, 2014 MapReduce ad Hadoop Debapriyo Majumdar Data Miig Fall 2014 Idia Statistical Istitute Kolkata November 10, 2014 Let s keep the itro short Moder data miig: process immese amout of data quickly Exploit parallelism

More information

Python Programming: An Introduction to Computer Science

Python Programming: An Introduction to Computer Science Pytho Programmig: A Itroductio to Computer Sciece Chapter 1 Computers ad Programs 1 Objectives To uderstad the respective roles of hardware ad software i a computig system. To lear what computer scietists

More information

Operating System Concepts. Operating System Concepts

Operating System Concepts. Operating System Concepts Chapter 4: Mass-Storage Systems Logical Disk Structure Logical Disk Structure Disk Schedulig Disk Maagemet RAID Structure Disk drives are addressed as large -dimesioal arrays of logical blocks, where the

More information

Chapter 4 Threads. Operating Systems: Internals and Design Principles. Ninth Edition By William Stallings

Chapter 4 Threads. Operating Systems: Internals and Design Principles. Ninth Edition By William Stallings Operatig Systems: Iterals ad Desig Priciples Chapter 4 Threads Nith Editio By William Stalligs Processes ad Threads Resource Owership Process icludes a virtual address space to hold the process image The

More information

GE FUNDAMENTALS OF COMPUTING AND PROGRAMMING UNIT III

GE FUNDAMENTALS OF COMPUTING AND PROGRAMMING UNIT III GE2112 - FUNDAMENTALS OF COMPUTING AND PROGRAMMING UNIT III PROBLEM SOLVING AND OFFICE APPLICATION SOFTWARE Plaig the Computer Program Purpose Algorithm Flow Charts Pseudocode -Applicatio Software Packages-

More information

Data Warehouse. Concepts and Techniques. Chapter 3. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1

Data Warehouse. Concepts and Techniques. Chapter 3. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1 Data Warehouse Concepts and Techniques Chapter 3 SS Chung April 5, 2013 Data Mining: Concepts and Techniques 1 Chapter 3: Data Warehousing and OLAP Technology: An Overview What is a data warehouse? A multi-dimensional

More information

Elementary Educational Computer

Elementary Educational Computer Chapter 5 Elemetary Educatioal Computer. Geeral structure of the Elemetary Educatioal Computer (EEC) The EEC coforms to the 5 uits structure defied by vo Neuma's model (.) All uits are preseted i a simplified

More information

Computer Graphics Hardware An Overview

Computer Graphics Hardware An Overview Computer Graphics Hardware A Overview Graphics System Moitor Iput devices CPU/Memory GPU Raster Graphics System Raster: A array of picture elemets Based o raster-sca TV techology The scree (ad a picture)

More information

Data Structures Week #9. Sorting

Data Structures Week #9. Sorting Data Structures Week #9 Sortig Outlie Motivatio Types of Sortig Elemetary (O( 2 )) Sortig Techiques Other (O(*log())) Sortig Techiques 21.Aralık.2010 Boraha Tümer, Ph.D. 2 Sortig 21.Aralık.2010 Boraha

More information

Modern Systems Analysis and Design Seventh Edition

Modern Systems Analysis and Design Seventh Edition Moder Systems Aalysis ad Desig Seveth Editio Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Desigig Databases Learig Objectives ü Cocisely defie each of the followig key database desig terms: relatio,

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

On-Line Analytical Processing (OLAP) Traditional OLTP

On-Line Analytical Processing (OLAP) Traditional OLTP On-Line Analytical Processing (OLAP) CSE 6331 / CSE 6362 Data Mining Fall 1999 Diane J. Cook Traditional OLTP DBMS used for on-line transaction processing (OLTP) order entry: pull up order xx-yy-zz and

More information

Goals of this Lecture Activity Diagram Example

Goals of this Lecture Activity Diagram Example Goals of this Lecture Activity Diagram Example Object-Orieted Aalysis ad Desig - Fall 998 Preset a example activity diagram Ð Relate to requiremets, use cases, ad class diagrams Also, respod to a questio

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

Chapter 3 Classification of FFT Processor Algorithms

Chapter 3 Classification of FFT Processor Algorithms Chapter Classificatio of FFT Processor Algorithms The computatioal complexity of the Discrete Fourier trasform (DFT) is very high. It requires () 2 complex multiplicatios ad () complex additios [5]. As

More information

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 1 Itroductio to Computers ad C++ Programmig Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 1.1 Computer Systems 1.2 Programmig ad Problem Solvig 1.3 Itroductio to C++ 1.4 Testig

More information

CSE 417: Algorithms and Computational Complexity

CSE 417: Algorithms and Computational Complexity Time CSE 47: Algorithms ad Computatioal Readig assigmet Read Chapter of The ALGORITHM Desig Maual Aalysis & Sortig Autum 00 Paul Beame aalysis Problem size Worst-case complexity: max # steps algorithm

More information

BOOLEAN MATHEMATICS: GENERAL THEORY

BOOLEAN MATHEMATICS: GENERAL THEORY CHAPTER 3 BOOLEAN MATHEMATICS: GENERAL THEORY 3.1 ISOMORPHIC PROPERTIES The ame Boolea Arithmetic was chose because it was discovered that literal Boolea Algebra could have a isomorphic umerical aspect.

More information

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved.

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved. Chapter 11 Frieds, Overloaded Operators, ad Arrays i Classes Copyright 2014 Pearso Addiso-Wesley. All rights reserved. Overview 11.1 Fried Fuctios 11.2 Overloadig Operators 11.3 Arrays ad Classes 11.4

More information

Lazy Type Changes in Object-oriented Database. Shan Ming Woo and Barbara Liskov MIT Lab. for Computer Science December 1999

Lazy Type Changes in Object-oriented Database. Shan Ming Woo and Barbara Liskov MIT Lab. for Computer Science December 1999 Lazy Type Chages i Object-orieted Database Sha Mig Woo ad Barbara Liskov MIT Lab. for Computer Sciece December 1999 Backgroud wbehavior of OODB apps compose of behavior of persistet obj wbehavior of objects

More information

Analyse des Données. Master 2 IMAFA. Andrea G. B. Tettamanzi

Analyse des Données. Master 2 IMAFA. Andrea G. B. Tettamanzi Analyse des Données Master 2 IMAFA Andrea G. B. Tettamanzi Université Nice Sophia Antipolis UFR Sciences - Département Informatique andrea.tettamanzi@unice.fr Andrea G. B. Tettamanzi, 2016 1 CM - Séance

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

Air Force Data Reference Architecture and Platform

Air Force Data Reference Architecture and Platform Headquarters U.S. Air Force Air Force Data Referece Architecture ad Platform Ms. Jackie Murray 11 Oct 2018 1 AF Data Challeges Large umber of legacy systems with umerous poit-to-poit iterfaces that are

More information

Computers and Scientific Thinking

Computers and Scientific Thinking Computers ad Scietific Thikig David Reed, Creighto Uiversity Chapter 15 JavaScript Strigs 1 Strigs as Objects so far, your iteractive Web pages have maipulated strigs i simple ways use text box to iput

More information

Basic allocator mechanisms The course that gives CMU its Zip! Memory Management II: Dynamic Storage Allocation Mar 6, 2000.

Basic allocator mechanisms The course that gives CMU its Zip! Memory Management II: Dynamic Storage Allocation Mar 6, 2000. 5-23 The course that gives CM its Zip Memory Maagemet II: Dyamic Storage Allocatio Mar 6, 2000 Topics Segregated lists Buddy system Garbage collectio Mark ad Sweep Copyig eferece coutig Basic allocator

More information

Τεχνολογία Λογισμικού

Τεχνολογία Λογισμικού ΕΘΝΙΚΟ ΜΕΤΣΟΒΙΟ ΠΟΛΥΤΕΧΝΕΙΟ Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών Τεχνολογία Λογισμικού, 7ο/9ο εξάμηνο 2018-2019 Τεχνολογία Λογισμικού Ν.Παπασπύρου, Αν.Καθ. ΣΗΜΜΥ, ickie@softlab.tua,gr

More information

The isoperimetric problem on the hypercube

The isoperimetric problem on the hypercube The isoperimetric problem o the hypercube Prepared by: Steve Butler November 2, 2005 1 The isoperimetric problem We will cosider the -dimesioal hypercube Q Recall that the hypercube Q is a graph whose

More information

IMP: Superposer Integrated Morphometrics Package Superposition Tool

IMP: Superposer Integrated Morphometrics Package Superposition Tool IMP: Superposer Itegrated Morphometrics Package Superpositio Tool Programmig by: David Lieber ( 03) Caisius College 200 Mai St. Buffalo, NY 4208 Cocept by: H. David Sheets, Dept. of Physics, Caisius College

More information

Computational Geometry

Computational Geometry Computatioal Geometry Chapter 4 Liear programmig Duality Smallest eclosig disk O the Ageda Liear Programmig Slides courtesy of Craig Gotsma 4. 4. Liear Programmig - Example Defie: (amout amout cosumed

More information

Outline. Research Definition. Motivation. Foundation of Reverse Engineering. Dynamic Analysis and Design Pattern Detection in Java Programs

Outline. Research Definition. Motivation. Foundation of Reverse Engineering. Dynamic Analysis and Design Pattern Detection in Java Programs Dyamic Aalysis ad Desig Patter Detectio i Java Programs Outlie Lei Hu Kamra Sartipi {hul4, sartipi}@mcmasterca Departmet of Computig ad Software McMaster Uiversity Caada Motivatio Research Problem Defiitio

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 21 Cocurrecy Cotrol Techiques Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Cocurrecy cotrol protocols Set of rules to guaratee

More information

System and Software Architecture Description (SSAD)

System and Software Architecture Description (SSAD) System ad Software Architecture Descriptio (SSAD) Diabetes Health Platform Team #6 Jasmie Berry (Cliet) Veerav Naidu (Project Maager) Mukai Nog (Architect) Steve South (IV&V) Vijaya Prabhakara (Quality

More information

Τεχνολογία Λογισμικού

Τεχνολογία Λογισμικού ΕΘΝΙΚΟ ΜΕΤΣΟΒΙΟ ΠΟΛΥΤΕΧΝΕΙΟ Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών Τεχνολογία Λογισμικού, 7ο/9ο εξάμηνο 2018-2019 Τεχνολογία Λογισμικού Ν.Παπασπύρου, Αν.Καθ. ΣΗΜΜΥ, ickie@softlab.tua,gr

More information

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution Multi-Threadig Hyper-, Multi-, ad Simultaeous Thread Executio 1 Performace To Date Icreasig processor performace Pipeliig. Brach predictio. Super-scalar executio. Out-of-order executio. Caches. Hyper-Threadig

More information

A Boolean Query Processing with a Result Cache in Mediator Systems

A Boolean Query Processing with a Result Cache in Mediator Systems A Boolea Query Processig with a Result Cache i Mediator Systems Jae-heo Cheog ad Sag-goo Lee * Departmet of Computer Sciece Seoul Natioal Uiversity Sa 56-1 Shillim-dog Kwaak-gu, Seoul Korea {cjh, sglee}cygus.su.ac.kr

More information

Decision Support Systems

Decision Support Systems Decision Support Systems 2011/2012 Week 3. Lecture 6 Previous Class Dimensions & Measures Dimensions: Item Time Loca0on Measures: Quan0ty Sales TransID ItemName ItemID Date Store Qty T0001 Computer I23

More information

CSE 2320 Notes 8: Sorting. (Last updated 10/3/18 7:16 PM) Idea: Take an unsorted (sub)array and partition into two subarrays such that.

CSE 2320 Notes 8: Sorting. (Last updated 10/3/18 7:16 PM) Idea: Take an unsorted (sub)array and partition into two subarrays such that. CSE Notes 8: Sortig (Last updated //8 7:6 PM) CLRS 7.-7., 9., 8.-8. 8.A. QUICKSORT Cocepts Idea: Take a usorted (sub)array ad partitio ito two subarrays such that p q r x y z x y y z Pivot Customarily,

More information

Keywords Software Architecture, Object-oriented metrics, Reliability, Reusability, Coupling evaluator, Cohesion, efficiency

Keywords Software Architecture, Object-oriented metrics, Reliability, Reusability, Coupling evaluator, Cohesion, efficiency Volume 3, Issue 9, September 2013 ISSN: 2277 128X Iteratioal Joural of Advaced Research i Computer Sciece ad Software Egieerig Research Paper Available olie at: www.ijarcsse.com Couplig Evaluator to Ehace

More information

Chapter 5. Functions for All Subtasks. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 5. Functions for All Subtasks. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 5 Fuctios for All Subtasks Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 5.1 void Fuctios 5.2 Call-By-Referece Parameters 5.3 Usig Procedural Abstractio 5.4 Testig ad Debuggig

More information

DATA WAREHOUSING & DATA MINING. by: Prof. Asha Ambhaikar

DATA WAREHOUSING & DATA MINING. by: Prof. Asha Ambhaikar DATA WAREHOUSING & DATA MINING by: Prof. Asha Ambhaikar 1 UNIT-I Overview and Concepts 2 Contents of Unit-I Need for data warehousing, Basic elements of data warehousing, Trends in data warehousing. Planning

More information

Message Integrity and Hash Functions. TELE3119: Week4

Message Integrity and Hash Functions. TELE3119: Week4 Message Itegrity ad Hash Fuctios TELE3119: Week4 Outlie Message Itegrity Hash fuctios ad applicatios Hash Structure Popular Hash fuctios 4-2 Message Itegrity Goal: itegrity (ot secrecy) Allows commuicatig

More information

Designing a learning system

Designing a learning system CS 75 Machie Learig Lecture Desigig a learig system Milos Hauskrecht milos@cs.pitt.edu 539 Seott Square, x-5 people.cs.pitt.edu/~milos/courses/cs75/ Admiistrivia No homework assigmet this week Please try

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control EE 459/500 HDL Based Digital Desig with Programmable Logic Lecture 13 Cotrol ad Sequecig: Hardwired ad Microprogrammed Cotrol Refereces: Chapter s 4,5 from textbook Chapter 7 of M.M. Mao ad C.R. Kime,

More information

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects. The

More information

Overview. Chapter 18 Vectors and Arrays. Reminder. vector. Bjarne Stroustrup

Overview. Chapter 18 Vectors and Arrays. Reminder. vector. Bjarne Stroustrup Chapter 18 Vectors ad Arrays Bjare Stroustrup Vector revisited How are they implemeted? Poiters ad free store Destructors Iitializatio Copy ad move Arrays Array ad poiter problems Chagig size Templates

More information

COSC 1P03. Ch 7 Recursion. Introduction to Data Structures 8.1

COSC 1P03. Ch 7 Recursion. Introduction to Data Structures 8.1 COSC 1P03 Ch 7 Recursio Itroductio to Data Structures 8.1 COSC 1P03 Recursio Recursio I Mathematics factorial Fiboacci umbers defie ifiite set with fiite defiitio I Computer Sciece sytax rules fiite defiitio,

More information

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity

More information

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time ( 3.1) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step- by- step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Analysis of Algorithms

Analysis of Algorithms Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Ruig Time Most algorithms trasform iput objects ito output objects. The

More information