CSE 344 JANUARY 3 RD - INTRODUCTION

Similar documents
Your New App. Motivation. Data Management is Universal. Staff. Introduction to Data Management (Database Systems) CSE 414. Lecture 1: Introduction

Introduction to Data Management CSE 344. Lecture 1: Introduction

Introduction to Data Management (Database Systems) CSE 414

CS 564: DATABASE MANAGEMENT SYSTEMS. Spring 2018

CMPT 354: Database System I. Lecture 1. Course Introduction

Introduction to Database S ystems Systems CSE 444 Lecture 1 Introduction CSE Summer

Introduction to Database Systems CSE 444. Lecture 1 Introduction

Introduction to Database Systems CSE 444. Lecture #1 March 26, 2007

Introduction to Database S ystems Systems CSE 444 Lecture 1 Introduction CSE Summer

Lecture Notes CPSC 321 (Fall 2018) Today... Survey. Course Overview. Homework. HW1 (out) S. Bowers 1 of 8

Introduc)on to Database Systems CSE 444. Lecture #1 March 29, 2010

CMPS 182: Introduction to Database Management Systems. Instructor: David Martin TA: Avi Kaushik. Syllabus

CS 245: Database System Principles

Database Management Systems CS Spring 2017

CSE 544 Principles of Database Management Systems

INF 315E Introduction to Databases School of Information Fall 2015

EECS3421 Introduction to Database Management Systems. Thanks to John Mylopoulos and Ryan Johnson for material in these slides

CS157a Fall 2018 Sec3 Home Page/Syllabus

Course Introduction. CSC343 - Introduction to Databases Manos Papagelis

Introduction to Databases Fall-Winter 2010/11. Syllabus

San José State University Computer Science Department CS157A: Introduction to Database Management Systems Sections 5 and 6, Fall 2015

CPS352 Database Systems Syllabus Fall 2012

What s a database anyway?

Introduction to Databases Fall-Winter 2009/10. Syllabus

Course and Contact Information. Course Description. Course Objectives

Rochester Institute of Technology Golisano College of Computing and Information Sciences Department of Information Sciences and Technologies

CSE 344 JANUARY 5 TH INTRO TO THE RELATIONAL DATABASE

CPS352 - DATABASE SYSTEMS. Professor: Russell C. Bjork Spring semester, Office: KOSC 242 x4377

Overview of the Class and Introduction to DB schemas and queries. Lois Delcambre

Database Management System Implementation. Who am I? Who is the teaching assistant? TR, 10:00am-11:20am NTRP B 140 Instructor: Dr.

CAS CS 460/660 Introduction to Database Systems. Fall

Course and Contact Information. Course Description. Course Objectives

Outline. Quick Introduction to Database Systems. Data Manipulation Tasks. What do they all have in common? CSE142 Wi03 G-1

CS 4604: Introduction to Database Management Systems. B. Aditya Prakash Lecture #1: Introduction

CS 4400 Database Systems. Meeting 1: Introduction Brandon Myers University of Iowa

Course Web Site. 445 Staff and Mailing Lists. Textbook. Databases and DBMS s. Outline. CMPSCI445: Information Systems. Yanlei Diao and Haopeng Zhang

Introduction to CS 4604

San José State University Science/Computer Science Database Management System I

COMP-421: Database Systems. Joseph D silva McConnel Engg. 102

San José State University College of Science / Department of Computer Science Introduction to Database Management Systems, CS157A-3-4, Fall 2017

CMPSCI445: Information Systems

COMP-421: Database Systems. Joseph D silva McConnel Engg. 102

Avi Silberschatz, Henry F. Korth, S. Sudarshan, Database System Concept, McGraw- Hill, ISBN , 6th edition.

EECE.2160: ECE Application Programming Spring 2017

CS564: Database Management Systems. Lecture 1: Course Overview. Acks: Chris Ré 1

CMPUT 391 Database Management Systems. Fall Semester 2006, Section A1, Dr. Jörg Sander. Introduction

COMP.3090/3100 Database I & II. Textbook

CSC 261/461 Database Systems. Fall 2017 MW 12:30 pm 1:45 pm CSB 601

Course: Database Management Systems. Lê Thị Bảo Thu

BBM371- Data Management. Lecture 1: Course policies, Introduction to DBMS

CSE 303: Database. Teaching Staff. Lecture 01. Lectures: 1 st half - from a user s perspective. Lectures: 2 nd half - understanding how it works

Announcements. 1. Forms to return today after class:

Outline. Database Management Systems (DBMS) Database Management and Organization. IT420: Database Management and Organization

CMPT 354 Database Systems I. Spring 2012 Instructor: Hassan Khosravi

Outline. Databases and DBMS s. Recent Database Applications. Earlier Database Applications. CMPSCI445: Information Systems.

CSC 4710 / CSC 6710 Database Systems. Rao Casturi

Database Concepts. CS 377: Database Systems

Introduction to Information Systems SSC, Semester 6

Course and Contact Information. Catalog Description. Course Objectives

Databases TDA357/DIT620. Niklas Broberg

Credits. Principles of Database Management Systems. Isn t Implementing a Database System Simple? Megatron 3000 Implementation Details

CPS352 - DATABASE SYSTEMS. Professor: Russell C. Bjork Spring semester, Office: KOSC 242 x4377

EECS 647: Introduction to Database Systems

Introduction. Random things to do after this course. Course roadmap. CPS 116 Introduction to Database Systems

CMPSCI 645 Database Design & Implementation

Advanced Topics in Database Systems Spring 2016

San José State University Department of Computer Science CS-174, Server-side Web Programming, Section 2, Spring 2018

Can you name one application that does not need any data? Can you name one application that does not need organized data?

Advanced Relational Database Management MISM Course S A3 Spring 2019 Carnegie Mellon University

Database Systems Management

Introduction to Data Management. Lecture #1 (The Course Trailer )

Course Title: Computer Networking 2. Course Section: CNS (Winter 2018) FORMAT: Face to Face

San Jose State University College of Science Department of Computer Science CS185C, Introduction to NoSQL databases, Spring 2017

Syllabus COSC-051-x - Computer Science I Fall Office Hours: Daily hours will be entered on Course calendar (or by appointment)

Introduction to Database Systems

Introduction to Databases

CSE 336. Introduction to Programming. for Electronic Commerce. Why You Need CSE336

Database Systems: Concepts, design, and implementation ISE 382 (3 Units)

Introduction to Databases

CSE 332: Data Abstractions. Ruth Anderson Spring 2014 Lecture 1

Introduction to Data Management. Lecture #2 (Big Picture, Cont.)

Why are you here? Introduction. Course roadmap. Course goals. What do you want from a DBMS? What is a database system? Aren t databases just

Welcome to CO 572: Advanced Databases

CS 525 Advanced Database Organization - Spring 2017 Mon + Wed 1:50-3:05 PM, Room: Stuart Building 111

9/8/2018. Prerequisites. Grading. People & Contact Information. Textbooks. Course Info. CS430/630 Database Management Systems Fall 2018

Modern Database Systems CS-E4610

Databases 1. Daniel POP

CSc 2310 Principles of Programming (Java) Jyoti Islam

IST659 Spring2015 M001 Wang Syllabus Data Administration Concepts and Database Management

Welcome to CS 241 Systems Programming at Illinois

Operating Systems CMPSC 473. Introduction January 15, Lecture 1 Instructor: Trent Jaeger

CMSC Introduction to Database Systems

CS 241 Data Organization using C

Data! CS 133: Databases. Goals for Today. So, what is a database? What is a database anyway? From the textbook:

Course Book Academic Year

AAAF (School of CS, Manchester) Advanced DBMSs / 19

CMSC 132: Object-Oriented Programming II. Administrivia

Note that your facilitator may provide additional required readings.

San José State University Department of Computer Science CS151, Section 04 Object Oriented Design Spring 2018

CS 564: DATABASE MANAGEMENT SYSTEMS

Transcription:

CSE 344 JANUARY 3 RD - INTRODUCTION

COURSE FORMAT Lectures Location: SIG 134 Please attend Sections: Content: exercises, tutorials, questions, new materials (occasionally) Locations: see web Please attend Bring your laptop 8 homework assignments 7 web quizzes Midterm and final

GRADING Homeworks 30% Web quizzes 10% Midterm 25% Final 35% This is all subject to change

COMMUNICATIONS Web page: http://www.cs.washington.edu/344 Syllabus (course information) Lecture/section notes will be available there Homework assignments will be available there Link to web quizzes is there Piazza Sign up: Link Soon THE place to ask course-related questions Log in today and enable notifications Class mailing list You are automatically subscribed Low traffic, only important announcements

TEXTBOOK Main textbook, available at the bookstore: Database Systems: The Complete Book, Hector Garcia-Molina, Jeffrey Ullman, Jennifer Widom Second edition.

EIGHT HOMEWORK ASSIGNMENTS H1: Sqlite intro (1 wk) H2: Sqlite basics (1 wk) H3: Advanced SQL on Azure (1⅔ weeks) H4: Datalog and Relational Algebra (1⅓weeks) H5: NoSQL: Json/SQL++ (1 wk) H6: Spark on AWS (1⅓weeks) H7: Schema Design (1wk) H8: Transactional Application (1⅓weeks) New this year: submit via git

ABOUT THE ASSIGNMENTS You will learn/practice the course material: SQL, RA, parallel db, transactions,... You will also learn lots of new technology Cloud computing: Azure, Cloud9, AWS NoSQL: AsterixDB, LogicBlox Git The time spent learning the new technology is very useful: write everything on your CV!

DEADLINES AND LATE DAYS Assignments are expected to be done on time, but things happen, so You have up to 3 late days No more than 2 on any one assignment Used in 24-hour chunks Late days = safety net, not convenience! You should not plan on using them If you use all 3 you are doing it wrong

SEVEN WEB QUIZZES http://newgradiance.com/ Create account; please use the same ID as your UW ID Token to be provided to course email Short tests, take many times, best score counts No late days closes at 11:00 deadline Provide explanations for wrong answers

EXAMS Midterm (TBA Early February) Final, Thursday, March 15 th, 230-4:20 Closed book. No computers, phones, watches,... Location: in class

ABOUT ME Evan McCarty (ejmcc@cs.washington.edu) Theory and Algorithms research Data Scientist for Partners for Our Children Lecture notes posted after class Panopto recordings Part-time Faculty On campus MWF Available by email Office hours Monday and Friday 4:30 6:00 or by email.

ABOUT STAFF TAs Joshua Bean Allison Chou Colin Evans Jayanth Garlapati Jonathan Leang Cindy Suripto James Wang First resource for coding / setup problems

ABOUT YOU Expect most are CSE majors (Hopefully) registered If not https://docs.google.com/forms/d/e/ 1FAIpQLSf4hqZmELivR1_lby_WmpgT66OM78K-EdsuebQTI84B0SLow/viewform Academic Honesty and Participation Piazza and help

CLASS GOALS The world is drowning in data! Need computer scientists to help manage this data Help domain scientists achieve new discoveries Help companies provide better services (e.g., Facebook) Help governments (and universities!) become more efficient Welcome to 344: Introduction to Data Management Existing tools PLUS data management principles This is not just a class on SQL!

WHY DATABASE MANAGEMENT? This course was my least favorite topic in undergrad

WHY DATABASE MANAGEMENT? This course was my least favorite topic in undergrad Now, I work with databases

WHY DATABASE MANAGEMENT? This course was my least favorite topic in undergrad Now, I work with databases Intelligent design and organization of data allows important work and research to occur efficiently and correctly

WHY DATABASE MANAGEMENT? This course was my least favorite topic in undergrad Now, I work with databases Intelligent design and organization of data allows important work and research to occur efficiently and correctly Organizations need a diverse set of skills, you may not ever need to manage a DB, but you will certainly be interfacing with one

WHY DATABASE MANAGEMENT? This course was my least favorite topic in undergrad Now, I work with databases Intelligent design and organization of data allows important work and research to occur efficiently and correctly Organizations need a diverse set of skills, you may not ever need to manage a DB, but you will certainly be interfacing with one Decisions made in setting up a DB (or even a query) can affect performance going forward

WHY DATABASE MANAGEMENT? Disk and magnetic tape are linear storage We can access elements throughout them, but there is a continuous serialization of this data. Data itself is rarely one dimensional Imagine storing all data about UW students on disk

WHY DATABASE MANAGEMENT? Disk and magnetic tape are linear storage We can access elements throughout them, but there is a continuous serialization of this data. Data itself is rarely one dimensional Imagine storing all data about UW students on disk What is their order? Are students related?

WHY DATABASE MANAGEMENT? Disk and magnetic tape are linear storage We can access elements throughout them, but there is a continuous serialization of this data. Data itself is rarely one dimensional Imagine storing all data about UW students on disk What is their order? Are students related? Related relative to other data? Why store students at all?

DATABASE What is a database?

DATABASE What is a database? A collection of files storing related data Give examples of databases

DATABASE What is a database? A collection of files storing related data Give examples of databases Accounts database; payroll database; UW s students database; Amazon s products database; airline reservation database

DATABASE MANAGEMENT SYSTEM What is a DBMS?

DATABASE MANAGEMENT SYSTEM What is a DBMS? A big program written by someone else that allows us to manage efficiently a large database and allows it to persist over long periods of time Examples of DBMSs Oracle, IBM DB2, Microsoft SQL Server, Vertica, Teradata Open source: MySQL (Sun/Oracle), PostgreSQL, CouchDB Open source library: SQLite We will focus on relational DBMSs most quarter

AN EXAMPLE: ONLINE BOOKSELLER What data do we need? Data about books, customers, pending orders, order histories, trends, preferences, etc. Data about sessions (clicks, pages, searches) Note: data must be persistent! Outlive application Also note that data is large won t fit all in memory What capabilities on the data do we need? Insert/remove books, find books by author/title/etc., analyze past order history, recommend books, Data must be accessed efficiently, by many users Data must be safe from failures and malicious users

CHALLENGES FOR A DBMS Alice and Bob receive a $200 gift certificate as wedding gift

CHALLENGES FOR A DBMS Alice and Bob receive a $200 gift certificate as wedding gift Alice @ her office orders "The Selfish Gene Bob @ home orders Guns, germs, and steel $80 $100

CHALLENGES FOR A DBMS Alice and Bob receive a $200 gift certificate as wedding gift Alice @ her office orders "The Selfish Gene Bob @ home orders Guns, germs, and steel $80 $100 Questions: What is the ending credit? What if second book costs $130? What if system crashes?

CHALLENGES FOR A DBMS Alice and Bob receive a $200 gift certificate as wedding gift Alice @ her office orders "The Selfish Gene Bob @ home orders Guns, germs, and steel $80 Questions: What is the ending credit? What if second book costs $130? What if system crashes? Lesson: a DBMS needs to handle various scenarios $100

WHAT A DBMS DOES Describe real-world entities in terms of stored data Persistently store large datasets Efficiently query & update Must handle complex questions about data Must handle sophisticated updates Performance matters Change structure (e.g., add attributes) Concurrency control: enable simultaneous updates Crash recovery Security and integrity

THE PLAYERS DB application developer: writes programs that query and modify data (344) DB designer: establishes schema (344) DB administrator: loads data, tunes system, keeps whole thing running (344, 444) Data analyst: data mining, data integration (344, 446) DBMS implementor: builds the DBMS (444)

WHAT IS THIS CLASS ABOUT? Unit 1: Intro (today) Unit 2: Relational Data Models and Query Languages Unit 3: Non-relational data Unit 4: RDMBS internals and query optimization Unit 5: Parallel query processing Unit 6: DBMS usability, conceptual design Unit 7: Transactions Unit 8: Advanced topics (time permitting)

WHAT TO EXPECT SOON Course Website Syllabus Git tutorial / help The first HW assignment Piazza page Canvas page Link for online quizzes