Informatics 1: Data & Analysis

Similar documents
Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Part I: Structured Data

Database Systems (INFR10070) Dr Paolo Guagliardo. University of Edinburgh. Fall 2016

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis

Introduction to Databases Fall-Winter 2010/11. Syllabus

Tutorial 8: Practice Exam Questions

Overview of the Class and Introduction to DB schemas and queries. Lois Delcambre

Part I: Structured Data

Course and Contact Information. Course Description. Course Objectives

Introduction to Databases Fall-Winter 2009/10. Syllabus

City University of Hong Kong Course Syllabus. offered by Department of Computer Science with effect from Semester A 2017/18

LIS 2680: Database Design and Applications

Course and Contact Information. Course Description. Course Objectives

CMPUT 391 Database Management Systems. Fall Semester 2006, Section A1, Dr. Jörg Sander. Introduction

INSTITUTE OF AERONAUTICAL ENGINEERING

TEACHING & ASSESSMENT PLAN

San José State University College of Science / Department of Computer Science Introduction to Database Management Systems, CS157A-3-4, Fall 2017

College Algebra. Cartesian Coordinates and Graphs. Dr. Nguyen August 22, Department of Mathematics UK

UCD School of Information and Library Studies. IS30020: Web Publishing

Fundamentals of Databases

Database Systems Management

Math 3820 Project. 1 Typeset or handwritten? Guidelines

Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations Zvi M. Kedem 1

PROGRAMME SPECIFICATION POSTGRADUATE PROGRAMMES

Grande Prairie Regional College

16. How quickly will the Level 1 and Level 2 examinations be marked? NOCN markers have a five-day turnaround for marking examination papers.

Part II: Semistructured Data

Guide for candidates

Database Design and Management - BADM 352 Fall 2009 Syllabus and Schedule

TITLE OF COURSE SYLLABUS, SEMESTER, YEAR

Course Design Document: IS202 Data Management. Version 4.5

Advanced Database Organization INF613

The exam. The exam. The exam 10. Sitting a City & Guilds online examination 11. Frequently asked questions 18. Exam content 20

KOMAR UNIVERSITY OF SCIENCE AND TECHNOLOGY (KUST)

Outline. Database Management Systems (DBMS) Database Management and Organization. IT420: Database Management and Organization

One of the main selling points of a database engine is the ability to make declarative queries---like SQL---that specify what should be done while

POSTGRADUATE DIPLOMA IN RESEARCH METHODS

Digital Communication and Aesthetics,

CS Final Exam Review Suggestions

Rochester Institute of Technology Golisano College of Computing and Information Sciences Department of Information Sciences and Technologies

Introduction to CS 4604

CSE 344 Final Review. August 16 th

Instructor Manual Contents

PROGRAMME SUMMARY You are required to take eight core modules in terms one and two as outlined in the module list.

CSE 336. Introduction to Programming. for Electronic Commerce. Why You Need CSE336

Training Plan 15 - Months MS Office, Business English, Business Math, Customer Service

APMP. The APM Project Management Qualification. Guide for Candidates

Project Risk Management Single Subject Certificate Level 2. Guide for candidates

Chapter 13 XML: Extensible Markup Language

Grande Prairie Regional College

Assessment Tools in D2L, Part 2: Dropbox

Welcome. Orientation to online CPS102 Computer Science 2 (Java 2)

Al al-bayt University Prince Hussein Bin Abdullah College for Information Technology Computer Science Department

Introduction, Functions

INFO-445: Advanced Database Design, Management, and Maintenance

UNIT OUTLINE. Network Engineering 304. Mr Iain Murray. Department of Electrical and Computer Engineering Curtin Engineering

Ministry of Higher Education and Scientific research

Frequently asked questions on the Exam Period

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer DBMS

Course Introduction. CSC343 - Introduction to Databases Manos Papagelis

APMP The APM Project Management Qualification. Guide for Candidates DRAFT

CSE111 Introduction to Computer Applications

Credit: The lecture slides are created based on previous lecture slides by Dan Zingaro.

General Office, GED Attainment 12 Months

Grade Point Scales Standard Honors AP/College A B C D F Sample file

CSE 544 Principles of Database Management Systems

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Relational Algebra for sets Introduction to relational algebra for bags

About this Course. Blackboard Student Orientation Course. About the Portal. Logging In. About the Course Layout. Showing the Course Menu

Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations Zvi M. Kedem 1

: 65% to 84% - M for Merit : 50% to 64% - P for Pass : 0% to 49% - R for Referral

GCE APPLIED ICT OCR UNIT 3. ICT Solutions for Individuals & Society. Student Workbook

Sage CRM elearning Sales, Marketing, Customer Service modules for Sage BMS

CS 4604: Introduction to Database Management Systems. B. Aditya Prakash Lecture #5: Entity/Relational Models---Part 1

San José State University Computer Science Department CS157A: Introduction to Database Management Systems Sections 5 and 6, Fall 2015

CSC 407 Database System I COURSE PARTICULARS COURSE INSTRUCTORS COURSE DESCRIPTION

Introduction to Access 97/2000

Civil Engineering, Civil Engineering with Industrial Placement. School of Mathematics, Computer Science and Engineering

CS34800, Fall 2016, Assignment 4

Master of Accounting (CPA Program)* Master of Advanced Professional Accounting

APMP. The APM Project Management Qualification. Guide for Candidates

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1

User Guide CDU HE Learnline Master Template

COURSE FILE INDEX ITEM DESCRIPTION

Course Requirements. Prerequisites Miscellaneous

Blackboard 5 Level One Student Manual

COURSE SYLLABUS FOR. COMP-297 Web Page Design: Adobe Dreamweaver

EDIT202 Spreadsheet Lab Prep Sheet

Transcription:

Informatics 1: Data & Analysis Lecture 20: Course Review Ian Stark School of Informatics The University of Edinburgh Tuesday 4 April 2017 Semester 2 Week 11 https://blog.inf.ed.ac.uk/da17

Plan! This is Teaching Week 11 of Semester 2 and the teaching block ends this Friday, 7 April. Your final Inf1-DA tutorial is this week, in which your tutor will return your coursework assignment with marks and feedback. But only if you go to the correct tutorial group! Find the course web page, Tutorial tab, group membership link. The rest of this lecture includes: a review of exam arrangements; advice on preparation and practice; and a summary of topics covered in the course. The final lecture, on Friday, will review specific past exam questions.

Exam: Date, Time, Place! Informatics 1: Data & Analysis has a single two-hour written examination. Date Tuesday 16 May 2017 Time 0930 1130 Place The Pleasance Sports Hall Code INFR08015 https://blog.inf.ed.ac.uk/da17/exam#dates This information is current at 2017-04-03; please check the link above closer to the date to check this and to confirm all of your exams.

Exam: Format As in previous years, the exam will have three compulsory questions. Read all questions before beginning the paper You don t need to do the questions in order Don t assume a question is only using one part of the course If you get stuck on one question: don t waste too much time on it; do go on to the next question; come back later. Don t give up! All questions include information about marks available for each section. Adding up the marks you achieve for each question gives your total mark for the exam, and for Inf1-DA overall.

Exam: Marking The exam is marked by me together with one or more of the course tutors. Marking is anonymous: we don t see your name or UUN All marking is positive: marking starts at zero and goes up for each item successfully completed or partially completed. Spelling and grammar are not part of the assessment; although it is important that the markers can read what you have written. Marks are not curved or scaled. You keep all the marks you get. There is no ranking your mark and grade do not in any way depend on how other students do in the course. See the course web pages for more on exam marking and results.

Exam: Grades The Inf1-DA exam is graded on the standard University of Edinburgh scale: Mark Grade Description 70 100 A Excellent 60 69 B Very Good 50 59 C Good 40 49 D Satisfactory 30 39 E Marginal fail 20 29 F Clear fail 10 19 G Bad fail 0 9 H Bad fail Students who do not pass at this sitting may resit in August. Passing this course is essential for progression to second-year study. All passing marks and grades are recorded on your transcript. However, final degree grade is based only on performance in Year 3 onwards.

Exam: Calculators You may use a scientific calculator in the Inf1-DA exam, and I recommend that you do take one with you. There is a standard list of permitted calculators, linked from the Inf1-DA web pages. Most of these calculators perform some algebra, but they are not programmable and do not draw graphs. If you need a different type of calculator (or other assistive technology) then please do get in touch with either me, your personal tutor, the student disability service, or the ITO. The FX-83GT Plus is the UK s Number 1 Scientific Calculator allowed in every UK exam where a calculator can be used. Casio UK

No other electronic devices You may take in a calculator from the authorised list, but no other devices. Exam Hall Regulations... 9. The use of mobile devices/personal electronic equipment is not permitted. Mobile devices must be switched off during an examination. These should be placed in your bag and should not be on your person. Mobile devices are those which store/display data or connect to the internet, such as a mobile telephone, smart watches, smart glasses or any other communications equipment.... Take care with this: make sure you do not accidentally carry in any connected device.

Exam: Closed Book The Inf1-DA exam is closed book you may not take in any books, notes, or other reference material into the examination hall. If you wish to take an English-language dictionary into the exam then please contact the ITO to arrange for permission. There is no formula sheet or book provided. However, where necessary questions will include tables of critical values or other essential material. I recommended you write your exam answers in blue or black ink, with diagrams in pencil. However, markers will always work to do their best with whatever is submitted.

Past Exam Papers Past Questions All tutorial exercises have Examples and Solutions from past exam papers. These are chosen to match each course topic. Solutions include comments about different possible answers, and notes on important things to include. Past Papers The University Library keeps a full set of past papers online. http://exampapers.ed.ac.uk For Inf1-DA, the ITO web pages also have sets of sample solutions. In each year there are two Inf1-DA exams, from the main and resit diets. All questions since the 2009 exam are appropriate for the current syllabus.

Using Past Exam Papers Past papers are a good source of practice material, and I strongly recommend you attempt as many of these questions as you can. However, please note the following. The key exam preparation task is to master the course material: understand it and be able to apply it in practice. Conveniently, that s also the best possible outcome for you from this course Memorizing the answers to previous questions is not a helpful way to tackle future ones. Attempting past exam questions yourself, to test your knowledge, and identify any gaps, is a much more effective way to learn. If you are puzzled by a past question or the solution provided, ask on Piazza or email me directly.

Exam Content Examinable Material Unless otherwise specified, all of the following material is examinable: Topics covered in lectures Directed reading distributed in lectures Topics covered in the weekly exercise sheets All non-examinable additional material in lectures is marked by being on purple-shaded slides with a + in the top right-hand corner. Limits of Computation + Matter organised to provide the greatest possible computing power is fancifully known as computronium. In the 1960 s Hans-Joachim Bremermann was one of the first people to estimate upper limits to computation. His Bremermann limit is the computation which could be performed using the earth, over the period of its existence so far. This is around 10 93 bits of computation. That s enough to solve the travelling salesman problem for 300 cities. But just the once. Ian Stark Inf1-DA / Lecture 8 2017-02-10

Burning Questions Will this be on the exam? Is this examinable? Do I need to know this? Is it important?

Topic Summary The entity-relationship model, ER diagrams. The relational model, SQL DDL. Translating an ER model into a relational one. Relational algebra, tuple-relational calculus, SQL queries; translating between all three. Semistructured data models and the XPath data tree. XML documents. Schema languages and DTDs. Relational data converted into XML. XPath as a query language. Corpora: what they are and how they are made; examples. Annotations and tagging. Concordances, frequencies, n-grams, collocations. Methods for machine translation. Information retrieval: what it is, evaluating and comparing performance of IR systems; tf-idf; the vector space model and cosine similarity measure. Summary statistics, population vs. sample; hypothesis testing and χ 2 ; significance, p-values; correlation does not imply causation.

Some Specific Items Corpora In general it is the principles of corpora that are examinable, rather then the precise details of individual corpora. Similarly, you should be familiar with the principles underlying POS-tagging and syntactic annotation, but you do not need to know detailed linguistics or specific tag sets. You should however, be able to give examples of a corpus or a POS tag. The CQP tool was used in a tutorial, so is examinable although again for general principles and use, not every detail of syntax. Statistics You are not expected to memorize critical value tables; however, you should be able to use one if provided. You are expected to know the formulas for the various statistics used, and to be able to calculate with them.

XML and XPath XML trees Nodes: root, internal, leaves. Parents, children, ancestors, descendants, siblings. XPath node types: root, element, text, attribute. XML documents Nested start and end tags; text content; tag attributes. Ordering of nodes matters; of attributes doesn t. Well-formed documents. Validation Specifying a hierarchy and attribute set. Validating document D against schema S. Schema languages, such as DTD. XPath navigation Path expressions identify a set of nodes. Navigation: axes, node tests, predicates.

Relational Database Modelling Relational models Relations: Tables matching schemas Schema: A set of field names and their domains Table: A set of tuples of values for these fields Student uun name age email s0456780 John 18 john@inf s0378435 Helen 20 helen@phys s0412375 Mary 18 mary@inf s0189034 Peter 22 peter@math Takes uun code mark s0456780 inf1 71 s0412375 math1 82 s0412375 geo1 64 s0189034 math1 56 Course code title year inf1 Informatics 1 1 math1 Mathematics 1 1 geo1 Geology 1 1 dbs Database Systems 3 adbs Advanced Databases 4

Top Hat Course Code 018138 ACID Transactions for Reliable Multiuser Databases Match each letter to its expansion and meaning A C I D Atomicity: A transaction changes just a single value in the database Atomicity: A transaction runs either to completion or not at all Consistency: A transaction always maintains database integrity Consistency: A transaction always does the same thing Isolation: Concurrent transactions act as if done sequentially Isolation: Transactions can only affect one table at a time Durability: A transaction can be reused repeatedly Durability: Once complete, a transaction will not be undone

Tuple-Relational Calculus The tuple-relational calculus (TRC) is a mathematical language for expressing queries over a relational database. Standard TRC Idioms To pick out some tuples from an existing table: { R Table S OtherTable, T FurtherTable. Test } To obtain tuples not directly appearing in any other table: { R S OtherTable, T FurtherTable. Test } In both cases Test is a Boolean expression using R, S, T,... and including assertions like (S.field = R.field)... to match R with other records.

Example: Records from Existing Table All records for students more than 19 years old { S Student S.age > 19 } The set of all tuples S in the table Student with field age greater than 19. All records for students taking math1 { S Student T Takes. S.uun = T.uun T.code = math1 } The set of all tuples S in the table Student for which there is a tuple T in Takes linking the UUN of S to course math1.

Example: Building New Records Results for a single course { R T Takes. T.code = "math1" R.uun = T.uun R.mark = T.mark } The set of all tuples R where there is a Takes tuple T with code math1 such that R and T have matching UUN and mark. Students on courses { R S Student, T Takes, C Course. S.uun = T.uun T.code = C.code R.name = S.name R.title = C.title } The set of all tuples R where there is a Student tuple S, a Takes tuple T, and a Course tuple C with matching UUNs and course codes, and where R takes student name from tuple S and course title from tuple C.

Instructions Quite separately from the technical content of your degree studies, the following is an important high-level skill which it s important that you can demonstrate by the time you graduate. Given a task to be completed: 1 Find the instructions. 2 Read the instructions. 3 Choose from: Either Follow the instructions as directed Or Recognise that you are not following the instructions, and be prepared to address any consequences. Both of these are viable choices. Do not underestimate this skill: it is much harder than many expect.

SpaceX Falcon 9 Going Up Ian Stark Inf1-DA / Lecture 20 2017-04-04

SpaceX Falcon 9 Launch Profile

SpaceX Falcon 9 Coming Down

Rocket Science

RTQ The most important rule of every examination ever: Read The Question Read the question. Answer the question. Go back and read the question again to make sure you have answered the right question. Repeat. Don t be fooled: this is easy to forget. Sometimes you will get it wrong.

What Next? Watch https://is.gd/diydiscocalc! Follow These Instructions Download the Inf1-DA exam papers from May and August 2016. Read the following two questions May 2016 Question 1 August 2016 Question 3 The first of these runs on to a second page. Write out answers to both questions. Bring your solutions along on Friday. In the lecture on Friday I shall work through and explain sample solutions for these past questions.