Computational statistics Jamie Griffin. Semester B 2018 Lecture 1

Size: px
Start display at page:

Download "Computational statistics Jamie Griffin. Semester B 2018 Lecture 1"

Transcription

1 Computational statistics Jamie Griffin Semester B 2018 Lecture 1

2 Course overview This course is not: Statistical computing Programming This course is: Computational statistics Statistical methods that use computation to replace certain assumptions Programming language R is used to implement them Main focus is on understanding the methods

3 Main topics covered Estimating probability densities Non-parametric tests, e.g. permutation tests Cross-validation Bootstrap This is all in the frequentist framework. Computational methods for Bayesian inference are not covered.

4 Lecture notes Slides will be put on QMplus before each lecture. More formal set of notes (single pdf file) will be updated as term progresses. More extensive proofs than covered in the lectures.

5 Time-table Lectures: Friday, Laws 1.19 Lectures are mainly about methods, not R programming Computer practicals: Tuesday, Engineering W128D.2 These use R Office hours: Friday, Queens CB202 Better to me beforehand Week 7 No lecture or regular practical. Possibly a revision practical session, if there is demand for this.

6 Assessment Final exam 70% Entirely written, not computer-based In May or June 3 hours for level 7, 2 hours for level 6 Coursework 30% Weekly exercise sheets

7 Exercise sheets Some is pen and paper, some uses R One question to be handed in each week One per week (9 in total) Available before each tutorial, to be handed in at next tutorial None in first week, so first one to be handed in 30th January

8 Books Gentle, J.E. Elements of Computational Statistics (2002) Covers most of the material on the course Library has book and electronic version Computational Statistics (2009) by the same author - expanded version, extra material is not in this module Efron, B. and Tibshirani, R.J. An Introduction to the Bootstrap (1994) Covers some of the later material (bootstrap, cross-validation) Library only has hard copy, not electronic Davison, A. C., Hinkley, D. V. Bootstrap methods and their application (1997) Library has book and electronic version

9 What is R? Free software system for data analysis Initially developed by R. Ihaka and R. Gentleman (1996) Currently developed by the R Core Team (around 20 people) Largest collection of tools for statistics and data analysis (1,000s of contributors)

10 R is free No need to pay Source code is freely available Anyone can re-use, modify and distribute the code

11 Who uses R? Academics and other researchers Increasingly, in business Companies using R New York Times Lloyds Google

12 Popularity of R This webpage has several sources of data on how widely used different statistical environments are: Popularity of data science software According to Google Scholar hits, R recently ( 2016) overtook SAS to become the second most widely used for research articles. SPSS was top, but declining rapidly. R was most widely used according to survey of data scientists.

13 Where can you get it? google "R" The R project: distribution on CRAN: available on Windows, MacOSX, Linux

14 What does it do? Base, core packages (30), and additional packages (> 12,000): probability distributions statistical tests linear/non-linear modelling multivariate analysis time series spatial statistics networks maps... See "task views":

15 R GUI Part of R Simple graphical interface Not menu-driven

16 R Studio Free interface to R that looks nicer than R GUI Syntax highlighting Debugging Command completion Also not menu-driven for running code May need to select version of R when first running it

17 Running code R is an interpreted programming language, derived from S-plus Interactively in command window In script file, select text, then CTRL + R Run a script in batch mode (we re not covering this)

18 Getting help Different ways of getting help:?foo or help("foo"): access the help page of foo (if you know the name of the command)??foo or help.search("foo"): look for "foo" in help pages RSiteSearch("foo"): search foo in help pages and forum archives Cross validated - general statistical questions, use tag "r" to ask or search for questions about R. Mailing lists Search main mailing list Book by R core team

19 Numbers Set a equal to 10+2 and then display the result: a = 10+2 a Variable and function names are case-sensitive. Operators have the usual meaning: + - * / ˆ

20 Assignment a = 8 b = 10 a == b a = b == checks if a and b are equal. = assigns the value of b to a. In R, can also use <- for assignment: a <- b Many R users prefer <-, as there are some places in which = does not work, but we will not meet them in this course.

21 Boolean variables Boolean variables are known as logicals in R. a = TRUE b = FALSE a/2 And, or, not: a & b a b!a Any non-zero value maps to TRUE, zero maps to FALSE.

22 Vectors A vector is a list of objects of the same kind. In R these can be numeric, logical, characters. There are multiple ways to specify vectors: u = vector(length=5, mode="numeric") v = c(1, 3, 4) w = rep(3, length.out=4) x = rep(3, times=4) y = seq(from=2, by=4, length.out=5) z = seq(from=2, to=10, by=2) u = 1:5 s = c("a", "b", "e")

23 Functions on vectors u = 1:5 length(u) sum(u) Inbuilt functions like exp, log, sin can act on single numbers or on vectors. exp(u) sin(u*pi/2) Operators we saw previously act on each element of a vector.

24 A = matrix(1:15,nrow=3) A Transpose t(a) Matrix multiplication B = matrix(1:15,ncol=3) A %*% B B %*% A u = 1:5 A %*% u u %*% B Matrices

25 Data frames A data frame is like a matrix, but different variables (columns) can have different types: d = data.frame(age=c(10,54,3), sex=c("m","f","m")) summary(d) Show first few rows of data frame: head(d) $ to access named element: d$age

26 v = 1:10 v[1] v[1:3] v[c(1, 2, 4)] v[c(0, 2, 4)] Using logicals v<5 v[v<5] Subsetting vectors

27 These have intuitive names: sum mean var sd median min max Summary statistics

28 Probability distribution functions Normal distribution: dnorm(x, mean=2, sd=3) probability density function (pdf) at x pnorm cumulative distribution function (cdf) qnorm quantile function (inverse of cdf) rnorm(n, mean=1, sd=2) generate a vector of N random numbers. To set random number seed, if you may need to reproduce the results. set.seed(n) where n is a positive integer.

29 Other distributions Similar functions exist for a range of distributions: runif: uniform distribution rpois: Poisson rbinom: binomial rgamma: gamma rbeta: beta rexp: exponential rgeom: geometric rcauchy: Cauchy mvrnorm: multivariate Normal

Description/History Objects/Language Description Commonly Used Basic Functions. More Specific Functionality Further Resources

Description/History Objects/Language Description Commonly Used Basic Functions. More Specific Functionality Further Resources R Outline Description/History Objects/Language Description Commonly Used Basic Functions Basic Stats and distributions I/O Plotting Programming More Specific Functionality Further Resources www.r-project.org

More information

LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT

LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT NAVAL POSTGRADUATE SCHOOL LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT Statistics (OA3102) Lab #2: Sampling, Sampling Distributions, and the Central Limit Theorem Goal: Use R to demonstrate sampling

More information

R Programming Basics - Useful Builtin Functions for Statistics

R Programming Basics - Useful Builtin Functions for Statistics R Programming Basics - Useful Builtin Functions for Statistics Vectorized Arithmetic - most arthimetic operations in R work on vectors. Here are a few commonly used summary statistics. testvect = c(1,3,5,2,9,10,7,8,6)

More information

Scientific Computing: Lecture 1

Scientific Computing: Lecture 1 Scientific Computing: Lecture 1 Introduction to course, syllabus, software Getting started Enthought Canopy, TextWrangler editor, python environment, ipython, unix shell Data structures in Python Integers,

More information

Numeric Vectors STAT 133. Gaston Sanchez. Department of Statistics, UC Berkeley

Numeric Vectors STAT 133. Gaston Sanchez. Department of Statistics, UC Berkeley Numeric Vectors STAT 133 Gaston Sanchez Department of Statistics, UC Berkeley gastonsanchez.com github.com/gastonstat/stat133 Course web: gastonsanchez.com/stat133 Data Types and Structures To make the

More information

Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018

Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018 Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018 Contents Overview 2 Generating random numbers 2 rnorm() to generate random numbers from

More information

An Introduction to R- Programming

An Introduction to R- Programming An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University

More information

Short Introduction to R

Short Introduction to R Short Introduction to R Paulino Pérez 1 José Crossa 2 1 ColPos-México 2 CIMMyT-México June, 2015. CIMMYT, México-SAGPDB Short Introduction to R 1/51 Contents 1 Introduction 2 Simple objects 3 User defined

More information

CREATING SIMULATED DATASETS Edition by G. David Garson and Statistical Associates Publishing Page 1

CREATING SIMULATED DATASETS Edition by G. David Garson and Statistical Associates Publishing Page 1 Copyright @c 2012 by G. David Garson and Statistical Associates Publishing Page 1 @c 2012 by G. David Garson and Statistical Associates Publishing. All rights reserved worldwide in all media. No permission

More information

Revising CS-M41. Oliver Kullmann Computer Science Department Swansea University. Linux Lab Swansea, December 13, 2011.

Revising CS-M41. Oliver Kullmann Computer Science Department Swansea University. Linux Lab Swansea, December 13, 2011. Computer Science Department Swansea University Linux Lab Swansea, December 13, 2011 How to use the revision lecture The purpose of this lecture (and the slides) is to emphasise the main topics of this

More information

Al al-bayt University Prince Hussein Bin Abdullah College for Information Technology Computer Science Department

Al al-bayt University Prince Hussein Bin Abdullah College for Information Technology Computer Science Department Al al-bayt University Prince Hussein Bin Abdullah College for Information Technology Computer Science Department 0901212 Python Programming 1 st Semester 2014/2015 Course Catalog This course introduces

More information

Random Number Generation and Monte Carlo Methods

Random Number Generation and Monte Carlo Methods James E. Gentle Random Number Generation and Monte Carlo Methods With 30 Illustrations Springer Contents Preface vii 1 Simulating Random Numbers from a Uniform Distribution 1 1.1 Linear Congruential Generators

More information

An introduction to R WS 2013/2014

An introduction to R WS 2013/2014 An introduction to R WS 2013/2014 Dr. Noémie Becker (AG Metzler) Dr. Sonja Grath (AG Parsch) Special thanks to: Dr. Martin Hutzenthaler (previously AG Metzler, now University of Frankfurt) course development,

More information

Revising CS-M41. Oliver Kullmann Computer Science Department Swansea University. Robert Recorde room Swansea, December 13, 2013.

Revising CS-M41. Oliver Kullmann Computer Science Department Swansea University. Robert Recorde room Swansea, December 13, 2013. Computer Science Department Swansea University Robert Recorde room Swansea, December 13, 2013 How to use the revision lecture The purpose of this lecture (and the slides) is to emphasise the main topics

More information

The R statistical computing environment

The R statistical computing environment The R statistical computing environment Luke Tierney Department of Statistics & Actuarial Science University of Iowa June 17, 2011 Luke Tierney (U. of Iowa) R June 17, 2011 1 / 27 Introduction R is a language

More information

Advanced Econometric Methods EMET3011/8014

Advanced Econometric Methods EMET3011/8014 Advanced Econometric Methods EMET3011/8014 Lecture 2 John Stachurski Semester 1, 2011 Announcements Missed first lecture? See www.johnstachurski.net/emet Weekly download of course notes First computer

More information

An introduction to R WS 2013/2014

An introduction to R WS 2013/2014 An introduction to R WS 2013/2014 Dr. Noémie Becker (AG Metzler) Dr. Sonja Grath (AG Parsch) Special thanks to: Dr. Martin Hutzenthaler (previously AG Metzler, now University of Frankfurt) course development,

More information

A Quick Introduction to R

A Quick Introduction to R Math 4501 Fall 2012 A Quick Introduction to R The point of these few pages is to give you a quick introduction to the possible uses of the free software R in statistical analysis. I will only expect you

More information

Today s Lecture. Factors & Sampling. Quick Review of Last Week s Computational Concepts. Numbers we Understand. 1. A little bit about Factors

Today s Lecture. Factors & Sampling. Quick Review of Last Week s Computational Concepts. Numbers we Understand. 1. A little bit about Factors Today s Lecture Factors & Sampling Jarrett Byrnes September 8, 2014 1. A little bit about Factors 2. Sampling 3. Describing your sample Quick Review of Last Week s Computational Concepts Numbers we Understand

More information

Introduction to Queueing Theory for Computer Scientists

Introduction to Queueing Theory for Computer Scientists Introduction to Queueing Theory for Computer Scientists Raj Jain Washington University in Saint Louis Jain@eecs.berkeley.edu or Jain@wustl.edu A Mini-Course offered at UC Berkeley, Sept-Oct 2012 These

More information

Introduction to machine learning, pattern recognition and statistical data modelling Coryn Bailer-Jones

Introduction to machine learning, pattern recognition and statistical data modelling Coryn Bailer-Jones Introduction to machine learning, pattern recognition and statistical data modelling Coryn Bailer-Jones What is machine learning? Data interpretation describing relationship between predictors and responses

More information

Chapter 3: Dynamic Testing Techniques

Chapter 3: Dynamic Testing Techniques Chapter 3: Dynamic Testing Techniques " The system was not fully tested to a satisfactory level of quality and resilience before full implementation on 26 October 1992." Extract from the main conclusions

More information

A Short Introduction to R

A Short Introduction to R A Short Introduction to R 1.1 The R initiative There are many commercial statistical softwares available. Well-known examples include SAS, SPSS, S-Plus, Minitab, Statgraphics, GLIM, and Genstat. Usually

More information

Lecture 3 - Object-oriented programming and statistical programming examples

Lecture 3 - Object-oriented programming and statistical programming examples Lecture 3 - Object-oriented programming and statistical programming examples Björn Andersson (w/ Ronnie Pingel) Department of Statistics, Uppsala University February 1, 2013 Table of Contents 1 Some notes

More information

An Introduction to the Bootstrap

An Introduction to the Bootstrap An Introduction to the Bootstrap Bradley Efron Department of Statistics Stanford University and Robert J. Tibshirani Department of Preventative Medicine and Biostatistics and Department of Statistics,

More information

Advanced R Programming - Lecture 1

Advanced R Programming - Lecture 1 Advanced R Programming - Lecture 1 Krzysztof Bartoszek (slides by Leif Jonsson and Måns Magnusson) Linköping University krzysztof.bartoszek@liu.se 29 August 2017 1/ 43 Today 1 Aim of the course 2 3 4 5

More information

R Short Course Session 1

R Short Course Session 1 R Short Course Session 1 Daniel Zhao, PhD Sixia Chen, PhD Department of Biostatistics and Epidemiology College of Public Health, OUHSC 10/23/2015 Outline Overview of the 5 sessions Pre-requisite requirements

More information

R is a programming language of a higher-level Constantly increasing amount of packages (new research) Free of charge Website:

R is a programming language of a higher-level Constantly increasing amount of packages (new research) Free of charge Website: Introduction to R R R is a programming language of a higher-level Constantly increasing amount of packages (new research) Free of charge Website: http://www.r-project.org/ Code Editor: http://rstudio.org/

More information

CSc 520. Course Outline (Subject to change) Course Outline (Subject to change)... Principles of Programming Languages. Christian Collberg

CSc 520. Course Outline (Subject to change) Course Outline (Subject to change)... Principles of Programming Languages. Christian Collberg Slide 0 2 Course Outline (Subject to change) This course will define, analyze and evaluate important concepts found in current programming languages. Its goals are to build an ability to evaluate and compare

More information

Introduction to Programming in C Department of Computer Science and Engineering. Lecture No. #16 Loops: Matrix Using Nested for Loop

Introduction to Programming in C Department of Computer Science and Engineering. Lecture No. #16 Loops: Matrix Using Nested for Loop Introduction to Programming in C Department of Computer Science and Engineering Lecture No. #16 Loops: Matrix Using Nested for Loop In this section, we will use the, for loop to code of the matrix problem.

More information

1/22/2018. Multivariate Applications in Ecology (BSC 747) Ecological datasets are very often large and complex

1/22/2018. Multivariate Applications in Ecology (BSC 747) Ecological datasets are very often large and complex Multivariate Applications in Ecology (BSC 747) Ecological datasets are very often large and complex Modern integrative approaches have allowed for collection of more data, challenge is proper integration

More information

CSE 336. Introduction to Programming. for Electronic Commerce. Why You Need CSE336

CSE 336. Introduction to Programming. for Electronic Commerce. Why You Need CSE336 CSE 336 Introduction to Programming for Electronic Commerce Why You Need CSE336 Concepts like bits and bytes, domain names, ISPs, IPAs, RPCs, P2P protocols, infinite loops, and cloud computing are strictly

More information

Getting Started. Slides R-Intro: R-Analytics: R-HPC:

Getting Started. Slides R-Intro:   R-Analytics:   R-HPC: Getting Started Download and install R + Rstudio http://www.r-project.org/ https://www.rstudio.com/products/rstudio/download2/ TACC ssh username@wrangler.tacc.utexas.edu % module load Rstats %R Slides

More information

Stochastic Models. Introduction to R. Walt Pohl. February 28, Department of Business Administration

Stochastic Models. Introduction to R. Walt Pohl. February 28, Department of Business Administration Stochastic Models Introduction to R Walt Pohl Universität Zürich Department of Business Administration February 28, 2013 What is R? R is a freely-available general-purpose statistical package, developed

More information

COMP Data Structures

COMP Data Structures Shahin Kamali Topic 1 - Introductions University of Manitoba Based on notes by S. Durocher. 1 / 35 Introduction Introduction 1 / 35 Introduction In a Glance... Data structures are building blocks for designing

More information

Linear transformations Affine transformations Transformations in 3D. Graphics 2009/2010, period 1. Lecture 5: linear and affine transformations

Linear transformations Affine transformations Transformations in 3D. Graphics 2009/2010, period 1. Lecture 5: linear and affine transformations Graphics 2009/2010, period 1 Lecture 5 Linear and affine transformations Vector transformation: basic idea Definition Examples Finding matrices Compositions of transformations Transposing normal vectors

More information

Organisation. Assessment

Organisation. Assessment Week 1 s s Getting Started 1 3 4 5 - - Lecturer Dr Lectures Tuesday 1-13 Fulton House Lecture room Tuesday 15-16 Fulton House Lecture room Thursday 11-1 Fulton House Lecture room Friday 10-11 Glyndwr C

More information

Business Statistics: R tutorials

Business Statistics: R tutorials Business Statistics: R tutorials Jingyu He September 29, 2017 Install R and RStudio R is a free software environment for statistical computing and graphics. Download free R and RStudio for Windows/Mac:

More information

Stat Wk 5. Random number generation. Special variables in data steps. Setting labels.

Stat Wk 5. Random number generation. Special variables in data steps. Setting labels. Stat 342 - Wk 5 Random number generation. Special variables in data steps. Setting labels. Do loops and data step behaviour. Example questions for the midterm. Stat 342 Notes. Week 3, Page 1 / 38 Random

More information

COMP Data Structures

COMP Data Structures COMP 2140 - Data Structures Shahin Kamali Topic 1 - Introductions University of Manitoba Based on notes by S. Durocher. COMP 2140 - Data Structures 1 / 35 Introduction COMP 2140 - Data Structures 1 / 35

More information

1 Introduction. 1.1 What is Statistics?

1 Introduction. 1.1 What is Statistics? 1 Introduction 1.1 What is Statistics? MATH1015 Biostatistics Week 1 Statistics is a scientific study of numerical data based on natural phenomena. It is also the science of collecting, organising, interpreting

More information

In this course, you need to use Pearson etext. Go to "Pearson etext and Video Notes".

In this course, you need to use Pearson etext. Go to Pearson etext and Video Notes. **Disclaimer** This syllabus is to be used as a guideline only. The information provided is a summary of topics to be covered in the class. Information contained in this document such as assignments, grading

More information

Visual Programming (CBVP2103) This course is worth 3 credit hours Will be covered in weeks Total 13 topics Assessment

Visual Programming (CBVP2103) This course is worth 3 credit hours Will be covered in weeks Total 13 topics Assessment CBVP2103 Visual Programming (CBVP2103) This course is worth 3 credit hours Will be covered in 12-14 weeks Total 13 topics Assessment Assignment 30% (submit by 8 th week) Final Exam 65% Class Participation

More information

CS 3030 Scripting Languages Syllabus

CS 3030 Scripting Languages Syllabus General Information CS 3030 Scripting Languages Semester: Fall 2017 Textbook: Location: Instructor Info: None. We will use freely available resources from the Internet. Online Ted Cowan tedcowan@weber.edu

More information

A VERY BRIEF INTRODUCTION TO R

A VERY BRIEF INTRODUCTION TO R CS 432/532 INTRODUCTION TO WEB SCIENCE A VERY BRIEF INTRODUCTION TO R SCOTT G. AINSWORTH OLD DOMINION UNIVERSITY WHO AM I? Scott G. Ainsworth Former sailor Worked for several consulting firms Computer

More information

1 Pencil and Paper stuff

1 Pencil and Paper stuff Spring 2008 - Stat C141/ Bioeng C141 - Statistics for Bioinformatics Course Website: http://www.stat.berkeley.edu/users/hhuang/141c-2008.html Section Website: http://www.stat.berkeley.edu/users/mgoldman

More information

An introduction to R: Organisation and Basics of Algorithmics

An introduction to R: Organisation and Basics of Algorithmics An introduction to R: Organisation and Basics of Algorithmics Noémie Becker, Benedikt Holtmann & Dirk Metzler 1 nbecker@bio.lmu.de - holtmann@bio.lmu.de Winter semester 2016-17 1 Special thanks to: Prof.

More information

An Introduction to R. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata October 17, 2012

An Introduction to R. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata October 17, 2012 An Introduction to R Subhajit Dutta Stat-Math Unit Indian Statistical Institute, Kolkata October 17, 2012 Why R? It is FREE!! Basic as well as specialized data analysis technique at your fingertips. Highly

More information

Vectors and Matrices Flow Control Plotting Functions Simulating Systems Installing Packages Getting Help Assignments. R Tutorial

Vectors and Matrices Flow Control Plotting Functions Simulating Systems Installing Packages Getting Help Assignments. R Tutorial R Tutorial Anup Aprem aaprem@ece.ubc.ca September 14, 2017 Installation Installing R: https://www.r-project.org/ Recommended to also install R Studio: https://www.rstudio.com/ Vectors Basic element is

More information

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015 STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, tsvv@steno.dk, Steno Diabetes Center June 11, 2015 Contents 1 Introduction 1 2 Recap: Variables 2 3 Data Containers 2 3.1 Vectors................................................

More information

Intro Intro.3

Intro Intro.3 Intro.1 Intro.2 Introduction to R Much of the content here is from Appendix A of my Analysis of Categorical Data with R book (www.chrisbilder.com/ categorical). All R code is available in AppendixInitialExamples.R

More information

CSE 20. Lecture 4: Number System and Boolean Function. CSE 20: Lecture2

CSE 20. Lecture 4: Number System and Boolean Function. CSE 20: Lecture2 CSE 20 Lecture 4: Number System and Boolean Function Next Weeks Next week we will do Unit:NT, Section 1. There will be an assignment set posted today. It is just for practice. Boolean Functions and Number

More information

Biostatistics & SAS programming. Kevin Zhang

Biostatistics & SAS programming. Kevin Zhang Biostatistics & SAS programming Kevin Zhang February 27, 2017 Random variables and distributions 1 Data analysis Simulation study Apply existing methodologies to your collected samples, with the hope to

More information

Introduction to R: Part I

Introduction to R: Part I Introduction to R: Part I Jeffrey C. Miecznikowski March 26, 2015 R impact R is the 13th most popular language by IEEE Spectrum (2014) Google uses R for ROI calculations Ford uses R to improve vehicle

More information

Introduction to Scientific Python, CME 193 Jan. 9, web.stanford.edu/~ermartin/teaching/cme193-winter15

Introduction to Scientific Python, CME 193 Jan. 9, web.stanford.edu/~ermartin/teaching/cme193-winter15 1 LECTURE 1: INTRO Introduction to Scientific Python, CME 193 Jan. 9, 2014 web.stanford.edu/~ermartin/teaching/cme193-winter15 Eileen Martin Some slides are from Sven Schmit s Fall 14 slides 2 Course Details

More information

On R for Statistics. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata September 16, 2011

On R for Statistics. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata September 16, 2011 On R for Statistics Subhajit Dutta Stat-Math Unit Indian Statistical Institute, Kolkata September 16, 2011 Why R? It is FREE!! Basic as well as specialized data analysis technique at your fingertips. Highly

More information

Package capwire. February 19, 2015

Package capwire. February 19, 2015 Type Package Package capwire February 19, 2015 Title Estimates population size from non-invasive sampling Version 1.1.4 Date 2012-08-16 Author and Craig R. Miller Maintainer Fits

More information

Package simed. November 27, 2017

Package simed. November 27, 2017 Version 1.0.3 Title Simulation Education Author Barry Lawson, Larry Leemis Package simed November 27, 2017 Maintainer Barry Lawson Imports graphics, grdevices, methods, stats, utils

More information

Course Name: Database Systems - 1 Course Code: IS211

Course Name: Database Systems - 1 Course Code: IS211 Course Name: Database Systems - 1 Course Code: IS211 I. Basic Course Information Major or minor element of program: General Department offering the course: Information Systems Department Academic level:

More information

7 Control Structures, Logical Statements

7 Control Structures, Logical Statements 7 Control Structures, Logical Statements 7.1 Logical Statements 1. Logical (true or false) statements comparing scalars or matrices can be evaluated in MATLAB. Two matrices of the same size may be compared,

More information

Algorithms and Data Structures. Algorithms and Data Structures. Algorithms and Data Structures. Algorithms and Data Structures

Algorithms and Data Structures. Algorithms and Data Structures. Algorithms and Data Structures. Algorithms and Data Structures Richard Mayr Slides adapted from Mary Cryan (2015/16) with some changes. School of Informatics University of Edinburgh ADS (2018/19) Lecture 1 slide 1 ADS (2018/19) Lecture 1 slide 3 ADS (2018/19) Lecture

More information

Introduction to R. Course in Practical Analysis of Microarray Data Computational Exercises

Introduction to R. Course in Practical Analysis of Microarray Data Computational Exercises Introduction to R Course in Practical Analysis of Microarray Data Computational Exercises 2010 March 22-26, Technischen Universität München Amin Moghaddasi, Kurt Fellenberg 1. Installing R. Check whether

More information

Course Name: Database Design Course Code: IS414

Course Name: Database Design Course Code: IS414 Course Name: Database Design Course Code: IS414 I. Basic Course Information Major or minor element of program: Both Major Minor Department offering the course: Information Systems Department Academic level:

More information

Mails : ; Document version: 14/09/12

Mails : ; Document version: 14/09/12 Mails : leslie.regad@univ-paris-diderot.fr ; gaelle.lelandais@univ-paris-diderot.fr Document version: 14/09/12 A freely available language and environment Statistical computing Graphics Supplementary

More information

CS 3030 Scripting Languages Syllabus

CS 3030 Scripting Languages Syllabus General Information CS 3030 Scripting Languages Semester: Summer 2013 Textbook: Location: Instructor Info: Website: None. We will use freely available resources from the Internet. Online Ted Cowan tedcowan@weber.edu

More information

STA 313: Topics in Statistics

STA 313: Topics in Statistics Al Nosedal. University of Toronto. Fall 2015 essentially, all models are wrong, but some are useful George E. P. Box (one of the great statistical minds of the 20th century). What is R? R language essentials

More information

An introduction to R 1 / 29

An introduction to R 1 / 29 An introduction to R 1 / 29 What is R? R is an integrated suite of software facilities for data manipulation, calculation and graphical display. Among other things it has: an effective data handling and

More information

R programming Philip J Cwynar University of Pittsburgh School of Information Sciences and Intelligent Systems Program

R programming Philip J Cwynar University of Pittsburgh School of Information Sciences and Intelligent Systems Program R programming Philip J Cwynar University of Pittsburgh School of Information Sciences and Intelligent Systems Program Background R is a programming language and software environment for statistical analysis,

More information

MS in Applied Statistics: Study Guide for the Data Science concentration Comprehensive Examination. 1. MAT 456 Applied Regression Analysis

MS in Applied Statistics: Study Guide for the Data Science concentration Comprehensive Examination. 1. MAT 456 Applied Regression Analysis MS in Applied Statistics: Study Guide for the Data Science concentration Comprehensive Examination. The Part II comprehensive examination is a three-hour closed-book exam that is offered on the second

More information

Introduction to R. Daniel Berglund. 9 November 2017

Introduction to R. Daniel Berglund. 9 November 2017 Introduction to R Daniel Berglund 9 November 2017 1 / 15 R R is available at the KTH computers If you want to install it yourself it is available at https://cran.r-project.org/ Rstudio an IDE for R is

More information

Columbus State Community College Mathematics Department Public Syllabus. Course and Number: MATH 1172 Engineering Mathematics A

Columbus State Community College Mathematics Department Public Syllabus. Course and Number: MATH 1172 Engineering Mathematics A Columbus State Community College Mathematics Department Public Syllabus Course and Number: MATH 1172 Engineering Mathematics A CREDITS: 5 CLASS HOURS PER WEEK: 5 PREREQUISITES: MATH 1151 with a C or higher

More information

Why use R? Getting started. Why not use R? Introduction to R: It s hard to use at first. To perform inferential statistics (e.g., use a statistical

Why use R? Getting started. Why not use R? Introduction to R: It s hard to use at first. To perform inferential statistics (e.g., use a statistical Why use R? Introduction to R: Using R for statistics ti ti and data analysis BaRC Hot Topics November 2013 George W. Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ To perform inferential

More information

Finite Math - J-term Homework. Section Inverse of a Square Matrix

Finite Math - J-term Homework. Section Inverse of a Square Matrix Section.5-77, 78, 79, 80 Finite Math - J-term 017 Lecture Notes - 1/19/017 Homework Section.6-9, 1, 1, 15, 17, 18, 1, 6, 9, 3, 37, 39, 1,, 5, 6, 55 Section 5.1-9, 11, 1, 13, 1, 17, 9, 30 Section.5 - Inverse

More information

UP School of Statistics Student Council Education and Research

UP School of Statistics Student Council Education and Research w UP School of Statistics Student Council Education and Research erho.weebly.com 0 erhomyhero@gmail.com f /erhoismyhero t @erhomyhero S133_HOA_001 Statistics 133 Bayesian Statistical Inference Use of R

More information

Introduction to R Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center

Introduction to R Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center Introduction to R Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center What is R? R is a statistical computing environment with graphics capabilites It is fully scriptable

More information

Software Testing Prof. Meenakshi D Souza Department of Computer Science and Engineering International Institute of Information Technology, Bangalore

Software Testing Prof. Meenakshi D Souza Department of Computer Science and Engineering International Institute of Information Technology, Bangalore Software Testing Prof. Meenakshi D Souza Department of Computer Science and Engineering International Institute of Information Technology, Bangalore Lecture 04 Software Test Automation: JUnit as an example

More information

Introduction to Databases Fall-Winter 2010/11. Syllabus

Introduction to Databases Fall-Winter 2010/11. Syllabus Introduction to Databases Fall-Winter 2010/11 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room POS 2.09 Office hours: Tuesday, 14:00 16:00 and by appointment (If you want to

More information

GEOMETRY. Teacher: LAIRD JONAS ADDRESS:

GEOMETRY. Teacher: LAIRD JONAS  ADDRESS: GEOMETRY Teacher: LAIRD JONAS EMAIL ADDRESS: ljonas@erhsnyc.net EMILY MELNICK emelnick@erhsnyc.net Year: 2015-2016 Course Description In Geometry, we will explore the relationships between points, lines,

More information

Outline EXPERIENCE WITH TWO OOP LANGUAGES IN ONE COURSE. HISTORY Methodology and learning design of the course Experience from classes

Outline EXPERIENCE WITH TWO OOP LANGUAGES IN ONE COURSE. HISTORY Methodology and learning design of the course Experience from classes EXPERIENCE WITH TWO OOP LANGUAGES IN ONE COURSE Ana Madevska Bogdanova, FCSE, Skopje, Macedoia Outline HISTORY Methodology and learning design of the course Experience from classes Lectures exercises Questionnaire

More information

This document is designed to get you started with using R

This document is designed to get you started with using R An Introduction to R This document is designed to get you started with using R We will learn about what R is and its advantages over other statistics packages the basics of R plotting data and graphs What

More information

Package RegressionFactory

Package RegressionFactory Type Package Package RegressionFactory September 8, 2016 Title Expander Functions for Generating Full Gradient and Hessian from Single-Slot and Multi-Slot Base Distributions Version 0.7.2 Date 2016-09-07

More information

Algorithms and Data Structures

Algorithms and Data Structures Algorithms and Data Structures or, Classical Algorithms of the 50s, 60s, 70s Richard Mayr Slides adapted from Mary Cryan (2015/16) with small changes. School of Informatics University of Edinburgh ADS

More information

COS 333: Advanced Programming Techniques

COS 333: Advanced Programming Techniques COS 333: Advanced Programming Techniques Robert M. Dondero, Ph.D. Princeton University Please pick up handouts at the back of the room 1 COS 333: Course Overview Copyright 2018 by Robert M. Dondero, Ph.D.

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:

More information

In this course, you need to use Pearson etext. Go to "Pearson etext and Video Notes".

In this course, you need to use Pearson etext. Go to Pearson etext and Video Notes. **Disclaimer** This syllabus is to be used as a guideline only. The information provided is a summary of topics to be covered in the class. Information contained in this document such as assignments, grading

More information

RNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University

RNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University RNA-Seq Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University joshua.ainsley@tufts.edu Day four Quantifying expression Intro to R Differential expression

More information

CS 4800: Algorithms & Data. Lecture 1 January 10, 2017

CS 4800: Algorithms & Data. Lecture 1 January 10, 2017 CS 4800: Algorithms & Data Lecture 1 January 10, 2017 Huy L. Nguyen Email: hu.nguyen@northeastern.edu Office hours: Tuesday 1:20 3:20, WVH 358 Research: Algorithms for massive data sets ( big data ) Theoretical

More information

CS240: Programming in C

CS240: Programming in C CS240: Programming in C Lecture 1: Class overview. Cristina Nita-Rotaru Lecture 1/ Fall 2013 1 WELCOME to CS240 Cristina Nita-Rotaru Lecture 1/ Fall 2013 2 240 Team Instructor: Cristina Nita-Rotaru Special

More information

Database Systems (INFR10070) Dr Paolo Guagliardo. University of Edinburgh. Fall 2016

Database Systems (INFR10070) Dr Paolo Guagliardo. University of Edinburgh. Fall 2016 Database Systems (INFR10070) Dr Paolo Guagliardo University of Edinburgh Fall 2016 Databases are everywhere Electronic commerce, websites (e.g., Wordpress blogs) Banking applications, booking systems,

More information

CSCI455: Introduction to Programming System Design

CSCI455: Introduction to Programming System Design CSCI455: Introduction to Programming System Design Claire Bono bono@usc.edu Spring 2019 http://bytes.usc.edu/cs455/ 455 Intro [Bono] 1 Today s topics Course overview and logistics Academic integrity Java

More information

PIANOS requirements specifications

PIANOS requirements specifications PIANOS requirements specifications Group Linja Helsinki 7th September 2005 Software Engineering Project UNIVERSITY OF HELSINKI Department of Computer Science Course 581260 Software Engineering Project

More information

Monte Carlo Simula/on and Copula Func/on. by Gerardo Ferrara

Monte Carlo Simula/on and Copula Func/on. by Gerardo Ferrara Monte Carlo Simula/on and Copula Func/on by Gerardo Ferrara Introduc)on A Monte Carlo method is a computational algorithm that relies on repeated random sampling to compute its results. In a nutshell,

More information

The History and Use of R. Joseph Kambourakis

The History and Use of R. Joseph Kambourakis The History and Use of R Joseph Kambourakis Ground Rules Interrupt me These are all my opinions and not of EMC or Big Data Analytics, Discovery & Visualization Meetup Slides will be available Joseph

More information

Weekly Discussion Sections & Readings

Weekly Discussion Sections & Readings Weekly Discussion Sections & Readings Teaching Fellows (TA) Name Office Email Mengting Gu Bass 437 mengting.gu (at) yale.edu Paul Muir Bass437 Paul.muir (at) yale.edu Please E-mail cbb752@gersteinlab.org

More information

Teaching Manual Math 2131

Teaching Manual Math 2131 Math 2131 Linear Algebra Labs with MATLAB Math 2131 Linear algebra with Matlab Teaching Manual Math 2131 Contents Week 1 3 1 MATLAB Course Introduction 5 1.1 The MATLAB user interface...........................

More information

Last time. Reasoning about programs. Coming up. Project Final Presentations. This Thursday, Nov 30: 4 th in-class exercise

Last time. Reasoning about programs. Coming up. Project Final Presentations. This Thursday, Nov 30: 4 th in-class exercise Last time Reasoning about programs Coming up This Thursday, Nov 30: 4 th in-class exercise sign up for group on moodle bring laptop to class Final projects: final project presentations: Tue Dec 12, in

More information

Reasoning about programs

Reasoning about programs Reasoning about programs Last time Coming up This Thursday, Nov 30: 4 th in-class exercise sign up for group on moodle bring laptop to class Final projects: final project presentations: Tue Dec 12, in

More information

limma: A brief introduction to R

limma: A brief introduction to R limma: A brief introduction to R Natalie P. Thorne September 5, 2006 R basics i R is a command line driven environment. This means you have to type in commands (line-by-line) for it to compute or calculate

More information

STAT 135 Lab 1 Solutions

STAT 135 Lab 1 Solutions STAT 135 Lab 1 Solutions January 26, 2015 Introduction To complete this lab, you will need to have access to R and RStudio. If you have not already done so, you can download R from http://cran.cnr.berkeley.edu/,

More information

CSE 167: Introduction to Computer Graphics. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2016

CSE 167: Introduction to Computer Graphics. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2016 CSE 167: Introduction to Computer Graphics Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2016 Today Course organization Course overview 2 Course Staff Instructor Jürgen Schulze,

More information