Apply. A. Michelle Lawing Ecosystem Science and Management Texas A&M University College Sta,on, TX

Size: px
Start display at page:

Download "Apply. A. Michelle Lawing Ecosystem Science and Management Texas A&M University College Sta,on, TX"

Transcription

1 Apply A. Michelle Lawing Ecosystem Science and Management Texas A&M University College Sta,on, TX

2 Schedule for today My presenta,on Review New stuff Mixed, Fixed, and Random Models presenta,on Mixed, Fixed, and Random Models demonstra,on BREAK Mixed, Fixed, and Random Models tutorial

3 Homework - Programming Create a func,on that plots a random walk by using rnorm. The func,on should read in the ini,al trait value and the number of,me steps.

4 Review: Model Selec,on Use a CRITERION for evalua,ng and comparing models Use a STRATEGY for searching all the possibili,es or a subset of reasonable possibili,es

5 Review: Model Selec,on We learned about several criteria for model selec,on R2 AIC BIC Mallow s Cp Stepwise Procedures Cross Valida,on Strategy to construct models

6 Review: Bootstrap What do you do if the assump,ons of the method to evaluate data are violated? Ogen we use computer intensive approaches (the power of the computer is used to create a sampling distribu,on) Es,mate with a bootstrap Hypothesis test with a permuta,on

7 Review: Bootstrap Bootstrap is primarily used in es,ma,on Value of a parameter Probability Standard error Confidence interval

8 Permuta,on Test Generated a null distribu,on for the associa,on between two or more variables Repeated random rearrangements of one of the variables

9 Permuta,on Test Rank tests are permuta,on tests e.g., Mann- Whitney U- test Data are replaced by their ranks Ranks are permuted repeatedly to generate null distribu,on Exact probability distribu,on is known

10 Permuta,on Test Why replace data with ranks? Permute the data themselves Probability distribu,on is unknown, so we generate a large number of permuta,ons instead

11 Example of a Permuta,on Test Male sage crickets offer females their hind wings to nibble on during ma,ng. Females receive nutri,on from feeding on the wings. Johnson et al. (1999) asked Are females more likely to mate if they are hungry?

12 Sage Cricket Ma,ng

13 Example of a Permuta,on Test 24 females were divided into 2 groups 11 were starved for two days 13 were fed Frequency Starved Wai,ng,me to ma,ng was recorded Data are not normally distributed Frequency Fed Hours to feeding

14 Example of a Permuta,on Test Null hypothesis is that the mean,me to ma,ng is the same for the starved and the fed groups Alterna,ve hypothesis is that the mean,me to ma,ng is NOT the same between the two groups

15 Example of a Permuta,on Test

16 Example of a Permuta,on Test

17 Example of a Permuta,on Test

18 Permuta,on Assump,ons Samples are random Assumes the distribu,on of variables is similar in every popula,on Robust to departure from equal- shape assump,on when sample size is large These tests have lower power than parametric tests with sample size is small. Power is similar at large sample sizes.

19 Why you should only use permuta,ons as a last resort Only provide p- value Provides no es,mate of a useful parameter

20 Review: Mul,variate Ordina,on Exploratory data analysis Orders objects so that similar ones are near to each other and dissimilar ones are far away There are many ordina,on techniques

21 Review: Mul,variate Ordina,on Ordina,on math goes both ways Eigenvectors describe how to transform data from original coordinates to PCs and back again Singular vectors are the same for CA.

22 Review: Mul,variate Ordina,on Mul,ply PC scores by eigenvector matrix and add back X and Y mean to get the original X and Y scores

23 apply

24 apply Q: How can I use a loop to [ insert task here ]? A: Don t. Use one of the apply func@ons. So, what is apply and how does it work?

25 Geqng Help?apply Apply Func,ons Over Array Margins?by Apply a Func,on to a Data Frame Split by Factors?eapply?lapply?mapply?rapply?tapply Apply a Func,on Over Values in an Environment Apply a Func,on over a List or Vector Apply a Func,on to Mul,ple List or Vector Arguments Recursively Apply a Func,on to a List Apply a Func,on Over a Ragged Array

26 apply Returns a vector or array or list of values obtained by applying a func@on to margins of an array or matrix. We know about vectors/arrays and func,ons, but what are these margins? Simple: either the rows (1), the columns (2) or both (1:2). By both, we mean apply the func,on to each individual value. An example:

27 by by is an object- oriented wrapper for tapply applied to data frames. The by func,on is a lisle more complex. The documenta,on tells you that a data frame is split by row into data frames subsesed by the values of one or more factors, and func,on FUN is applied to each subset in turn. So, we use this one where factors are involved. To illustrate, we use iris :

28 eapply eapply applies FUN to the named values from an environment and returns the results as a list. This one is a lisle trickier, since you need to know something about environments in R. An environment, as the name suggests, is a self- contained object with its own variables and func,ons. To con,nue using our very simple example: I don t ogen create my own environments, but they re commonly used by R packages such as so it s good to know how to handle them.

29 lapply lapply returns a list of the same length as X, each element of which is the result of applying FUN to the corresponding element of X. That s a nice, clear descrip,on which makes lapply one of the easier apply func,ons to understand. A simple example: The lapply documenta,on tells us to consult further documenta,on for sapply, vapply and replicate. Let s do that.

30 sapply sapply is a user- friendly version of lapply by default returning a vector or matrix if appropriate. That simply means that if lapply would have returned a list with elements $a and $b, sapply will return either a vector, with elements [[ a ]] and [[ b ]], or a matrix with column names a and b. Returning to our previous simple example:

31 vapply vapply is similar to sapply, but has a pre- specified type of return value, so it can be safer (and some@mes faster) to use. A third argument is supplied to vapply, which you can think of as a kind of template for the output. The documenta,on uses the fivenum func,on as an example, so let s go with that: So, vapply returned a matrix, where the column names correspond to the original list elements and the row names to the output template. Nice!

32 replicate replicate is a wrapper for the common use of sapply for repeated evalua@on of an expression (which will usually involve random number genera@on). The replicate func,on is very useful. Give it two mandatory arguments: the number of replica,ons and the func,on to replicate; a third op,onal argument, simplify = T, tries to simplify the result to a vector or matrix. An example let s simulate 10 normal distribu,ons, each with 10 observa,ons: > replicate(10, rnorm(10))

33 mapply mapply is a mul@variate version of sapply. mapply applies FUN to the first elements of each ( ) argument, the second elements, the third elements, and so on. The mapply documenta,on is full of quite complex examples, but here s a simple, silly one: Here, we sum l1$a[1] + l1$b[1] + l2$c[1] + l2$d[1] ( ) to get 64, the first element of the returned list. All the way through to l1$a[10] + l1$b[10] + l2$c[10] + l2$d[10] ( ) = 100, the last element.

34 tapply Apply a func@on to each cell of a ragged array, that is to each (non- empty) group of values given by a unique combina@on of the levels of certain factors. That sounds complicated. It becomes clearer when the required arguments are described. Usage is tapply(x, INDEX, FUN = NULL,, simplify = TRUE), where X is an atomic object, typically a vector and INDEX is a list of factors, each of same length as X.

35 Summary of apply Things to consider What class is my input data? vector, matrix, data frame On which subsets of that data do I want the func,on to act? rows, columns, all values What class will the func,on return? How is the original data structure transformed? It s the usual input- process- output story: what do I have, what do I want and what lies in between?

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University Sta$s$cs & Experimental Design with R Barbara Kitchenham Keele University 1 Comparing two or more groups Part 5 2 Aim To cover standard approaches for independent and dependent groups For two groups Student

More information

The Grammar of Graphics

The Grammar of Graphics The Grammar of Graphics A. Michelle Lawing Ecosystem Science and Management Texas A&M University College Sta,on, TX 77843 alawing@tamu.edu michellelawing.info/rcode ANNOUNCEMENT Ecological Integra,on Symposium

More information

Package future.apply

Package future.apply Version 1.0.0 Package future.apply June 20, 2018 Title Apply Function to Elements in Parallel using Futures Depends R (>= 3.2.0), future (>= 1.8.1) Imports globals (>= 0.12.0) Suggests datasets, stats,

More information

Introduction to the R Language

Introduction to the R Language Introduction to the R Language Loop Functions Biostatistics 140.776 1 / 32 Looping on the Command Line Writing for, while loops is useful when programming but not particularly easy when working interactively

More information

Intermediate Programming in R Session 4: Avoiding Loops. Olivia Lau, PhD

Intermediate Programming in R Session 4: Avoiding Loops. Olivia Lau, PhD Intermediate Programming in R Session 4: Avoiding Loops Olivia Lau, PhD Outline Thinking in Parallel Vectorization Avoiding Loops with Homogenous Data Structures Avoiding Loops with Heterogenous Data Structures

More information

the R environment The R language is an integrated suite of software facilities for:

the R environment The R language is an integrated suite of software facilities for: the R environment The R language is an integrated suite of software facilities for: Data Handling and storage Matrix Math: Manipulating matrices, vectors, and arrays Statistics: A large, integrated set

More information

Topics. Data Types, Control Flow, Func9ons & Programming. Data Types. Vectoriza9on 8/22/10

Topics. Data Types, Control Flow, Func9ons & Programming. Data Types. Vectoriza9on 8/22/10 Topics Data Types, Control Flow, Func9ons & Programming Data types Vectoriza9on Missing values Func9on calls and seman9cs copying values, lazy evalua9on, scope & symbol evalua9on. Control flow Wri9ng func9ons

More information

Lecture 14: The Split-Apply-Combine Paradigm Statistical Computing, Monday November 9, 2015

Lecture 14: The Split-Apply-Combine Paradigm Statistical Computing, Monday November 9, 2015 Lecture 14: The Split-Apply-Combine Paradigm Statistical Computing, 36-350 Monday November 9, 2015 Outline A quick reminder of what R can do How to make life easier with repeated tasks on large data sets

More information

Search Engines. Informa1on Retrieval in Prac1ce. Annota1ons by Michael L. Nelson

Search Engines. Informa1on Retrieval in Prac1ce. Annota1ons by Michael L. Nelson Search Engines Informa1on Retrieval in Prac1ce Annota1ons by Michael L. Nelson All slides Addison Wesley, 2008 Evalua1on Evalua1on is key to building effec$ve and efficient search engines measurement usually

More information

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University Sta$s$cs & Experimental Design with R Barbara Kitchenham Keele University 1 Analysis of Variance Mul$ple groups with Normally distributed data 2 Experimental Design LIST Factors you may be able to control

More information

R workshop. Ac,on! (Operators and Func,ons)

R workshop. Ac,on! (Operators and Func,ons) R workshop Ac,on! (Operators and Func,ons) Ac,ng on variables In the previous sec,on, we talked about objects and data sets Now let s do something with them Verbs Operators Func,ons Simple calcula,on Operator

More information

Minimum Redundancy and Maximum Relevance Feature Selec4on. Hang Xiao

Minimum Redundancy and Maximum Relevance Feature Selec4on. Hang Xiao Minimum Redundancy and Maximum Relevance Feature Selec4on Hang Xiao Background Feature a feature is an individual measurable heuris4c property of a phenomenon being observed In character recogni4on: horizontal

More information

CSCI 599 Class Presenta/on. Zach Levine. Markov Chain Monte Carlo (MCMC) HMM Parameter Es/mates

CSCI 599 Class Presenta/on. Zach Levine. Markov Chain Monte Carlo (MCMC) HMM Parameter Es/mates CSCI 599 Class Presenta/on Zach Levine Markov Chain Monte Carlo (MCMC) HMM Parameter Es/mates April 26 th, 2012 Topics Covered in this Presenta2on A (Brief) Review of HMMs HMM Parameter Learning Expecta2on-

More information

Thinking Induc,vely. COS 326 David Walker Princeton University

Thinking Induc,vely. COS 326 David Walker Princeton University Thinking Induc,vely COS 326 David Walker Princeton University slides copyright 2017 David Walker permission granted to reuse these slides for non-commercial educa,onal purposes Administra,on 2 Assignment

More information

Repetition Through Recursion

Repetition Through Recursion Fundamentals of Computer Science I (CS151.02 2007S) Repetition Through Recursion Summary: In many algorithms, you want to do things again and again and again. For example, you might want to do something

More information

STA 4273H: Sta-s-cal Machine Learning

STA 4273H: Sta-s-cal Machine Learning STA 4273H: Sta-s-cal Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! h0p://www.cs.toronto.edu/~rsalakhu/ Lecture 3 Parametric Distribu>ons We want model the probability

More information

Bootstrapped and Means Trimmed One Way ANOVA and Multiple Comparisons in R

Bootstrapped and Means Trimmed One Way ANOVA and Multiple Comparisons in R Bootstrapped and Means Trimmed One Way ANOVA and Multiple Comparisons in R Another way to do a bootstrapped one-way ANOVA is to use Rand Wilcox s R libraries. Wilcox (2012) states that for one-way ANOVAs,

More information

CSE 473: Ar+ficial Intelligence Uncertainty and Expec+max Tree Search

CSE 473: Ar+ficial Intelligence Uncertainty and Expec+max Tree Search CSE 473: Ar+ficial Intelligence Uncertainty and Expec+max Tree Search Instructors: Luke ZeDlemoyer Univeristy of Washington [These slides were adapted from Dan Klein and Pieter Abbeel for CS188 Intro to

More information

Stat 579: List Objects

Stat 579: List Objects Stat 579: List Objects Ranjan Maitra 2220 Snedecor Hall Department of Statistics Iowa State University. Phone: 515-294-7757 maitra@iastate.edu, 1/10 Example: Eigenvalues of a matrix mm

More information

CS 6140: Machine Learning Spring 2017

CS 6140: Machine Learning Spring 2017 CS 6140: Machine Learning Spring 2017 Instructor: Lu Wang College of Computer and Informa@on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Logis@cs Grades

More information

Discussion Notes 3 Stepwise Regression and Model Selection

Discussion Notes 3 Stepwise Regression and Model Selection Discussion Notes 3 Stepwise Regression and Model Selection Stepwise Regression There are many different commands for doing stepwise regression. Here we introduce the command step. There are many arguments

More information

file:///users/williams03/a/workshops/2015.march/final/intro_to_r.html

file:///users/williams03/a/workshops/2015.march/final/intro_to_r.html Intro to R R is a functional programming language, which means that most of what one does is apply functions to objects. We will begin with a brief introduction to R objects and how functions work, and

More information

CS60092: Informa0on Retrieval

CS60092: Informa0on Retrieval Introduc)on to CS60092: Informa0on Retrieval Sourangshu Bha1acharya Today s lecture hypertext and links We look beyond the content of documents We begin to look at the hyperlinks between them Address ques)ons

More information

Ways to implement a language

Ways to implement a language Interpreters Implemen+ng PLs Most of the course is learning fundamental concepts for using PLs Syntax vs. seman+cs vs. idioms Powerful constructs like closures, first- class objects, iterators (streams),

More information

CS 6140: Machine Learning Spring Final Exams. What we learned Final Exams 2/26/16

CS 6140: Machine Learning Spring Final Exams. What we learned Final Exams 2/26/16 Logis@cs CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa@on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Assignment

More information

CS 6140: Machine Learning Spring 2016

CS 6140: Machine Learning Spring 2016 CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa?on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Logis?cs Assignment

More information

Special Topic: Missing Values. Missing Can Mean Many Things. Missing Values Common in Real Data

Special Topic: Missing Values. Missing Can Mean Many Things. Missing Values Common in Real Data Special Topic: Missing Values Missing Values Common in Real Data Pneumonia: 6.3% of attribute values are missing one attribute is missing in 61% of cases C-Section: only about 1/2% of attribute values

More information

Information Retrieval and Web Search Engines

Information Retrieval and Web Search Engines Information Retrieval and Web Search Engines Lecture 7: Document Clustering December 4th, 2014 Wolf-Tilo Balke and José Pinto Institut für Informationssysteme Technische Universität Braunschweig The Cluster

More information

Unit 5: Estimating with Confidence

Unit 5: Estimating with Confidence Unit 5: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Unit 5 Estimating with Confidence 8.1 8.2 8.3 Confidence Intervals: The Basics Estimating

More information

Ar#ficial Intelligence

Ar#ficial Intelligence Ar#ficial Intelligence Advanced Searching Prof Alexiei Dingli Gene#c Algorithms Charles Darwin Genetic Algorithms are good at taking large, potentially huge search spaces and navigating them, looking for

More information

Network Analysis Integra2ve Genomics module

Network Analysis Integra2ve Genomics module Network Analysis Integra2ve Genomics module Michael Inouye Centre for Systems Genomics University of Melbourne, Australia Summer Ins@tute in Sta@s@cal Gene@cs 2016 SeaBle, USA @minouye271 inouyelab.org

More information

Model selection Outline for today

Model selection Outline for today Model selection Outline for today The problem of model selection Choose among models by a criterion rather than significance testing Criteria: Mallow s C p and AIC Search strategies: All subsets; stepaic

More information

SPSS 11.5 for Windows Assignment 2

SPSS 11.5 for Windows Assignment 2 1 SPSS 11.5 for Windows Assignment 2 Material covered: Generating frequency distributions and descriptive statistics, converting raw scores to standard scores, creating variables using the Compute option,

More information

Lecture 3 - Object-oriented programming and statistical programming examples

Lecture 3 - Object-oriented programming and statistical programming examples Lecture 3 - Object-oriented programming and statistical programming examples Björn Andersson (w/ Ronnie Pingel) Department of Statistics, Uppsala University February 1, 2013 Table of Contents 1 Some notes

More information

Lab 1- Introduction to Motion

Lab 1- Introduction to Motion Partner : Purpose Partner 2: Lab - Section: The purpose of this lab is to learn via a motion detector the relationship between position and velocity. Remember that this device measures the position of

More information

3. Replace any row by the sum of that row and a constant multiple of any other row.

3. Replace any row by the sum of that row and a constant multiple of any other row. Math Section. Section.: Solving Systems of Linear Equations Using Matrices As you may recall from College Algebra or Section., you can solve a system of linear equations in two variables easily by applying

More information

Using The foreach Package

Using The foreach Package Steve Weston doc@revolutionanalytics.com December 9, 2017 1 Introduction One of R s most useful features is its interactive interpreter. This makes it very easy to learn and experiment with R. It allows

More information

Local defini1ons. Func1on mul1ples- of

Local defini1ons. Func1on mul1ples- of Local defini1ons The func1ons and special forms we ve seen so far can be arbitrarily nested except define and check- expect. So far, defini.ons have to be made at the top level, outside any expression.

More information

Lecture 6: Sequential Sorting

Lecture 6: Sequential Sorting 15-150 Lecture 6: Sequential Sorting Lecture by Dan Licata February 2, 2012 Today s lecture is about sorting. Along the way, we ll learn about divide and conquer algorithms, the tree method, and complete

More information

Information Retrieval and Web Search Engines

Information Retrieval and Web Search Engines Information Retrieval and Web Search Engines Lecture 7: Document Clustering May 25, 2011 Wolf-Tilo Balke and Joachim Selke Institut für Informationssysteme Technische Universität Braunschweig Homework

More information

Overview. A mathema5cal proof technique Proves statements about natural numbers 0,1,2,... (or more generally, induc+vely defined objects) Merge Sort

Overview. A mathema5cal proof technique Proves statements about natural numbers 0,1,2,... (or more generally, induc+vely defined objects) Merge Sort Goals for Today Induc+on Lecture 22 Spring 2011 Be able to state the principle of induc+on Iden+fy its rela+onship to recursion State how it is different from recursion Be able to understand induc+ve proofs

More information

REDCap Data Dic+onary

REDCap Data Dic+onary REDCap Data Dic+onary ITHS Biomedical Informa+cs Core iths_redcap_admin@uw.edu Bas de Veer MS Research Consultant REDCap version: 6.2.1 Last updated December 9, 2014 1 Goals & Agenda Goals CraDing your

More information

Mul$-objec$ve Visual Odometry Hsiang-Jen (Johnny) Chien and Reinhard Kle=e

Mul$-objec$ve Visual Odometry Hsiang-Jen (Johnny) Chien and Reinhard Kle=e Mul$-objec$ve Visual Odometry Hsiang-Jen (Johnny) Chien and Reinhard Kle=e Centre for Robo+cs & Vision Dept. of Electronic and Electric Engineering School of Engineering, Computer, and Mathema+cal Sciences

More information

Cross- Valida+on & ROC curve. Anna Helena Reali Costa PCS 5024

Cross- Valida+on & ROC curve. Anna Helena Reali Costa PCS 5024 Cross- Valida+on & ROC curve Anna Helena Reali Costa PCS 5024 Resampling Methods Involve repeatedly drawing samples from a training set and refibng a model on each sample. Used in model assessment (evalua+ng

More information

MHPE 494: Data Analysis. Welcome! The Analytic Process

MHPE 494: Data Analysis. Welcome! The Analytic Process MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your

More information

2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;

2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments; A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual

More information

Automa'c Test Genera'on

Automa'c Test Genera'on Automa'c Test Genera'on First, about Purify Paper about Purify (and PurifyPlus) posted How do you monitor reads and writes: insert statements before and a?er reads, writes in code can s'll be done with

More information

1. The Normal Distribution, continued

1. The Normal Distribution, continued Math 1125-Introductory Statistics Lecture 16 10/9/06 1. The Normal Distribution, continued Recall that the standard normal distribution is symmetric about z = 0, so the area to the right of zero is 0.5000.

More information

Lecture 3: Basics of R Programming

Lecture 3: Basics of R Programming Lecture 3: Basics of R Programming This lecture introduces you to how to do more things with R beyond simple commands. Outline: 1. R as a programming language 2. Grouping, loops and conditional execution

More information

Architectural Requirements Phase. See Sommerville Chapters 11, 12, 13, 14, 18.2

Architectural Requirements Phase. See Sommerville Chapters 11, 12, 13, 14, 18.2 Architectural Requirements Phase See Sommerville Chapters 11, 12, 13, 14, 18.2 1 Architectural Requirements Phase So7ware requirements concerned construc>on of a logical model Architectural requirements

More information

10601 Machine Learning. Model and feature selection

10601 Machine Learning. Model and feature selection 10601 Machine Learning Model and feature selection Model selection issues We have seen some of this before Selecting features (or basis functions) Logistic regression SVMs Selecting parameter value Prior

More information

Decision Trees, Random Forests and Random Ferns. Peter Kovesi

Decision Trees, Random Forests and Random Ferns. Peter Kovesi Decision Trees, Random Forests and Random Ferns Peter Kovesi What do I want to do? Take an image. Iden9fy the dis9nct regions of stuff in the image. Mark the boundaries of these regions. Recognize and

More information

STAT 540: R: Sections Arithmetic in R. Will perform these on vectors, matrices, arrays as well as on ordinary numbers

STAT 540: R: Sections Arithmetic in R. Will perform these on vectors, matrices, arrays as well as on ordinary numbers Arithmetic in R R can be viewed as a very fancy calculator Can perform the ordinary mathematical operations: + - * / ˆ Will perform these on vectors, matrices, arrays as well as on ordinary numbers With

More information

61A LECTURE 8 SEQUENCES, ITERABLES

61A LECTURE 8 SEQUENCES, ITERABLES 61A LECTURE 8 SEQUENCES, ITERABLES Steven Tang and Eric Tzeng July 8, 013 Announcements Homework 4 due tonight Homework 5 is out, due Friday Midterm is Thursday, 7pm Thanks for coming to the potluck! What

More information

61A LECTURE 8 SEQUENCES, ITERABLES. Steven Tang and Eric Tzeng July 8, 2013

61A LECTURE 8 SEQUENCES, ITERABLES. Steven Tang and Eric Tzeng July 8, 2013 61A LECTURE 8 SEQUENCES, ITERABLES Steven Tang and Eric Tzeng July 8, 2013 Announcements Homework 4 due tonight Homework 5 is out, due Friday Midterm is Thursday, 7pm Thanks for coming to the potluck!

More information

Business Statistics: R tutorials

Business Statistics: R tutorials Business Statistics: R tutorials Jingyu He September 29, 2017 Install R and RStudio R is a free software environment for statistical computing and graphics. Download free R and RStudio for Windows/Mac:

More information

Thinking Induc,vely. COS 326 David Walker Princeton University

Thinking Induc,vely. COS 326 David Walker Princeton University Thinking Induc,vely COS 326 David Walker Princeton University slides copyright 2013-2015 David Walker and Andrew W. Appel permission granted to reuse these slides for non-commercial educa,onal purposes

More information

Introduction to SPSS Faiez Mossa 2 nd Class

Introduction to SPSS Faiez Mossa 2 nd Class Introduction to SPSS 16.0 Faiez Mossa 2 nd Class 1 Outline Review of Concepts (stats and scales) Data entry (the workspace and labels) By hand Import Excel Running an analysis- frequency, central tendency,

More information

Using Formulas and Functions

Using Formulas and Functions Using Formulas and Functions Formulas... 1 Using operators in formulas... 1 Creating formulas... 2 Good Practice: The easy way to create formulas... 2 Copying formulas... 3 Operators... 3 Formula error

More information

CS395T Visual Recogni5on and Search. Gautam S. Muralidhar

CS395T Visual Recogni5on and Search. Gautam S. Muralidhar CS395T Visual Recogni5on and Search Gautam S. Muralidhar Today s Theme Unsupervised discovery of images Main mo5va5on behind unsupervised discovery is that supervision is expensive Common tasks include

More information

Correlation. January 12, 2019

Correlation. January 12, 2019 Correlation January 12, 2019 Contents Correlations The Scattterplot The Pearson correlation The computational raw-score formula Survey data Fun facts about r Sensitivity to outliers Spearman rank-order

More information

The scope Package. April 19, 2006

The scope Package. April 19, 2006 The Package April 19, 2006 Type Package Title Data Manipulation Using Arbitrary Row and Column Criteria Version 1.0-2 Date 2006-04-17 Author Tim Bergsma Maintainer Tim Bergsma Calculate,

More information

Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018

Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018 Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018 Contents Overview 2 Generating random numbers 2 rnorm() to generate random numbers from

More information

PPI Network Alignment Advanced Topics in Computa8onal Genomics

PPI Network Alignment Advanced Topics in Computa8onal Genomics PPI Network Alignment 02-715 Advanced Topics in Computa8onal Genomics PPI Network Alignment Compara8ve analysis of PPI networks across different species by aligning the PPI networks Find func8onal orthologs

More information

Install RStudio from - use the standard installation.

Install RStudio from   - use the standard installation. Session 1: Reading in Data Before you begin: Install RStudio from http://www.rstudio.com/ide/download/ - use the standard installation. Go to the course website; http://faculty.washington.edu/kenrice/rintro/

More information

Map Math and StaMsMcs

Map Math and StaMsMcs Map Math and StaMsMcs A. Michelle Lawing Ecosystem Science and Management Texas A&M University College StaMon, TX 77843 alawing@tamu.edu ObjecMves What is "Map Algebra or Map Math Images are data DescripMve

More information

(Updated 29 Oct 2016)

(Updated 29 Oct 2016) (Updated 29 Oct 2016) 1 Class Maker 2016 Program Description Creating classes for the new school year is a time consuming task that teachers are asked to complete each year. Many schools offer their students

More information

DART Tutorial Sec'on 21: Observa'on Types and Observing System Design

DART Tutorial Sec'on 21: Observa'on Types and Observing System Design DART Tutorial Sec'on 21: Observa'on Types and Observing System Design UCAR 2014 The Na'onal Center for Atmospheric Research is sponsored by the Na'onal Science Founda'on. Any opinions, findings and conclusions

More information

BBM 101 Introduc/on to Programming I Fall 2014, Lecture 7. Aykut Erdem, Erkut Erdem, Fuat Akal

BBM 101 Introduc/on to Programming I Fall 2014, Lecture 7. Aykut Erdem, Erkut Erdem, Fuat Akal BBM 101 Introduc/on to Programming I Fall 2014, Lecture 7 Aykut Erdem, Erkut Erdem, Fuat Akal 1 Today Func/ons Defini4ons Invoca4on Parameter Lists Return Values Prototypes Recursion Recursion Induc4ve

More information

Remedial Java - Excep0ons 3/09/17. (remedial) Java. Jars. Anastasia Bezerianos 1

Remedial Java - Excep0ons 3/09/17. (remedial) Java. Jars. Anastasia Bezerianos 1 (remedial) Java anastasia.bezerianos@lri.fr Jars Anastasia Bezerianos 1 Disk organiza0on of Packages! Packages are just directories! For example! class3.inheritancerpg is located in! \remedialjava\src\class3\inheritencerpg!

More information

Excel Tips and FAQs - MS 2010

Excel Tips and FAQs - MS 2010 BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my

More information

Group Sta*s*cs in MEG/EEG

Group Sta*s*cs in MEG/EEG Group Sta*s*cs in MEG/EEG Will Woods NIF Fellow Brain and Psychological Sciences Research Centre Swinburne University of Technology A Cau*onary tale. A Cau*onary tale. A Cau*onary tale. Overview Introduc*on

More information

Introduc)on to Probabilis)c Latent Seman)c Analysis. NYP Predic)ve Analy)cs Meetup June 10, 2010

Introduc)on to Probabilis)c Latent Seman)c Analysis. NYP Predic)ve Analy)cs Meetup June 10, 2010 Introduc)on to Probabilis)c Latent Seman)c Analysis NYP Predic)ve Analy)cs Meetup June 10, 2010 PLSA A type of latent variable model with observed count data and nominal latent variable(s). Despite the

More information

Feature Selec+on. Machine Learning Fall 2018 Kasthuri Kannan

Feature Selec+on. Machine Learning Fall 2018 Kasthuri Kannan Feature Selec+on Machine Learning Fall 2018 Kasthuri Kannan Interpretability vs. Predic+on Types of feature selec+on Subset selec+on/forward/backward Shrinkage (Lasso/Ridge) Best model (CV) Feature selec+on

More information

Graphing on Excel. Open Excel (2013). The first screen you will see looks like this (it varies slightly, depending on the version):

Graphing on Excel. Open Excel (2013). The first screen you will see looks like this (it varies slightly, depending on the version): Graphing on Excel Open Excel (2013). The first screen you will see looks like this (it varies slightly, depending on the version): The first step is to organize your data in columns. Suppose you obtain

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ when the population standard deviation is known and population distribution is normal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses

More information

R for sta)s)cs: automate your analysis

R for sta)s)cs: automate your analysis For the first part of this presenta)on see: hcp://www.peteraldhous.com/car/aldhous_car2011_rforstats.pdf This presenta)on is available at: hcp://jacobfenton.s3.amazonaws.com/automate- r.pdf R for sta)s)cs:

More information

Math 2B Linear Algebra Test 2 S13 Name Write all responses on separate paper. Show your work for credit.

Math 2B Linear Algebra Test 2 S13 Name Write all responses on separate paper. Show your work for credit. Math 2B Linear Algebra Test 2 S3 Name Write all responses on separate paper. Show your work for credit.. Construct a matrix whose a. null space consists of all combinations of (,3,3,) and (,2,,). b. Left

More information

Exsys RuleBook Selector Tutorial. Copyright 2004 EXSYS Inc. All right reserved. Printed in the United States of America.

Exsys RuleBook Selector Tutorial. Copyright 2004 EXSYS Inc. All right reserved. Printed in the United States of America. Exsys RuleBook Selector Tutorial Copyright 2004 EXSYS Inc. All right reserved. Printed in the United States of America. This documentation, as well as the software described in it, is furnished under license

More information

Homework 1 Excel Basics

Homework 1 Excel Basics Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the

More information

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on. Instructor: Wei-Min Shen

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on. Instructor: Wei-Min Shen CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on Instructor: Wei-Min Shen Status Check and Review Status check Have you registered in Piazza? Have you run the Project-1?

More information

MATLAB TUTORIAL WORKSHEET

MATLAB TUTORIAL WORKSHEET MATLAB TUTORIAL WORKSHEET What is MATLAB? Software package used for computation High-level programming language with easy to use interactive environment Access MATLAB at Tufts here: https://it.tufts.edu/sw-matlabstudent

More information

About the Course. Reading List. Assignments and Examina5on

About the Course. Reading List. Assignments and Examina5on Uppsala University Department of Linguis5cs and Philology About the Course Introduc5on to machine learning Focus on methods used in NLP Decision trees and nearest neighbor methods Linear models for classifica5on

More information

R Programming: Worksheet 6

R Programming: Worksheet 6 R Programming: Worksheet 6 Today we ll study a few useful functions we haven t come across yet: all(), any(), `%in%`, match(), pmax(), pmin(), unique() We ll also apply our knowledge to the bootstrap.

More information

Bioconductor s stepnorm package

Bioconductor s stepnorm package Bioconductor s stepnorm package Yuanyuan Xiao 1 and Yee Hwa Yang 2 October 18, 2004 Departments of 1 Biopharmaceutical Sciences and 2 edicine University of California, San Francisco yxiao@itsa.ucsf.edu

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Variable Selection - Chapters 8 in Dielman Artin Department of Statistical Science October 23, 2009 Outline Introduction 1 Introduction 2 3 4 Variable Selection Model

More information

DART Tutorial Sec'on 16: Diagnos'c Output

DART Tutorial Sec'on 16: Diagnos'c Output DART Tutorial Sec'on 16: Diagnos'c Output UCAR The Na'onal Center for Atmospheric Research is sponsored by the Na'onal Science Founda'on. Any opinions, findings and conclusions or recommenda'ons expressed

More information

DART Tutorial Sec'on 16: Diagnos'c Output

DART Tutorial Sec'on 16: Diagnos'c Output DART Tutorial Sec'on 16: Diagnos'c Output UCAR 214 The Na'onal Center for Atmospheric Research is sponsored by the Na'onal Science Founda'on. Any opinions, findings and conclusions or recommenda'ons expressed

More information

Microsoft Office Excel Use Excel s functions. Tutorial 2 Working With Formulas and Functions

Microsoft Office Excel Use Excel s functions. Tutorial 2 Working With Formulas and Functions Microsoft Office Excel 2003 Tutorial 2 Working With Formulas and Functions 1 Use Excel s functions You can easily calculate the sum of a large number of cells by using a function. A function is a predefined,

More information

How to sleep *ght and keep your applica*ons running on IPv6 transi*on. The importance of IPv6 Applica*on Tes*ng

How to sleep *ght and keep your applica*ons running on IPv6 transi*on. The importance of IPv6 Applica*on Tes*ng How to sleep *ght and keep your applica*ons running on IPv6 transi*on The importance of IPv6 Applica*on Tes*ng About this presenta*on It presents a generic methodology to test the IPv6 func*onality of

More information

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS. 1 SPSS 11.5 for Windows Introductory Assignment Material covered: Opening an existing SPSS data file, creating new data files, generating frequency distributions and descriptive statistics, obtaining printouts

More information

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation: Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,

More information

Chapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea

Chapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea Chapter 3 Bootstrap 3.1 Introduction The estimation of parameters in probability distributions is a basic problem in statistics that one tends to encounter already during the very first course on the subject.

More information

Lastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software.

Lastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software. Welcome to the EASE workshop series, part of the STEM Gateway program. Before we begin, I want to make sure we are clear that this is by no means meant to be an all inclusive class in Excel. At each step,

More information

Introduction to R. Nishant Gopalakrishnan, Martin Morgan January, Fred Hutchinson Cancer Research Center

Introduction to R. Nishant Gopalakrishnan, Martin Morgan January, Fred Hutchinson Cancer Research Center Introduction to R Nishant Gopalakrishnan, Martin Morgan Fred Hutchinson Cancer Research Center 19-21 January, 2011 Getting Started Atomic Data structures Creating vectors Subsetting vectors Factors Matrices

More information

Assignment 6 - Model Building

Assignment 6 - Model Building Assignment 6 - Model Building your name goes here Due: Wednesday, March 7, 2018, noon, to Sakai Summary Primarily from the topics in Chapter 9 of your text, this homework assignment gives you practice

More information

DOWNLOAD PDF MICROSOFT EXCEL ALL FORMULAS LIST WITH EXAMPLES

DOWNLOAD PDF MICROSOFT EXCEL ALL FORMULAS LIST WITH EXAMPLES Chapter 1 : Examples of commonly used formulas - Office Support A collection of useful Excel formulas for sums and counts, dates and times, text manipularion, conditional formatting, percentages, Excel

More information

Last &me: Javascript (forms and func&ons)

Last &me: Javascript (forms and func&ons) Let s debug some code together: hkp://www.clsp.jhu.edu/~anni/cs103/test_before.html hkp://www.clsp.jhu.edu/~anni/cs103/test_arer.html

More information

ML4Bio Lecture #1: Introduc3on. February 24 th, 2016 Quaid Morris

ML4Bio Lecture #1: Introduc3on. February 24 th, 2016 Quaid Morris ML4Bio Lecture #1: Introduc3on February 24 th, 216 Quaid Morris Course goals Prac3cal introduc3on to ML Having a basic grounding in the terminology and important concepts in ML; to permit self- study,

More information