book 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition

Size: px
Start display at page:

Download "book 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition"

Transcription

1 book 2014/5/6 15:21 page v #3 Contents List of figures List of tables Preface to the second edition Preface to the first edition xvii xix xxi xxiii 1 Data input and output Input Native dataset Fixed format text files Other fixed files Reading more complex text files Comma separated value (CSV) files Read sheets from an Excel file Read data from R into SAS Read data from SAS into R Reading datasets in other formats Reading data with a variable number of words in a field Read a file byte by byte Access data from a URL Read an XML-formatted file Manual data entry Output Displaying data Number of digits to display Save a native dataset Creating datasets in text format Creating Excel spreadsheets Creating files for use by other packages Creating HTML formatted output Creating XML datasets and output Further resources Data management Structure and meta-data Access variables from a dataset Names of variables and their types Values of variables in a dataset v

2 book 2014/5/6 15:21 page vi #4 vi Label variables Add comment to a dataset or variable Derived variables and data manipulation Add derived variable to a dataset Rename variables in a dataset Create string variables from numeric variables Create categorical variables from continuous variables Recode a categorical variable Create a categorical variable using logic Create numeric variables from string variables Extract characters from string variables Length of string variables Concatenate string variables Set operations Find strings within string variables Find approximate strings Replace strings within string variables Split strings into multiple strings Remove spaces around string variables Upper to lower case Lagged variable Formatting values of variables Perl interface Accessing databases using SQL (structured query language) Merging, combining, and subsetting datasets Subsetting observations Drop or keep variables in a dataset Random sample of a dataset Observation number Keep unique values Identify duplicated values Convert from wide to long (tall) format Convert from long (tall) to wide format Concatenate and stack datasets Sort datasets Merge datasets Date and time variables Create date variable Extract weekday Extract month Extract year Extract quarter Create time variable Further resources Examples Data input and output Data display Derived variables and data manipulation Sorting and subsetting datasets

3 book 2014/5/6 15:21 page vii #5 vii 3 Statistical and mathematical functions Probability distributions and random number generation Probability density function Quantiles of a probability density function Setting the random number seed Uniform random variables Multinomial random variables Normal random variables Multivariate normal random variables Truncated multivariate normal random variables Exponential random variables Other random variables Mathematical functions Basic functions Trigonometric functions Special functions Integer functions Comparisons of floating point variables Complex numbers Derivatives Integration Optimization problems Matrix operations Create matrix from vector Combine vectors or matrices Matrix addition Transpose matrix Find the dimension of a matrix or dataset Matrix multiplication Invert matrix Component-wise multiplication Create submatrix Create a diagonal matrix Create a vector of diagonal elements Create a vector from a matrix Calculate the determinant Find eigenvalues and eigenvectors Find the singular value decomposition Examples Probability distributions Programming and operating system interface Control flow, programming, and data generation Looping Conditional execution Sequence of values or patterns Referring to a range of variables Perform an action repeatedly over a set of variables Grid of values Debugging Error recovery

4 book 2014/5/6 15:21 page viii #6 viii 4.2 Functions and macros SAS macros R functions Interactions with the operating system Timing commands Suspend execution for a time interval Execute a command in the operating system Command history Find working directory Change working directory List and access files Common statistical procedures Summary statistics Means and other summary statistics Other moments Trimmed mean Quantiles Centering, normalizing, and scaling Mean and 95% confidence interval Proportion and 95% confidence interval Maximum likelihood estimation of parameters Bivariate statistics Epidemiologic statistics Test characteristics Correlation Kappa (agreement) Contingency tables Display cross-classification table Displaying missing value categories in a table Pearson chi-square statistic Cochran Mantel Haenszel test Cramér s V Fisher s exact test McNemar s test Tests for continuous variables Tests for normality Student s t test Test for equal variances Nonparametric tests Permutation test Logrank test Analytic power and sample size calculations Further resources Examples Summary statistics and exploratory data analysis Bivariate relationships Contingency tables Two sample tests of continuous variables Survival analysis: logrank test

5 book 2014/5/6 15:21 page ix #7 ix 6 Linear regression and ANOVA Model fitting Linear regression Linear regression with categorical covariates Changing the reference category Parameterization of categorical covariates Linear regression with no intercept Linear regression with interactions One-way analysis of variance Analysis of variance with two or more factors Tests, contrasts, and linear functions of parameters Joint null hypotheses: several parameters equal Joint null hypotheses: sum of parameters Tests of equality of parameters Multiple comparisons Linear combinations of parameters Model diagnostics Predicted values Residuals Standardized and Studentized residuals Leverage Cook s D DFFITS Diagnostic plots Heteroscedasticity tests Model parameters and results Parameter estimates Standardized regression coefficients Standard errors of parameter estimates Confidence interval for parameter estimates Confidence limits for the mean Prediction limits R-squared Design and information matrix Covariance matrix of parameter estimates Correlation matrix of parameter estimates Further resources Examples Scatterplot with smooth fit Linear regression with interaction Regression diagnostics Fitting the regression model separately for each value of another variable Two-way ANOVA Multiple comparisons Contrasts

6 book 2014/5/6 15:21 page x #8 x 7 Regression generalizations and modeling Generalized linear models Logistic regression model Conditional logistic regression model Exact logistic regression Ordered logistic model Generalized logistic model Poisson model Negative binomial model Log-linear model Further generalizations Zero-inflated Poisson model Zero-inflated negative binomial model Generalized additive model Nonlinear least squares model Robust methods Quantile regression model Robust regression model Ridge regression model Models for correlated data Linear models with correlated outcomes Linear mixed models with random intercepts Linear mixed models with random slopes More complex random coefficient models Multilevel models Generalized linear models with correlated outcomes Generalized linear mixed models Generalized estimating equations MANOVA Time series model Survival analysis Proportional hazards (Cox) regression model Proportional hazards (Cox) model with frailty Nelson Aalen estimate of cumulative hazard Testing the proportionality of the Cox model Cox model with time-varying predictors Multivariate statistics and discriminant procedures Cronbach s α Factor analysis Recursive partitioning Linear discriminant analysis Latent class analysis Hierarchical clustering Complex survey design Model selection and assessment Compare two models Log-likelihood Akaike Information Criterion (AIC) Bayesian Information Criterion (BIC) LASSO model Hosmer Lemeshow goodness of fit

7 book 2014/5/6 15:21 page xi #9 xi Goodness of fit for count models Further resources Examples Logistic regression Poisson regression Zero-inflated Poisson regression Negative binomial regression Quantile regression Ordered logistic Generalized logistic model Generalized additive model Reshaping a dataset for longitudinal regression Linear model for correlated data Linear mixed (random slope) model Generalized estimating equations Generalized linear mixed model Cox proportional hazards model Cronbach s α Factor analysis Recursive partitioning Linear discriminant analysis Hierarchical clustering A graphical compendium Univariate plots Barplot Stem-and-leaf plot Dotplot Histogram Density plots Empirical cumulative probability density plot Boxplot Violin plots Univariate plots by grouping variable Side-by-side histograms Side-by-side boxplots Overlaid density plots Bar chart with error bars Bivariate plots Scatterplot Scatterplot with multiple y values Scatterplot with binning Transparent overplotting scatterplot Bivariate density plot Scatterplot with marginal histograms Multivariate plots Matrix of scatterplots Conditioning plot Contour plots D plots Special purpose plots

8 book 2014/5/6 15:21 page xii #10 xii Choropleth maps Interaction plots Plots for categorical data Circular plot Plot an arbitrary function Normal quantile-quantile plot Receiver operating characteristic (ROC) curve Plot confidence intervals for the mean Plot prediction limits from a simple linear regression Plot predicted lines for each value of a variable Kaplan Meier plot Hazard function plotting Mean-difference plots Further resources Examples Scatterplot with multiple axes Conditioning plot Scatterplot with marginal histograms Kaplan Meier plot ROC curve Pairs plot Visualize correlation matrix Graphical options and configuration Adding elements Arbitrary straight line Plot symbols Add points to an existing graphic Jitter points Regression line fit to points Smoothed line Normal density Marginal rug plot Titles Footnotes Text Mathematical symbols Arrows and shapes Add grid Legend Identifying and locating points Options and parameters Graph size Grid of plots per page More general page layouts Fonts Point and text size Box around plots Size of margins Graphical settings Axis range and style

9 book 2014/5/6 15:21 page xiii #11 xiii Axis labels, values, and tick marks Line styles Line widths Colors Log scale Omit axes Saving graphs PDF Postscript RTF JPEG Windows Metafile (WMF) Bitmap image file (BMP) Tagged image file format (TIFF) Portable Network Graphics (PNG) Closing a graphic device Simulation Generating data Generate categorical data Generate data from a logistic regression Generate data from a generalized linear mixed model Generate correlated binary data Generate data from a Cox model Sampling from a challenging distribution Simulation applications Simulation study of Student s t test Diploma (or hat-check) problem Monty Hall problem Further resources Special topics Processing by group Simulation-based power calculations Reproducible analysis and output Advanced statistical methods Bayesian methods Propensity scores Bootstrapping Missing data Finite mixture models with concomitant variables Further resources Case studies Data management and related tasks Finding two closest values in a vector Tabulate binomial probabilities Calculate and plot a running average Create a Fibonacci sequence Read variable format files Plotting maps

10 book 2014/5/6 15:21 page xiv #12 xiv Massachusetts counties, continued Bike ride plot Choropleth maps Data scraping and visualization Scraping data from HTML files Reading data with two lines per observation Plotting time series data URL APIs and truly random numbers Manipulating bigger datasets Constrained optimization: the knapsack problem A Introduction to SAS 341 A.1 Installation A.2 Running SAS and a sample session A.3 Learning SAS and getting help A.4 Fundamental elements of SAS syntax A.5 Work process: The cognitive style of SAS A.6 Useful SAS background A.6.1 Dataset options A.6.2 Subsetting A.6.3 Formats and informats A.7 Output Delivery System A.7.1 Saving output as datasets and controlling output A.7.2 Output file types and ODS destinations A.8 SAS macro variables A.9 Miscellanea B Introduction to R and RStudio 357 B.1 Installation B.1.1 Installation under Windows B.1.2 Installation under Mac OS X B.1.3 RStudio B.1.4 Other graphical interfaces B.2 Running R and sample session B.2.1 Replicating examples from the book and sourcing commands B.2.2 Batch mode B.3 Learning R and getting help B.4 Fundamental structures and objects B.4.1 Objects and vectors B.4.2 Indexing B.4.3 Operators B.4.4 Lists B.4.5 Matrices B.4.6 Dataframes B.4.7 Attributes and classes B.4.8 Options B.5 Functions B.5.1 Calling functions B.5.2 The apply family of functions B.6 Add-ons: packages B.6.1 Introduction to packages

11 book 2014/5/6 15:21 page xv #13 xv B.6.2 CRAN task views B.6.3 Installed libraries and packages B.6.4 Packages referenced in this book B.6.5 Datasets available with R B.7 Support and bugs C The HELP study dataset 379 C.1 Background on the HELP study C.2 Roadmap to analyses of the HELP dataset C.3 Detailed description of the dataset References 385 Subject index 399 SAS index 419 R index 431

Technical Support Minitab Version Student Free technical support for eligible products

Technical Support Minitab Version Student Free technical support for eligible products Technical Support Free technical support for eligible products All registered users (including students) All registered users (including students) Registered instructors Not eligible Worksheet Size Number

More information

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions Technical Support Free technical support Worksheet Size All registered users, including students Registered instructors Number of worksheets Limited only by system resources 5 5 Number of cells per worksheet

More information

Minitab 18 Feature List

Minitab 18 Feature List Minitab 18 Feature List * New or Improved Assistant Measurement systems analysis * Capability analysis Graphical analysis Hypothesis tests Regression DOE Control charts * Graphics Scatterplots, matrix

More information

Minitab 17 commands Prepared by Jeffrey S. Simonoff

Minitab 17 commands Prepared by Jeffrey S. Simonoff Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

JMP Book Descriptions

JMP Book Descriptions JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked

More information

Learn What s New. Statistical Software

Learn What s New. Statistical Software Statistical Software Learn What s New Upgrade now to access new and improved statistical features and other enhancements that make it even easier to analyze your data. The Assistant Data Customization

More information

Acknowledgments. Acronyms

Acknowledgments. Acronyms Acknowledgments Preface Acronyms xi xiii xv 1 Basic Tools 1 1.1 Goals of inference 1 1.1.1 Population or process? 1 1.1.2 Probability samples 2 1.1.3 Sampling weights 3 1.1.4 Design effects. 5 1.2 An introduction

More information

Statistical Methods for the Analysis of Repeated Measurements

Statistical Methods for the Analysis of Repeated Measurements Charles S. Davis Statistical Methods for the Analysis of Repeated Measurements With 20 Illustrations #j Springer Contents Preface List of Tables List of Figures v xv xxiii 1 Introduction 1 1.1 Repeated

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical

More information

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Examples: Mixture Modeling With Cross-Sectional Data CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Mixture modeling refers to modeling with categorical latent variables that represent

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

SPSS Modules Features

SPSS Modules Features SPSS Modules Features Core System Functionality (included in every license) Data access and management Data Prep features: Define Variable properties tool; copy data properties tool, Visual Bander, Identify

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office) SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon

More information

SAS and R. Ken Kleinman and Nicholas J. Horton SECOND EDITION. Kleinman and Horton

SAS and R. Ken Kleinman and Nicholas J. Horton SECOND EDITION. Kleinman and Horton Statistics This edition now covers Studio, a powerful and easy-to-use interface for. It incorporates a number of additional topics, including application program interfaces (APIs), database management

More information

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113 Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,

More information

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer Ludwig Fahrmeir Gerhard Tute Statistical odelling Based on Generalized Linear Model íecond Edition. Springer Preface to the Second Edition Preface to the First Edition List of Examples List of Figures

More information

COPYRIGHTED MATERIAL CONTENTS

COPYRIGHTED MATERIAL CONTENTS PREFACE ACKNOWLEDGMENTS LIST OF TABLES xi xv xvii 1 INTRODUCTION 1 1.1 Historical Background 1 1.2 Definition and Relationship to the Delta Method and Other Resampling Methods 3 1.2.1 Jackknife 6 1.2.2

More information

JMP 10 Student Edition Quick Guide

JMP 10 Student Edition Quick Guide JMP 10 Student Edition Quick Guide Instructions presume an open data table, default preference settings and appropriately typed, user-specified variables of interest. RMC = Click Right Mouse Button Graphing

More information

Contents. Tutorials Section 1. About SAS Enterprise Guide ix About This Book xi Acknowledgments xiii

Contents. Tutorials Section 1. About SAS Enterprise Guide ix About This Book xi Acknowledgments xiii Contents About SAS Enterprise Guide ix About This Book xi Acknowledgments xiii Tutorials Section 1 Tutorial A Getting Started with SAS Enterprise Guide 3 Starting SAS Enterprise Guide 3 SAS Enterprise

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

A Beginner's Guide to. Randall E. Schumacker. The University of Alabama. Richard G. Lomax. The Ohio State University. Routledge

A Beginner's Guide to. Randall E. Schumacker. The University of Alabama. Richard G. Lomax. The Ohio State University. Routledge A Beginner's Guide to Randall E. Schumacker The University of Alabama Richard G. Lomax The Ohio State University Routledge Taylor & Francis Group New York London About the Authors Preface xv xvii 1 Introduction

More information

Random Number Generation and Monte Carlo Methods

Random Number Generation and Monte Carlo Methods James E. Gentle Random Number Generation and Monte Carlo Methods With 30 Illustrations Springer Contents Preface vii 1 Simulating Random Numbers from a Uniform Distribution 1 1.1 Linear Congruential Generators

More information

STATISTICS (STAT) 200 Level Courses. 300 Level Courses. Statistics (STAT) 1

STATISTICS (STAT) 200 Level Courses. 300 Level Courses. Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) 200 Level Courses STAT 250: Introductory Statistics I. 3 credits. Elementary introduction to statistics. Topics include descriptive statistics, probability, and estimation

More information

SAS High-Performance Analytics Products

SAS High-Performance Analytics Products Fact Sheet What do SAS High-Performance Analytics products do? With high-performance analytics products from SAS, you can develop and process models that use huge amounts of diverse data. These products

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

GraphPad Prism Features

GraphPad Prism Features GraphPad Prism Features GraphPad Prism 4 is available for both Windows and Macintosh. The two versions are very similar. You can open files created on one platform on the other platform with no special

More information

Time Series Analysis by State Space Methods

Time Series Analysis by State Space Methods Time Series Analysis by State Space Methods Second Edition J. Durbin London School of Economics and Political Science and University College London S. J. Koopman Vrije Universiteit Amsterdam OXFORD UNIVERSITY

More information

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Analysis of Panel Data Third Edition Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Contents Preface to the ThirdEdition Preface to the Second Edition Preface to the First Edition

More information

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1 Preface to the Second Edition Preface to the First Edition vii xi 1 Introduction 1 2 Overview of Supervised Learning 9 2.1 Introduction... 9 2.2 Variable Types and Terminology... 9 2.3 Two Simple Approaches

More information

STATISTICS (STAT) 200 Level Courses Registration Restrictions: STAT 250: Required Prerequisites: not Schedule Type: Mason Core: STAT 346:

STATISTICS (STAT) 200 Level Courses Registration Restrictions: STAT 250: Required Prerequisites: not Schedule Type: Mason Core: STAT 346: Statistics (STAT) 1 STATISTICS (STAT) 200 Level Courses STAT 250: Introductory Statistics I. 3 credits. Elementary introduction to statistics. Topics include descriptive statistics, probability, and estimation

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

SPSS: AN OVERVIEW. V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi

SPSS: AN OVERVIEW. V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi SPSS: AN OVERVIEW V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi-110012 The abbreviation SPSS stands for Statistical Package for the Social Sciences and is a comprehensive system

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Analytics:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

An Introduction to the R Commander

An Introduction to the R Commander An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu

More information

Using the DATAMINE Program

Using the DATAMINE Program 6 Using the DATAMINE Program 304 Using the DATAMINE Program This chapter serves as a user s manual for the DATAMINE program, which demonstrates the algorithms presented in this book. Each menu selection

More information

Introduction to Mplus

Introduction to Mplus Introduction to Mplus May 12, 2010 SPONSORED BY: Research Data Centre Population and Life Course Studies PLCS Interdisciplinary Development Initiative Piotr Wilk piotr.wilk@schulich.uwo.ca OVERVIEW Mplus

More information

Predict Outcomes and Reveal Relationships in Categorical Data

Predict Outcomes and Reveal Relationships in Categorical Data PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,

More information

Python for Data Analysis

Python for Data Analysis Python for Data Analysis Wes McKinney O'REILLY 8 Beijing Cambridge Farnham Kb'ln Sebastopol Tokyo Table of Contents Preface xi 1. Preliminaries " 1 What Is This Book About? 1 Why Python for Data Analysis?

More information

Excel for Chemists. Second Edition

Excel for Chemists. Second Edition Excel for Chemists Second Edition This page intentionally left blank ExceL for Chemists A Comprehensive Guide Second Edition E. Joseph Billo Department of Chemistry Boston College Chestnut Hill, Massachusetts

More information

Study Guide. Module 1. Key Terms

Study Guide. Module 1. Key Terms Study Guide Module 1 Key Terms general linear model dummy variable multiple regression model ANOVA model ANCOVA model confounding variable squared multiple correlation adjusted squared multiple correlation

More information

Generalized least squares (GLS) estimates of the level-2 coefficients,

Generalized least squares (GLS) estimates of the level-2 coefficients, Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical

More information

From Getting Started with the Graph Template Language in SAS. Full book available for purchase here.

From Getting Started with the Graph Template Language in SAS. Full book available for purchase here. From Getting Started with the Graph Template Language in SAS. Full book available for purchase here. Contents About This Book... xi About The Author... xv Acknowledgments...xvii Chapter 1: Introduction

More information

Generalized Additive Models

Generalized Additive Models :p Texts in Statistical Science Generalized Additive Models An Introduction with R Simon N. Wood Contents Preface XV 1 Linear Models 1 1.1 A simple linear model 2 Simple least squares estimation 3 1.1.1

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming

More information

IBM SPSS Statistics Traditional License packages and features

IBM SPSS Statistics Traditional License packages and features IBM SPSS Statistics Traditional License packages and features 1 2 3 The includes the following features: Data access and management Compare two data files for compatibility Data prep features: Define Variable

More information

SPSS Statistics 19.0 Fix Pack 2 Fix List Release notes Abstract Content Number Description

SPSS Statistics 19.0 Fix Pack 2 Fix List Release notes Abstract Content Number Description SPSS Statistics 19.0 Fix Pack 2 Fix List Release notes Abstract A comprehensive list of defect corrections for the SPSS Statistics 19.0 Fix Pack 2. Details of the fixes are listed below. If you have questions

More information

Base package The Base subscription includes the following features:

Base package The Base subscription includes the following features: IBM SPSS Statistics Subscription: Base, add-ons and features 1 2 3 Base package The Base subscription includes the following features: Data access and management Compare two data files for compatibility

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information

STATA 13 INTRODUCTION

STATA 13 INTRODUCTION STATA 13 INTRODUCTION Catherine McGowan & Elaine Williamson LONDON SCHOOL OF HYGIENE & TROPICAL MEDICINE DECEMBER 2013 0 CONTENTS INTRODUCTION... 1 Versions of STATA... 1 OPENING STATA... 1 THE STATA

More information

Handbook of Statistical Modeling for the Social and Behavioral Sciences

Handbook of Statistical Modeling for the Social and Behavioral Sciences Handbook of Statistical Modeling for the Social and Behavioral Sciences Edited by Gerhard Arminger Bergische Universität Wuppertal Wuppertal, Germany Clifford С. Clogg Late of Pennsylvania State University

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

Contents of SAS Programming Techniques

Contents of SAS Programming Techniques Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal

More information

Excel Scientific and Engineering Cookbook

Excel Scientific and Engineering Cookbook Excel Scientific and Engineering Cookbook David M. Bourg O'REILLY* Beijing Cambridge Farnham Koln Paris Sebastopol Taipei Tokyo Preface xi 1. Using Excel 1 1.1 Navigating the Interface 1 1.2 Entering Data

More information

An introduction to SPSS

An introduction to SPSS An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible

More information

Getting Started with JMP at ISU

Getting Started with JMP at ISU Getting Started with JMP at ISU 1 Introduction JMP (pronounced like jump ) is the new campus-wide standard statistical package for introductory statistics courses at Iowa State University. JMP is produced

More information

Data Science with Python Course Catalog

Data Science with Python Course Catalog Enhance Your Contribution to the Business, Earn Industry-recognized Accreditations, and Develop Skills that Help You Advance in Your Career March 2018 www.iotintercon.com Table of Contents Syllabus Overview

More information

Linear Methods for Regression and Shrinkage Methods

Linear Methods for Regression and Shrinkage Methods Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors

More information

Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design

Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design Randomized the Clinical Trails About the Uncontrolled Trails The protocol Development The

More information

Regression III: Advanced Methods

Regression III: Advanced Methods Lecture 3: Distributions Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture Examine data in graphical form Graphs for looking at univariate distributions

More information

JMP Clinical. Release Notes. Version 5.0

JMP Clinical. Release Notes. Version 5.0 JMP Clinical Version 5.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics PASW Complex Samples 17.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see

More information

Chapter 1 Introduction. Chapter Contents

Chapter 1 Introduction. Chapter Contents Chapter 1 Introduction Chapter Contents OVERVIEW OF SAS/STAT SOFTWARE................... 17 ABOUT THIS BOOK.............................. 17 Chapter Organization............................. 17 Typographical

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered

More information

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400 Statistics (STAT) 1 Statistics (STAT) STAT 1200: Introductory Statistical Reasoning Statistical concepts for critically evaluation quantitative information. Descriptive statistics, probability, estimation,

More information

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core)

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core) Introduction to Data Science What is Analytics and Data Science? Overview of Data Science and Analytics Why Analytics is is becoming popular now? Application of Analytics in business Analytics Vs Data

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) Clinical SAS:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

Release Notes. Release The real voyage of discovery consists not in seeking new landscapes, but in having new eyes.

Release Notes. Release The real voyage of discovery consists not in seeking new landscapes, but in having new eyes. Release 8.0.1 The real voyage of discovery consists not in seeking new landscapes, but in having new eyes. Marcel Proust Release Notes JMP, A Business Unit of SAS SAS Campus Drive Cary, NC 27513 The correct

More information

Credit card Fraud Detection using Predictive Modeling: a Review

Credit card Fraud Detection using Predictive Modeling: a Review February 207 IJIRT Volume 3 Issue 9 ISSN: 2396002 Credit card Fraud Detection using Predictive Modeling: a Review Varre.Perantalu, K. BhargavKiran 2 PG Scholar, CSE, Vishnu Institute of Technology, Bhimavaram,

More information

SPSS Statistics 20.0 GA Fix List. Release notes. Abstract

SPSS Statistics 20.0 GA Fix List. Release notes. Abstract SPSS Statistics 20.0 GA Fix List Release notes Abstract A comprehensive list of defect corrections for SPSS Statistics 20 GA. Details of the fixes are listed below. If you have questions about a particular

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

Reproducible Research with R and RStudio

Reproducible Research with R and RStudio The R Series Reproducible Research with R and RStudio Christopher Gandrud C\ CRC Press cj* Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Group an informa

More information

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &

More information

StatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.

StatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved. StatCalc User Manual Version 9 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Introduction... 4 Getting Help... 4 Uninstalling StatCalc...

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

DATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course:

DATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course: DATA SCIENCE About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst/Analytics Manager/Actuarial Scientist/Business

More information

Chapter 17: INTERNATIONAL DATA PRODUCTS

Chapter 17: INTERNATIONAL DATA PRODUCTS Chapter 17: INTERNATIONAL DATA PRODUCTS After the data processing and data analysis, a series of data products were delivered to the OECD. These included public use data files and codebooks, compendia

More information

CREATING SIMULATED DATASETS Edition by G. David Garson and Statistical Associates Publishing Page 1

CREATING SIMULATED DATASETS Edition by G. David Garson and Statistical Associates Publishing Page 1 Copyright @c 2012 by G. David Garson and Statistical Associates Publishing Page 1 @c 2012 by G. David Garson and Statistical Associates Publishing. All rights reserved worldwide in all media. No permission

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

1 RefresheR. Figure 1.1: Soy ice cream flavor preferences

1 RefresheR. Figure 1.1: Soy ice cream flavor preferences 1 RefresheR Figure 1.1: Soy ice cream flavor preferences 2 The Shape of Data Figure 2.1: Frequency distribution of number of carburetors in mtcars dataset Figure 2.2: Daily temperature measurements from

More information

An Introduction to Preparing Data for Analysis with JMP. Full book available for purchase here. About This Book... ix About The Author...

An Introduction to Preparing Data for Analysis with JMP. Full book available for purchase here. About This Book... ix About The Author... An Introduction to Preparing Data for Analysis with JMP. Full book available for purchase here. Contents About This Book... ix About The Author... xiii Chapter 1: Data Management in the Analytics Process...

More information

What s new in MINITAB English R14

What s new in MINITAB English R14 What s new in MINITAB English R14 Overview Graphics Interface enhancements Statistical Enhancements Graphics Enhanced appearance Easy to create Easy to edit Easy to layout Paneling Automatic updating Rotating

More information

Overview 14 Table Definitions and Style Definitions 16 Output Objects and Output Destinations 18 ODS References and Resources 20

Overview 14 Table Definitions and Style Definitions 16 Output Objects and Output Destinations 18 ODS References and Resources 20 Contents Acknowledgments xiii About This Book xv Part 1 Introduction 1 Chapter 1 Why Use ODS? 3 Limitations of SAS Listing Output 4 Difficulties with Importing Standard Listing Output into a Word Processor

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Exploratory Data Analysis Dr. David Koop What is Exploratory Data Analysis? "Detective work" to summarize and explore datasets Includes: - Data acquisition and input

More information

Simulation Modeling and Analysis

Simulation Modeling and Analysis Simulation Modeling and Analysis FOURTH EDITION Averill M. Law President Averill M. Law & Associates, Inc. Tucson, Arizona, USA www. averill-law. com Boston Burr Ridge, IL Dubuque, IA New York San Francisco

More information

Enterprise Miner Tutorial Notes 2 1

Enterprise Miner Tutorial Notes 2 1 Enterprise Miner Tutorial Notes 2 1 ECT7110 E-Commerce Data Mining Techniques Tutorial 2 How to Join Table in Enterprise Miner e.g. we need to join the following two tables: Join1 Join 2 ID Name Gender

More information

Modelling and Quantitative Methods in Fisheries

Modelling and Quantitative Methods in Fisheries SUB Hamburg A/553843 Modelling and Quantitative Methods in Fisheries Second Edition Malcolm Haddon ( r oc) CRC Press \ y* J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of

More information

Intro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014

Intro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014 to 1/12 Intro to University of Virginia Library data.library.virginia.edu September 16, 2014 Getting to Know to 2/12 Strengths Available A full-featured statistical programming language For Windows, Mac

More information

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata CLAREMONT MCKENNA COLLEGE Fletcher Jones Student Peer to Peer Technology Training Program Basic Statistics using Stata An Introduction to Stata A Comparison of Statistical Packages... 3 Opening Stata...

More information

Antrix Academy of Data Science TM

Antrix Academy of Data Science TM TM Preparing for MOST Promising Career Opportunities in Data Analytics... Excel Tableau SAS Excel & SQL IBM SPSS Business Analytics COURSES # Duration* 1 Excel Proficiency 5 Hrs 2 Data Analytics with SAS

More information