The grbase Package. August 13, Author Søren Højsgaard Claus Dethlefsen

Size: px
Start display at page:

Download "The grbase Package. August 13, Author Søren Højsgaard Claus Dethlefsen"

Transcription

1 The grbase Package August 13, 2005 Version Date Title grbase - a package for graphical modelling in R Author Søren Højsgaard <sorenh@agrsci.dk>, Claus Dethlefsen <aas.claus.dethlefsen@nja.dk> Maintainer Søren Højsgaard <sorenh@agrsci.dk> Depends R (>= 2.1.0), MASS, dynamicgraph This package defines S3-classes gmdata (graphical meta data), gmodel (graphical model) and illustrates how hierarchical log-linear models (hllm) may be implemented and combined with dynamicgraph. License GPL version 2 or newer URL R topics documented: cadcomplete carcass dietox gmodel grbase-internal grbase grfit gmdata hllm mathmark rats reinis validvartypes Index 11 1

2 2 carcass cadcomplete Coronary artery disease data Coronary artery disease data data(cadcomplete) Format A data frame with 236 observations on the following 14 variables: Sex, AngPec, AMI, QWave, QWavecode, STcode, STchange, SuffHeartF, Hypertrophi, Hyperchol, Smoker, Inherit, Heartfail, CAD data(cadcomplete) carcass lean meat contents of pig carcasses Measurement of lean meat percentage and thickness of meat and fat layers at different locations of 344 pig carcasses. data(carcass) Format A data frame with 7 variables and 344 observations. LMP is the lean meat percentage determined by dissection. F11, F12 and F13 are measurements of fat depth at three different locations on the carcass and M11, M12 and M13 are measurements of meat depth at the same locations. Source Busk, Hans

3 dietox 3 dietox Growth curves of pigs in a 3x3 factorial experiment The dietox data frame has 861 rows and 7 columns. data(dietox) Format This data frame contains the following columns: Weight, Feed, Time, Pig, Evit, Cu, Litter. Source Lauridsen, C., Højsgaard, S.,Sørensen, M.T. C. (1999) Influence of Dietary Rapeseed Oli, Vitamin E, and Copper on Performance and Antioxidant and Oxidative Status of Pigs. J. Anim. Sci.77: data(dietox) gmodel Class "gmodel" - graphical models The general class gmodel contains a formula object and a gmdata object. Implementations of different specific graphical model classes can inherit from this class and provide methods for parsing the formula. This is illustrated in the implementation of a class for hierarchical log linear models, hllm. gmodel(formula, gmdata) addedge(object, name.1, name.2) dropedge(object, name.1, name.2) addvertex(object, name) dropvertex(object, name)

4 4 grbase-internal Arguments formula gmdata object name name.1 name.2 an object of class formula. an object of class gmdata. an object of class gmodel. a string with a name of a variable available in gmdata to be added or removed from the model formula. a string with a name of a variable available in gmdata to mark the first node in the edge to be added or dropped from the model. a string with a name of a variable available in gmdata to mark the second node in the edge to be added or dropped from the model.. Value gmodel creates an object of class gmodel with the two components formula and gmdata. These components can be retrieved or replaced using the accessor functions of the same names. Also, a gmodel object may be manipulated using the dynamicgraph interface. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, aas.claus.dethlefsen@nja.dk See Also gmdata, grfit, hllm, dynamic.graph. data(rats) rats <- as.gmdata(rats) m1 <- gmodel(~.^., rats) m1.form <- formula(m1) m1.data <- gmdata(m1) observations(gmdata(m1)) <- observations(rats)[1:10,] grbase-internal Internal grbase functions Internal functions called by other functions.

5 grbase 5 grbase The package grbase : summary information This package provides a basis for graphical modelling in R Details grbase provides the general framework for setting up data and model structures and provide examples for fitting hierarchical log linear models for contingency tables and graphical Gaussian models for the multivariate normal distribution. Other graphical model software on CRAN includes: mimr, deal, CoCo, ggm and SIN. The package is intended as a contribution to the gr-project described by Lauritzen (2002). Authors Søren Højsgaard, Biometry Research Unit, Danish Institute of Agricultural Sciences, DK-8830 Tjele, Denmark Claus Dethlefsen, Center for Cardiovascular Research, Aalborg Hospital, Århus University Hospital, DK-9000 Aalborg, Denmark Acknowledgements Thanks to the other members of the gr initiative, in particular to David Edwards for providing functions for formula-manipulation. References Lauritzen, S. L. (2002). graphical Models in R. R News, 3(2)39. grfit Class "grfit" - fitted graphical models Objects of class grfit are created when the function fit is applied to a gmodel object. When adding new types of gmodel objects, one must also supply the appropritate fit function. The grfit object contains the output of the fit which can be accessed by getfit. Separate print and summary methods exist for grfit objects. fit(m, engine) getfit(x)

6 6 gmdata Arguments m engine x an object of class gmodel a string with the possibility of implementing different fitting engines for the same gmodels. an object of class grfit as created from fit applied to a gmodel object. Value fit creates an object of class grfit. getfit returns the fit information created by the fitting algorithm. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, aas.claus.dethlefsen@nja.dk See Also gmodel. data(reinis) reinis <- as.gmdata(reinis) m1 <- hllm(~.^., reinis) m1 <- fit(m1,engine="loglm") gmdata Class "gmdata" graphical meta data A common class for representing data. No matter the actual representation of data, the important characteristics are contained in a graphical metadata object. newgmdata(varnames, vartypes=rep(validvartypes()[1],length(varnames)), nlevels=na, latent=na, valuelabels=null, observations=null, description=na, shortnames=c(letters,letters) )

7 hllm 7 Arguments Value varnames vartypes nlevels latent valuelabels a vector of strings with names of variables. a vector of strings with values from validvartypes giving the types of the variables. a numeric vector with integer values for discrete or ordinal variables giving the number of levels. a vector of strings with names of the latent variables. a list of vectors of strings with names of the levels for each discrete or ordinal variable. observations an object containing the observations, eg. a dataframe or a table. description shortnames a string describing the origin of the data. a vector of strings giving a short name of each variable. An object of class gmdata holds information about the data and can be retrieved and changed by accessor functions. Objects from the Class Objects can be created by calls of the form newgmdata( varnames, vartypes, nlevels, latent, valuelabels, observations, description). More often, gmdata objects will be created from a data.frame or table. A gmdata object contains the abstraction of data into a meta data object including variable names and types etc. However, the actual data might not be present or may be represented by a reference to data, such as a database file. Also, it may be possible to work without data, which may be valuable if the point of interest is in the model alone. Separating the specification of the variables from data has the benefit, that some properties of a model can be investigated without any reference to data, for example decomposability and collapsibility. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, dethlef@math.aau.dk See Also demo(gmdata) hllm Hierarchical log-linear models An implementation of hierarchical log-linear models using the framework of grbase. A model object is defined using hllm, fitted using fit (which calls loglm) and a model search performed using stepwise. The models may be displayed and manipulated using the grbase functions, eg. dynamic.graph.

8 8 mathmark hllm(formula = ~.^1, gmdata, marginal) Arguments formula gmdata marginal an object of class formula. The right hand side of the formula is a list of the generators separated by +. A generator is specified by variable names with separated by *. Commonly used models have short hand notations: saturated model (~.^.), main effects (~.^1), all k th order interactions (~.^k). an object of class gmdata. an optional argument specifying a subset of the variables from the gmdata object. Value hllm returns an object of class hllm, inheriting from the superclass gmodel. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, aas.claus.dethlefsen@nja.dk See Also gmdata, grfit, ggm, dynamic.graph data(reinis) reinis <- as.gmdata(reinis) m2 <- hllm(~smoke*phys*protein+mental*phys+mental*family+smoke*systol*protein, reinis) m2 <- fit(m2,engine="loglm") ## Not run: dynamic.graph(m2) mathmark Mathematics marks for students The mathmark data frame has 88 rows and 5 columns. data(mathmark) Format This data frame contains the following columns: mechanics, vectors, algebra, analysis, statistics.

9 rats 9 Author(s) Søren Højsgaard, sorenh@agrsci.dk References David Edwards, An Introduction to Graphical Modelling, Second Edition, Springer Verlag, 2000 data(mathmark) rats Weightloss of rats Format An artificial dataset. 24 rats (12 female, 12 male) have been randomized to use one of three drugs (products for loosing weight). The weightloss for each rat is noted after one and two weeks. data(rats) A dataframe with 4 variables. Sex: "M" (male), "F" (female). Drug: "D1", "D2", "D3" (three types). W1 weightloss, week one. W2 weightloss, week 2. References Morrison, D.F. (1976). Multivariate Statistical Methods. McGraw-Hill, USA. Edwards, D. (1995). Introduction to Graphical Modelling, Springer-Verlag. New York. reinis Risk factors for coronary heart disease. Format Data collected at the beginning of a 15 year follow-up study of probable risk factors for coronary thrombosis. Data are from all men employed in a car factory. data(reinis) A table with 6 discrete variables. A: smoking, B: strenous mental work, D: strenuous physical work, E: systolic blood pressure, F: ratio of lipoproteins, G: Family anamnesis of coronary heart disease.

10 10 validvartypes References Edwards and Havranek (1985): A fast procedure for model search in multidimensional contingency tables. Biometrika, 72: Reinis et al (1981): Prognostic significance of the risk profile in the prevention of coronary heart disease. Bratis. lek. Listy. 76: validvartypes Admissible variable types in gmdata objects Value The variable types in a gmdata object must be from a vector predefined types which may be inspected by the command validvartypes(). The available types may be extended by the package developers as demonstrated in the example. The types of the variables are important for the way they are displayed using the package dynamicgraph. The type is also important when the models are fitted to data. validvartypes() A character vector with the names of the admissible variable types. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, aas.claus.dethlefsen@nja.dk See Also gmdata, dynamic.graph oldtypes <- validvartypes() validvartypes <- function() c(oldtypes, "MyVarType") validvartypes()

11 Index Topic datasets cadcomplete, 1 carcass, 2 dietox, 2 mathmark, 8 rats, 8 reinis, 9 Topic graphs grbase, 4 Topic internal grbase-internal, 4 Topic models gmdata, 6 gmodel, 3 grbase, 4 grfit, 5 hllm, 7 validvartypes, 9 Topic multivariate grbase, 4 add.edge addedge (gmodel), 3 addvertex (gmodel), 3 all.subsets as.gmdata (gmdata), 6 cadcomplete, 1 cadincomplete (cadcomplete), 1 carcass, 2 contains corr.matrix delete.edge description (gmdata), 6 description<- (gmdata), 6 dietox, 2 dropedge (gmodel), 3 dropvertex (gmodel), 3 dual.rep dynamic.gr.graph dynamic.graph, 3, 7, 10 dynamic.graph dynamic.graph,gmodel-method dynamic.graph,grfit-method ell extract.power fit (grfit), 5 fit.ggm fit.hllm (hllm), 7 formula, 3, 7 formula (gmodel), 3 formula<- (gmodel), 3 fun getfit (grfit), 5 getfit<- (grfit), 5 ggm, 7 ggm gmdata, 3, 6, 7, 9, 10 gmdata (gmodel), 3 gmdata<- (gmodel), 3 gmodel, 3, 5 gmodel-class (gmodel), 3 gmodeltestclass-class grbase, 4, 7 grbase-internal, 4 gredges grfit, 3, 5, 7 grfit-class grvariable hllm, 3, 7 hllm-class (hllm), 7 in.list ips is.cont is.graphical label 11

12 12 INDEX label,gmodeltestclass-method label-methods LabelAllEdges latent (gmdata), 6 latent<- (gmdata), 6 loglm, 7 mathmark, 8 modifymodel modifymodel,gmodel-method valuelabels (gmdata), 6 valuelabels<- (gmdata), 6 varnames (gmdata), 6 varnames<- (gmdata), 6 varset vartypes (gmdata), 6 vartypes<- (gmdata), 6 width width,gmodeltestclass-method width-methods newgmdata (gmdata), 6 nlevels (gmdata), 6 nlevels<- (gmdata), 6 obs (gmdata), 6 observations (gmdata), 6 observations<- (gmdata), 6 out partial.corr.matrix print.gmdata (gmdata), 6 print.gmodel (gmodel), 3 print.grfit (grfit), 5 processformula (grbase-internal), 4 rats, 8 readf readg reinis, 9 remove.redundant selectorder shortnames (gmdata), 6 shortnames<- (gmdata), 6 showf showg stepwise stepwise.hllm (hllm), 7 subsetof summary.gmdata (gmdata), 6 summary.grfit (grfit), 5 testedge testedge,gmodel-method UserMenus validvartypes, 6, 9

The grbase Package. October 25, 2007

The grbase Package. October 25, 2007 The grbase Package October 25, 2007 Version 0.2.1 Title A package for graphical modelling in R Author Søren Højsgaard , Claus Dethlefsen Maintainer Claus Dethlefsen

More information

mimr A package for graphical modelling in R

mimr A package for graphical modelling in R DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ mimr A package for graphical modelling in R Søren Højsgaard Abstract The mimr package for graphical modelling in

More information

Aalborg Universitet. Graphical modelling software in R - status Detlefsen, Claus; Højsgaard, Søren; Lauritzen, Steffen L

Aalborg Universitet. Graphical modelling software in R - status Detlefsen, Claus; Højsgaard, Søren; Lauritzen, Steffen L Aalborg Universitet Graphical modelling software in R - status Detlefsen, Claus; Højsgaard, Søren; Lauritzen, Steffen L Published in: Proceedings of DSC 2007 Publication date: 2007 Document Version Publisher's

More information

Inference for loglinear models (contd):

Inference for loglinear models (contd): Stat 504, Lecture 25 1 Inference for loglinear models (contd): Loglinear/Logit connection Intro to Graphical Models Stat 504, Lecture 25 2 Loglinear Models no distinction between response and explanatory

More information

On the usage of the grim package

On the usage of the grim package On the usage of the grim package Søren Højsgaard grim version 0.2-0 as of 2017-03-31 Contents 1 Introduction 2 2 Introductory examples 2 2.1 A Discrete Model................................ 2 2.2 Model

More information

Draft: More complex graph computations for undirected graphical models by the CoCo bundle for R

Draft: More complex graph computations for undirected graphical models by the CoCo bundle for R DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ Draft: More complex graph computations for undirected graphical models by the CoCo bundle for R Jens Henrik Badsberg

More information

LIMIDs for decision support in pig production

LIMIDs for decision support in pig production LIMIDs for decision support in pig production Merete Stenner Hansen Anders Ringgaard Kristensen Department of Large Animal Sciences, Royal Veterinary and Agricultural University Grønnegårdsvej 2, DK-1870

More information

Introduction to SPSS Edward A. Greenberg, PhD

Introduction to SPSS Edward A. Greenberg, PhD Introduction to SPSS Edward A. Greenberg, PhD ASU HEALTH SOLUTIONS DATA LAB JANUARY 7, 2013 Files for this workshop Files can be downloaded from: http://www.public.asu.edu/~eagle/spss or (with less typing):

More information

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof.

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof. CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing University of Florida, CISE Department Prof. Daisy Zhe Wang Data Visualization Value of Visualization Data And Image Models

More information

Package quickreg. R topics documented:

Package quickreg. R topics documented: Package quickreg September 28, 2017 Title Build Regression Models Quickly and Display the Results Using 'ggplot2' Version 1.5.0 A set of functions to extract results from regression models and plot the

More information

Package grbase. August 7, 2018

Package grbase. August 7, 2018 Version 1.8-3.4 Title A Package for Graphical Modelling in R Author Søren Højsgaard Package grbase August 7, 2018 Maintainer Søren Højsgaard The 'grbase' package

More information

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3 Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization GLOBAL JOURNAL OF ENGINEERING SCIENCE AND RESEARCHES APPLICATION OF CLASSIFICATION TECHNIQUES TO DETECT HYPERTENSIVE HEART DISEASE Tulasimala B. N* 1, Elakkiya S 2 & Keerthana N 3 *1 Assistant Professor,

More information

Statistical Tests for Variable Discrimination

Statistical Tests for Variable Discrimination Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:

More information

Classification using Weka (Brain, Computation, and Neural Learning)

Classification using Weka (Brain, Computation, and Neural Learning) LOGO Classification using Weka (Brain, Computation, and Neural Learning) Jung-Woo Ha Agenda Classification General Concept Terminology Introduction to Weka Classification practice with Weka Problems: Pima

More information

Package ordinalforest

Package ordinalforest Type Package Package ordinalforest July 16, 2018 Title Ordinal Forests: Prediction and Variable Ranking with Ordinal Target Variables Version 2.2 Date 2018-07-16 Author Roman Hornung Maintainer Roman Hornung

More information

Generalized additive models II

Generalized additive models II Generalized additive models II Patrick Breheny October 13 Patrick Breheny BST 764: Applied Statistical Modeling 1/23 Coronary heart disease study Today s lecture will feature several case studies involving

More information

Package doby. March 31, 2016

Package doby. March 31, 2016 Version 4.5-15 Package doby March 31, 2016 Title Groupwise Statistics, LSmeans, Linear Contrasts, Utilities Author Søren Højsgaard and Ulrich Halekoh Maintainer

More information

Package DSBayes. February 19, 2015

Package DSBayes. February 19, 2015 Type Package Title Bayesian subgroup analysis in clinical trials Version 1.1 Date 2013-12-28 Copyright Ravi Varadhan Package DSBayes February 19, 2015 URL http: //www.jhsph.edu/agingandhealth/people/faculty_personal_pages/varadhan.html

More information

Strategies for Graphical Model Selection. David Madigan, University of Washington. Adrian E. Raftery, University of Washington

Strategies for Graphical Model Selection. David Madigan, University of Washington. Adrian E. Raftery, University of Washington Strategies for Graphical Model Selection David Madigan, University of Washington Adrian E. Raftery, University of Washington Jeremy C. York, Carnegie-Mellon University Jerey M. Bradshaw, Fred Hutchinson

More information

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables Select Cases GRAPHS The Select Cases command excludes from further analysis all those cases that do not meet specified selection criteria. Select Cases For a subset of the datafile, use Select Cases. In

More information

Package mathgraph. February 20, 2015

Package mathgraph. February 20, 2015 Version 0.9-11 Date 2012-10-30 Title Directed and undirected graphs Package mathgraph February 20, 2015 Author Original S code by Patrick J. Burns. Ported to R by Nick Efthymiou. Adapted to new R releases

More information

Package DFP. February 2, 2018

Package DFP. February 2, 2018 Type Package Title Gene Selection Version 1.36.0 Date 2009-07-22 Package DFP February 2, 2018 Author R. Alvarez-Gonzalez, D. Glez-Pena, F. Diaz, F. Fdez-Riverola Maintainer Rodrigo Alvarez-Glez

More information

The Bootstrap and Jackknife

The Bootstrap and Jackknife The Bootstrap and Jackknife Summer 2017 Summer Institutes 249 Bootstrap & Jackknife Motivation In scientific research Interest often focuses upon the estimation of some unknown parameter, θ. The parameter

More information

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2.

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2. Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

Package OLScurve. August 29, 2016

Package OLScurve. August 29, 2016 Type Package Title OLS growth curve trajectories Version 0.2.0 Date 2014-02-20 Package OLScurve August 29, 2016 Maintainer Provides tools for more easily organizing and plotting individual ordinary least

More information

Package madsim. December 7, 2016

Package madsim. December 7, 2016 Type Package Package madsim December 7, 2016 Title A Flexible Microarray Data Simulation Model Version 1.2.1 Date 2016-12-07 Author Doulaye Dembele Maintainer Doulaye Dembele Description

More information

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data. Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting

More information

Mission Lipid Data Management Software User s Guide

Mission Lipid Data Management Software User s Guide Mission Lipid Data Management Software User s Guide V1.0 September 2018 Table of Contents 1. Overview...1 1.1 About the Mission Lipid Data Management Software...1 1.2 System Requirements...1 1.3 Materials

More information

Package zebu. R topics documented: October 24, 2017

Package zebu. R topics documented: October 24, 2017 Type Package Title Local Association Measures Version 0.1.2 Date 2017-10-21 Author Olivier M. F. Martin [aut, cre], Michel Ducher [aut] Package zebu October 24, 2017 Maintainer Olivier M. F. Martin

More information

An introduction to ggplot: An implementation of the grammar of graphics in R

An introduction to ggplot: An implementation of the grammar of graphics in R An introduction to ggplot: An implementation of the grammar of graphics in R Hadley Wickham 00-0-7 1 Introduction Currently, R has two major systems for plotting data, base graphics and lattice graphics

More information

The mathgraph Package

The mathgraph Package The mathgraph Package April 19, 2007 Version 0.9-8 Date 2007-04-19 Title Directed and undirected graphs Author Original S code by Patrick J. Burns. Ported to R by. Adapted to new R releases by Claus Dethlefsen.

More information

Package modmarg. R topics documented:

Package modmarg. R topics documented: Package modmarg February 1, 2018 Title Calculating Marginal Effects and Levels with Errors Version 0.9.2 Calculate predicted levels and marginal effects, using the delta method to calculate standard errors.

More information

Unit 1 Lesson 4 Representing Data. Copyright Houghton Mifflin Harcourt Publishing Company

Unit 1 Lesson 4 Representing Data. Copyright Houghton Mifflin Harcourt Publishing Company Florida Benchmarks SC.6.N.1.1 Define a problem from the sixth grade curriculum, use appropriate reference materials to support scientific understanding, plan and carry out scientific investigation of various

More information

Learning Bayesian Networks with R

Learning Bayesian Networks with R Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003 March 20 22, Vienna, Austria ISSN 1609-395X Kurt Hornik, Friedrich Leisch & Achim Zeileis (eds. http://www.ci.tuwien.ac.at/conferences/dsc-2003/

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Data organization. So what kind of data did we collect?

Data organization. So what kind of data did we collect? Data organization Suppose we go out and collect some data. What do we do with it? First we need to figure out what kind of data we have. To illustrate, let s do a simple experiment and collect the height

More information

Chapter Two: Descriptive Methods 1/50

Chapter Two: Descriptive Methods 1/50 Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained

More information

Package lmesplines. R topics documented: February 20, Version

Package lmesplines. R topics documented: February 20, Version Version 1.1-10 Package lmesplines February 20, 2015 Title Add smoothing spline modelling capability to nlme. Author Rod Ball Maintainer Andrzej Galecki

More information

Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion

Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion PROBLEM Decimal Fraction Percent Ratio 1. 0.05 2. 3. 45% 4. 1. Complete row 1 in the table above., 5%, 1:20 DIF: Application REF: Ratios

More information

36-720: Graphical Models

36-720: Graphical Models 36-720: Graphical Models Brian Junker September 17, 2007 Undirected Graphs and Conditional Independence Generators and Graphs Graphical Models Log-Linear Graphical Models Example Decomposabe Models 1 36-720

More information

Chapter 6. The Normal Distribution. McGraw-Hill, Bluman, 7 th ed., Chapter 6 1

Chapter 6. The Normal Distribution. McGraw-Hill, Bluman, 7 th ed., Chapter 6 1 Chapter 6 The Normal Distribution McGraw-Hill, Bluman, 7 th ed., Chapter 6 1 Bluman, Chapter 6 2 Chapter 6 Overview Introduction 6-1 Normal Distributions 6-2 Applications of the Normal Distribution 6-3

More information

B. Graphing Representation of Data

B. Graphing Representation of Data B Graphing Representation of Data The second way of displaying data is by use of graphs Although such visual aids are even easier to read than tables, they often do not give the same detail It is essential

More information

Selected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13.

Selected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13. Minitab@Oneonta.Manual: Selected Introductory Statistical and Data Manipulation Procedures Gordon & Johnson 2002 Minitab version 13.0 Minitab@Oneonta.Manual: Selected Introductory Statistical and Data

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

Package FWDselect. December 19, 2015

Package FWDselect. December 19, 2015 Title Selecting Variables in Regression Models Version 2.1.0 Date 2015-12-18 Author Marta Sestelo [aut, cre], Nora M. Villanueva [aut], Javier Roca-Pardinas [aut] Maintainer Marta Sestelo

More information

Package influence.sem

Package influence.sem Type Package Package influence.sem April 14, 2018 Title Case Influence in Structural Equation Models Version 2.2 Date 2018-04-14 Author Massimiliano Pastore & Gianmarco Altoe' Depends lavaan Suggests tcltk

More information

JMP Book Descriptions

JMP Book Descriptions JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked

More information

> glucose = c(81, 85, 93, 93, 99, 76, 75, 84, 78, 84, 81, 82, 89, + 81, 96, 82, 74, 70, 84, 86, 80, 70, 131, 75, 88, 102, 115, + 89, 82, 79, 106)

> glucose = c(81, 85, 93, 93, 99, 76, 75, 84, 78, 84, 81, 82, 89, + 81, 96, 82, 74, 70, 84, 86, 80, 70, 131, 75, 88, 102, 115, + 89, 82, 79, 106) This document describes how to use a number of R commands for plotting one variable and for calculating one variable summary statistics Specifically, it describes how to use R to create dotplots, histograms,

More information

1. Study Registration. 2. Confirm Registration

1. Study Registration. 2. Confirm Registration USER MANUAL 1. Study Registration Diabetic patients are more susceptible to experiencing cardiovascular events, but this can be minimized with control of blood glucose levels and other risk factors (blood

More information

Package compeir. February 19, 2015

Package compeir. February 19, 2015 Type Package Package compeir February 19, 2015 Title Event-specific incidence rates for competing risks data Version 1.0 Date 2011-03-09 Author Nadine Grambauer, Andreas Neudecker Maintainer Nadine Grambauer

More information

Package geecc. October 9, 2015

Package geecc. October 9, 2015 Type Package Package geecc October 9, 2015 Title Gene set Enrichment analysis Extended to Contingency Cubes Version 1.2.0 Date 2014-12-31 Author Markus Boenn Maintainer Markus Boenn

More information

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015 STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, tsvv@steno.dk, Steno Diabetes Center June 11, 2015 Contents 1 Introduction 1 2 Recap: Variables 2 3 Data Containers 2 3.1 Vectors................................................

More information

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite

More information

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to

More information

Array operations in the grbase package

Array operations in the grbase package Array operations in the grbase package Søren Højsgaard grbase version 1.8-3.4 as of 2018-08-07 Contents 1 Introduction 1 2 Arrays/tables in R 2 2.1 Cross classified data - contingency tables........................

More information

Package catdap. R topics documented: March 20, 2018

Package catdap. R topics documented: March 20, 2018 Version 1.3.4 Title Categorical Data Analysis Program Package Author The Institute of Statistical Mathematics Package catdap March 20, 2018 Maintainer Masami Saga Depends R (>=

More information

Package biomformat. April 11, 2018

Package biomformat. April 11, 2018 Version 1.7.0 Date 2016-04-16 Package biomformat April 11, 2018 Maintainer Paul J. McMurdie License GPL-2 Title An interface package for the BIOM file format Type Package Author

More information

Package samplesizelogisticcasecontrol

Package samplesizelogisticcasecontrol Package samplesizelogisticcasecontrol February 4, 2017 Title Sample Size Calculations for Case-Control Studies Version 0.0.6 Date 2017-01-31 Author Mitchell H. Gail To determine sample size for case-control

More information

22/10/16. Data Coding in SPSS. Data Coding in SPSS. Data Coding in SPSS. Data Coding in SPSS

22/10/16. Data Coding in SPSS. Data Coding in SPSS. Data Coding in SPSS. Data Coding in SPSS DATA CODING IN SPSS STAFF TRAINING WORKSHOP March 28, 2017 Delivered by Dr. Director of Applied Economics Unit African Heritage Institution Enugu Nigeria To code data in SPSS, Lunch the SPSS The Data Editor

More information

Vine Medical Group Patient Registration Form Your Information

Vine Medical Group Patient Registration Form Your Information Your Information Welcome to Vine Medical Group. In order for us to offer you the high standards of clinical care we give to our patients, we ask that you complete this registration form. Before we are

More information

Female Brown Bear Weights

Female Brown Bear Weights CC-20 Normal Distributions Common Core State Standards MACC.92.S-ID..4 Use the mean and standard of a data set to fit it to a normal distribution and to estimate population percentages. Recognize that

More information

Creating summary tables using the sumtable command

Creating summary tables using the sumtable command Creating summary tables using the sumtable command Lauren Scott and Chris Rogers University of Bristol Clinical Trials and Evaluation Unit 2016 London Stata Users Group meeting Scott LJ, Rogers CA. Creating

More information

CHRIST THE KING BOYS MATRIC HR. SEC. SCHOOL, KUMBAKONAM CHAPTER 2 TEXT FORMATTING

CHRIST THE KING BOYS MATRIC HR. SEC. SCHOOL, KUMBAKONAM CHAPTER 2 TEXT FORMATTING CHAPTER 2 TEXT FORMATTING 1. Explain how to create a Bulleted and Numbered List in Star Office Writer? One way to create a list of points or topics in a document is to create a bulleted list. A bullet

More information

Department Curriculum Map (new GCSE)

Department Curriculum Map (new GCSE) Department Curriculum Map 2014-15 (new GCSE) Department Mathematics required in Year 11 Foundation 1. Structure and calculation: N1 to N9 Fractions, decimals and percentages: N10 to N12 Measure and accuracy:

More information

Package woebinning. December 15, 2017

Package woebinning. December 15, 2017 Type Package Package woebinning December 15, 2017 Title Supervised Weight of Evidence Binning of Numeric Variables and Factors Version 0.1.5 Date 2017-12-14 Author Thilo Eichenberg Maintainer Thilo Eichenberg

More information

separate representations of data.

separate representations of data. 1 It s been said that there are two kinds of people in the world: those who divide everything into two groups, and those who don t. To taxonomists, these folks are commonly known as lumpers and splitters.

More information

Counting shapes 1.4.6

Counting shapes 1.4.6 GRADE R_TERM 1 WEEK TOPIC CONTENT CAMI KEYSTROKE CAMI Program Count in ones 1.1.1.1; 1.1.1.2; 1.1.1.3 1.1.1.4 Cami Math Count pictures 1.1.3.1; 1.1.3.2; 1 & 2 Counting 1.1.3.3; 1.1.3.4; Counting in units

More information

Package ETC. February 19, 2015

Package ETC. February 19, 2015 Type Package Title Equivalence to control Version 1.3 Date 2009-01-30 Author Suggests SimComp, multcomp, mratios Imports mvtnorm Package ETC February 19, 2015 Maintainer The

More information

LECTURE 10. SPREADSHEET

LECTURE 10. SPREADSHEET LECTURE 10. SPREADSHEET Those who excel in virtue have the best right of all to rebel, but then they are of all men the least inclined to do so. Aristotle S.M. Sitompul (2016 version) MODULE OVERVIEW Part

More information

Monday, 12 November 12. Matrices

Monday, 12 November 12. Matrices Matrices Matrices Matrices are convenient way of storing multiple quantities or functions They are stored in a table like structure where each element will contain a numeric value that can be the result

More information

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3. Type Package Title GeoJSON to Simple Feature Converter Version 1.3.0 Date 2019-01-11 Package geojsonsf January 11, 2019 Converts Between GeoJSON and simple feature objects. License GPL-3 Encoding UTF-8

More information

DynaMed. Tutorial. support.ebsco.com

DynaMed. Tutorial. support.ebsco.com DynaMed Tutorial support.ebsco.com Welcome to the DynaMed basic searching tutorial, where you will learn about the key DynaMed features, such as basic searching, browsing categories, recently-updated content,

More information

LASER s Level 2 Maths Course - Summary

LASER s Level 2 Maths Course - Summary LASER s Level 2 Maths Course - Summary Unit Code Unit Title Credits Level Status SER945 Shape, Space and Measurement 3 2 Mandatory SER946 Collecting, Recording and Analysing Data 3 2 Mandatory SER947 Development

More information

Package ggseas. June 12, 2018

Package ggseas. June 12, 2018 Package ggseas June 12, 2018 Title 'stats' for Seasonal Adjustment on the Fly with 'ggplot2' Version 0.5.4 Maintainer Peter Ellis Provides 'ggplot2' 'stats' that estimate

More information

1 Introduction. 1.1 What is Statistics?

1 Introduction. 1.1 What is Statistics? 1 Introduction 1.1 What is Statistics? MATH1015 Biostatistics Week 1 Statistics is a scientific study of numerical data based on natural phenomena. It is also the science of collecting, organising, interpreting

More information

Number- Algebra. Problem solving Statistics Investigations

Number- Algebra. Problem solving Statistics Investigations Place Value Addition, Subtraction, Multiplication and Division Fractions Position and Direction Decimals Percentages Algebra Converting units Perimeter, Area and Volume Ratio Properties of Shapes Problem

More information

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to

More information

BIO 139 HUMAN ANATOMY AND PHYSIOLOGY II LABORATORY SYLLABUS

BIO 139 HUMAN ANATOMY AND PHYSIOLOGY II LABORATORY SYLLABUS BLUEGRASS COMMUNITY AND TECHNICAL COLLEGE NATURAL SCIENCES Summer 2011 BIO 139 HUMAN ANATOMY AND PHYSIOLOGY II LABORATORY SYLLABUS 0 CREDIT HOURS 2 CONTACT HOURS PREREQUISITE: BIO 137 COURSE DESCRIPTION:

More information

The partial Package. R topics documented: October 16, Version 0.1. Date Title partial package. Author Andrea Lehnert-Batar

The partial Package. R topics documented: October 16, Version 0.1. Date Title partial package. Author Andrea Lehnert-Batar The partial Package October 16, 2006 Version 0.1 Date 2006-09-21 Title partial package Author Andrea Lehnert-Batar Maintainer Andrea Lehnert-Batar Depends R (>= 2.0.1),e1071

More information

Package qicharts2. March 3, 2018

Package qicharts2. March 3, 2018 Title Quality Improvement Charts Version 0.4.0 Date 2018-03-03 Package qicharts2 March 3, 2018 Functions for making run charts, Shewhart control charts and Pareto charts for continuous quality improvement.

More information

Standard Safety Visualization Set-up Using Spotfire

Standard Safety Visualization Set-up Using Spotfire Paper SD08 Standard Safety Visualization Set-up Using Spotfire Michaela Mertes, F. Hoffmann-La Roche, Ltd., Basel, Switzerland ABSTRACT Stakeholders are requesting real-time access to clinical data to

More information

Package geecc. R topics documented: December 7, Type Package

Package geecc. R topics documented: December 7, Type Package Type Package Package geecc December 7, 2018 Title Gene Set Enrichment Analysis Extended to Contingency Cubes Version 1.16.0 Date 2016-09-19 Author Markus Boenn Maintainer Markus Boenn

More information

Package GFD. January 4, 2018

Package GFD. January 4, 2018 Type Package Title Tests for General Factorial Designs Version 0.2.5 Date 2018-01-04 Package GFD January 4, 2018 Author Sarah Friedrich, Frank Konietschke, Markus Pauly Maintainer Sarah Friedrich

More information

The GLMMGibbs Package

The GLMMGibbs Package The GLMMGibbs Package April 22, 2002 Version 0.5-1 Author Jonathan Myles and David Clayton Maintainer Jonathan Myles Depends R (>= 1.0) Date 2001/22/01 Title

More information

Package r2d2. February 20, 2015

Package r2d2. February 20, 2015 Package r2d2 February 20, 2015 Version 1.0-0 Date 2014-03-31 Title Bivariate (Two-Dimensional) Confidence Region and Frequency Distribution Author Arni Magnusson [aut], Julian Burgos [aut, cre], Gregory

More information

Web page recommendation using a stochastic process model

Web page recommendation using a stochastic process model Data Mining VII: Data, Text and Web Mining and their Business Applications 233 Web page recommendation using a stochastic process model B. J. Park 1, W. Choi 1 & S. H. Noh 2 1 Computer Science Department,

More information

Package CBCgrps. R topics documented: July 27, 2018

Package CBCgrps. R topics documented: July 27, 2018 Package CBCgrps July 27, 2018 Type Package Title Compare Baseline Characteristics Between Groups Version 2.3 Date 2018-07-27 Author Zhongheng Zhang, Sir Run-Run Shaw hospital, Zhejiang university school

More information

The brlr Package. March 22, brlr... 1 lizards Index 5. Bias-reduced Logistic Regression

The brlr Package. March 22, brlr... 1 lizards Index 5. Bias-reduced Logistic Regression The brlr Package March 22, 2006 Version 0.8-8 Date 2006-03-22 Title Bias-reduced logistic regression Author David Firth URL http://www.warwick.ac.uk/go/dfirth Maintainer David Firth

More information

Interactive Math Glossary Terms and Definitions

Interactive Math Glossary Terms and Definitions Terms and Definitions Absolute Value the magnitude of a number, or the distance from 0 on a real number line Addend any number or quantity being added addend + addend = sum Additive Property of Area the

More information

Data mining for neuroimaging data. John Ashburner

Data mining for neuroimaging data. John Ashburner Data mining for neuroimaging data John Ashburner MODELLING The Scientific Process MacKay, David JC. Bayesian interpolation. Neural computation 4, no. 3 (1992): 415-447. Model Selection Search for the best

More information

Last Time: Value of Visualization

Last Time: Value of Visualization CS448B :: 29 Sep 2011 Data and Image Models Last Time: Value of Visualization Jeffrey Heer Stanford University The Value of Visualization Record information Blueprints, photographs, seismographs, Analyze

More information

Chislehurst and Sidcup Grammar School Mathematics Department Year 9 Programme of Study

Chislehurst and Sidcup Grammar School Mathematics Department Year 9 Programme of Study Chislehurst and Sidcup Grammar School Mathematics Department Year 9 Programme of Study Timings Topics Autumn Term - 1 st half (7 weeks - 21 lessons) 1. Algebra 1: Expressions, Formulae, Equations and Inequalities

More information

Package FPDclustering

Package FPDclustering Type Package Title PD-Clustering and Factor PD-Clustering Version 1.2 Date 2017-08-23 Package FPDclustering Author Cristina Tortora and Paul D. McNicholas August 23, 2017 Maintainer Cristina Tortora

More information

Opening a Data File in SPSS. Defining Variables in SPSS

Opening a Data File in SPSS. Defining Variables in SPSS Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with

More information

Excel Functions & Tables

Excel Functions & Tables Excel Functions & Tables Winter 2012 Winter 2012 CS130 - Excel Functions & Tables 1 Review of Functions Quick Mathematics Review As it turns out, some of the most important mathematics for this course

More information

DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS

DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS Using SPSS Topics addressed today: 1. Accessing data from CMR 2. Starting SPSS 3. Getting familiar with SPSS 4. Entering data 5. Saving data

More information

This is a good time to refresh your memory on double-integration. We will be using this skill in the upcoming lectures.

This is a good time to refresh your memory on double-integration. We will be using this skill in the upcoming lectures. Chapter 5: JOINT PROBABILITY DISTRIBUTIONS Part 1: Sections 5-1.1 to 5-1.4 For both discrete and continuous random variables we will discuss the following... Joint Distributions for two or more r.v. s)

More information

From the Grade 8, High School, Functions Progression Document, pp. 7-8:

From the Grade 8, High School, Functions Progression Document, pp. 7-8: Time Frame: Approximately 2-3 weeks Connections to Previous Learning: Students use their knowledge of expressions and equations to evaluate functions and to rewrite when necessary. They also use rigid

More information