The grbase Package. October 25, 2007

Size: px
Start display at page:

Download "The grbase Package. October 25, 2007"

Transcription

1 The grbase Package October 25, 2007 Version Title A package for graphical modelling in R Author Søren Højsgaard <sorenh@agrsci.dk>, Claus Dethlefsen <cld@rn.dk> Maintainer Claus Dethlefsen <cld@rn.dk> This package defines S3-classes gmdata (graphical meta data), gmodel (graphical model). Several other graphical modelling packages rely on these fundamental constructs. grbase illustrates how hierarchical log-linear models (hllm) may be implemented. License GPL version 2 or newer URL ZipData no Depends R (>= 2.1.0), MASS Encoding latin1 R topics documented: cadcomplete carcass dietox gmodel grbase-internal grbase grfit gmdata hllm mathmark rats reinis validvartypes Index 12 1

2 2 carcass cadcomplete Coronary artery disease data Coronary artery disease data data(cadcomplete) Format A data frame with 236 observations on the following 14 variables: Sex, AngPec, AMI, QWave, QWavecode, STcode, STchange, SuffHeartF, Hypertrophi, Hyperchol, Smoker, Inherit, Heartfail, CAD data(cadcomplete) carcass lean meat contents of pig carcasses Measurement of lean meat percentage and thickness of meat and fat layers at different locations of 344 pig carcasses. data(carcass) Format A data frame with 7 variables and 344 observations. LMP is the lean meat percentage determined by dissection. F11, F12 and F13 are measurements of fat depth at three different locations on the carcass and M11, M12 and M13 are measurements of meat depth at the same locations. Source Busk, Hans

3 dietox 3 dietox Growth curves of pigs in a 3x3 factorial experiment The dietox data frame has 861 rows and 7 columns. data(dietox) Format This data frame contains the following columns: Weight, Feed, Time, Pig, Evit, Cu, Litter. Source Lauridsen, C., Højsgaard, S.,Sørensen, M.T. C. (1999) Influence of Dietary Rapeseed Oli, Vitamin E, and Copper on Performance and Antioxidant and Oxidative Status of Pigs. J. Anim. Sci.77: data(dietox) gmodel Class "gmodel" - graphical models The general class gmodel contains a formula object and a gmdata object. Implementations of different specific graphical model classes can inherit from this class and provide methods for parsing the formula. This is illustrated in the implementation of a class for hierarchical log linear models, hllm. gmodel(formula, gmdata) Arguments formula gmdata an object of class formula. an object of class gmdata.

4 4 grbase Value gmodel creates an object of class gmodel with the two components formula and gmdata. These components can be retrieved or replaced using the accessor functions of the same names. Also, a gmodel object may be manipulated using the dynamicgraph interface. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, cld@rn.dk See Also gmdata, grfit, hllm. data(rats) rats <- as.gmdata(rats) m1 <- gmodel(~.^., rats) m1.form <- formula(m1) m1.data <- gmdata(m1) observations(gmdata(m1)) <- observations(rats)[1:10,] grbase-internal Internal grbase functions Internal functions called by other functions. grbase The package grbase : summary information Details This package provides a basis for graphical modelling in R grbase provides the general framework for setting up data and model structures and provide examples for fitting hierarchical log linear models for contingency tables and graphical Gaussian models for the multivariate normal distribution. Other graphical model software on CRAN includes: mimr, deal, CoCo, ggm and SIN. The package is intended as a contribution to the gr-project described by Lauritzen (2002).

5 grfit 5 Authors Søren Højsgaard, Aarhus University, DK-8830 Tjele, Denmark Claus Dethlefsen, Center for Cardiovascular Research, Aalborg Hospital, Århus University Hospital, DK-9000 Aalborg, Denmark Acknowledgements Thanks to the other members of the gr initiative, in particular to David Edwards for providing functions for formula-manipulation. References Lauritzen, S. L. (2002). graphical Models in R. R News, 3(2)39. grfit Class "grfit" - fitted graphical models Objects of class grfit are created when the function fit is applied to a gmodel object. When adding new types of gmodel objects, one must also supply the appropritate fit function. The grfit object contains the output of the fit which can be accessed by getfit. Separate print and summary methods exist for grfit objects. fit(m, engine) getfit(x) Arguments m engine x an object of class gmodel a string with the possibility of implementing different fitting engines for the same gmodels. an object of class grfit as created from fit applied to a gmodel object. Value fit creates an object of class grfit. getfit returns the fit information created by the fitting algorithm. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, cld@rn.dk

6 6 gmdata See Also gmodel. data(reinis) reinis <- as.gmdata(reinis) m1 <- hllm(~.^., reinis) m1 <- fit(m1,engine="loglm") gmdata Class "gmdata" graphical meta data A common class for representing data. No matter the actual representation of data, the important characteristics are contained in a graphical metadata object. newgmdata(varnames, vartypes=rep(validvartypes()[1],length(varnames)), nlevels=null, latent=null, valuelabels=null, observations=null, description=null, shortnames=null ) Arguments varnames vartypes nlevels latent valuelabels a vector of strings with names of variables. a vector of strings with values from validvartypes giving the types of the variables; typical types are "Discrete","Ordinal","Continuous", but others can be defined. The types can be abbreviated. a numeric vector with integer values for discrete or ordinal variables giving the number of levels. a vector of strings with names of the latent variables. a list of vectors of strings with names of the levels for each discrete or ordinal variable. observations an object containing the observations, eg. a dataframe or a table. description shortnames a string describing the origin of the data. a vector of strings giving a short name of each variable.

7 gmdata 7 Details Value If neither nlevels nor valuelabels are given, then all categorical variables are assumed to be binary. If valuelabels are given then nlevels are infered from these. valuelabels / nlevels are recycled if necessary. An object of class gmdata holds information about the data and can be retrieved and changed by accessor functions. Objects from the Class Objects can be created by calls of the form newgmdata( varnames, vartypes, nlevels, latent, valuelabels, observations, description). More often, gmdata objects will be created from a data.frame or table. A gmdata object contains the abstraction of data into a meta data object including variable names and types etc. However, the actual data might not be present or may be represented by a reference to data, such as a database file. Also, it may be possible to work without data, which may be valuable if the point of interest is in the model alone. Separating the specification of the variables from data has the benefit, that some properties of a model can be investigated without any reference to data, for example decomposability and collapsibility. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, dethlef@math.aau.dk See Also demo(gmdata) vn <- c("a","b","c","d") z<-newgmdata(vn,vartypes=c("dis","dis","con","con")) summary(z) z<-newgmdata(vn,vartypes=c("dis","dis","con","con"),nlevels=c(4,3,na,na)) summary(z) z<-newgmdata(vn,vartypes=c("dis","dis","con","con"),nlevels=c(4,na,na,na)) summary(z) z<-newgmdata(vn,vartypes=c("dis","dis","ord","con"),valuelabels=list("a"=1:2, "b"=1:4)) summary(z) ccnames <- c("asia", "smoke", "tub", "lung", "bronc", "either", "xray", "dysp") gmd <- newgmdata(ccnames,valuelabels=c("yes","no"), description="chest clinic") summary(gmd) data(mathmark) as.gmdata(mathmark)

8 8 hllm data(haireyecolor) as.gmdata(haireyecolor) hllm Hierarchical log-linear models An implementation of hierarchical log-linear models using the framework of grbase. A model object is defined using hllm, fitted using fit (which calls loglm) and a model search performed using stepwise. The models may be displayed and manipulated using the grbase. hllm(formula = ~.^1, gmdata, marginal) Arguments formula gmdata marginal an object of class formula. The right hand side of the formula is a list of the generators separated by +. A generator is specified by variable names with separated by *. Commonly used models have short hand notations: saturated model (~.^.), main effects (~.^1), all k th order interactions (~.^k). an object of class gmdata. an optional argument specifying a subset of the variables from the gmdata object. Value hllm returns an object of class hllm, inheriting from the superclass gmodel. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, cld@rn.dk See Also gmdata, grfit, ggm data(reinis) reinis <- as.gmdata(reinis) m2 <- hllm(~smoke*phys*protein+mental*phys+mental*family+smoke*systol*protein, reinis) m2 <- fit(m2,engine="loglm") ## plot(m2)

9 mathmark 9 mathmark Mathematics marks for students The mathmark data frame has 88 rows and 5 columns. data(mathmark) Format This data frame contains the following columns: mechanics, vectors, algebra, analysis, statistics. Author(s) Søren Højsgaard, sorenh@agrsci.dk References David Edwards, An Introduction to Graphical Modelling, Second Edition, Springer Verlag, 2000 data(mathmark) rats Weightloss of rats Format An artificial dataset. 24 rats (12 female, 12 male) have been randomized to use one of three drugs (products for loosing weight). The weightloss for each rat is noted after one and two weeks. data(rats) A dataframe with 4 variables. Sex: "M" (male), "F" (female). Drug: "D1", "D2", "D3" (three types). W1 weightloss, week one. W2 weightloss, week 2. References Morrison, D.F. (1976). Multivariate Statistical Methods. McGraw-Hill, USA. Edwards, D. (1995). Introduction to Graphical Modelling, Springer-Verlag. New York.

10 10 validvartypes reinis Risk factors for coronary heart disease. Format Data collected at the beginning of a 15 year follow-up study of probable risk factors for coronary thrombosis. Data are from all men employed in a car factory. data(reinis) A table with 6 discrete variables. A: smoking, B: strenous mental work, D: strenuous physical work, E: systolic blood pressure, F: ratio of lipoproteins, G: Family anamnesis of coronary heart disease. References Edwards and Havranek (1985): A fast procedure for model search in multidimensional contingency tables. Biometrika, 72: Reinis et al (1981): Prognostic significance of the risk profile in the prevention of coronary heart disease. Bratis. lek. Listy. 76: validvartypes Admissible variable types in gmdata objects Value The variable types in a gmdata object must be from a vector predefined types which may be inspected by the command validvartypes(). The available types may be extended by the package developers as demonstrated in the example. validvartypes() A character vector with the names of the admissible variable types. Author(s) Søren Højsgaard, sorenh@agrsci.dk, Claus Dethlefsen, cld@rn.dk

11 validvartypes 11 See Also gmdata oldtypes <- validvartypes() validvartypes <- function() c(oldtypes, "MyVarType") validvartypes()

12 Index Topic datasets cadcomplete, 1 carcass, 2 dietox, 2 mathmark, 9 rats, 9 reinis, 10 Topic graphs grbase, 4 Topic internal grbase-internal, 4 Topic models gmdata, 6 gmodel, 3 grbase, 4 grfit, 5 hllm, 8 validvartypes, 10 Topic multivariate grbase, 4 add.edge (grbase-internal), 4 addedge (grbase-internal), 4 all.subsets (grbase-internal), 4 as.gmdata (gmdata), 6 cad1 (cadcomplete), 1 cad2 (cadcomplete), 1 cadcomplete, 1 cadincomplete (cadcomplete), 1 carcass, 2 contains (grbase-internal), 4 corr.matrix (grbase-internal), 4 dataorigin (gmdata), 6 delete.edge (grbase-internal), 4 description (gmdata), 6 description<- (gmdata), 6 dietox, 2 dropedge (grbase-internal), 4 dual.rep (grbase-internal), 4 dynamic.gr.graph (grbase-internal), 4 ell (grbase-internal), 4 extract.power (grbase-internal), 4 fit (grfit), 5 fit.ggm (grbase-internal), 4 fit.hllm (hllm), 8 formula, 3, 8 formula (gmodel), 3 formula<- (gmodel), 3 fun (grbase-internal), 4 getfit (grfit), 5 getfit<- (grfit), 5 ggm, 8 ggm (grbase-internal), 4 gmdata, 3, 6, 8, 10, 11 gmdata (gmodel), 3 gmdata<- (gmodel), 3 gmodel, 3, 5 gmodel-class (gmodel), 3 grbase, 4, 8 grbase-internal, 4 gredges (grbase-internal), 4 grfit, 3, 5, 8 grfit-class (grbase-internal), 4 grvariable (grbase-internal), 4 hllm, 3, 8 hllm-class (hllm), 8 in.list (grbase-internal), 4 ips (grbase-internal), 4 is.cont (grbase-internal), 4 is.graphical (grbase-internal), 4 LabelAllEdges (grbase-internal), 4 12

13 INDEX 13 latent (gmdata), 6 latent<- (gmdata), 6 loglm, 8 math (mathmark), 9 mathmark, 9 newgmdata (gmdata), 6 nlevels (gmdata), 6 nlevels<- (gmdata), 6 nominal (gmdata), 6 nominal<- (gmdata), 6 valuelabels (gmdata), 6 valuelabels<- (gmdata), 6 varnames (gmdata), 6 varnames<- (gmdata), 6 varset (grbase-internal), 4 vartypes (gmdata), 6 vartypes<- (gmdata), 6 obs (gmdata), 6 observations (gmdata), 6 observations<- (gmdata), 6 ordinal (gmdata), 6 ordinal<- (gmdata), 6 out (grbase-internal), 4 partial.corr.matrix (grbase-internal), 4 print.gmdata (gmdata), 6 print.gmodel (gmodel), 3 print.grfit (grfit), 5 processformula (grbase-internal), 4 rats, 9 readf (grbase-internal), 4 readg (grbase-internal), 4 reinis, 10 remove.redundant (grbase-internal), 4 selectorder (grbase-internal), 4 shortnames (gmdata), 6 shortnames<- (gmdata), 6 showf (grbase-internal), 4 showg (grbase-internal), 4 stepwise (grbase-internal), 4 stepwise.hllm (hllm), 8 subsetof (grbase-internal), 4 summary.gmdata (gmdata), 6 summary.grfit (grfit), 5 update.gmodel (grbase-internal), 4 UserMenus (grbase-internal), 4 validvartypes, 6, 10

The grbase Package. August 13, Author Søren Højsgaard Claus Dethlefsen

The grbase Package. August 13, Author Søren Højsgaard Claus Dethlefsen The grbase Package August 13, 2005 Version 0.1.23 Date 2005-08-12 Title grbase - a package for graphical modelling in R Author Søren Højsgaard , Claus Dethlefsen

More information

mimr A package for graphical modelling in R

mimr A package for graphical modelling in R DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ mimr A package for graphical modelling in R Søren Højsgaard Abstract The mimr package for graphical modelling in

More information

Aalborg Universitet. Graphical modelling software in R - status Detlefsen, Claus; Højsgaard, Søren; Lauritzen, Steffen L

Aalborg Universitet. Graphical modelling software in R - status Detlefsen, Claus; Højsgaard, Søren; Lauritzen, Steffen L Aalborg Universitet Graphical modelling software in R - status Detlefsen, Claus; Højsgaard, Søren; Lauritzen, Steffen L Published in: Proceedings of DSC 2007 Publication date: 2007 Document Version Publisher's

More information

Inference for loglinear models (contd):

Inference for loglinear models (contd): Stat 504, Lecture 25 1 Inference for loglinear models (contd): Loglinear/Logit connection Intro to Graphical Models Stat 504, Lecture 25 2 Loglinear Models no distinction between response and explanatory

More information

On the usage of the grim package

On the usage of the grim package On the usage of the grim package Søren Højsgaard grim version 0.2-0 as of 2017-03-31 Contents 1 Introduction 2 2 Introductory examples 2 2.1 A Discrete Model................................ 2 2.2 Model

More information

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof.

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof. CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing University of Florida, CISE Department Prof. Daisy Zhe Wang Data Visualization Value of Visualization Data And Image Models

More information

Draft: More complex graph computations for undirected graphical models by the CoCo bundle for R

Draft: More complex graph computations for undirected graphical models by the CoCo bundle for R DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ Draft: More complex graph computations for undirected graphical models by the CoCo bundle for R Jens Henrik Badsberg

More information

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3 Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

LIMIDs for decision support in pig production

LIMIDs for decision support in pig production LIMIDs for decision support in pig production Merete Stenner Hansen Anders Ringgaard Kristensen Department of Large Animal Sciences, Royal Veterinary and Agricultural University Grønnegårdsvej 2, DK-1870

More information

Statistical Tests for Variable Discrimination

Statistical Tests for Variable Discrimination Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:

More information

Generalized additive models II

Generalized additive models II Generalized additive models II Patrick Breheny October 13 Patrick Breheny BST 764: Applied Statistical Modeling 1/23 Coronary heart disease study Today s lecture will feature several case studies involving

More information

Package grbase. August 7, 2018

Package grbase. August 7, 2018 Version 1.8-3.4 Title A Package for Graphical Modelling in R Author Søren Højsgaard Package grbase August 7, 2018 Maintainer Søren Højsgaard The 'grbase' package

More information

Introduction to SPSS Edward A. Greenberg, PhD

Introduction to SPSS Edward A. Greenberg, PhD Introduction to SPSS Edward A. Greenberg, PhD ASU HEALTH SOLUTIONS DATA LAB JANUARY 7, 2013 Files for this workshop Files can be downloaded from: http://www.public.asu.edu/~eagle/spss or (with less typing):

More information

Chapter 1: Thinking Critically Lecture notes Math 1030 Section C

Chapter 1: Thinking Critically Lecture notes Math 1030 Section C Section C.1: Sets and Venn Diagrams Definition of a set A set is a collection of objects and its objects are called members. Ex.1 Days of the week. Ex.2 Students in this class. Ex.3 Letters of the alphabet.

More information

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization GLOBAL JOURNAL OF ENGINEERING SCIENCE AND RESEARCHES APPLICATION OF CLASSIFICATION TECHNIQUES TO DETECT HYPERTENSIVE HEART DISEASE Tulasimala B. N* 1, Elakkiya S 2 & Keerthana N 3 *1 Assistant Professor,

More information

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data. Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting

More information

Package quickreg. R topics documented:

Package quickreg. R topics documented: Package quickreg September 28, 2017 Title Build Regression Models Quickly and Display the Results Using 'ggplot2' Version 1.5.0 A set of functions to extract results from regression models and plot the

More information

Package zebu. R topics documented: October 24, 2017

Package zebu. R topics documented: October 24, 2017 Type Package Title Local Association Measures Version 0.1.2 Date 2017-10-21 Author Olivier M. F. Martin [aut, cre], Michel Ducher [aut] Package zebu October 24, 2017 Maintainer Olivier M. F. Martin

More information

Package mathgraph. February 20, 2015

Package mathgraph. February 20, 2015 Version 0.9-11 Date 2012-10-30 Title Directed and undirected graphs Package mathgraph February 20, 2015 Author Original S code by Patrick J. Burns. Ported to R by Nick Efthymiou. Adapted to new R releases

More information

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite

More information

An introduction to ggplot: An implementation of the grammar of graphics in R

An introduction to ggplot: An implementation of the grammar of graphics in R An introduction to ggplot: An implementation of the grammar of graphics in R Hadley Wickham 00-0-7 1 Introduction Currently, R has two major systems for plotting data, base graphics and lattice graphics

More information

Package doby. March 31, 2016

Package doby. March 31, 2016 Version 4.5-15 Package doby March 31, 2016 Title Groupwise Statistics, LSmeans, Linear Contrasts, Utilities Author Søren Højsgaard and Ulrich Halekoh Maintainer

More information

Package DSBayes. February 19, 2015

Package DSBayes. February 19, 2015 Type Package Title Bayesian subgroup analysis in clinical trials Version 1.1 Date 2013-12-28 Copyright Ravi Varadhan Package DSBayes February 19, 2015 URL http: //www.jhsph.edu/agingandhealth/people/faculty_personal_pages/varadhan.html

More information

The mathgraph Package

The mathgraph Package The mathgraph Package April 19, 2007 Version 0.9-8 Date 2007-04-19 Title Directed and undirected graphs Author Original S code by Patrick J. Burns. Ported to R by. Adapted to new R releases by Claus Dethlefsen.

More information

Creating summary tables using the sumtable command

Creating summary tables using the sumtable command Creating summary tables using the sumtable command Lauren Scott and Chris Rogers University of Bristol Clinical Trials and Evaluation Unit 2016 London Stata Users Group meeting Scott LJ, Rogers CA. Creating

More information

The Bootstrap and Jackknife

The Bootstrap and Jackknife The Bootstrap and Jackknife Summer 2017 Summer Institutes 249 Bootstrap & Jackknife Motivation In scientific research Interest often focuses upon the estimation of some unknown parameter, θ. The parameter

More information

Package woebinning. December 15, 2017

Package woebinning. December 15, 2017 Type Package Package woebinning December 15, 2017 Title Supervised Weight of Evidence Binning of Numeric Variables and Factors Version 0.1.5 Date 2017-12-14 Author Thilo Eichenberg Maintainer Thilo Eichenberg

More information

JMP Book Descriptions

JMP Book Descriptions JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked

More information

Package ordinalforest

Package ordinalforest Type Package Package ordinalforest July 16, 2018 Title Ordinal Forests: Prediction and Variable Ranking with Ordinal Target Variables Version 2.2 Date 2018-07-16 Author Roman Hornung Maintainer Roman Hornung

More information

Strategies for Graphical Model Selection. David Madigan, University of Washington. Adrian E. Raftery, University of Washington

Strategies for Graphical Model Selection. David Madigan, University of Washington. Adrian E. Raftery, University of Washington Strategies for Graphical Model Selection David Madigan, University of Washington Adrian E. Raftery, University of Washington Jeremy C. York, Carnegie-Mellon University Jerey M. Bradshaw, Fred Hutchinson

More information

Houghton Mifflin MATHEMATICS Level 1 correlated to NCTM Standard

Houghton Mifflin MATHEMATICS Level 1 correlated to NCTM Standard Number and Operations Standard Understand numbers, ways of representing numbers, relationships among numbers, and number systems count with understanding and recognize TE: 191A 195B, 191 195, 201B, 201

More information

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset. Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied

More information

Monday, 12 November 12. Matrices

Monday, 12 November 12. Matrices Matrices Matrices Matrices are convenient way of storing multiple quantities or functions They are stored in a table like structure where each element will contain a numeric value that can be the result

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

Package CBCgrps. R topics documented: July 27, 2018

Package CBCgrps. R topics documented: July 27, 2018 Package CBCgrps July 27, 2018 Type Package Title Compare Baseline Characteristics Between Groups Version 2.3 Date 2018-07-27 Author Zhongheng Zhang, Sir Run-Run Shaw hospital, Zhejiang university school

More information

From the Grade 8, High School, Functions Progression Document, pp. 7-8:

From the Grade 8, High School, Functions Progression Document, pp. 7-8: Time Frame: Approximately 2-3 weeks Connections to Previous Learning: Students use their knowledge of expressions and equations to evaluate functions and to rewrite when necessary. They also use rigid

More information

DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS

DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS Using SPSS Topics addressed today: 1. Accessing data from CMR 2. Starting SPSS 3. Getting familiar with SPSS 4. Entering data 5. Saving data

More information

Data organization. So what kind of data did we collect?

Data organization. So what kind of data did we collect? Data organization Suppose we go out and collect some data. What do we do with it? First we need to figure out what kind of data we have. To illustrate, let s do a simple experiment and collect the height

More information

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables Select Cases GRAPHS The Select Cases command excludes from further analysis all those cases that do not meet specified selection criteria. Select Cases For a subset of the datafile, use Select Cases. In

More information

Package FWDselect. December 19, 2015

Package FWDselect. December 19, 2015 Title Selecting Variables in Regression Models Version 2.1.0 Date 2015-12-18 Author Marta Sestelo [aut, cre], Nora M. Villanueva [aut], Javier Roca-Pardinas [aut] Maintainer Marta Sestelo

More information

B. Graphing Representation of Data

B. Graphing Representation of Data B Graphing Representation of Data The second way of displaying data is by use of graphs Although such visual aids are even easier to read than tables, they often do not give the same detail It is essential

More information

Classification using Weka (Brain, Computation, and Neural Learning)

Classification using Weka (Brain, Computation, and Neural Learning) LOGO Classification using Weka (Brain, Computation, and Neural Learning) Jung-Woo Ha Agenda Classification General Concept Terminology Introduction to Weka Classification practice with Weka Problems: Pima

More information

Package modmarg. R topics documented:

Package modmarg. R topics documented: Package modmarg February 1, 2018 Title Calculating Marginal Effects and Levels with Errors Version 0.9.2 Calculate predicted levels and marginal effects, using the delta method to calculate standard errors.

More information

Package binomlogit. February 19, 2015

Package binomlogit. February 19, 2015 Type Package Title Efficient MCMC for Binomial Logit Models Version 1.2 Date 2014-03-12 Author Agnes Fussl Maintainer Agnes Fussl Package binomlogit February 19, 2015 Description The R package

More information

Package rpst. June 6, 2017

Package rpst. June 6, 2017 Type Package Title Recursive Partitioning Survival Trees Version 1.0.0 Date 2017-6-6 Author Maintainer Package rpst June 6, 2017 An implementation of Recursive Partitioning Survival Trees

More information

PSS718 - Data Mining

PSS718 - Data Mining Lecture 3 Hacettepe University, IPS, PSS October 10, 2016 Data is important Data -> Information -> Knowledge -> Wisdom Dataset a collection of data, a.k.a. matrix, table. Observation a row of a dataset,

More information

Learning Bayesian Networks with R

Learning Bayesian Networks with R Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003 March 20 22, Vienna, Austria ISSN 1609-395X Kurt Hornik, Friedrich Leisch & Achim Zeileis (eds. http://www.ci.tuwien.ac.at/conferences/dsc-2003/

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2.

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2. Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

Package catdap. R topics documented: March 20, 2018

Package catdap. R topics documented: March 20, 2018 Version 1.3.4 Title Categorical Data Analysis Program Package Author The Institute of Statistical Mathematics Package catdap March 20, 2018 Maintainer Masami Saga Depends R (>=

More information

1 Introduction. 1.1 What is Statistics?

1 Introduction. 1.1 What is Statistics? 1 Introduction 1.1 What is Statistics? MATH1015 Biostatistics Week 1 Statistics is a scientific study of numerical data based on natural phenomena. It is also the science of collecting, organising, interpreting

More information

Package madsim. December 7, 2016

Package madsim. December 7, 2016 Type Package Package madsim December 7, 2016 Title A Flexible Microarray Data Simulation Model Version 1.2.1 Date 2016-12-07 Author Doulaye Dembele Maintainer Doulaye Dembele Description

More information

Package JGEE. November 18, 2015

Package JGEE. November 18, 2015 Type Package Package JGEE November 18, 2015 Title Joint Generalized Estimating Equation Solver Version 1.1 Date 2015-11-17 Author Gul Inan Maintainer Gul Inan Fits two different joint

More information

Mission Lipid Data Management Software User s Guide

Mission Lipid Data Management Software User s Guide Mission Lipid Data Management Software User s Guide V1.0 September 2018 Table of Contents 1. Overview...1 1.1 About the Mission Lipid Data Management Software...1 1.2 System Requirements...1 1.3 Materials

More information

Vine Medical Group Patient Registration Form Your Information

Vine Medical Group Patient Registration Form Your Information Your Information Welcome to Vine Medical Group. In order for us to offer you the high standards of clinical care we give to our patients, we ask that you complete this registration form. Before we are

More information

Package qicharts2. March 3, 2018

Package qicharts2. March 3, 2018 Title Quality Improvement Charts Version 0.4.0 Date 2018-03-03 Package qicharts2 March 3, 2018 Functions for making run charts, Shewhart control charts and Pareto charts for continuous quality improvement.

More information

Package compeir. February 19, 2015

Package compeir. February 19, 2015 Type Package Package compeir February 19, 2015 Title Event-specific incidence rates for competing risks data Version 1.0 Date 2011-03-09 Author Nadine Grambauer, Andreas Neudecker Maintainer Nadine Grambauer

More information

What are we working with? Data Abstractions. Week 4 Lecture A IAT 814 Lyn Bartram

What are we working with? Data Abstractions. Week 4 Lecture A IAT 814 Lyn Bartram What are we working with? Data Abstractions Week 4 Lecture A IAT 814 Lyn Bartram Munzner s What-Why-How What are we working with? DATA abstractions, statistical methods Why are we doing it? Task abstractions

More information

Package biomformat. April 11, 2018

Package biomformat. April 11, 2018 Version 1.7.0 Date 2016-04-16 Package biomformat April 11, 2018 Maintainer Paul J. McMurdie License GPL-2 Title An interface package for the BIOM file format Type Package Author

More information

Array operations in the grbase package

Array operations in the grbase package Array operations in the grbase package Søren Højsgaard grbase version 1.8-3.4 as of 2018-08-07 Contents 1 Introduction 1 2 Arrays/tables in R 2 2.1 Cross classified data - contingency tables........................

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Exploratory Data Analysis Dr. David Koop What is Exploratory Data Analysis? "Detective work" to summarize and explore datasets Includes: - Data acquisition and input

More information

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 8. ATTRIBUTE DATA INPUT AND MANAGEMENT 8.1 Attribute Data in GIS 8.1.1 Type of Attribute Table 8.1.2 Database Management 8.1.3 Type of Attribute Data Box 8.1 Categorical and Numeric Data 8.2 The

More information

A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals

A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals GUI Release v1.0.2: User Manual January 2009 If you find this software useful, please

More information

Data Mining. Part 1. Introduction. 1.4 Input. Spring Instructor: Dr. Masoud Yaghini. Input

Data Mining. Part 1. Introduction. 1.4 Input. Spring Instructor: Dr. Masoud Yaghini. Input Data Mining Part 1. Introduction 1.4 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Instances Attributes References Instances Instance: Instances Individual, independent example of the concept to be

More information

STAT 1291: Data Science Lecture 4 - Data Visualization: Composing/dissecting Data Graphics Sungkyu Jung

STAT 1291: Data Science Lecture 4 - Data Visualization: Composing/dissecting Data Graphics Sungkyu Jung STAT 1291: Data Science Lecture 4 - Data Visualization: Composing/dissecting Data Graphics Sungkyu Jung Where are we? What is Data Science? How do we learn Data Science? Data visualization: What is a good

More information

Chapter Two: Descriptive Methods 1/50

Chapter Two: Descriptive Methods 1/50 Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Table of Contents. Introduction.*.. 7. Part /: Getting Started With MATLAB 5. Chapter 1: Introducing MATLAB and Its Many Uses 7

Table of Contents. Introduction.*.. 7. Part /: Getting Started With MATLAB 5. Chapter 1: Introducing MATLAB and Its Many Uses 7 MATLAB Table of Contents Introduction.*.. 7 About This Book 1 Foolish Assumptions 2 Icons Used in This Book 3 Beyond the Book 3 Where to Go from Here 4 Part /: Getting Started With MATLAB 5 Chapter 1:

More information

7 th Grade HONORS Year at a Glance

7 th Grade HONORS Year at a Glance These skills will be incorporated into at least 75% of the test questions in reporting categories 1 5 and will be identified along with content standards. 7.13A identify and apply mathematics to everyday

More information

The partial Package. R topics documented: October 16, Version 0.1. Date Title partial package. Author Andrea Lehnert-Batar

The partial Package. R topics documented: October 16, Version 0.1. Date Title partial package. Author Andrea Lehnert-Batar The partial Package October 16, 2006 Version 0.1 Date 2006-09-21 Title partial package Author Andrea Lehnert-Batar Maintainer Andrea Lehnert-Batar Depends R (>= 2.0.1),e1071

More information

Package geecc. October 9, 2015

Package geecc. October 9, 2015 Type Package Package geecc October 9, 2015 Title Gene set Enrichment analysis Extended to Contingency Cubes Version 1.2.0 Date 2014-12-31 Author Markus Boenn Maintainer Markus Boenn

More information

Excel Functions & Tables

Excel Functions & Tables Excel Functions & Tables Winter 2012 Winter 2012 CS130 - Excel Functions & Tables 1 Review of Functions Quick Mathematics Review As it turns out, some of the most important mathematics for this course

More information

Package lmesplines. R topics documented: February 20, Version

Package lmesplines. R topics documented: February 20, Version Version 1.1-10 Package lmesplines February 20, 2015 Title Add smoothing spline modelling capability to nlme. Author Rod Ball Maintainer Andrzej Galecki

More information

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015 STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, tsvv@steno.dk, Steno Diabetes Center June 11, 2015 Contents 1 Introduction 1 2 Recap: Variables 2 3 Data Containers 2 3.1 Vectors................................................

More information

Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator

Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator R.Saravanan 1, J.Sivapriya 2, M.Shahidha 3 1 Assisstant Professor, Department of IT,SMVEC, Puducherry, India 2,3 UG student, Department

More information

Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion

Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion PROBLEM Decimal Fraction Percent Ratio 1. 0.05 2. 3. 45% 4. 1. Complete row 1 in the table above., 5%, 1:20 DIF: Application REF: Ratios

More information

Math 183 Statistical Methods

Math 183 Statistical Methods Math 183 Statistical Methods Eddie Aamari S.E.W. Assistant Professor eaamari@ucsd.edu math.ucsd.edu/~eaamari/ AP&M 5880A 1 / 24 Math 183 Statistical Methods Eddie Aamari S.E.W. Assistant Professor eaamari@ucsd.edu

More information

LASER s Level 2 Maths Course - Summary

LASER s Level 2 Maths Course - Summary LASER s Level 2 Maths Course - Summary Unit Code Unit Title Credits Level Status SER945 Shape, Space and Measurement 3 2 Mandatory SER946 Collecting, Recording and Analysing Data 3 2 Mandatory SER947 Development

More information

R For Sql Developers. Kiran Math

R For Sql Developers. Kiran Math R For Sql Developers Kiran Math - kiranmath@outlook.com 2017-09-22 R Figure 1: R Download R CRAN The Comprehensive R Archive Network ~ OpenSource R MRAN Micrsoft R Open ~ Enhanced R Distribution Microsoft

More information

Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images

Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images Takeshi Hara, Hiroshi Fujita,Yongbum Lee, Hitoshi Yoshimura* and Shoji Kido** Department of Information Science, Gifu University Yanagido

More information

STAT10010 Introductory Statistics Lab 2

STAT10010 Introductory Statistics Lab 2 STAT10010 Introductory Statistics Lab 2 1. Aims of Lab 2 By the end of this lab you will be able to: i. Recognize the type of recorded data. ii. iii. iv. Construct summaries of recorded variables. Calculate

More information

Package samplesizelogisticcasecontrol

Package samplesizelogisticcasecontrol Package samplesizelogisticcasecontrol February 4, 2017 Title Sample Size Calculations for Case-Control Studies Version 0.0.6 Date 2017-01-31 Author Mitchell H. Gail To determine sample size for case-control

More information

The GLMMGibbs Package

The GLMMGibbs Package The GLMMGibbs Package April 22, 2002 Version 0.5-1 Author Jonathan Myles and David Clayton Maintainer Jonathan Myles Depends R (>= 1.0) Date 2001/22/01 Title

More information

Data Preprocessing. Slides by: Shree Jaswal

Data Preprocessing. Slides by: Shree Jaswal Data Preprocessing Slides by: Shree Jaswal Topics to be covered Why Preprocessing? Data Cleaning; Data Integration; Data Reduction: Attribute subset selection, Histograms, Clustering and Sampling; Data

More information

36-720: Graphical Models

36-720: Graphical Models 36-720: Graphical Models Brian Junker September 17, 2007 Undirected Graphs and Conditional Independence Generators and Graphs Graphical Models Log-Linear Graphical Models Example Decomposabe Models 1 36-720

More information

Package Metatron. R topics documented: February 19, Version Date

Package Metatron. R topics documented: February 19, Version Date Version 0.1-1 Date 2013-10-16 Package Metatron February 19, 2015 Title Meta-analysis for Classification Data and Correction to Imperfect Reference Author Huiling Huang Maintainer Huiling Huang

More information

Package PTE. October 10, 2017

Package PTE. October 10, 2017 Type Package Title Personalized Treatment Evaluator Version 1.6 Date 2017-10-9 Package PTE October 10, 2017 Author Adam Kapelner, Alina Levine & Justin Bleich Maintainer Adam Kapelner

More information

Counting shapes 1.4.6

Counting shapes 1.4.6 GRADE R_TERM 1 WEEK TOPIC CONTENT CAMI KEYSTROKE CAMI Program Count in ones 1.1.1.1; 1.1.1.2; 1.1.1.3 1.1.1.4 Cami Math Count pictures 1.1.3.1; 1.1.3.2; 1 & 2 Counting 1.1.3.3; 1.1.3.4; Counting in units

More information

Chapter 6. The Normal Distribution. McGraw-Hill, Bluman, 7 th ed., Chapter 6 1

Chapter 6. The Normal Distribution. McGraw-Hill, Bluman, 7 th ed., Chapter 6 1 Chapter 6 The Normal Distribution McGraw-Hill, Bluman, 7 th ed., Chapter 6 1 Bluman, Chapter 6 2 Chapter 6 Overview Introduction 6-1 Normal Distributions 6-2 Applications of the Normal Distribution 6-3

More information

Multiple-imputation analysis using Stata s mi command

Multiple-imputation analysis using Stata s mi command Multiple-imputation analysis using Stata s mi command Yulia Marchenko Senior Statistician StataCorp LP 2009 UK Stata Users Group Meeting Yulia Marchenko (StataCorp) Multiple-imputation analysis using mi

More information

Package OLScurve. August 29, 2016

Package OLScurve. August 29, 2016 Type Package Title OLS growth curve trajectories Version 0.2.0 Date 2014-02-20 Package OLScurve August 29, 2016 Maintainer Provides tools for more easily organizing and plotting individual ordinary least

More information

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current

More information

Department Curriculum Map (new GCSE)

Department Curriculum Map (new GCSE) Department Curriculum Map 2014-15 (new GCSE) Department Mathematics required in Year 11 Foundation 1. Structure and calculation: N1 to N9 Fractions, decimals and percentages: N10 to N12 Measure and accuracy:

More information

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3. Type Package Title GeoJSON to Simple Feature Converter Version 1.3.0 Date 2019-01-11 Package geojsonsf January 11, 2019 Converts Between GeoJSON and simple feature objects. License GPL-3 Encoding UTF-8

More information

Package misclassglm. September 3, 2016

Package misclassglm. September 3, 2016 Type Package Package misclassglm September 3, 2016 Title Computation of Generalized Linear Models with Misclassified Covariates Using Side Information Version 0.2.0 Date 2016-09-02 Author Stephan Dlugosz

More information

Opening a Data File in SPSS. Defining Variables in SPSS

Opening a Data File in SPSS. Defining Variables in SPSS Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with

More information

The Basics of Graphical Models

The Basics of Graphical Models The Basics of Graphical Models David M. Blei Columbia University September 30, 2016 1 Introduction (These notes follow Chapter 2 of An Introduction to Probabilistic Graphical Models by Michael Jordan.

More information

Package DFP. February 2, 2018

Package DFP. February 2, 2018 Type Package Title Gene Selection Version 1.36.0 Date 2009-07-22 Package DFP February 2, 2018 Author R. Alvarez-Gonzalez, D. Glez-Pena, F. Diaz, F. Fdez-Riverola Maintainer Rodrigo Alvarez-Glez

More information

Standard Safety Visualization Set-up Using Spotfire

Standard Safety Visualization Set-up Using Spotfire Paper SD08 Standard Safety Visualization Set-up Using Spotfire Michaela Mertes, F. Hoffmann-La Roche, Ltd., Basel, Switzerland ABSTRACT Stakeholders are requesting real-time access to clinical data to

More information