Dynamic Documents. David Allen University of Kentucky. August 15, 2014

Size: px
Start display at page:

Download "Dynamic Documents. David Allen University of Kentucky. August 15, 2014"

Transcription

1 Dynamic Documents David Allen University of Kentucky August 15, 2014 The term dynamic document covers a wide assortment of documents. See Wikipedia for a general definition. This discussion involves a situation where the document is a report including statistical analyses and graphical displays based on data. The data are continually being augmented or replaced. What is needed a way to automate the revision of the document when the data changes. The approach here is to use R to do the statistical calculations and graphics, tikzdevice (an R package) to output the graphics to L A TEX, and knitr (also an R package) to process an input file to a L A TEX file. 1 The Kentucky Senate Race On November 4, 2014, the Commonwealth of Kentucky will elect a United States Senator. The candidates are Alison Lundergan Grimes and Mitch McConnell. This race has high national impact and is closely watched. A poll yields the number of people in a sample, from a population of potential voters, favoring each candidate. The proportion of the sample favoring Alison (or Mitch) is reported. However, this provides no indication of the sampling variability. The parameter of interest is the population proportion favoring Alison. A credible interval is such that the parameter lies within the interval with high probability. A credible interval is a more informative mode of presentation, as it conveys the uncertainty of knowledge about the parameter. Calculation of the credible interval is the statistical analysis portion of the report. Denote the proportion of the population favoring Alison by p. The first step in calculating a credible interval is finding the posterior density function 1

2 of p given the sample results. One needs to select a level of credibility. The value 0.95 has a strong tradition and is used here. The 0.95 credibility interval is an interval (p 1, p 2 ) where P (p 1 < p < p 2 ) = There are multiple intervals satisfying the probability statement. The interval having minimal length is usually used. An example assuming a sample with 55 favoring Alison and 45 favoring Mitch is shown in the following graph. The 0.95 credible interval (0.4528, ) is the base of the shaded region. Posterior Density Proportion Favoring Alison A report might say something like A current poll produced 55 potential voters favoring Alison and 45 favoring Mitch. These results give a 0.95 credible interval for the proportion favoring Alison of (0.4528, ). The report might also include the above graph. The preceding graphic and the credible interval were produced with R. The output from R was then transcribed to a L A TEX file to produce the report. Polling will be a continuing activity from now until election day. Rerunning R and cutting and pasting output into a L A TEX document is tedious and error prone. Subsequent sections show how to automate the process. 2 TikZ graphics TikZ is a graphics package used in conjunction with TEX. It is included with most distributions of TEX, but may be downloaded at 2

3 net/projects/pgf/. A large selection of examples of TikZ graphics are posted at I think it likely that most TUGboat readers are familiar with TikZ. I give just two examples I have composed. This graphic was hand-coded in Sketch, and then processed into TikZ. This example was hand-coded directly in TikZ. θ 2 θ 1 GI tract Plasma Other θ 4 θ 3 3 An overview of R R is a language and environment for statistical computing and graphics. Its home page is R is a free software project. It compiles and runs on a wide variety of Unix platforms and similar systems (including FreeBSD and GNU/Linux), Windows and MacOSX. R is often the vehicle of choice for research in statistical methodology, and it provides an open source route to participation in that activity. R provides a wide variety of statistical techniques including linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, and clustering. R is highly extensible and contains a rich collection of graphical techniques. One of R s strengths is the ease with which welldesigned publication-quality plots can be produced, including mathematical 3

4 symbols and formulas where needed. Defaults for the minor design choices in graphics have been carefully considered, but the user retains full control. 4 TikzDevice TikzDevice enables L A TEX-ready output from R graphics functions. This is done by producing code that can be understood by the TikZ graphics language. All text in a graphic output with the tikz() function will be typeset by L A TEX and therefore will match whatever fonts are currently used in the document. This also means that L A TEX mathematics can be typeset directly into labels and annotations! Graphics produced this way can also be annotated with custom TikZ commands. An example R Graphic output using tikzdevice is y y = 10 + (x 5) x The program that produced the preceding graph is setwd("~/tug2014") source("quadratic-data.r") source("quadratic-graph.r") The source command executes the statements in the named file. Here I group code into small parts to focus discussion. The file quadratic-data.r contains the data generation code x <- (0:100)/10 y < (x-5)^2 4

5 R formulas are different from standard mathematical formulas. A function call operates on every element of a vector. When vectors of different lengths are added or subtracted, the shorter one is recycled in an effort to make them the same length. The file quadratic-graph.r contains the graphics code require(tikzdevice) tikz("quadratic-graph.tex",standalone=false, width=4.5, height=2.5) par(mex=0.6, mar=c(4.5,5,0,0)+0.1) plot(x, y, type= l, xlab="$x$",ylab="$y$") text(5, 25, "$y = 10+(x-5)^2$") dev.off() Most of this code is understandable by comparing it to the resulting graph. An exception might be the par function used to change graphic parameters from their default values. The ones used in this example are Arg mex mar Description A character size expansion factor used to describe coordinates in the margins of plots. Vector of the form c(bottom, left, top, right) which gives the number of lines of margin to be specified on the four sides. 5 Implementation This section implements a dynamic document that facilitates reporting the current status of the race between Alison and Mitch. The document has a title, a graph of the posterior density, and a short statistical report. The data file for this senate race is senate.dat and contains just two numbers, the number in the sample that favor Alison, and the number that favor Mitch. This two number file can be updated with an editor. For more complicated situations there might be a program that updates a data file. Knitr is an R package containing a function knit. The function knit takes a file name with an extension.rnw as an argument. An.Rnw file is like a L A TEX file with inter-dispersed R chunks. The output is a pure L A TEX file containing the output from running the R chunks. Documentation for knitr is available online and in Yihui Xie s book, [1]. 5

6 R is an implemention of a language S. There is a function \Sexpr(), for S expression, that may be placed in the TEX portion of the file. \Sexpr() takes an R expression as an argument. The expression is evaluated, converted to text, and passed into the L A TEX output. The content of senate.rnw is \documentclass[12pt]{article} \begin{document} <<setup,echo=false>>= \title{alison Versus Mitch} \author{david Allen\\University of Kentucky} \maketitle \thispagestyle{empty} % <<params,echo=false>>= A poll released July 28, 2014 produced \Sexpr{a-2} potential voters favoring Alison and \Sexpr{b-2} favoring Mitch. These results give a \Sexpr{level} credible interval for the proportion favoring Alison of (\Sexpr{p1}, \Sexpr{p2}). % The posterior density function, with the area over the credible interval shaded, is <<label="density",dev= tikz,echo=false,fig.width=4,fig.height=2.75,fig.align= cen \end{document} The R chunks have a header of the form <<... >>= where commands are inside the double brackets. A chunk ends with The content of chunk1.r is setwd("~/tug2014") 6

7 interval.length <- function(p1, a, b, level=0.95) { q <- qbeta(1-level, a,b) if( p1 > q ) return(1 - q) if( p1 < 0 ) return(qbeta(level, a, b)) p2 <- qbeta(pbeta(p1, a, b) + level, a, b) return(p2-p1) } The function interval.length is a function I wrote that gives the length of an interval starting at p 1. The posterior distribution of p is the beta distribution. qbeta is a built-in function giving quantiles of the beta distribution. The content of chunk2 is vote <- vector(mode="numeric") vote <- scan(file="senate.dat") a <- vote[1] + 2 b <- vote[2] + 2 level < p1 <- optimize(f = interval.length, interval = c(0, qbeta(1-level, a,b)), a=a, b=b, level=level)$minimum p2 <- qbeta(pbeta(p1, a, b) + level, a, b) Here the built-in optimize function is used to find the value of p 1 associated with the shortest interval. Then the corresponding p 2 is computed. The content of chunk3.r is left <- (1:80)/80*p1 interval <- p1 + (1:80)/80*(p2-p1) right <- p2 + (1:80)/80*(1-p2) domain <- c(left, interval, right) range <- dbeta(domain, a, b) par(mex=0.6, mar=c(4.5,5,0,0)+0.1) plot(c(0,1), c(0, max(range)), type="n", xlab="proportion Favoring Alison", ylab="density",yaxt= n ) 7

8 polygon(c(interval, p2, p1), c(dbeta(interval, a, b), 0, 0), col=27) lines(domain, range); lines(c(0,1),c(0,0)) lines(c(0.5,0.5),c(0,max(range))) This code produces the graph. After each data update run the following command in a terminal. Rscript -e "library(knitr);knit( senate.rnw )" A L A TEX file is produced that is processed in the usual ways. Rscript is just the command line version of R. The -e option means the following are statements to be run, as opposed to a file containing statements. I conclude with an exercise. A poll was released on July 28, 2014 (the first full day of the conference in Portland) showing 321 for Alison and 336 for Mitch. I invite you to prepare a data file, senate.dat, containing Then knit the.rnw file and L A TEX the resulting senate.tex References [1] Yihui Xie. Dynamic Documents with R and knitr. Chapman & Hall/CRC Press, ISBN

Dynamic Documents. David Allen University of Kentucky. July 30, Presented at TUG 2014

Dynamic Documents. David Allen University of Kentucky. July 30, Presented at TUG 2014 Dynamic Documents David Allen University of Kentucky July 30, 2014 Presented at TUG 2014 1 Introduction A generic definition of a dynamic document from Wikipedia: A living document or dynamic document

More information

Using R. Liang Peng Georgia Institute of Technology January 2005

Using R. Liang Peng Georgia Institute of Technology January 2005 Using R Liang Peng Georgia Institute of Technology January 2005 1. Introduction Quote from http://www.r-project.org/about.html: R is a language and environment for statistical computing and graphics. It

More information

Dynamic Documents With R And Knitr (Chapman & Hall/CRC The R Series) By Yihui Xie READ ONLINE

Dynamic Documents With R And Knitr (Chapman & Hall/CRC The R Series) By Yihui Xie READ ONLINE Dynamic Documents With R And Knitr (Chapman & Hall/CRC The R Series) By Yihui Xie READ ONLINE If searched for the ebook by Yihui Xie Dynamic Documents with R and knitr (Chapman & Hall/CRC The R Series)

More information

Guidelines for reporting statistical results

Guidelines for reporting statistical results Guidelines for reporting statistical results Leonhard Held and Isaac Gravestock 7th September 2015 This document provides some guidelines for writing statistical consulting reports. A framework is needed

More information

L A T E X Week 11, Other Document Classes

L A T E X Week 11, Other Document Classes and L A T E X Week and Department of Mathematics University of California, Berkeley November 15, 2007 What we will cover and Overview What we will cover and Overview classes including What we will cover

More information

Math 235: Introduction to LaTeX

Math 235: Introduction to LaTeX Math 235: Introduction to LaTeX The LaTeX word processing system was built to do mathematical typesetting. It is different than word processors; in LaTeX you type in text and typesetting commands, then

More information

CSCM10 Research Methodology A Taster of L A TEX

CSCM10 Research Methodology A Taster of L A TEX CSCM10 Research Methodology A Taster of L A TEX Anton Setzer http://www.cs.swan.ac.uk/ csetzer/lectures/ computerscienceprojectresearchmethods/current/index.html November 10, 2016 CSCM10 Lecture 4, 11/2/16:

More information

Dynamic Documents. Using knitr. Benjamin Hofner

Dynamic Documents. Using knitr. Benjamin Hofner Dynamic Documents Using knitr Benjamin Hofner benjamin.hofnerfau.de Institut für Medizininformatik, Biometrie und Epidemiologie (IMBE) Friedrich-Alexander-Universität Erlangen-Nürnberg Biometrisches Kolloquium

More information

EPIB Four Lecture Overview of R

EPIB Four Lecture Overview of R EPIB-613 - Four Lecture Overview of R R is a package with enormous capacity for complex statistical analysis. We will see only a small proportion of what it can do. The R component of EPIB-613 is divided

More information

WYSIWYG Systems. CSCM10 Research Methodology Lecture 4 (11/2/16): A Taster of L A TEX WYSIWYM. Advantages/Disadvantages of WYSIWYG Systems

WYSIWYG Systems. CSCM10 Research Methodology Lecture 4 (11/2/16): A Taster of L A TEX WYSIWYM. Advantages/Disadvantages of WYSIWYG Systems WYSIWYG Systems CSCM10 Research Methodology Lecture 4 (11/2/16): A Taster of L A TEX Anton Setzer http://www.cs.swan.ac.uk/ csetzer/lectures/ computerscienceprojectresearchmethods/current/index.html WYSIWYG

More information

Use of GeoGebra in teaching about central tendency and spread variability

Use of GeoGebra in teaching about central tendency and spread variability CREAT. MATH. INFORM. 21 (2012), No. 1, 57-64 Online version at http://creative-mathematics.ubm.ro/ Print Edition: ISSN 1584-286X Online Edition: ISSN 1843-441X Use of GeoGebra in teaching about central

More information

LaTeX. Information Literacy II EN(IL2) Course

LaTeX. Information Literacy II EN(IL2) Course LaTeX Information Literacy II EN(IL2) Course Previous Lecture Saving plots to file Customizing plots Bar and pie charts Today Introduction to Latex - Basic commands - Structure of the document - Mathematical

More information

OHIO GRADUATION TESTS

OHIO GRADUATION TESTS Student Name OHIO GRADUATION TESTS Mathematics Practice Test 2006 by Ohio Department of Education The Ohio Department of Education does not discriminate on the basis of race, color, national origin, sex,

More information

CSCM10 Research Methodology A Taster of L A TEX

CSCM10 Research Methodology A Taster of L A TEX CSCM10 Research Methodology A Taster of L A TEX Anton Setzer http://www.cs.swan.ac.uk/ csetzer/lectures/ computerscienceprojectresearchmethods/current/index.html October 22, 2017 WYSIWYG Systems WYSIWYG

More information

WYSIWYG Systems. CSCM10 Research Methodology A Taster of L A TEX WYSIWYM. Advantages/Disadvantages of WYSIWYG Systems.

WYSIWYG Systems. CSCM10 Research Methodology A Taster of L A TEX WYSIWYM. Advantages/Disadvantages of WYSIWYG Systems. WYSIWYG Systems CSCM10 Research Methodology A Taster of L A TEX Anton Setzer http://www.cs.swan.ac.uk/ csetzer/lectures/ computerscienceprojectresearchmethods/current/index.html WYSIWYG = What You See

More information

practice: quadratic functions [102 marks]

practice: quadratic functions [102 marks] practice: quadratic functions [102 marks] A quadratic function, f(x) = a x 2 + bx, is represented by the mapping diagram below. 1a. Use the mapping diagram to write down two equations in terms of a and

More information

Getting Started in L A TEX

Getting Started in L A TEX Getting Started in L A TEX Roy Martinez Tuesday May 7th, 2008 1 Welcome to L A TEX Welcome to L A TEX, this guide is designed to give you a quick crash course introduction to this great formating tool.

More information

The Curse of Dimensionality

The Curse of Dimensionality The Curse of Dimensionality ACAS 2002 p1/66 Curse of Dimensionality The basic idea of the curse of dimensionality is that high dimensional data is difficult to work with for several reasons: Adding more

More information

f( x ), or a solution to the equation f( x) 0. You are already familiar with ways of solving

f( x ), or a solution to the equation f( x) 0. You are already familiar with ways of solving The Bisection Method and Newton s Method. If f( x ) a function, then a number r for which f( r) 0 is called a zero or a root of the function f( x ), or a solution to the equation f( x) 0. You are already

More information

Table 3: Midpoint estimate

Table 3: Midpoint estimate The function y gx.x x 1 is shown in Figure 1. Six midpoint rectangles have been drawn between the function and the x-axis over,1 ; the areas of these six rectangles are shown in Table 1. Figure 1 Let s

More information

PROBLEM SOLVING AND OFFICE AUTOMATION. A Program consists of a series of instruction that a computer processes to perform the required operation.

PROBLEM SOLVING AND OFFICE AUTOMATION. A Program consists of a series of instruction that a computer processes to perform the required operation. UNIT III PROBLEM SOLVING AND OFFICE AUTOMATION Planning the Computer Program Purpose Algorithm Flow Charts Pseudo code -Application Software Packages- Introduction to Office Packages (not detailed commands

More information

A Knitr Demo. Charles J. Geyer. February 8, 2017

A Knitr Demo. Charles J. Geyer. February 8, 2017 A Knitr Demo Charles J. Geyer February 8, 2017 1 Licence This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-sa/4.0/.

More information

LASER s Level 2 Maths Course - Summary

LASER s Level 2 Maths Course - Summary LASER s Level 2 Maths Course - Summary Unit Code Unit Title Credits Level Status SER945 Shape, Space and Measurement 3 2 Mandatory SER946 Collecting, Recording and Analysing Data 3 2 Mandatory SER947 Development

More information

Introduction to L A T E X

Introduction to L A T E X L A T E X R. M. Department of Mathematics University of Kentucky 6 October 2008 / L A T E X Outline T E X T E X was begun by Donald Knuth in 1977 because he did not like the appearance of his book The

More information

Parametric. Practices. Patrick Cunningham. CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved.

Parametric. Practices. Patrick Cunningham. CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved. Parametric Modeling Best Practices Patrick Cunningham July, 2012 CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved. E-Learning Webinar Series This

More information

Formatting with LaTeX

Formatting with LaTeX Formatting with LaTeX Zuyuan Wang School of Mechanical Engineering Purdue University wang1707@purdue.edu June 23, 2016 Seminar @ SURF 2016 About the SURF GAs Formatting with LaTeX (02/35) 06/23/2016 Purdue

More information

Introduction to Computer Applications CISY Zahoor Khan, Ph.D.

Introduction to Computer Applications CISY Zahoor Khan, Ph.D. Introduction to Computer Applications CISY 1225 Zahoor Khan, Ph.D. Last updated: May 2014 CISY 1225 Custom book Chapter 5 Introduction to Excel Copyright 2011 Pearson Education, Inc. Publishing as Prentice

More information

Package spikes. September 22, 2016

Package spikes. September 22, 2016 Type Package Package spikes September 22, 2016 Title Detecting Election Fraud from Irregularities in Vote-Share Distributions Version 1.1 Depends R (>= 3.2.2), emdbook Date 2016-09-21 Author Arturas Rozenas

More information

Pimp your thesis: a minimal introduction to L A T E X.

Pimp your thesis: a minimal introduction to L A T E X. 1 / 20 Pimp your thesis: a minimal introduction to L A T E X. Maarten Bransen IC/TC, U.S.S. Proton March 20, 2018 2 / 20 What is L A T E X? Most word processors you may be used to (i.e. Microsoft Word,

More information

A Triangular Fuzzy Model for Assessing Problem Solving Skills

A Triangular Fuzzy Model for Assessing Problem Solving Skills Annals of Pure and Applied Mathematics Vol. 7, No., 04, 3-8 ISSN: 79-087X (P), 79-0888(online) Published on 9 September 04 www.researchmathsci.org Annals of A Triangular Fuzzy Model for Assessing Problem

More information

Remembering the Past. Who Needs Documentation?

Remembering the Past. Who Needs Documentation? Remembering the Past Using SAS Keyboard Macros to Enhance Documentation Pete Lund Looking Glass Analytics Olympia, WA Who Needs Documentation? How many times have you looked at line after line of code

More information

Chapter 4. Clustering Core Atoms by Location

Chapter 4. Clustering Core Atoms by Location Chapter 4. Clustering Core Atoms by Location In this chapter, a process for sampling core atoms in space is developed, so that the analytic techniques in section 3C can be applied to local collections

More information

The Cantor Handbook. Alexander Rieder

The Cantor Handbook. Alexander Rieder Alexander Rieder 2 Contents 1 Introduction 5 2 Using Cantor 6 2.1 Cantor features....................................... 6 2.2 The Cantor backends.................................... 7 2.3 The Cantor Workspace...................................

More information

Guide to using L A TEX

Guide to using L A TEX Guide to using L A TEX Andrew Stevens, UC Berkeley 1 What is L A TEX, and why use it? L A TEX (pronounced LAH-tekh or LAY-tekh) is a language and document preparation system for typesetting. L A TEX is

More information

MAT 110 WORKSHOP. Updated Fall 2018

MAT 110 WORKSHOP. Updated Fall 2018 MAT 110 WORKSHOP Updated Fall 2018 UNIT 3: STATISTICS Introduction Choosing a Sample Simple Random Sample: a set of individuals from the population chosen in a way that every individual has an equal chance

More information

Lecture 31 Sections 9.4. Tue, Mar 17, 2009

Lecture 31 Sections 9.4. Tue, Mar 17, 2009 s for s for Lecture 31 Sections 9.4 Hampden-Sydney College Tue, Mar 17, 2009 Outline s for 1 2 3 4 5 6 7 s for Exercise 9.17, page 582. It is believed that 20% of all university faculty would be willing

More information

Document Preparation Using L A TEX

Document Preparation Using L A TEX Document Preparation Using L A TEX 1 1 Department of Mathematics Michigan State University East Lansing, MI USA weil@math.msu.edu October 28, 2008 Before L A TEX Was TEX TEX is a professional typesetting

More information

Excel 2016 Basics for Windows

Excel 2016 Basics for Windows Excel 2016 Basics for Windows Excel 2016 Basics for Windows Training Objective To learn the tools and features to get started using Excel 2016 more efficiently and effectively. What you can expect to learn

More information

The talk Document Class

The talk Document Class The talk Document Class Martin Wiebusch August 4, 2007 Abstract The talk document class allows you to create slides for screen presentations or printing on transparencies It also allows you to print personal

More information

Measures of Dispersion

Measures of Dispersion Lesson 7.6 Objectives Find the variance of a set of data. Calculate standard deviation for a set of data. Read data from a normal curve. Estimate the area under a curve. Variance Measures of Dispersion

More information

Using LaTex and BibTex to Write Academic Documents. Ligang He Department of Computer Science University of Warwick

Using LaTex and BibTex to Write Academic Documents. Ligang He Department of Computer Science University of Warwick Using LaTex and BibTex to Write Academic Documents Ligang He Department of Computer Science University of Warwick Introduction In LaTex, a markup language is used to describe document structure and format,

More information

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form. CHAPTER 2 Frequency Distributions and Graphs Objectives Organize data using frequency distributions. Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives.

More information

L A TEX. COMPSCI 111 / 111G Mastering Cyberspace: An introduction to practical computing. LaTeX. Development. Why?

L A TEX. COMPSCI 111 / 111G Mastering Cyberspace: An introduction to practical computing. LaTeX. Development. Why? LaTeX A document preparation system Used to typeset a document COMPSCI 111 / 111G Mastering Cyberspace: An introduction to practical computing LaTeX Compiler L A TEX http://en.wikipedia.org/wiki/latex

More information

Business Data Analysis MA0123. Dr Gavin Shaddick Department of Mathematical Sciences 4W 5.7

Business Data Analysis MA0123. Dr Gavin Shaddick Department of Mathematical Sciences 4W 5.7 Business Data Analysis MA0123 Dr Gavin Shaddick Department of Mathematical Sciences g.shaddick@bath.ac.uk 4W 5.7 Lectures and computer labs Two lectures a week (Monday and Friday). One computing lab (time

More information

TUTORIAL 10: ARRAYS AND MATRICES. 1. Welcome. Hello. My name is Dr. Christopher Raridan (Dr. R). I want to welcome you to the L A TEX Tutorial Series.

TUTORIAL 10: ARRAYS AND MATRICES. 1. Welcome. Hello. My name is Dr. Christopher Raridan (Dr. R). I want to welcome you to the L A TEX Tutorial Series. TUTORIAL 10: ARRAYS AND MATRICES CHRISTOPHER RARIDAN Abstract. Upon completion of this tutorial, the author should be able to construct arrays and matrices. 1. Welcome Hello. My name is Dr. Christopher

More information

Package bayesdp. July 10, 2018

Package bayesdp. July 10, 2018 Type Package Package bayesdp July 10, 2018 Title Tools for the Bayesian Discount Prior Function Version 1.3.2 Date 2018-07-10 Depends R (>= 3.2.3), ggplot2, survival, methods Functions for data augmentation

More information

Data Science Center Eindhoven. The Mathematics Behind Big Data. Alessandro Di Bucchianico

Data Science Center Eindhoven. The Mathematics Behind Big Data. Alessandro Di Bucchianico Data Science Center Eindhoven The Mathematics Behind Big Data Alessandro Di Bucchianico 4TU AMI SRO Big Data Meeting Big Data: Mathematics in Action! November 24, 2017 Outline Big Data Some real-life examples

More information

What is T E X? T E X and L A T E X Document preparation tools. Setting and casting type. What Knuth was setting. Setting and casting type

What is T E X? T E X and L A T E X Document preparation tools. Setting and casting type. What Knuth was setting. Setting and casting type T E X and L A T E X Document preparation tools This lecture will introduce software necessary to produce documents using L A T E X in the School of Computer Science. It will also show the basics of producing

More information

Using L A TEX. A numbered list is just that a collection of items sorted and labeled by number.

Using L A TEX. A numbered list is just that a collection of items sorted and labeled by number. Using L A TEX About these notes These notes give some starting tips on using L A TEX to typeset mathematical documents. To learn the system at all fully you ll need a proper L A TEX manual, but you can

More information

Typesetting in wxmaxima

Typesetting in wxmaxima Typesetting in wxmaxima 1 Introduction To make your paper look as good as possible you should be familiar with the following: how to format text and create sections (and subsections, etc.), how to typeset

More information

An interesting related problem is Buffon s Needle which was first proposed in the mid-1700 s.

An interesting related problem is Buffon s Needle which was first proposed in the mid-1700 s. Using Monte Carlo to Estimate π using Buffon s Needle Problem An interesting related problem is Buffon s Needle which was first proposed in the mid-1700 s. Here s the problem (in a simplified form). Suppose

More information

Implementing an Algorithm for Boomerang Fraction Sequences in Python

Implementing an Algorithm for Boomerang Fraction Sequences in Python Introduction Implementing an Algorithm for Boomerang Fraction Sequences in Python We ve all encountered maze problems, where the challenge is to find a path through a labyrinth from a starting point to

More information

Excel 2016 Basics for Mac

Excel 2016 Basics for Mac Excel 2016 Basics for Mac Excel 2016 Basics for Mac Training Objective To learn the tools and features to get started using Excel 2016 more efficiently and effectively. What you can expect to learn from

More information

Table of Contents. 1. Creating a Microsoft Excel Workbook...1 EVALUATION COPY

Table of Contents. 1. Creating a Microsoft Excel Workbook...1 EVALUATION COPY Table of Contents Table of Contents 1. Creating a Microsoft Excel Workbook...1 Starting Microsoft Excel...1 Creating a Workbook...2 Saving a Workbook...3 The Status Bar...5 Adding and Deleting Worksheets...6

More information

GO! with Microsoft Excel 2016 Comprehensive

GO! with Microsoft Excel 2016 Comprehensive GO! with Microsoft Excel 2016 Comprehensive First Edition Chapter 2 Using Functions, Creating Tables, and Managing Large Workbooks Use SUM and Statistical Functions The SUM function is a predefined formula

More information

Unit 5: Estimating with Confidence

Unit 5: Estimating with Confidence Unit 5: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Unit 5 Estimating with Confidence 8.1 8.2 8.3 Confidence Intervals: The Basics Estimating

More information

L A TEX examples. a b c a b c b b a c b c c b a. This table was typeset with the following commands.

L A TEX examples. a b c a b c b b a c b c c b a. This table was typeset with the following commands. L A TEX examples This document is designed to help you see how to produce some mathematical typesetting. The best way to learn how to use L A TEX is to experiment with particular commands. After you have

More information

Fall 2017: Numerical Methods I Assignment 1 (due Sep. 21, 2017)

Fall 2017: Numerical Methods I Assignment 1 (due Sep. 21, 2017) MATH-GA 2010.001/CSCI-GA 2420.001, Georg Stadler (NYU Courant) Fall 2017: Numerical Methods I Assignment 1 (due Sep. 21, 2017) Objectives. This class is for you and you should try to get the most out of

More information

L A TEX for psychological researchers

L A TEX for psychological researchers Kahoot! L A TEX for psychological researchers Lecture 1: Introducton Sacha Epskamp University of Amsterdam Department of Psychological Methods 27-01-2015 Contact Details Workshop website: http://sachaepskamp.com/latex-workshop

More information

Lecture 2 Map design. Dr. Zhang Spring, 2017

Lecture 2 Map design. Dr. Zhang Spring, 2017 Lecture 2 Map design Dr. Zhang Spring, 2017 Model of the course Using and making maps Navigating GIS maps Map design Working with spatial data Geoprocessing Spatial data infrastructure Digitizing File

More information

WELCOME! Lecture 3 Thommy Perlinger

WELCOME! Lecture 3 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important

More information

An Introduction to. Rado Ivanov CIS400 Senior Design Tutorial September 18, 2014

An Introduction to. Rado Ivanov CIS400 Senior Design Tutorial September 18, 2014 An Introduction to Rado Ivanov CIS400 Senior Design Tutorial September 18, 2014 Today's Outline Introducing TeX/LaTeX Benefits and potential difficulties Installation and use on Unix/Mac/Windows Compiling

More information

Lab1: Communicating science

Lab1: Communicating science Lab1: Communicating science We would all like to be good citizens of the scientific community. An important part of being a good citizen is being able to communicate results, papers, and ideas. Since many

More information

Lecture 1. MA2730: Analysis I. Lecture slides for MA2730 Analysis I. Study and Assessment Components. MA L A TEX: topics for Lecture 1

Lecture 1. MA2730: Analysis I. Lecture slides for MA2730 Analysis I. Study and Assessment Components. MA L A TEX: topics for Lecture 1 Contents of the teaching and assessment blocks MA2730: Analysis I Lecture slides for MA2730 Analysis I Simon people.brunel.ac.uk/~icsrsss simon.shaw@brunel.ac.uk College of Engineering, Design and Physical

More information

Atelier B. Model Editor. User Manual. version 3.1

Atelier B. Model Editor. User Manual. version 3.1 Atelier B Model Editor User Manual version 3.1 ATELIER B Model Editor User Manual version 3.1 Document made by CLEARSY. This document is the property of CLEARSY and shall not be copied, duplicated or distributed,

More information

Latex Tutorial. CIS400 Senior Design 9/5/2013

Latex Tutorial. CIS400 Senior Design 9/5/2013 1 Latex Tutorial CIS400 Senior Design 9/5/2013 2 Outline Introducing TeX/LaTeX Benefits and potential difficulties Installation and use on Unix/Mac/Windows Compiling PDF documents from LaTeX Basic document

More information

INSTALLING AND USING L A TEX

INSTALLING AND USING L A TEX INSTALLING AND USING L A TEX DAVID MEREDITH Contents 1. Installing and Running L A TEX with Microsoft Windows 2 1.1. Installing the compiler MikTeX 2 1.2. Installing the editor TeXtudio 2 1.3. Running

More information

Minitab Notes for Activity 1

Minitab Notes for Activity 1 Minitab Notes for Activity 1 Creating the Worksheet 1. Label the columns as team, heat, and time. 2. Have Minitab automatically enter the team data for you. a. Choose Calc / Make Patterned Data / Simple

More information

Discussion on Bayesian Model Selection and Parameter Estimation in Extragalactic Astronomy by Martin Weinberg

Discussion on Bayesian Model Selection and Parameter Estimation in Extragalactic Astronomy by Martin Weinberg Discussion on Bayesian Model Selection and Parameter Estimation in Extragalactic Astronomy by Martin Weinberg Phil Gregory Physics and Astronomy Univ. of British Columbia Introduction Martin Weinberg reported

More information

Graphing Polynomial Functions: The Leading Coefficient Test and End Behavior: For an n th n. degree polynomial function a 0, 0, then

Graphing Polynomial Functions: The Leading Coefficient Test and End Behavior: For an n th n. degree polynomial function a 0, 0, then Graphing Polynomial Functions: The Leading Coefficient Test and End Behavior: For an n th n n 1 degree polynomial function a 0, n If n is even and an 0, then f x a x a x a x a with n n 1 1 0 x If n is

More information

Microsoft Excel Match Index Offset

Microsoft Excel Match Index Offset MATCH, INDEX and OFFSET The MATCH function returns the relative position of an item in an range that matches a specified value. If there is no match then the #N/A error value is returned. MATCH(lookup_value,lookup_array,match_type)

More information

A Beginner s guide to L A TEX for CSCA67/MATA67. Kohilan Mohanarajan

A Beginner s guide to L A TEX for CSCA67/MATA67. Kohilan Mohanarajan A Beginner s guide to L A TEX for CSCA67/MATA67 Kohilan Mohanarajan August 31, 2017 Contents 1 Foreword 2 2 Getting Started 3 3 Setting up your L A TEXDocument 4 4 Writing your L A TEXDocument 6 4.1 Environments...............................

More information

Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data

Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Ryan Atallah, John Ryan, David Aeschlimann December 14, 2013 Abstract In this project, we study the problem of classifying

More information

Section 26: Associativity and Order of Operations

Section 26: Associativity and Order of Operations Section 26: Associativity and Order of Operations One of the most important properties of the matrix operations is called associativity To understand what this property is we need to discuss something

More information

Introduction to LATEX

Introduction to LATEX Introduction to L A TEX Department of Statistics, UC Berkeley January 28, 2011 1 Why L A TEX? 2 Basics 3 Typing Math 4 BibTeX 5 More Why L A TEX? Professional typesetting tool offering great control Why

More information

DOING MORE WITH EXCEL: MICROSOFT OFFICE 2013

DOING MORE WITH EXCEL: MICROSOFT OFFICE 2013 DOING MORE WITH EXCEL: MICROSOFT OFFICE 2013 GETTING STARTED PAGE 02 Prerequisites What You Will Learn MORE TASKS IN MICROSOFT EXCEL PAGE 03 Cutting, Copying, and Pasting Data Basic Formulas Filling Data

More information

Algebra II: Strand 3. Quadratic Functions; Topic 2. Digging Deeper; Task 3.2.1

Algebra II: Strand 3. Quadratic Functions; Topic 2. Digging Deeper; Task 3.2.1 1 TASK 3..1: PUTTING IT TOGETHER Solutions 1. Each of the following quadratic functions is given in standard form ( y = ax + bx + c ). For each function: Transform the function to the form y = a(x h) +

More information

Introduction to L A T E X

Introduction to L A T E X to L A T E X Ricky Patterson Big Library 21 Sep 2016 Ricky Patterson Intro to LAT E X 21 Sep 2016 1 / 18 Outline A Basic L A T E X Document \documentclass Packages Caveats Formatting Some L A T E X Examples

More information

N. Brownlee Independent Submissions Editor Expires: April 21, 2013 October 18, 2012

N. Brownlee Independent Submissions Editor Expires: April 21, 2013 October 18, 2012 INTERNET-DRAFT H. Flanagan Intended Status: Informational RFC Series Editor N. Brownlee Independent Submissions Editor Expires: April 21, 2013 October 18, 2012 RFC Series Format Development draft-rfc-format-flanagan-01

More information

COMPUTER SCIENCE LARGE PRACTICAL.

COMPUTER SCIENCE LARGE PRACTICAL. COMPUTER SCIENCE LARGE PRACTICAL Page 45 of 100 SURVEY RESULTS Approx. 1/5 of class responded; statistically significant? The majority of you have substantial experience in Java, and all have at least

More information

Overview. Dynamic documents with R and L A TEX (while citing properly) Outline. Software for your Statistical Analysis (1) knitr.

Overview. Dynamic documents with R and L A TEX (while citing properly) Outline. Software for your Statistical Analysis (1) knitr. Overview Dynamic documents with R and L A TEX (while citing properly) Michael Höhle 1 1 Department of Mathematics Stockholm University, Sweden Kandidatarbete i matematisk statistik VT-14 Seminarium, 25

More information

MAG Demographic Map Viewer Training

MAG Demographic Map Viewer Training Exercise 1 In this exercise you will create a map showing the percentage of Hispanic population of each block group, showing eight data breaks using equal intervals, an orange and purple color scheme,

More information

A noninformative Bayesian approach to small area estimation

A noninformative Bayesian approach to small area estimation A noninformative Bayesian approach to small area estimation Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 glen@stat.umn.edu September 2001 Revised May 2002 Research supported

More information

Guidelines for ETNA manuscripts 1

Guidelines for ETNA manuscripts 1 Guidelines for ETNA manuscripts 1 1 General formatting guidelines A manuscript for ETNA must be written in English. It may be in color provided it is equally readable when displayed in black and white.

More information

EXCEL BASICS: MICROSOFT OFFICE 2007

EXCEL BASICS: MICROSOFT OFFICE 2007 EXCEL BASICS: MICROSOFT OFFICE 2007 GETTING STARTED PAGE 02 Prerequisites What You Will Learn USING MICROSOFT EXCEL PAGE 03 Opening Microsoft Excel Microsoft Excel Features Keyboard Review Pointer Shapes

More information

Chapters 5-6: Statistical Inference Methods

Chapters 5-6: Statistical Inference Methods Chapters 5-6: Statistical Inference Methods Chapter 5: Estimation (of population parameters) Ex. Based on GSS data, we re 95% confident that the population mean of the variable LONELY (no. of days in past

More information

Managing GOTV Data on Election Day

Managing GOTV Data on Election Day Managing GOTV Data on Election Day This tutorial has been created to explain how we manage GOTV data on Election Day. We will be using calling services on Election Day in an attempt to get people out to

More information

Optimization in One Variable Using Solver

Optimization in One Variable Using Solver Chapter 11 Optimization in One Variable Using Solver This chapter will illustrate the use of an Excel tool called Solver to solve optimization problems from calculus. To check that your installation of

More information

UNIT 15 GRAPHICAL PRESENTATION OF DATA-I

UNIT 15 GRAPHICAL PRESENTATION OF DATA-I UNIT 15 GRAPHICAL PRESENTATION OF DATA-I Graphical Presentation of Data-I Structure 15.1 Introduction Objectives 15.2 Graphical Presentation 15.3 Types of Graphs Histogram Frequency Polygon Frequency Curve

More information

Carnegie LearningÒ Middle School Math Solution Correlations Course 3 NCSCoS: Grade 8

Carnegie LearningÒ Middle School Math Solution Correlations Course 3 NCSCoS: Grade 8 MATHEMATICAL PRACTICES - 1 - Make sense of problems and persevere in solving them. Explain the meaning of a problem and look for entry points to its solution. Analyze givens, constraints, relationships,

More information

Math 1311 Section 3.3 Modeling Data with Linear Functions

Math 1311 Section 3.3 Modeling Data with Linear Functions Math 1311 Section 3.3 Modeling Data with Linear Functions Most of the real world does not come with math equations attached, like price tags. You observe a particular phenomenon and obtain data points.

More information

Introduction to Geospatial Analysis

Introduction to Geospatial Analysis Introduction to Geospatial Analysis Introduction to Geospatial Analysis 1 Descriptive Statistics Descriptive statistics. 2 What and Why? Descriptive Statistics Quantitative description of data Why? Allow

More information

Workbook Also called a spreadsheet, the Workbook is a unique file created by Excel. Title bar

Workbook Also called a spreadsheet, the Workbook is a unique file created by Excel. Title bar Microsoft Excel 2007 is a spreadsheet application in the Microsoft Office Suite. A spreadsheet is an accounting program for the computer. Spreadsheets are primarily used to work with numbers and text.

More information

Introduction to Programming in C Department of Computer Science and Engineering. Lecture No. #03 The Programming Cycle

Introduction to Programming in C Department of Computer Science and Engineering. Lecture No. #03 The Programming Cycle Introduction to Programming in C Department of Computer Science and Engineering Lecture No. #03 The Programming Cycle (Refer Slide Time: 00:22) Once we are understood what algorithms are, we will start

More information

RANDOM TESTS COMBINING MATHEMATICA PACKAGE AND LATEX COMPILER

RANDOM TESTS COMBINING MATHEMATICA PACKAGE AND LATEX COMPILER RANDOM TESTS COMBINING MATHEMATICA PACKAGE AND LATEX COMPILER Emilo Gómez-Déniz 1,2 Nancy Dávila-Cárdenes 1,2 and Mª Dolores García-Artiles 1 1 Department of Quantitative Methods in Economics and Management,

More information

Landscape Ecology. Lab 2: Indices of Landscape Pattern

Landscape Ecology. Lab 2: Indices of Landscape Pattern Introduction In this lab exercise we explore some metrics commonly used to summarize landscape pattern. You will begin with a presettlement landscape entirely covered in forest. You will then develop this

More information

More Summer Program t-shirts

More Summer Program t-shirts ICPSR Blalock Lectures, 2003 Bootstrap Resampling Robert Stine Lecture 2 Exploring the Bootstrap Questions from Lecture 1 Review of ideas, notes from Lecture 1 - sample-to-sample variation - resampling

More information

Introduction to Social Explorer

Introduction to Social Explorer Introduction to Social Explorer Janine Billadello, Geospatial Data Lab, Baruch College October 1, 2015 Abstract This tutorial will introduce you to Social Explorer, an online mapping interface that allows

More information

Catalan Numbers. Table 1: Balanced Parentheses

Catalan Numbers. Table 1: Balanced Parentheses Catalan Numbers Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles November, 00 We begin with a set of problems that will be shown to be completely equivalent. The solution to each problem

More information

Archbold Area Schools Math Curriculum Map

Archbold Area Schools Math Curriculum Map Math 8 August - May Mathematical Processes Formulate a problem or mathematical model in response to a specific need or situation, determine information required to solve the problem, choose method for

More information