An Introduction to R. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata October 17, 2012

Size: px
Start display at page:

Download "An Introduction to R. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata October 17, 2012"

Transcription

1 An Introduction to R Subhajit Dutta Stat-Math Unit Indian Statistical Institute, Kolkata October 17, 2012

2 Why R? It is FREE!! Basic as well as specialized data analysis technique at your fingertips. Highly competitive with existing expensive statistical packages (like Matlab, SAS, SPSS). Very small installer file ( 30 MB). Easy to write codes. Draw on the talents of data scientists worldwide.

3 Background S was developed by John Chambers and others at Bell Labs, 1976 as an internal statistical analysis environment. In 1993, Bell Labs gave Stat Sci (now Insightful Corp.) an exclusive license to develop, and sell the S language. R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, in 1991.

4 Background S was developed by John Chambers and others at Bell Labs, 1976 as an internal statistical analysis environment. In 1993, Bell Labs gave Stat Sci (now Insightful Corp.) an exclusive license to develop, and sell the S language. R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, in In 1997, the R-Core Group was formed. The first R version is released in 2000.

5 Background S was developed by John Chambers and others at Bell Labs, 1976 as an internal statistical analysis environment. In 1993, Bell Labs gave Stat Sci (now Insightful Corp.) an exclusive license to develop, and sell the S language. R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, in In 1997, the R-Core Group was formed. The first R version is released in Chambers is now a member of the R-Core Group.

6 Background S was developed by John Chambers and others at Bell Labs, 1976 as an internal statistical analysis environment. In 1993, Bell Labs gave Stat Sci (now Insightful Corp.) an exclusive license to develop, and sell the S language. R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, in In 1997, the R-Core Group was formed. The first R version is released in Chambers is now a member of the R-Core Group. R is named partly after the first names of the first two R authors, and partly as a play on the name of S.

7 Developer of S : John Chambers

8 The Developers of R : Ross Ihaka and Robert Gentleman

9 Design of R The R system is divided into 2 conceptual parts :

10 Design of R The R system is divided into 2 conceptual parts : The base R system.

11 Design of R The R system is divided into 2 conceptual parts : The base R system. User created packages : Allows specialized statistical techniques.

12 Design of R The R system is divided into 2 conceptual parts : The base R system. User created packages : Allows specialized statistical techniques. R functionality is divided into a number of packages, more than 4000 (as of August 2012) are available.

13 Design of R The R system is divided into 2 conceptual parts : The base R system. User created packages : Allows specialized statistical techniques. R functionality is divided into a number of packages, more than 4000 (as of August 2012) are available. Download from Comprehensive R Archive Network (CRAN), which contains mirrors throughout the world.

14 Design of R The R system is divided into 2 conceptual parts : The base R system. User created packages : Allows specialized statistical techniques. R functionality is divided into a number of packages, more than 4000 (as of August 2012) are available. Download from Comprehensive R Archive Network (CRAN), which contains mirrors throughout the world. India hosts a mirror at IIT, Madras.

15 Design of R The R system is divided into 2 conceptual parts : The base R system. User created packages : Allows specialized statistical techniques. R functionality is divided into a number of packages, more than 4000 (as of August 2012) are available. Download from Comprehensive R Archive Network (CRAN), which contains mirrors throughout the world. India hosts a mirror at IIT, Madras. Dr. Deepayan Sarkar, ISID (tomorrow s speaker) is among the core developers for R.

16 Areas of application Applications are listed at :

17 Areas of application Applications are listed at : Statistical Pattern Recognition, Spatial and Bayesian Statistics.

18 Areas of application Applications are listed at : Statistical Pattern Recognition, Spatial and Bayesian Statistics. Finance, Genetics, Machine Learning, Medical Imaging and Social Sciences.

19 Areas of application Applications are listed at : Statistical Pattern Recognition, Spatial and Bayesian Statistics. Finance, Genetics, Machine Learning, Medical Imaging and Social Sciences. The Bioconductor (hosted by Fred Hutchinson Cancer Research Center, USA) project provides 460 packages. Microarrays, High Throughput Assays, Sequence Data and Annotation.

20 Areas of application Applications are listed at : Statistical Pattern Recognition, Spatial and Bayesian Statistics. Finance, Genetics, Machine Learning, Medical Imaging and Social Sciences. The Bioconductor (hosted by Fred Hutchinson Cancer Research Center, USA) project provides 460 packages. Microarrays, High Throughput Assays, Sequence Data and Annotation. Google uses R to make online advertising more effective. Statistics : The Secret Weapon of Successful Web Giants (JSM, 2011).

21 Powerful and Excellent Visualizations

22 Powerful and Excellent Visualizations

23 Powerful and Excellent Visualizations

24 Advantages of using R A lot of packages are uploaded by the authors/their students.

25 Advantages of using R A lot of packages are uploaded by the authors/their students. It contains advanced statistical routines not yet available in other softwares.

26 Advantages of using R A lot of packages are uploaded by the authors/their students. It contains advanced statistical routines not yet available in other softwares. Workspace can be saved, and shifted to another some other computer.

27 Advantages of using R A lot of packages are uploaded by the authors/their students. It contains advanced statistical routines not yet available in other softwares. Workspace can be saved, and shifted to another some other computer. Lines of script are significantly less, and hence easier for debugging.

28 Advantages of using R A lot of packages are uploaded by the authors/their students. It contains advanced statistical routines not yet available in other softwares. Workspace can be saved, and shifted to another some other computer. Lines of script are significantly less, and hence easier for debugging. Codes in C can be interfaced with R, and vice versa.

29 Advantages of using R A lot of packages are uploaded by the authors/their students. It contains advanced statistical routines not yet available in other softwares. Workspace can be saved, and shifted to another some other computer. Lines of script are significantly less, and hence easier for debugging. Codes in C can be interfaced with R, and vice versa. R respects C, it retains the speed of the C code when run in an R environment.

30 Advantages of using R A lot of packages are uploaded by the authors/their students. It contains advanced statistical routines not yet available in other softwares. Workspace can be saved, and shifted to another some other computer. Lines of script are significantly less, and hence easier for debugging. Codes in C can be interfaced with R, and vice versa. R respects C, it retains the speed of the C code when run in an R environment. R code/data written by you can be shared with the rest of the statistics community as a package.

31 Drawbacks, and comments from fellow users Cannot handle huge data sets.

32 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data.

33 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners.

34 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners. Slow

35 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners. Slow Specific commands can improve speed a lot.

36 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners. Slow Specific commands can improve speed a lot. Comments

37 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners. Slow Specific commands can improve speed a lot. Comments Support : send an to "rhelp@r-project.org" and you will get very good/fast response.

38 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners. Slow Specific commands can improve speed a lot. Comments Support : send an to "rhelp@r-project.org" and you will get very good/fast response. Anything related to Math/Stat/Finance, you will get an existing package/function.

39 Drawbacks, and comments from fellow users Cannot handle huge data sets. Usual R can easily handle 400 MB data. With the library bigmemory, there is no limit on data size. But, using it is not as easy for beginners. Slow Specific commands can improve speed a lot. Comments Support : send an to "rhelp@r-project.org" and you will get very good/fast response. Anything related to Math/Stat/Finance, you will get an existing package/function. You have the scope to explore.

40 R installation

41 Data Types, Arrays, Loops (avoid!!).

42 Matrix Manipulations.

43 Sorting and Searching.

44 Graph Plotting (2D and 3D).

45 Probability Distributions.

46 File Reading and Writing.

47 Integrating C with R / R with C.

48 The best thing about R is that it was developed by statisticians. The worst thing about R is that...

49 The best thing about R is that it was developed by statisticians. The worst thing about R is that... it was developed by statisticians. Bo Cowgill, Google.

50 The best thing about R is that it was developed by statisticians. The worst thing about R is that... it was developed by statisticians. Bo Cowgill, Google. R you ready for R? Thanks to Palash Da, Buddha, Minerva and of course, Google.

On R for Statistics. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata September 16, 2011

On R for Statistics. Subhajit Dutta Stat-Math Unit. Indian Statistical Institute, Kolkata September 16, 2011 On R for Statistics Subhajit Dutta Stat-Math Unit Indian Statistical Institute, Kolkata September 16, 2011 Why R? It is FREE!! Basic as well as specialized data analysis technique at your fingertips. Highly

More information

Overview of R. Biostatistics

Overview of R. Biostatistics Overview of R Biostatistics 140.776 Stroustrup s Law There are only two kinds of languages: the ones people complain about and the ones nobody uses. R is a dialect of S What is R? What is S? S is a language

More information

The History and Use of R. Joseph Kambourakis

The History and Use of R. Joseph Kambourakis The History and Use of R Joseph Kambourakis Ground Rules Interrupt me These are all my opinions and not of EMC or Big Data Analytics, Discovery & Visualization Meetup Slides will be available Joseph

More information

8.1 Come analizzare i dati: R

8.1 Come analizzare i dati: R 8.1 Come analizzare i dati: R Insegnamento di Informatica Elisabetta Ronchieri Corso di Laurea di Economia, Universitá di Ferrara I semestre, anno 2014-2015 Elisabetta Ronchieri (Universitá) Insegnamento

More information

RNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University

RNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University RNA-Seq Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University joshua.ainsley@tufts.edu Day four Quantifying expression Intro to R Differential expression

More information

Introduction to R and Bioconductor

Introduction to R and Bioconductor Introduction to R and Bioconductor RNA-Seq / ChIP-Seq Data Analysis Workshop 10 September 2012 CSC, Helsinki Nicolas Delhomme A bit of interaction? What is your R knowledge, on a 0 (beginner) to 2 (expert)

More information

SQL Server 2017: Data Science with Python or R?

SQL Server 2017: Data Science with Python or R? SQL Server 2017: Data Science with Python or R? Dejan Sarka Sponsor Introduction Dejan Sarka (dsarka@solidq.com, dsarka@siol.net, @DejanSarka) 30 years of experience SQL Server MVP, MCT, 16 books 20+ courses,

More information

Introduction to R: Part I

Introduction to R: Part I Introduction to R: Part I Jeffrey C. Miecznikowski March 26, 2015 R impact R is the 13th most popular language by IEEE Spectrum (2014) Google uses R for ROI calculations Ford uses R to improve vehicle

More information

A Survey of Statistical Modeling Tools

A Survey of Statistical Modeling Tools 1 of 6 A Survey of Statistical Modeling Tools Madhuri Kulkarni (A survey paper written under the guidance of Prof. Raj Jain) Abstract: A plethora of statistical modeling tools are available in the market

More information

Introduction to R programming a SciLife Lab course

Introduction to R programming a SciLife Lab course Introduction to R programming a SciLife Lab course 31 August 2016 What R is a programming language, a programming platform (=environment + interpreter), a software project driven by the core team and the

More information

Statistics Statistical Computing Software

Statistics Statistical Computing Software Statistics 135 - Statistical Computing Software Mark E. Irwin Department of Statistics Harvard University Autumn Term Monday, September 19, 2005 - January 2006 Copyright c 2005 by Mark E. Irwin Personnel

More information

Data Wrangling in the Tidyverse

Data Wrangling in the Tidyverse Data Wrangling in the Tidyverse 21 st Century R DS Portugal Meetup, at Farfetch, Porto, Portugal April 19, 2017 Jim Porzak Data Science for Customer Insights 4/27/2017 1 Outline 1. A very quick introduction

More information

SQL Server 2016 R Integration for database administrators

SQL Server 2016 R Integration for database administrators SQL Server 2016 R Integration for database administrators What can DBA gain by using R Integration for SQL Server 2016? Tomaž Kaštrun 20.Jänner, 2017 Our Sponsors About BI Developer and data analyst (SQL

More information

An Introduction to R- Programming

An Introduction to R- Programming An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University

More information

Introduction to R programming a SciLife Lab course

Introduction to R programming a SciLife Lab course Introduction to R programming a SciLife Lab course 20 October 2017 What R really is? a programming language, a programming platform (= environment + interpreter), a software project driven by the core

More information

Data-informed collection decisions using R or, learning R using collection data

Data-informed collection decisions using R or, learning R using collection data Data-informed collection decisions using R or, learning R using collection data Heidi Tebbe Collections & Research Librarian for Engineering and Data Science NCSU Libraries Collections & Research Librarian

More information

STAT 540 Computing in Statistics

STAT 540 Computing in Statistics STAT 540 Computing in Statistics Introduces programming skills in two important statistical computer languages/packages. 30-40% R and 60-70% SAS Examples of Programming Skills: 1. Importing Data from External

More information

CREATING POWERFUL AND EFFECTIVE GRAPHICAL DISPLAYS: AN INTRODUCTION TO LATTICE GRAPHICS IN R

CREATING POWERFUL AND EFFECTIVE GRAPHICAL DISPLAYS: AN INTRODUCTION TO LATTICE GRAPHICS IN R APSA Short Course, SC 13 Chicago, Illinois August 29, 2007 Michigan State University CREATING POWERFUL AND EFFECTIVE GRAPHICAL DISPLAYS: AN INTRODUCTION TO LATTICE GRAPHICS IN R I. Some Basic R Concepts

More information

Why use R? Getting started. Why not use R? Introduction to R: Log into tak. Start R R or. It s hard to use at first

Why use R? Getting started. Why not use R? Introduction to R: Log into tak. Start R R or. It s hard to use at first Why use R? Introduction to R: Using R for statistics ti ti and data analysis BaRC Hot Topics October 2011 George Bell, Ph.D. http://iona.wi.mit.edu/bio/education/r2011/ To perform inferential statistics

More information

Using R for statistics and data analysis

Using R for statistics and data analysis Introduction ti to R: Using R for statistics and data analysis BaRC Hot Topics October 2011 George Bell, Ph.D. http://iona.wi.mit.edu/bio/education/r2011/ Why use R? To perform inferential statistics (e.g.,

More information

R in the City. Richard Saldanha Oxquant Consulting LondonR Group Meeting 3rd November 2009

R in the City. Richard Saldanha Oxquant Consulting LondonR Group Meeting 3rd November 2009 R in the City Richard Saldanha Oxquant Consulting richard@oxquant.com LondonR Group Meeting 3rd November 2009 S Language Development 1965 Bell Labs pre-s work on a statistical computing language 1977 Bell

More information

An Introduction To R For Spatial Analysis And Mapping

An Introduction To R For Spatial Analysis And Mapping We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with an introduction to r

More information

R-volution? The R statistical package and what it can do for you. Mike Babyak. March 12, Duke University Medical Center

R-volution? The R statistical package and what it can do for you. Mike Babyak. March 12, Duke University Medical Center The R statistical package and what it can do for you Duke University Medical Center March 12, 2010 http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html// http://bits.blogs.nytimes.com/2009/01/08/r-you-ready-for-r

More information

Introduction to R programming a SciLife Lab course

Introduction to R programming a SciLife Lab course Introduction to R programming a SciLife Lab course 22 March 2017 What R really is? a programming language, a programming platform (= environment + interpreter), a software project driven by the core team

More information

Introduction to RStudio

Introduction to RStudio Introduction to RStudio Ulrich Halekoh Epidemiology and Biostatistics, SDU May 4, 2018 R R is a language that started by Ross Ihaka and Robert Gentleman in 1991 as an open source alternative to S emphasizes

More information

IST Computational Tools for Statistics I. DEÜ, Department of Statistics

IST Computational Tools for Statistics I. DEÜ, Department of Statistics IST 1051 Computational Tools for Statistics I 1 DEÜ, Department of Statistics Course Objectives Computational Tools for Statistics-I course can increase the understanding of statistics and helps to learn

More information

Fuzzy Rogers Research Computing Administrator Materials Research Laboratory (MRL) Center for Scientific Computing (CSC)

Fuzzy Rogers Research Computing Administrator Materials Research Laboratory (MRL) Center for Scientific Computing (CSC) Intro to R Fuzzy Rogers Research Computing Administrator Materials Research Laboratory (MRL) Center for Scientific Computing (CSC) fuz@mrl.ucsb.edu MRL 2066B Sharon Solis Paul Weakliem Research Computing

More information

Intro Intro.3

Intro Intro.3 Intro.1 Intro.2 Introduction to R Much of the content here is from Appendix A of my Analysis of Categorical Data with R book (www.chrisbilder.com/ categorical). All R code is available in AppendixInitialExamples.R

More information

Extending R to the Enterprise

Extending R to the Enterprise Extending R to the Enterprise With TIBCO Spotfire and TERR Lou Bajuk-Yorgan, Sr. Dir., Product Management, TIBCO (Edit via Slide Master) Name Job Title youremail@yourdomain.com Extending R to the Enterprise

More information

Introduction to 3/15/2012. Poll Are You Sticking Around for Part 2? 1. Yes 2. No. Steve Berman, FCAS, MAAA Jim Guszcza, FCAS, MAAA

Introduction to 3/15/2012. Poll Are You Sticking Around for Part 2? 1. Yes 2. No. Steve Berman, FCAS, MAAA Jim Guszcza, FCAS, MAAA Introduction to CAS RPM Seminar March 19, 2012 Steve Berman, FCAS, MAAA Jim Guszcza, FCAS, MAAA Poll Are You Sticking Around for Part 2? 1. Yes 2. No 1 1 Poll How Much Do You Know About R? 1. Isn t that

More information

Introduction to R: Using R for statistics and data analysis

Introduction to R: Using R for statistics and data analysis Why use R? Introduction to R: Using R for statistics and data analysis George W Bell, Ph.D. BaRC Hot Topics November 2014 Bioinformatics and Research Computing Whitehead Institute http://barc.wi.mit.edu/hot_topics/

More information

Fraud Detection Using Random Forest Algorithm

Fraud Detection Using Random Forest Algorithm Fraud Detection Using Random Forest Algorithm Eesha Goel Computer Science Engineering and Technology, GZSCCET, Bhatinda, India eesha1992@rediffmail.com Abhilasha Computer Science Engineering and Technology,

More information

Introduction to R. base -> R win32.exe (this will change depending on the latest version)

Introduction to R. base -> R win32.exe (this will change depending on the latest version) Dr Raffaella Calabrese, Essex Business School 1. GETTING STARTED Introduction to R R is a powerful environment for statistical computing which runs on several platforms. R is available free of charge.

More information

Part 1: Getting Started

Part 1: Getting Started Part 1: Getting Started 140.776 Statistical Computing Ingo Ruczinski Thanks to Thomas Lumley and Robert Gentleman of the R-core group (http://www.r-project.org/) for providing some tex files that appear

More information

Intro to R. Some history. Some history

Intro to R. Some history. Some history Intro to R Héctor Corrada Bravo CMSC858B Spring 2012 University of Maryland Computer Science http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html?_r=2&pagewanted=1 http://www.forbes.com/forbes/2010/0524/opinions-software-norman-nie-spss-ideas-opinions.html

More information

Introduction to R: Using R for statistics and data analysis

Introduction to R: Using R for statistics and data analysis Why use R? Introduction to R: Using R for statistics and data analysis George W Bell, Ph.D. BaRC Hot Topics November 2015 Bioinformatics and Research Computing Whitehead Institute http://barc.wi.mit.edu/hot_topics/

More information

The R statistical computing environment

The R statistical computing environment The R statistical computing environment Luke Tierney Department of Statistics & Actuarial Science University of Iowa June 17, 2011 Luke Tierney (U. of Iowa) R June 17, 2011 1 / 27 Introduction R is a language

More information

Why use R? Getting started. Why not use R? Introduction to R: It s hard to use at first. To perform inferential statistics (e.g., use a statistical

Why use R? Getting started. Why not use R? Introduction to R: It s hard to use at first. To perform inferential statistics (e.g., use a statistical Why use R? Introduction to R: Using R for statistics ti ti and data analysis BaRC Hot Topics November 2013 George W. Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ To perform inferential

More information

Solving the Unsolvable Through Scientific Computing: Explorations in the Best Uses of Popular Mathematics Software

Solving the Unsolvable Through Scientific Computing: Explorations in the Best Uses of Popular Mathematics Software Solving the Unsolvable Through Scientific Computing: Explorations in the Best Uses of Popular Mathematics Software Talitha Washington, Howard University Edray Goins, Purdue University Luis Melara, Shippensburg

More information

Introduction to R Programming

Introduction to R Programming Course Overview Over the past few years, R has been steadily gaining popularity with business analysts, statisticians and data scientists as a tool of choice for conducting statistical analysis of data

More information

Introducing Oracle R Enterprise 1.4 -

Introducing Oracle R Enterprise 1.4 - Hello, and welcome to this online, self-paced lesson entitled Introducing Oracle R Enterprise. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle. I

More information

Roblox Roblox is the world s largest social platform for play We help power the imaginations of people around the world. R Define R at Dictionary R

Roblox Roblox is the world s largest social platform for play We help power the imaginations of people around the world. R Define R at Dictionary R Roblox Roblox is the world s largest social platform for play We help power the imaginations of people around the world. R Define R at Dictionary R definition, the th letter of the English alphabet, a

More information

Instruction: Download and Install R and RStudio

Instruction: Download and Install R and RStudio 1 Instruction: Download and Install R and RStudio We will use a free statistical package R, and a free version of RStudio. Please refer to the following two steps to download both R and RStudio on your

More information

Computational statistics Jamie Griffin. Semester B 2018 Lecture 1

Computational statistics Jamie Griffin. Semester B 2018 Lecture 1 Computational statistics Jamie Griffin Semester B 2018 Lecture 1 Course overview This course is not: Statistical computing Programming This course is: Computational statistics Statistical methods that

More information

Gene Survey: FAQ. Gene Survey: FAQ Tod Casasent DRAFT

Gene Survey: FAQ. Gene Survey: FAQ Tod Casasent DRAFT Gene Survey: FAQ Tod Casasent 2016-02-22-1245 DRAFT 1 What is this document? This document is intended for use by internal and external users of the Gene Survey package, results, and output. This document

More information

Csima Judit február 6.

Csima Judit február 6. R Története, alapjellemzők Csima Judit BME, VIK, Számítástudományi és Információelméleti Tanszék 2017. február 6. Csima Judit R Története, alapjellemzők 1 / 8 Történet Az R gyökerei az R az S nyelv egy

More information

ICPSR Training Program McMaster University Summer, The R Statistical Computing Environment: The Basics and Beyond

ICPSR Training Program McMaster University Summer, The R Statistical Computing Environment: The Basics and Beyond John Fox ICPSR Training Program McMaster University Summer, 2012 The R Statistical Computing Environment: The Basics and Beyond The R statistical programming language and computing environment has become

More information

Getting Started. Slides R-Intro: R-Analytics: R-HPC:

Getting Started. Slides R-Intro:   R-Analytics:   R-HPC: Getting Started Download and install R + Rstudio http://www.r-project.org/ https://www.rstudio.com/products/rstudio/download2/ TACC ssh username@wrangler.tacc.utexas.edu % module load Rstats %R Slides

More information

Deploying, Managing and Reusing R Models in an Enterprise Environment

Deploying, Managing and Reusing R Models in an Enterprise Environment Deploying, Managing and Reusing R Models in an Enterprise Environment Making Data Science Accessible to a Wider Audience Lou Bajuk-Yorgan, Sr. Director, Product Management Streaming and Advanced Analytics

More information

Spatial Ecology Lab 2: Data Analysis with R

Spatial Ecology Lab 2: Data Analysis with R Spatial Ecology Lab 2: Data Analysis with R Damian Maddalena Spring 2015 1 Introduction This lab will get your started with basic data analysis in R. We will load a dataset, do some very basic data manipulations,

More information

Python, SageMath/Cloud, R and Open-Source

Python, SageMath/Cloud, R and Open-Source Python, SageMath/Cloud, R and Open-Source Harald Schilly 2016-10-14 TANCS Workshop Institute of Physics University Graz The big picture The Big Picture Software up to the end of 1979: Fortran: LINPACK

More information

STAT 20060: Statistics for Engineers. Statistical Programming with R

STAT 20060: Statistics for Engineers. Statistical Programming with R STAT 20060: Statistics for Engineers Statistical Programming with R Why R? Because it s free to download for everyone! Most statistical software is very, very expensive, so this is a big advantage. Statisticians

More information

Spotfire Template Automation with Iron Python and Statistical Modeling with TERR. Tom Bernens May 17, :15PM Room

Spotfire Template Automation with Iron Python and Statistical Modeling with TERR. Tom Bernens May 17, :15PM Room Spotfire Template Automation with Iron Python and Statistical Modeling with TERR Tom Bernens May 17, 2016 3:15PM Room 106-107 Agenda Repetitive analyses Standardizing data formats Automation of gui interactions

More information

An Introduction to the R Commander

An Introduction to the R Commander An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu

More information

Fuzzy Rogers Research Computing Administrator Materials Research Laboratory (MRL) Center for Scientific Computing (CSC)

Fuzzy Rogers Research Computing Administrator Materials Research Laboratory (MRL) Center for Scientific Computing (CSC) Intro to R Fuzzy Rogers Research Computing Administrator Materials Research Laboratory (MRL) Center for Scientific Computing (CSC) fuz@mrl.ucsb.edu MRL 2066B Sharon Solis Paul Weakliem Research Computing

More information

History and Ecology of R

History and Ecology of R History and Ecology of R Martyn Plummer International Agency for Research on Cancer ANF R avancé et performances Aussois 6 Oct 2015 Pre-history Before there was R, there was S. The S language Developed

More information

Short Introduction to R

Short Introduction to R Short Introduction to R Paulino Pérez 1 José Crossa 2 1 ColPos-México 2 CIMMyT-México June, 2015. CIMMYT, México-SAGPDB Short Introduction to R 1/51 Contents 1 Introduction 2 Simple objects 3 User defined

More information

Getting Started with R

Getting Started with R Getting Started with R STAT 133 Gaston Sanchez Department of Statistics, UC Berkeley gastonsanchez.com github.com/gastonstat/stat133 Course web: gastonsanchez.com/stat133 Tool Some of you may have used

More information

BGGN 213 Working with R packages Barry Grant

BGGN 213 Working with R packages Barry Grant BGGN 213 Working with R packages Barry Grant http://thegrantlab.org/bggn213 Recap From Last Time: Why it is important to visualize data during exploratory data analysis. Discussed data visualization best

More information

A/D Converter. Sampling. Figure 1.1: Block Diagram of a DSP System

A/D Converter. Sampling. Figure 1.1: Block Diagram of a DSP System CHAPTER 1 INTRODUCTION Digital signal processing (DSP) technology has expanded at a rapid rate to include such diverse applications as CDs, DVDs, MP3 players, ipods, digital cameras, digital light processing

More information

We deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR

We deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR Numerical Computation, Statistical analysis and Visualization Using MATLAB and Tools Authors: Jamuna Konda, Jyothi Bonthu, Harpitha Joginipally Infotech Enterprises Ltd, Hyderabad, India August 8, 2013

More information

Statistics 120 Statistical Computing With R. First Prev Next Last Go Back Full Screen Close Quit

Statistics 120 Statistical Computing With R. First Prev Next Last Go Back Full Screen Close Quit Statistics 120 Statistical Computing With R First Prev Next Last Go Back Full Screen Close Quit The R System This course uses the R computing environment for practical examples. R serves both as a statistical

More information

Embedded Technosolutions

Embedded Technosolutions Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication

More information

STAT 571A Advanced Statistical Regression Analysis. Introduction to R NOTES

STAT 571A Advanced Statistical Regression Analysis. Introduction to R NOTES STAT 571A Advanced Statistical Regression Analysis Introduction to R NOTES 2015 University of Arizona Statistics GIDP. All rights reserved, except where previous rights exist. No part of this material

More information

Data Manipulation with JMP

Data Manipulation with JMP Data Manipulation with JMP Introduction JMP was introduced in 1989 by its Principle developer, John Sall at SAS. It is a richly graphic environment and very useful for both preliminary data exploration

More information

YouTube for Hosting Video Tutorials: The Hows, Whys and Why Nots

YouTube for Hosting Video Tutorials: The Hows, Whys and Why Nots University of Iowa Libraries Staff Publications 3-15-2012 YouTube for Hosting Video Tutorials: The Hows, Whys and Why Nots Amy Blevins University of Iowa Chris Childs University of Iowa Ericka A. Raber

More information

Introduction to R. Biostatistics 615/815 Lecture 23

Introduction to R. Biostatistics 615/815 Lecture 23 Introduction to R Biostatistics 615/815 Lecture 23 So far We have been working with C Strongly typed language Variable and function types set explicitly Functional language Programs are a collection of

More information

DATA SHEET. VEE Pro 9.33

DATA SHEET. VEE Pro 9.33 DATA SHEET VEE Pro 9.33 Keysight s software products make you more productive, so you can focus on solving engineering problems, not on programming. Keysight VEE is a powerful intuitive graphical language

More information

Matlab Advanced Programming. Matt Wyant University of Washington

Matlab Advanced Programming. Matt Wyant University of Washington Matlab Advanced Programming Matt Wyant University of Washington Matlab as a programming Language Strengths (as compared to C/C++/Fortran) Fast to write -no type declarations needed Memory allocation/deallocation

More information

The Top 10 New Features in KNIME 2.8. Rosaria Silipo KNIME.com AG, San Francisco

The Top 10 New Features in KNIME 2.8. Rosaria Silipo KNIME.com AG, San Francisco The Top 10 New Features in KNIME 2.8 Rosaria Silipo KNIME.com AG, San Francisco KNIME 2.8 KNIME 2.8 was out end of July 2013 Many New Features Documentation available at: http://tech.knime.org/whats-new-in-knime-28

More information

Outline. S: past, present and future Some thoughts. The 80s. Interfaces - 60s & 70s. Duncan Temple Lang Department of Statistics UC Davis

Outline. S: past, present and future Some thoughts. The 80s. Interfaces - 60s & 70s. Duncan Temple Lang Department of Statistics UC Davis S: past, present and future Some thoughts Duncan Temple Lang Department of Statistics UC Davis Outline Good idea that was either taken up or missed. Interfaces Thoughts on how S evolved and what implications

More information

Protegrity Vaultless Tokenization

Protegrity Vaultless Tokenization Protegrity Vaultless Tokenization Protegrity Vaultless Tokenization employs a patent-pending approach to tokenization that improves security and efficiency by eliminating the need for a token vault. By

More information

What You Will Learn. What You Will Learn. How to Get Started with Wistia & 5 Ways It Generates More Leads. with Josh White

What You Will Learn. What You Will Learn. How to Get Started with Wistia & 5 Ways It Generates More Leads. with Josh White How to Get Started with Wistia & 5 Ways It Generates More Leads with Josh White What You Will Learn 1. Why Video Marketing 2. Importance to Businesses 3. Video Marketing Requirements 4. Video Platforms

More information

An Introduction to R. Scott A. Handley, PhD.! September 18, 2014

An Introduction to R. Scott A. Handley, PhD.! September 18, 2014 An Introduction to R Scott A. Handley, PhD! September 18, 2014 What is R? A free software environment for statistical computing and graphics. Why is R useful? Data management and manipulation Well established

More information

A EOTECH. DAQFactory. DAQLab is now part of DAQFactory! Software for data acquisition + process control + analysis

A EOTECH. DAQFactory. DAQLab is now part of DAQFactory! Software for data acquisition + process control + analysis A EOTECH Z Software for data acquisition + process control + analysis TM DAQFactory TM DAQLab is now part of DAQFactory! DAQFactory Powerful, easy to use, and innovative, the DAQFactory suite of data acquisition

More information

USER S MANUAL. of Proposal Management System Scheme for Promotion of Academic Research Collaboration (SPARC)

USER S MANUAL. of Proposal Management System Scheme for Promotion of Academic Research Collaboration (SPARC) USER S MANUAL of Proposal Management System Scheme for Promotion of Academic Research Collaboration (SPARC) SPARC Team, IIT Kharagpur October 2018 TABLE OF CONTENTS Page # 1. ABOUT THE SCHEME... 1-1 2.

More information

Regression III: Advanced Methods

Regression III: Advanced Methods Lecture 2: Software Introduction Regression III: Advanced Methods William G. Jacoby Department of Political Science Michigan State University jacoby@msu.edu Getting Started with R What is R? A tiny R session

More information

Exploratory Data Analysis with R. Matthew Renze Iowa Code Camp Fall 2013

Exploratory Data Analysis with R. Matthew Renze Iowa Code Camp Fall 2013 Exploratory Data Analysis with R Matthew Renze Iowa Code Camp Fall 2013 Motivation The ability to take data to be able to understand it, to process it, to extract value from it, to visualize it, to communicate

More information

QPM Lab 1: Installing R and R Studio

QPM Lab 1: Installing R and R Studio QPM Lab 1: Installing R and R Studio Department of Political Science Washington University, St. Louis September 1-2, 2016 QPM Lab 1: Installing R and R Studio 1 Introductions About me Your turn: Name Year

More information

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics

More information

Enhanced Model Deployment in GAMS

Enhanced Model Deployment in GAMS Enhanced Model Deployment in GAMS Using R/Shiny to deploy and visualize GAMS models in a Web Interface Lutz Westermann Frederik Proske GAMS Software GmbH GAMS Development Corp. GAMS Software GmbH www.gams.com

More information

Use of Technology. Perform statistical computations on stored data or entered statistics.

Use of Technology. Perform statistical computations on stored data or entered statistics. Chapter 1 Introduction Use of Technology Statistics is a field that deals with sets of data. After the data is collected, it needs to be organized and interpreted. There is a limit to how much of the work

More information

Recap From Last Time:

Recap From Last Time: BIMM 143 More on R functions and packages Lecture 7 Barry Grant http://thegrantlab.org/bimm143 Office hour check-in! Recap From Last Time: Covered data input with the read.table() family of functions including

More information

IIT Madras, Polaris Software and IndusInd Bank launch INTELLECT PRIVACY Internet Banking Security Card

IIT Madras, Polaris Software and IndusInd Bank launch INTELLECT PRIVACY Internet Banking Security Card IIT Madras, Polaris Software and IndusInd Bank launch INTELLECT PRIVACY Internet Banking Security Card - Built on state-of-the-art technology, now live at IndusInd Bank Polaris Software Lab Limited (POLS.BO),

More information

WhatsApp Group Data Analysis with R

WhatsApp Group Data Analysis with R WhatsApp Group Data Analysis with R Sanchita Patil MCA Department Vivekanand Education Society's Institute of Technology Chembur, Mumbai 400074. ABSTRACT The means of communication has changed over time

More information

Automatic Differentiation in. Finlay Scott & Iago Mosqueira

Automatic Differentiation in. Finlay Scott & Iago Mosqueira Automatic Differentiation in Finlay Scott & Iago Mosqueira Structure What is R? history, strengths, limitations Current differentiation options in R How we have used AD with R Next steps AD in R What is

More information

Data Science Training

Data Science Training Data Science Training R, Predictive Modeling, Machine Learning, Python, Bigdata & Spark 9886760678 Introduction: This is a comprehensive course which builds on the knowledge and experience a business analyst

More information

Introduction to Scripting Languages. October 2017

Introduction to Scripting Languages. October 2017 Introduction to Scripting Languages damien.francois@uclouvain.be October 2017 1 Goal of this session: Advocate the use of scripting languages and help you choose the most suitable for your needs 2 Agenda

More information

GNU OCTAVE BEGINNER'S GUIDE BY JESPER SCHMIDT HANSEN DOWNLOAD EBOOK : GNU OCTAVE BEGINNER'S GUIDE BY JESPER SCHMIDT HANSEN PDF

GNU OCTAVE BEGINNER'S GUIDE BY JESPER SCHMIDT HANSEN DOWNLOAD EBOOK : GNU OCTAVE BEGINNER'S GUIDE BY JESPER SCHMIDT HANSEN PDF GNU OCTAVE BEGINNER'S GUIDE BY JESPER SCHMIDT HANSEN DOWNLOAD EBOOK : GNU OCTAVE BEGINNER'S GUIDE BY JESPER SCHMIDT HANSEN PDF Click link bellow and free register to download ebook: GNU OCTAVE BEGINNER'S

More information

INVENTORY MANAGEMENT SYSTEM

INVENTORY MANAGEMENT SYSTEM INVENTORY MANAGEMENT SYSTEM Neeraj Kumar Information Technology Dronacharya College of Engineering Gurgaon, Haryana ABSTRACT:- : This paper based on Inventory Management System, provided a lot of facility

More information

High Speed Pipelined Architecture for Adaptive Median Filter

High Speed Pipelined Architecture for Adaptive Median Filter Abstract High Speed Pipelined Architecture for Adaptive Median Filter D.Dhanasekaran, and **Dr.K.Boopathy Bagan *Assistant Professor, SVCE, Pennalur,Sriperumbudur-602105. **Professor, Madras Institute

More information

System Design S.CS301

System Design S.CS301 System Design S.CS301 (Autumn 2015/16) Page 1 Agenda Contents: Course overview Reading materials What is the MATLAB? MATLAB system History of MATLAB License of MATLAB Release history Syntax of MATLAB (Autumn

More information

Module 1: Introduction RStudio

Module 1: Introduction RStudio Module 1: Introduction RStudio Contents Page(s) Installing R and RStudio Software for Social Network Analysis 1-2 Introduction to R Language/ Syntax 3 Welcome to RStudio 4-14 A. The 4 Panes 5 B. Calculator

More information

To get started with Visual Basic 2005, I recommend that you jump right in

To get started with Visual Basic 2005, I recommend that you jump right in In This Chapter Chapter 1 Wading into Visual Basic Seeing where VB fits in with.net Writing your first Visual Basic 2005 program Exploiting the newfound power of VB To get started with Visual Basic 2005,

More information

R Short Course Session 1

R Short Course Session 1 R Short Course Session 1 Daniel Zhao, PhD Sixia Chen, PhD Department of Biostatistics and Epidemiology College of Public Health, OUHSC 10/23/2015 Outline Overview of the 5 sessions Pre-requisite requirements

More information

Data Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens

Data Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens Data Science and Open Source Software Iraklis Varlamis Assistant Professor Harokopio University of Athens varlamis@hua.gr What is data science? 2 Why data science is important? More data (volume, variety,...)

More information

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017 Lab #1 Basics in Spatial Statistics (Due Date: 01/30/2017) PURPOSES 1. Get familiar with statistics and GIS 2. Learn to use open-source software R for statistical analysis Before starting your lab, create

More information

Intro to Stata for Political Scientists

Intro to Stata for Political Scientists Intro to Stata for Political Scientists Andrew S. Rosenberg Junior PRISM Fellow Department of Political Science Workshop Description This is an Introduction to Stata I will assume little/no prior knowledge

More information

Analytical Processing of Data of statistical genetics research in UNIX like Systems

Analytical Processing of Data of statistical genetics research in UNIX like Systems Survival Skills for Analytical Processing of Data of statistical genetics research in UNIX like Systems robert yu :: March 2011 anote UNIX like? Traditional/classical UNIX, e.g. System V (Solaris), BSD

More information

Engineering Tool Development

Engineering Tool Development Engineering Tool Development Codification of Legacy Three critical challenges for Indian engineering industry today Dr. R. S. Prabakar and Dr. M. Sathya Prasad Advanced Engineering 21 st August 2013 Three

More information