Using R to Make Sense of NMR Datasets

Size: px
Start display at page:

Download "Using R to Make Sense of NMR Datasets"

Transcription

1 Using R to Make Sense of NMR Datasets Prof. Bryan A. Hanson Dept. of Chemistry & Biochemistry DePauw University, Greencastle Indiana Presentation available at github.com/bryanhanson/panic2017 Additional references & resources on last slide

2 What Exactly is R? R is a free software environment for statistical computing and graphics The Ecosystem: Base R via R-Core Add-on packages from many authors Comprehensive R Archival Network (aka CRAN) (>10,000 packages) Bioconductor (>1,300 packages) Unofficial repositories: Github, Gitlab, SourceForge etc. Support forums User guides galore!

3 The Ecosystem: Support Resources Official Documentation Focused, Topical Task Views R-Bloggers: over 600 R-oriented bloggers Stack Overflow: over 160K questions on use of R Hundreds of Intro to R documents on the web Dozens of free R books on the web Many packages have a vignette or user guide. More resources on last slide.

4 Features of R Written by statisticians... a rather unlikely linguistic cocktail... 1 Cross-Platform: Windows, Linux, Mac OS Infrastructure: ready integration, interactive options Interfaces to many other languages, programs SAS, SPSS, python, JavaScript, MATLAB, C++ etc. Several ways of running in parallel, using multiple cores Command line, or several GUI options 1 Structure of the R Language

5 R is Open Source Free! Transparent: All code readily available for inspection Given enough eyeballs, all bugs are shallow Linus Torvalds Many parts of the ecosystem are community driven Open source means everyone can see my stupid mistakes. Version control means everyone can see every stupid mistake I ve ever made. Karl Broman

6 Do People Use R? 2 2 Bob Muenchen r4stats.com/articles/popularity/

7 Who Uses R? AirBnB Zillow 3 Etsy NYT Twitter Facebook 3 Data Science at Zillow

8 User Contributed Packages 4 4 Script by Gergely Daróczi

9 Reproducible Research with R Automation of Workflow: data analysis code + explanatory text figures + tables + text = report Many resources for reproducible research Several possible input formats Typical output formats are pdf files and web pages This presentation written with L A TEX and R via the knitr package.

10 What is ChemoSpec? ChemoSpec = Chemometrics + Spectroscopy Tools for exploratory data analysis No attempt to duplicate functions available on the spectrometer

11 ChemoSpec: Design Goals User friendly design Helpful error messages Reliable results High quality plots Consistent plot appearance Provide access to a wide range of chemometric operations Extensibility Developed with metabolomics and IR, NMR & Raman in mind

12 What Can ChemoSpec Do? Data Cleaning & Prep Import data Remove samples Drop frequency ranges Baseline correction Signal alignment Normalization Savitzky-Golay filters Exploratory Data Analysis Plotting & surveying Hierarchical cluster analysis (HCA) Principal component analysis (PCA) PCA diagnostics Score & loading plots ANOVA-PCA Empirical clustering

13 Demonstration Data Set: Saw Palmetto Caps Retail samples of Serenoa repens gel caps 500 MHz 1 H NMR in CDCl 3 4 samples were pure according to the label 10 samples have another oil present per label 2 outliers: olive oil, and evening primrose oil Serenoa repens extracts mainly fatty acids Outliers mainly triglycerides

14 Representative 1 H NMR Spectra peak height 0.0e e e+07 TJ_OO ET_pSrE ppm

15 Where is the Variation in the 1 H NMR Spectra? Centered Spectra 2.0e e e e ppm

16 Hierarchical Clustering 0e+00 4e+07 8e+07 SV_EPO TJ_OO NP_adSrE NR_pSrE ET_pSrE MDB_pSrE LF_adSrE SN_adSrE clustering method: complete NA_pSrE Nat_adSrE GNC_adSrE NW_adSrE NSI_adSrE distance method: euclidean TD_adSrE CVS_adSrE Sol_adSrE Key adsre EPO OO psre

17 Principal Component Analysis: Scree Plot 2e+07 2e+07 6e+07 component scores % 81% 91% 95% 97% 98% 99% 99% 100% 100% centered/noscale/classical cumulative percent variance shown to right of PC

18 Principal Component Analysis: Score Plot PC2 score (20%) 4e+07 0e+00 4e+07 centered/noscale/classical classic ellipses by group Key adsre EPO OO psre 4e+07 2e+07 0e+00 2e+07 4e+07 6e+07 PC1 score (62%)

19 Principal Component Analysis: Loadings Plot Reference Spectrum PC 1 Loadings ppm centered/noscale/classical

20 Principal Component Analysis: S Plot 1e+14 5e+13 0e covariance correlation centered/noscale/classical

21 Acknowledgements Thanks for your attention! Kristie Adams for the invite Sabbatical Support, DePauw University

22 Additional References & Resources R Project Home Page Selected Topical Task Views Chemometrics & Computational Physics Clinical Trials Experimental Design Pharmacokinetics Machine Learning Reproducible Research Bioconductor Home Page

ChemoSpec: An R Package for Chemometric Analysis of Spectroscopic Data (Package Version )

ChemoSpec: An R Package for Chemometric Analysis of Spectroscopic Data (Package Version ) ChemoSpec: An R Package for Chemometric Analysis of Spectroscopic Data (Package Version 4.4.97) Bryan A. Hanson e-mail: hanson@depauw.edu with contributions from Matt J. Keinsley DePauw University Department

More information

Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set

Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set Claudia Beleites CENMAT and DI3, University of Trieste Spectroscopy Imaging, IPHT Jena e.v. February 13,

More information

A Survey of Statistical Modeling Tools

A Survey of Statistical Modeling Tools 1 of 6 A Survey of Statistical Modeling Tools Madhuri Kulkarni (A survey paper written under the guidance of Prof. Raj Jain) Abstract: A plethora of statistical modeling tools are available in the market

More information

JMP Book Descriptions

JMP Book Descriptions JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked

More information

The History and Use of R. Joseph Kambourakis

The History and Use of R. Joseph Kambourakis The History and Use of R Joseph Kambourakis Ground Rules Interrupt me These are all my opinions and not of EMC or Big Data Analytics, Discovery & Visualization Meetup Slides will be available Joseph

More information

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core)

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core) Introduction to Data Science What is Analytics and Data Science? Overview of Data Science and Analytics Why Analytics is is becoming popular now? Application of Analytics in business Analytics Vs Data

More information

PharmaSUG 2018 Paper #AD-05

PharmaSUG 2018 Paper #AD-05 Dr. Danni Yu is a Sr. Research Scientist in Oncology at Eli Lilly and Company. She received her doctoral degree in Statistics from Purdue University, where she worked with several multidisciplinary experts

More information

Introducing Oracle R Enterprise 1.4 -

Introducing Oracle R Enterprise 1.4 - Hello, and welcome to this online, self-paced lesson entitled Introducing Oracle R Enterprise. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle. I

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

Fourier Transformation Methods in the Field of Gamma Spectrometry

Fourier Transformation Methods in the Field of Gamma Spectrometry International Journal of Pure and Applied Physics ISSN 0973-1776 Volume 3 Number 1 (2007) pp. 132 141 Research India Publications http://www.ripublication.com/ijpap.htm Fourier Transformation Methods in

More information

Having Fun with Social Coding. Sean Handley. February 25, 2010

Having Fun with Social Coding. Sean Handley. February 25, 2010 Having Fun with Social Coding February 25, 2010 What is Github? GitHub is to collaborative coding, what Facebook is to social networking 1 It serves as a web front-end to open source projects by allowing

More information

Project selection Use-case collection Characteristics scoring. Principal Components Analysis (PCA) Biplot and numerical representation Clustering

Project selection Use-case collection Characteristics scoring. Principal Components Analysis (PCA) Biplot and numerical representation Clustering Authors Neil Caithness Michel Drescher Peter Deussen David Wallom University of Oxford European Grid Infrastructure Fraunhofer FOKUS University of Oxford Background We support the European Commission s

More information

A new shiny GUI for sdcmicro

A new shiny GUI for sdcmicro A new shiny GUI for sdcmicro Bernhard Meindl, Alexander Kowarik, Matthias Templ, Matthew Welch, Thijs Benschop Methods Division, Statistics Austria, Vienna, Austria ZHAW - Zurich University of Applied

More information

Data Analytics Training Program

Data Analytics Training Program Data Analytics Training Program In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers Willing

More information

NMR Users Guide Organic Chemistry Laboratory

NMR Users Guide Organic Chemistry Laboratory NMR Users Guide Organic Chemistry Laboratory Introduction The chemistry department is fortunate to have a high field (400 MHz) Nuclear Magnetic Resonance (NMR) spectrometer. You will be using this instrument

More information

Data Entry, and Manipulation. DataONE Community Engagement & Outreach Working Group

Data Entry, and Manipulation. DataONE Community Engagement & Outreach Working Group Data Entry, and Manipulation DataONE Community Engagement & Outreach Working Group Lesson Topics Best Practices for Creating Data Files Data Entry Options Data Integration Best Practices Data Manipulation

More information

Introduction to the Graphical User Interface of TREND

Introduction to the Graphical User Interface of TREND Introduction to the Graphical User Interface of TREND Description: Both GUI and command-line versions of TREND are available. The arguments and usages of the GUI panels are briefly introduced below. Detailed

More information

Package batman. Installation and Testing

Package batman. Installation and Testing Package batman Installation and Testing Table of Contents 1. INSTALLATION INSTRUCTIONS... 1 2. TESTING... 3 Test 1: Single spectrum from designed mixture data... 3 Test 2: Multiple spectra from designed

More information

Bio3D: Interactive Tools for Structural Bioinformatics.

Bio3D: Interactive Tools for Structural Bioinformatics. Bio3D: Interactive Tools for Structural Bioinformatics http://thegrantlab.org/bio3d/ What is Bio3D A freely distributed and widely used R package for structural bioinformatics. Provides a large number

More information

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract 19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

Customizable information fields (or entries) linked to each database level may be replicated and summarized to upstream and downstream levels.

Customizable information fields (or entries) linked to each database level may be replicated and summarized to upstream and downstream levels. Manage. Analyze. Discover. NEW FEATURES BioNumerics Seven comes with several fundamental improvements and a plethora of new analysis possibilities with a strong focus on user friendliness. Among the most

More information

Chemometric Analysis of Bio-Spectroscopic Data in : hyperspec

Chemometric Analysis of Bio-Spectroscopic Data in : hyperspec Chemometric Analysis of Bio-Spectroscopic Data in : hyperspec Claudia Beleites 1,2 (Claudia.Beleites@ipht-jena.de), Christoph Krafft 2, Jürgen Popp 2,3, and Valter Sergo 1 1 CENMAT and Dept. of Industrial

More information

Metabolomic Data Analysis with MetaboAnalyst

Metabolomic Data Analysis with MetaboAnalyst Metabolomic Data Analysis with MetaboAnalyst User ID: guest6522519400069885256 April 14, 2009 1 Data Processing and Normalization 1.1 Reading and Processing the Raw Data MetaboAnalyst accepts a variety

More information

Hal Varian, Google s Chief Economist The McKinsey Quarterly, Jan 2009

Hal Varian, Google s Chief Economist The McKinsey Quarterly, Jan 2009 The ability to take data to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it that s going to be a hugely important skill in the next decades, because

More information

IBM SPSS Statistics and open source: A powerful combination. Let s go

IBM SPSS Statistics and open source: A powerful combination. Let s go and open source: A powerful combination Let s go The purpose of this paper is to demonstrate the features and capabilities provided by the integration of IBM SPSS Statistics and open source programming

More information

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics

More information

An Introduction to R- Programming

An Introduction to R- Programming An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University

More information

Barry Grant

Barry Grant Barry Grant bjgrant@umich.edu http://thegrantlab.org What is R? R is a freely distributed and widely used programing language and environment for statistical computing, data analysis and graphics. R provides

More information

Raman Images. Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch. 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc.

Raman Images. Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch. 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc. Analyzing and Visualizing Large Raman Images Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch 1 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc. What is a Large Image? Gone from

More information

How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects?

How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? Saraj Singh Manes School of Computer Science Carleton University Ottawa, Canada sarajmanes@cmail.carleton.ca Olga

More information

PowerShell-Module Documentation. Release docs

PowerShell-Module Documentation. Release docs PowerShell-Module Documentation Release docs December 29, 2016 User Documentation 1 Requirements 3 2 Installation 5 2.1 Option 1: Installer Script......................................... 5 2.2 Option

More information

Step-by-step user instructions to the hamlet-package

Step-by-step user instructions to the hamlet-package Step-by-step user instructions to the hamlet-package Teemu Daniel Laajala May 26, 2018 Contents 1 Analysis workflow 2 2 Loading data into R 2 2.1 Excel format data.......................... 4 2.2 CSV-files...............................

More information

MALDIquant: Quantitative Analysis of Mass Spectrometry Data

MALDIquant: Quantitative Analysis of Mass Spectrometry Data MALDIquant: Quantitative Analysis of Mass Spectrometry Data Sebastian Gibb November 12, 2017 Abstract MALDIquant provides a complete analysis pipeline for MALDI- TOF and other 2D mass spectrometry data.

More information

The latest trend of hybrid instrumentation

The latest trend of hybrid instrumentation Multivariate Data Processing of Spectral Images: The Ugly, the Bad, and the True The results of various multivariate data-processing methods of Raman maps recorded with a dispersive Raman microscope are

More information

An Introduction to the R Commander

An Introduction to the R Commander An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu

More information

Package mqtl. May 25, 2018

Package mqtl. May 25, 2018 Type Package Package mqtl May 25, 2018 Title Metabolomic Quantitative Trait Locus Mapping Version 1.0 Date 2013-09-18 Author Lyamine Hedjazi and Jean-Baptiste Cazier Maintainer Lyamine Hedjazi

More information

Package ASICS. January 23, 2018

Package ASICS. January 23, 2018 Type Package Package ASICS January 23, 2018 Title Automatic Statistical Identification in Complex Spectra Version 1.0.1 With a set of pure metabolite spectra, ASICS quantifies metabolites concentration

More information

Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426

Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426 Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426 Application Note Environmental Authors John Seelenbinder and Dipak Mainali Agilent Technologies, Inc. Introduction

More information

NMR Spectroscopy with VnmrJ. University of Toronto, Department of Chemistry

NMR Spectroscopy with VnmrJ. University of Toronto, Department of Chemistry NMR Spectroscopy with VnmrJ University of Toronto, Department of Chemistry Walk-up interface 1 Logging in 1 Starting VnmrJ 1 Inserting sample into the magnet or sample changer 1 Enter sample information

More information

Quick Start Guide for IS-EPOS

Quick Start Guide for IS-EPOS Quick Start Guide for IS-EPOS IS-EPOS is a prototype of Anthropogenic Hazard (AH) Thematic Core Service (TCS), in European Plate Observing System (EPOS) integration program. IS-EPOS integrates data describing

More information

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing

More information

Getting started with GitHub

Getting started with GitHub Getting started with GitHub A beginner s guide. (There s no code in this slide deck!) Presented by Quinn Supplee https://github.com/quinns What is GitHub? GitHub is a code hosting platform for version

More information

What is KNIME? workflows nodes standard data mining, data analysis data manipulation

What is KNIME? workflows nodes standard data mining, data analysis data manipulation KNIME TUTORIAL What is KNIME? KNIME = Konstanz Information Miner Developed at University of Konstanz in Germany Desktop version available free of charge (Open Source) Modular platform for building and

More information

We deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR

We deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR Numerical Computation, Statistical analysis and Visualization Using MATLAB and Tools Authors: Jamuna Konda, Jyothi Bonthu, Harpitha Joginipally Infotech Enterprises Ltd, Hyderabad, India August 8, 2013

More information

Visualization and PCA with Gene Expression Data

Visualization and PCA with Gene Expression Data References Visualization and PCA with Gene Expression Data Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 2.4 Chapter 10 of Bioconductor Monograph (course text) Ringner (2008).

More information

Multiple Sequence Alignment

Multiple Sequence Alignment Introduction to Bioinformatics online course: IBT Multiple Sequence Alignment Lec3: Navigation in Cursor mode By Ahmed Mansour Alzohairy Professor (Full) at Department of Genetics, Zagazig University,

More information

Scalable Machine Learning in R. with H2O

Scalable Machine Learning in R. with H2O Scalable Machine Learning in R with H2O Erin LeDell @ledell DSC July 2016 Introduction Statistician & Machine Learning Scientist at H2O.ai in Mountain View, California, USA Ph.D. in Biostatistics with

More information

Other useful tools. Eugeniy E. Mikhailov. Lecture 11. The College of William & Mary. Eugeniy Mikhailov (W&M) Practical Computing Lecture 11 1 / 9

Other useful tools. Eugeniy E. Mikhailov. Lecture 11. The College of William & Mary. Eugeniy Mikhailov (W&M) Practical Computing Lecture 11 1 / 9 Other useful tools Eugeniy E. Mikhailov The College of William & Mary Lecture 11 Eugeniy Mikhailov (W&M) Practical Computing Lecture 11 1 / 9 Specialization is... A human being should be able to change

More information

PROCESSING 2D SPECTRA USING VNMRJ JB Stothers NMR Facility Materials Science Addition 0216 Department of Chemistry Western University

PROCESSING 2D SPECTRA USING VNMRJ JB Stothers NMR Facility Materials Science Addition 0216 Department of Chemistry Western University PROCESSING 2D SPECTRA USING VNMRJ JB Stothers NMR Facility Materials Science Addition 0216 Department of Chemistry Western University 1. INTRODUCTION...1 1.1. About this Worksheet... 1 1.2. A Very Brief

More information

VERINT EFM 8.0 Release Overview

VERINT EFM 8.0 Release Overview VERINT EFM 8.0 Release Overview In January of 2015 Verint will release version 8.0 of the Enterprise Feedback Management (EFM) solution. Verint hosted SaaS customers will receive this update as part of

More information

Preprocessing: Smoothing and Derivatives

Preprocessing: Smoothing and Derivatives Preprocessing: Smoothing and Derivatives Jarmo Alander University of Vaasa November 1, 2010 Contents Preprocessing Baseline correction. Autoscaling. Savitzky-Golay Smoothing Filters Savitzky-Golay Filters

More information

Visual Analytics Tools for the Global Change Assessment Model. Ross Maciejewski Arizona State University

Visual Analytics Tools for the Global Change Assessment Model. Ross Maciejewski Arizona State University Visual Analytics Tools for the Global Change Assessment Model Ross Maciejewski Arizona State University GCAM Simulation After running thousands or even hundreds of simulations through GCAM this process

More information

Hyperspectral Chemical Imaging: principles and Chemometrics.

Hyperspectral Chemical Imaging: principles and Chemometrics. Hyperspectral Chemical Imaging: principles and Chemometrics aoife.gowen@ucd.ie University College Dublin University College Dublin 1,596 PhD students 6,17 international students 8,54 graduate students

More information

Nature Methods: doi: /nmeth Supplementary Figure 1

Nature Methods: doi: /nmeth Supplementary Figure 1 Supplementary Figure 1 Schematic representation of the Workflow window in Perseus All data matrices uploaded in the running session of Perseus and all processing steps are displayed in the order of execution.

More information

Clustering and Dimensionality Reduction

Clustering and Dimensionality Reduction Clustering and Dimensionality Reduction Some material on these is slides borrowed from Andrew Moore's excellent machine learning tutorials located at: Data Mining Automatically extracting meaning from

More information

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized

More information

Hello! Python. Anthony Briggs MANNING SHELTER ISLAND

Hello! Python. Anthony Briggs MANNING SHELTER ISLAND Hello! Python Hello! Python Anthony Briggs MANNING SHELTER ISLAND For online information and ordering of this and other Manning books, please visit www.manning.com. The publisher offers discounts on this

More information

Data Science Bootcamp Curriculum. NYC Data Science Academy

Data Science Bootcamp Curriculum. NYC Data Science Academy Data Science Bootcamp Curriculum NYC Data Science Academy 100+ hours free, self-paced online course. Access to part-time in-person courses hosted at NYC campus Machine Learning with R and Python Foundations

More information

Virtuozzo Automator 6.1

Virtuozzo Automator 6.1 Virtuozzo Automator 6.1 Installation Guide for Windows April 18, 2016 Copyright 1999-2016 Parallels IP Holdings GmbH and its affiliates. All rights reserved. Parallels IP Holdings GmbH. c/o Parallels International

More information

RNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University

RNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University RNA-Seq Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University joshua.ainsley@tufts.edu Day four Quantifying expression Intro to R Differential expression

More information

2016 Stat-Ease, Inc. & CAMO Software

2016 Stat-Ease, Inc. & CAMO Software Multivariate Analysis and Design of Experiments in practice using The Unscrambler X Frank Westad CAMO Software fw@camo.com Pat Whitcomb Stat-Ease pat@statease.com Agenda Goal: Part 1: Part 2: Show how

More information

Introduction to Data Analytics. David Walling

Introduction to Data Analytics. David Walling Introduction to Data Analytics David Walling walling@tacc.utexas.edu Source: http://research.microsoft.com/en-us/collaboration/fourthparadigm/default.aspx Computational Simulation Model first, given initial

More information

Visual Analytics Tools for the Global Change Assessment Model. Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University

Visual Analytics Tools for the Global Change Assessment Model. Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University Visual Analytics Tools for the Global Change Assessment Model Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University GCAM Simulation When exploring the impact of various conditions or

More information

Release Notes. JMP Genomics. Version 4.0

Release Notes. JMP Genomics. Version 4.0 JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive

More information

Version control. with git and GitHub. Karl Broman. Biostatistics & Medical Informatics, UW Madison

Version control. with git and GitHub. Karl Broman. Biostatistics & Medical Informatics, UW Madison Version control with git and GitHub Karl Broman Biostatistics & Medical Informatics, UW Madison kbroman.org github.com/kbroman @kwbroman Course web: kbroman.org/tools4rr Slides prepared with Sam Younkin

More information

Monitoring and Improving Quality of Data Handling

Monitoring and Improving Quality of Data Handling Monitoring and Improving Quality of Data Handling The purpose of this document is to: (a) (b) (c) Maximise the quality of the research process once the question has been formulated and the study designed.

More information

Automatic Differentiation in. Finlay Scott & Iago Mosqueira

Automatic Differentiation in. Finlay Scott & Iago Mosqueira Automatic Differentiation in Finlay Scott & Iago Mosqueira Structure What is R? history, strengths, limitations Current differentiation options in R How we have used AD with R Next steps AD in R What is

More information

Tutorial on Machine Learning Tools

Tutorial on Machine Learning Tools Tutorial on Machine Learning Tools Yanbing Xue Milos Hauskrecht Why do we need these tools? Widely deployed classical models No need to code from scratch Easy-to-use GUI Outline Matlab Apps Weka 3 UI TensorFlow

More information

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6.

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6. UNIT 6 ONE VARIABLE STATISTICS Date Lesson TOPIC HOMEWORK 6.1 3.3 6.2 3.4 Displaying Data WS 6.1 Measures of Central Tendency WS 6.2 6.3 6.4 3.5 6.5 3.5 Grouped Data Central Tendency Measures of Spread

More information

Git - A brief overview

Git - A brief overview breno.leitao@gmail.com Linux Technology Center May 16, 2008 Agenda Introduction Concepts Commands (Tutorial) Front Ends History Started in 2005 to replace BitKeeper. Designed to support the kernel development.

More information

CSE 391 Lecture 9. Version control with Git

CSE 391 Lecture 9. Version control with Git CSE 391 Lecture 9 Version control with Git slides created by Ruth Anderson & Marty Stepp, images from http://git-scm.com/book/en/ http://www.cs.washington.edu/391/ 1 Problems Working Alone Ever done one

More information

CCD Report Radial Basis Function Modeling of CARS Data

CCD Report Radial Basis Function Modeling of CARS Data CCD Report 2011-5 Radial Basis Function Modeling of CARS Data March 28, 2011 Doyle Knight Center for Computational Design Dept Mechanical and Aerospace Engineering Rutgers University 98 Brett Road Piscataway,

More information

Data Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44

Data Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44 Data Mining Piotr Paszek piotr.paszek@us.edu.pl Introduction (Piotr Paszek) Data Mining DM KDD 1 / 44 Plan of the lecture 1 Data Mining (DM) 2 Knowledge Discovery in Databases (KDD) 3 CRISP-DM 4 DM software

More information

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017 Lab #1 Basics in Spatial Statistics (Due Date: 01/30/2017) PURPOSES 1. Get familiar with statistics and GIS 2. Learn to use open-source software R for statistical analysis Before starting your lab, create

More information

Where Does R Fit Into Your SQL Server Stack? Stacia Varga blog.datainspirations.com

Where Does R Fit Into Your SQL Server Stack? Stacia Varga blog.datainspirations.com Where Does R Fit Into Your SQL Server Stack? Stacia Varga Stacia@datainspirations.com blog.datainspirations.com Twitter: @_StaciaV_ Stacia (Misner) Varga Over 30 years of IT experience, 17 years of BI

More information

PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION

PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION Sharon M. Passe, Hoffmann-La Roche Inc. Andrea L Contino, Hoffmann-La Roche Inc. ABSTRACT The statistician responsible for

More information

Introduction to Multivariate Image Analysis (MIA) Table of Contents

Introduction to Multivariate Image Analysis (MIA) Table of Contents Introduction to Multivariate Image Analysis (MIA) Copyright 1996-2013 Eigenvector Research, Inc. No part of this material may be photocopied or reproduced in any form without prior written consent from

More information

Learn Use Linux Made Easy Home Office On The Go

Learn Use Linux Made Easy Home Office On The Go We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with learn use linux made

More information

INF160 IS Development Environments

INF160 IS Development Environments INF160 IS Development Environments AUBG, COS dept Reference books: Baltzan Paige, Business Driven Information Systems, McGraw- Hill/Irwin, 3e, 2012. Doar Matthew B., Practical Development Environments,

More information

Computational statistics Jamie Griffin. Semester B 2018 Lecture 1

Computational statistics Jamie Griffin. Semester B 2018 Lecture 1 Computational statistics Jamie Griffin Semester B 2018 Lecture 1 Course overview This course is not: Statistical computing Programming This course is: Computational statistics Statistical methods that

More information

Data Mining: Exploring Data. Lecture Notes for Chapter 3

Data Mining: Exploring Data. Lecture Notes for Chapter 3 Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for accompanying R code on the course web site. Topics Exploratory Data Analysis

More information

Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow

Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow ORNL: Brian Smith, John Harney, Brian Jewell LLNL: Jeffrey Painter, James McEnerney, ORNL is managed by UT-Battelle for the US Department

More information

Continuous Integration / Continuous Testing

Continuous Integration / Continuous Testing Bitte decken Sie die schraffierte Fläche mit einem Bild ab. Please cover the shaded area with a picture. (24,4 x 7,6 cm) Continuous Integration / Continuous Testing IIC What s SW Integration? Integration

More information

Part I. An Introduction to R

Part I. An Introduction to R Part I An Introduction to R 1 Chapter 1 Getting Started R is a programming language and comprehensive statistical platform for data exploration and analysis. It is free and open source, which means anyone

More information

Lecture Topic Projects

Lecture Topic Projects Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data

More information

Module 1: Introduction RStudio

Module 1: Introduction RStudio Module 1: Introduction RStudio Contents Page(s) Installing R and RStudio Software for Social Network Analysis 1-2 Introduction to R Language/ Syntax 3 Welcome to RStudio 4-14 A. The 4 Panes 5 B. Calculator

More information

Package PRISMA. May 27, 2018

Package PRISMA. May 27, 2018 Type Package Package PRISMA May 27, 2018 Title Protocol Inspection and State Machine Analysis Version 0.2-7 Date 2018-05-26 Depends R (>= 2.10), Matrix, gplots, methods, ggplot2 Suggests tm (>= 0.6) Author

More information

Introduction to R. base -> R win32.exe (this will change depending on the latest version)

Introduction to R. base -> R win32.exe (this will change depending on the latest version) Dr Raffaella Calabrese, Essex Business School 1. GETTING STARTED Introduction to R R is a powerful environment for statistical computing which runs on several platforms. R is available free of charge.

More information

Lecture 25: Review I

Lecture 25: Review I Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,

More information

SQL Server 2017: Data Science with Python or R?

SQL Server 2017: Data Science with Python or R? SQL Server 2017: Data Science with Python or R? Dejan Sarka Sponsor Introduction Dejan Sarka (dsarka@solidq.com, dsarka@siol.net, @DejanSarka) 30 years of experience SQL Server MVP, MCT, 16 books 20+ courses,

More information

Pre-processing in vibrational spectroscopy, a when, why and how

Pre-processing in vibrational spectroscopy, a when, why and how Analytical Methods Pre-processing in vibrational spectroscopy, a when, why and how Journal: Analytical Methods Manuscript ID: AY-TRV--0-00.R Article Type: Tutorial Review Date Submitted by the Author:

More information

Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set

Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set Claudia Beleites DIA Raman Spectroscopy Group, University of Trieste/Italy (2 28) Spectroscopy Imaging,

More information

Python ecosystem for scientific computing with ABINIT: challenges and opportunities. M. Giantomassi and the AbiPy group

Python ecosystem for scientific computing with ABINIT: challenges and opportunities. M. Giantomassi and the AbiPy group Python ecosystem for scientific computing with ABINIT: challenges and opportunities M. Giantomassi and the AbiPy group Frejus, May 9, 2017 Python package for: generating input files automatically post-processing

More information

Workshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial

Workshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial Note: for non-commercial purposes only Workshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial Presenters: Prof Marius Kublickas Laima Juodvirsiene Prof Magnus Westgren Friday,

More information

NMRProcFlow Macro-command Reference Guide

NMRProcFlow Macro-command Reference Guide NMRProcFlow Macro-command Reference Guide This document is the reference guide of the macro-commands Daniel Jacob UMR 1332 BFP, Metabolomics Facility CGFB Bordeaux, MetaboHUB - 2018 1 NMRProcFlow - Macro-command

More information

PharmaSUG China Big Insights in Small Data with RStudio Shiny Mina Chen, Roche Product Development in Asia Pacific, Shanghai, China

PharmaSUG China Big Insights in Small Data with RStudio Shiny Mina Chen, Roche Product Development in Asia Pacific, Shanghai, China PharmaSUG China 2016-74 Big Insights in Small Data with RStudio Shiny Mina Chen, Roche Product Development in Asia Pacific, Shanghai, China ABSTRACT Accelerating analysis and faster data interpretation

More information

Package covr. October 18, 2018

Package covr. October 18, 2018 Encoding UTF-8 Title Test Coverage for Packages Version 3.2.1 Package covr October 18, 2018 Track and report code coverage for your package and (optionally) upload the results to a coverage service like

More information

CSE 158. Web Mining and Recommender Systems. Midterm recap

CSE 158. Web Mining and Recommender Systems. Midterm recap CSE 158 Web Mining and Recommender Systems Midterm recap Midterm on Wednesday! 5:10 pm 6:10 pm Closed book but I ll provide a similar level of basic info as in the last page of previous midterms CSE 158

More information

Chemical Shift Perturbation Tutorial

Chemical Shift Perturbation Tutorial CcpNmr Analysis Version 3 Chemical Shift Perturbation Tutorial Written by: Luca Mureddu CCPN application developer, PhD Student MCB, University of Leicester, Lancaster Road, Leicester, LE1 9HN, UK 1 t:

More information