Using R to Make Sense of NMR Datasets
|
|
- Alexandrina Curtis
- 5 years ago
- Views:
Transcription
1 Using R to Make Sense of NMR Datasets Prof. Bryan A. Hanson Dept. of Chemistry & Biochemistry DePauw University, Greencastle Indiana Presentation available at github.com/bryanhanson/panic2017 Additional references & resources on last slide
2 What Exactly is R? R is a free software environment for statistical computing and graphics The Ecosystem: Base R via R-Core Add-on packages from many authors Comprehensive R Archival Network (aka CRAN) (>10,000 packages) Bioconductor (>1,300 packages) Unofficial repositories: Github, Gitlab, SourceForge etc. Support forums User guides galore!
3 The Ecosystem: Support Resources Official Documentation Focused, Topical Task Views R-Bloggers: over 600 R-oriented bloggers Stack Overflow: over 160K questions on use of R Hundreds of Intro to R documents on the web Dozens of free R books on the web Many packages have a vignette or user guide. More resources on last slide.
4 Features of R Written by statisticians... a rather unlikely linguistic cocktail... 1 Cross-Platform: Windows, Linux, Mac OS Infrastructure: ready integration, interactive options Interfaces to many other languages, programs SAS, SPSS, python, JavaScript, MATLAB, C++ etc. Several ways of running in parallel, using multiple cores Command line, or several GUI options 1 Structure of the R Language
5 R is Open Source Free! Transparent: All code readily available for inspection Given enough eyeballs, all bugs are shallow Linus Torvalds Many parts of the ecosystem are community driven Open source means everyone can see my stupid mistakes. Version control means everyone can see every stupid mistake I ve ever made. Karl Broman
6 Do People Use R? 2 2 Bob Muenchen r4stats.com/articles/popularity/
7 Who Uses R? AirBnB Zillow 3 Etsy NYT Twitter Facebook 3 Data Science at Zillow
8 User Contributed Packages 4 4 Script by Gergely Daróczi
9 Reproducible Research with R Automation of Workflow: data analysis code + explanatory text figures + tables + text = report Many resources for reproducible research Several possible input formats Typical output formats are pdf files and web pages This presentation written with L A TEX and R via the knitr package.
10 What is ChemoSpec? ChemoSpec = Chemometrics + Spectroscopy Tools for exploratory data analysis No attempt to duplicate functions available on the spectrometer
11 ChemoSpec: Design Goals User friendly design Helpful error messages Reliable results High quality plots Consistent plot appearance Provide access to a wide range of chemometric operations Extensibility Developed with metabolomics and IR, NMR & Raman in mind
12 What Can ChemoSpec Do? Data Cleaning & Prep Import data Remove samples Drop frequency ranges Baseline correction Signal alignment Normalization Savitzky-Golay filters Exploratory Data Analysis Plotting & surveying Hierarchical cluster analysis (HCA) Principal component analysis (PCA) PCA diagnostics Score & loading plots ANOVA-PCA Empirical clustering
13 Demonstration Data Set: Saw Palmetto Caps Retail samples of Serenoa repens gel caps 500 MHz 1 H NMR in CDCl 3 4 samples were pure according to the label 10 samples have another oil present per label 2 outliers: olive oil, and evening primrose oil Serenoa repens extracts mainly fatty acids Outliers mainly triglycerides
14 Representative 1 H NMR Spectra peak height 0.0e e e+07 TJ_OO ET_pSrE ppm
15 Where is the Variation in the 1 H NMR Spectra? Centered Spectra 2.0e e e e ppm
16 Hierarchical Clustering 0e+00 4e+07 8e+07 SV_EPO TJ_OO NP_adSrE NR_pSrE ET_pSrE MDB_pSrE LF_adSrE SN_adSrE clustering method: complete NA_pSrE Nat_adSrE GNC_adSrE NW_adSrE NSI_adSrE distance method: euclidean TD_adSrE CVS_adSrE Sol_adSrE Key adsre EPO OO psre
17 Principal Component Analysis: Scree Plot 2e+07 2e+07 6e+07 component scores % 81% 91% 95% 97% 98% 99% 99% 100% 100% centered/noscale/classical cumulative percent variance shown to right of PC
18 Principal Component Analysis: Score Plot PC2 score (20%) 4e+07 0e+00 4e+07 centered/noscale/classical classic ellipses by group Key adsre EPO OO psre 4e+07 2e+07 0e+00 2e+07 4e+07 6e+07 PC1 score (62%)
19 Principal Component Analysis: Loadings Plot Reference Spectrum PC 1 Loadings ppm centered/noscale/classical
20 Principal Component Analysis: S Plot 1e+14 5e+13 0e covariance correlation centered/noscale/classical
21 Acknowledgements Thanks for your attention! Kristie Adams for the invite Sabbatical Support, DePauw University
22 Additional References & Resources R Project Home Page Selected Topical Task Views Chemometrics & Computational Physics Clinical Trials Experimental Design Pharmacokinetics Machine Learning Reproducible Research Bioconductor Home Page
ChemoSpec: An R Package for Chemometric Analysis of Spectroscopic Data (Package Version )
ChemoSpec: An R Package for Chemometric Analysis of Spectroscopic Data (Package Version 4.4.97) Bryan A. Hanson e-mail: hanson@depauw.edu with contributions from Matt J. Keinsley DePauw University Department
More informationRaman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set
Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set Claudia Beleites CENMAT and DI3, University of Trieste Spectroscopy Imaging, IPHT Jena e.v. February 13,
More informationA Survey of Statistical Modeling Tools
1 of 6 A Survey of Statistical Modeling Tools Madhuri Kulkarni (A survey paper written under the guidance of Prof. Raj Jain) Abstract: A plethora of statistical modeling tools are available in the market
More informationJMP Book Descriptions
JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked
More informationThe History and Use of R. Joseph Kambourakis
The History and Use of R Joseph Kambourakis Ground Rules Interrupt me These are all my opinions and not of EMC or Big Data Analytics, Discovery & Visualization Meetup Slides will be available Joseph
More informationIntroduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core)
Introduction to Data Science What is Analytics and Data Science? Overview of Data Science and Analytics Why Analytics is is becoming popular now? Application of Analytics in business Analytics Vs Data
More informationPharmaSUG 2018 Paper #AD-05
Dr. Danni Yu is a Sr. Research Scientist in Oncology at Eli Lilly and Company. She received her doctoral degree in Statistics from Purdue University, where she worked with several multidisciplinary experts
More informationIntroducing Oracle R Enterprise 1.4 -
Hello, and welcome to this online, self-paced lesson entitled Introducing Oracle R Enterprise. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle. I
More informationStatistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte
Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,
More informationFourier Transformation Methods in the Field of Gamma Spectrometry
International Journal of Pure and Applied Physics ISSN 0973-1776 Volume 3 Number 1 (2007) pp. 132 141 Research India Publications http://www.ripublication.com/ijpap.htm Fourier Transformation Methods in
More informationHaving Fun with Social Coding. Sean Handley. February 25, 2010
Having Fun with Social Coding February 25, 2010 What is Github? GitHub is to collaborative coding, what Facebook is to social networking 1 It serves as a web front-end to open source projects by allowing
More informationProject selection Use-case collection Characteristics scoring. Principal Components Analysis (PCA) Biplot and numerical representation Clustering
Authors Neil Caithness Michel Drescher Peter Deussen David Wallom University of Oxford European Grid Infrastructure Fraunhofer FOKUS University of Oxford Background We support the European Commission s
More informationA new shiny GUI for sdcmicro
A new shiny GUI for sdcmicro Bernhard Meindl, Alexander Kowarik, Matthias Templ, Matthew Welch, Thijs Benschop Methods Division, Statistics Austria, Vienna, Austria ZHAW - Zurich University of Applied
More informationData Analytics Training Program
Data Analytics Training Program In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers Willing
More informationNMR Users Guide Organic Chemistry Laboratory
NMR Users Guide Organic Chemistry Laboratory Introduction The chemistry department is fortunate to have a high field (400 MHz) Nuclear Magnetic Resonance (NMR) spectrometer. You will be using this instrument
More informationData Entry, and Manipulation. DataONE Community Engagement & Outreach Working Group
Data Entry, and Manipulation DataONE Community Engagement & Outreach Working Group Lesson Topics Best Practices for Creating Data Files Data Entry Options Data Integration Best Practices Data Manipulation
More informationIntroduction to the Graphical User Interface of TREND
Introduction to the Graphical User Interface of TREND Description: Both GUI and command-line versions of TREND are available. The arguments and usages of the GUI panels are briefly introduced below. Detailed
More informationPackage batman. Installation and Testing
Package batman Installation and Testing Table of Contents 1. INSTALLATION INSTRUCTIONS... 1 2. TESTING... 3 Test 1: Single spectrum from designed mixture data... 3 Test 2: Multiple spectra from designed
More informationBio3D: Interactive Tools for Structural Bioinformatics.
Bio3D: Interactive Tools for Structural Bioinformatics http://thegrantlab.org/bio3d/ What is Bio3D A freely distributed and widely used R package for structural bioinformatics. Provides a large number
More informationChemometrics. Description of Pirouette Algorithms. Technical Note. Abstract
19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms
More informationFusion Registry 9 SDMX Data and Metadata Management System
Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance
More informationCustomizable information fields (or entries) linked to each database level may be replicated and summarized to upstream and downstream levels.
Manage. Analyze. Discover. NEW FEATURES BioNumerics Seven comes with several fundamental improvements and a plethora of new analysis possibilities with a strong focus on user friendliness. Among the most
More informationChemometric Analysis of Bio-Spectroscopic Data in : hyperspec
Chemometric Analysis of Bio-Spectroscopic Data in : hyperspec Claudia Beleites 1,2 (Claudia.Beleites@ipht-jena.de), Christoph Krafft 2, Jürgen Popp 2,3, and Valter Sergo 1 1 CENMAT and Dept. of Industrial
More informationMetabolomic Data Analysis with MetaboAnalyst
Metabolomic Data Analysis with MetaboAnalyst User ID: guest6522519400069885256 April 14, 2009 1 Data Processing and Normalization 1.1 Reading and Processing the Raw Data MetaboAnalyst accepts a variety
More informationHal Varian, Google s Chief Economist The McKinsey Quarterly, Jan 2009
The ability to take data to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it that s going to be a hugely important skill in the next decades, because
More informationIBM SPSS Statistics and open source: A powerful combination. Let s go
and open source: A powerful combination Let s go The purpose of this paper is to demonstrate the features and capabilities provided by the integration of IBM SPSS Statistics and open source programming
More informationIvy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)
Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics
More informationAn Introduction to R- Programming
An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University
More informationBarry Grant
Barry Grant bjgrant@umich.edu http://thegrantlab.org What is R? R is a freely distributed and widely used programing language and environment for statistical computing, data analysis and graphics. R provides
More informationRaman Images. Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch. 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc.
Analyzing and Visualizing Large Raman Images Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch 1 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc. What is a Large Image? Gone from
More informationHow Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects?
How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? Saraj Singh Manes School of Computer Science Carleton University Ottawa, Canada sarajmanes@cmail.carleton.ca Olga
More informationPowerShell-Module Documentation. Release docs
PowerShell-Module Documentation Release docs December 29, 2016 User Documentation 1 Requirements 3 2 Installation 5 2.1 Option 1: Installer Script......................................... 5 2.2 Option
More informationStep-by-step user instructions to the hamlet-package
Step-by-step user instructions to the hamlet-package Teemu Daniel Laajala May 26, 2018 Contents 1 Analysis workflow 2 2 Loading data into R 2 2.1 Excel format data.......................... 4 2.2 CSV-files...............................
More informationMALDIquant: Quantitative Analysis of Mass Spectrometry Data
MALDIquant: Quantitative Analysis of Mass Spectrometry Data Sebastian Gibb November 12, 2017 Abstract MALDIquant provides a complete analysis pipeline for MALDI- TOF and other 2D mass spectrometry data.
More informationThe latest trend of hybrid instrumentation
Multivariate Data Processing of Spectral Images: The Ugly, the Bad, and the True The results of various multivariate data-processing methods of Raman maps recorded with a dispersive Raman microscope are
More informationAn Introduction to the R Commander
An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu
More informationPackage mqtl. May 25, 2018
Type Package Package mqtl May 25, 2018 Title Metabolomic Quantitative Trait Locus Mapping Version 1.0 Date 2013-09-18 Author Lyamine Hedjazi and Jean-Baptiste Cazier Maintainer Lyamine Hedjazi
More informationPackage ASICS. January 23, 2018
Type Package Package ASICS January 23, 2018 Title Automatic Statistical Identification in Complex Spectra Version 1.0.1 With a set of pure metabolite spectra, ASICS quantifies metabolites concentration
More informationAgilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426
Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426 Application Note Environmental Authors John Seelenbinder and Dipak Mainali Agilent Technologies, Inc. Introduction
More informationNMR Spectroscopy with VnmrJ. University of Toronto, Department of Chemistry
NMR Spectroscopy with VnmrJ University of Toronto, Department of Chemistry Walk-up interface 1 Logging in 1 Starting VnmrJ 1 Inserting sample into the magnet or sample changer 1 Enter sample information
More informationQuick Start Guide for IS-EPOS
Quick Start Guide for IS-EPOS IS-EPOS is a prototype of Anthropogenic Hazard (AH) Thematic Core Service (TCS), in European Plate Observing System (EPOS) integration program. IS-EPOS integrates data describing
More informationSOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie
SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing
More informationGetting started with GitHub
Getting started with GitHub A beginner s guide. (There s no code in this slide deck!) Presented by Quinn Supplee https://github.com/quinns What is GitHub? GitHub is a code hosting platform for version
More informationWhat is KNIME? workflows nodes standard data mining, data analysis data manipulation
KNIME TUTORIAL What is KNIME? KNIME = Konstanz Information Miner Developed at University of Konstanz in Germany Desktop version available free of charge (Open Source) Modular platform for building and
More informationWe deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR
Numerical Computation, Statistical analysis and Visualization Using MATLAB and Tools Authors: Jamuna Konda, Jyothi Bonthu, Harpitha Joginipally Infotech Enterprises Ltd, Hyderabad, India August 8, 2013
More informationVisualization and PCA with Gene Expression Data
References Visualization and PCA with Gene Expression Data Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 2.4 Chapter 10 of Bioconductor Monograph (course text) Ringner (2008).
More informationMultiple Sequence Alignment
Introduction to Bioinformatics online course: IBT Multiple Sequence Alignment Lec3: Navigation in Cursor mode By Ahmed Mansour Alzohairy Professor (Full) at Department of Genetics, Zagazig University,
More informationScalable Machine Learning in R. with H2O
Scalable Machine Learning in R with H2O Erin LeDell @ledell DSC July 2016 Introduction Statistician & Machine Learning Scientist at H2O.ai in Mountain View, California, USA Ph.D. in Biostatistics with
More informationOther useful tools. Eugeniy E. Mikhailov. Lecture 11. The College of William & Mary. Eugeniy Mikhailov (W&M) Practical Computing Lecture 11 1 / 9
Other useful tools Eugeniy E. Mikhailov The College of William & Mary Lecture 11 Eugeniy Mikhailov (W&M) Practical Computing Lecture 11 1 / 9 Specialization is... A human being should be able to change
More informationPROCESSING 2D SPECTRA USING VNMRJ JB Stothers NMR Facility Materials Science Addition 0216 Department of Chemistry Western University
PROCESSING 2D SPECTRA USING VNMRJ JB Stothers NMR Facility Materials Science Addition 0216 Department of Chemistry Western University 1. INTRODUCTION...1 1.1. About this Worksheet... 1 1.2. A Very Brief
More informationVERINT EFM 8.0 Release Overview
VERINT EFM 8.0 Release Overview In January of 2015 Verint will release version 8.0 of the Enterprise Feedback Management (EFM) solution. Verint hosted SaaS customers will receive this update as part of
More informationPreprocessing: Smoothing and Derivatives
Preprocessing: Smoothing and Derivatives Jarmo Alander University of Vaasa November 1, 2010 Contents Preprocessing Baseline correction. Autoscaling. Savitzky-Golay Smoothing Filters Savitzky-Golay Filters
More informationVisual Analytics Tools for the Global Change Assessment Model. Ross Maciejewski Arizona State University
Visual Analytics Tools for the Global Change Assessment Model Ross Maciejewski Arizona State University GCAM Simulation After running thousands or even hundreds of simulations through GCAM this process
More informationHyperspectral Chemical Imaging: principles and Chemometrics.
Hyperspectral Chemical Imaging: principles and Chemometrics aoife.gowen@ucd.ie University College Dublin University College Dublin 1,596 PhD students 6,17 international students 8,54 graduate students
More informationNature Methods: doi: /nmeth Supplementary Figure 1
Supplementary Figure 1 Schematic representation of the Workflow window in Perseus All data matrices uploaded in the running session of Perseus and all processing steps are displayed in the order of execution.
More informationClustering and Dimensionality Reduction
Clustering and Dimensionality Reduction Some material on these is slides borrowed from Andrew Moore's excellent machine learning tutorials located at: Data Mining Automatically extracting meaning from
More informationRicopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015
Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized
More informationHello! Python. Anthony Briggs MANNING SHELTER ISLAND
Hello! Python Hello! Python Anthony Briggs MANNING SHELTER ISLAND For online information and ordering of this and other Manning books, please visit www.manning.com. The publisher offers discounts on this
More informationData Science Bootcamp Curriculum. NYC Data Science Academy
Data Science Bootcamp Curriculum NYC Data Science Academy 100+ hours free, self-paced online course. Access to part-time in-person courses hosted at NYC campus Machine Learning with R and Python Foundations
More informationVirtuozzo Automator 6.1
Virtuozzo Automator 6.1 Installation Guide for Windows April 18, 2016 Copyright 1999-2016 Parallels IP Holdings GmbH and its affiliates. All rights reserved. Parallels IP Holdings GmbH. c/o Parallels International
More informationRNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University
RNA-Seq Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University joshua.ainsley@tufts.edu Day four Quantifying expression Intro to R Differential expression
More information2016 Stat-Ease, Inc. & CAMO Software
Multivariate Analysis and Design of Experiments in practice using The Unscrambler X Frank Westad CAMO Software fw@camo.com Pat Whitcomb Stat-Ease pat@statease.com Agenda Goal: Part 1: Part 2: Show how
More informationIntroduction to Data Analytics. David Walling
Introduction to Data Analytics David Walling walling@tacc.utexas.edu Source: http://research.microsoft.com/en-us/collaboration/fourthparadigm/default.aspx Computational Simulation Model first, given initial
More informationVisual Analytics Tools for the Global Change Assessment Model. Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University
Visual Analytics Tools for the Global Change Assessment Model Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University GCAM Simulation When exploring the impact of various conditions or
More informationRelease Notes. JMP Genomics. Version 4.0
JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive
More informationVersion control. with git and GitHub. Karl Broman. Biostatistics & Medical Informatics, UW Madison
Version control with git and GitHub Karl Broman Biostatistics & Medical Informatics, UW Madison kbroman.org github.com/kbroman @kwbroman Course web: kbroman.org/tools4rr Slides prepared with Sam Younkin
More informationMonitoring and Improving Quality of Data Handling
Monitoring and Improving Quality of Data Handling The purpose of this document is to: (a) (b) (c) Maximise the quality of the research process once the question has been formulated and the study designed.
More informationAutomatic Differentiation in. Finlay Scott & Iago Mosqueira
Automatic Differentiation in Finlay Scott & Iago Mosqueira Structure What is R? history, strengths, limitations Current differentiation options in R How we have used AD with R Next steps AD in R What is
More informationTutorial on Machine Learning Tools
Tutorial on Machine Learning Tools Yanbing Xue Milos Hauskrecht Why do we need these tools? Widely deployed classical models No need to code from scratch Easy-to-use GUI Outline Matlab Apps Weka 3 UI TensorFlow
More informationDate Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6.
UNIT 6 ONE VARIABLE STATISTICS Date Lesson TOPIC HOMEWORK 6.1 3.3 6.2 3.4 Displaying Data WS 6.1 Measures of Central Tendency WS 6.2 6.3 6.4 3.5 6.5 3.5 Grouped Data Central Tendency Measures of Spread
More informationGit - A brief overview
breno.leitao@gmail.com Linux Technology Center May 16, 2008 Agenda Introduction Concepts Commands (Tutorial) Front Ends History Started in 2005 to replace BitKeeper. Designed to support the kernel development.
More informationCSE 391 Lecture 9. Version control with Git
CSE 391 Lecture 9 Version control with Git slides created by Ruth Anderson & Marty Stepp, images from http://git-scm.com/book/en/ http://www.cs.washington.edu/391/ 1 Problems Working Alone Ever done one
More informationCCD Report Radial Basis Function Modeling of CARS Data
CCD Report 2011-5 Radial Basis Function Modeling of CARS Data March 28, 2011 Doyle Knight Center for Computational Design Dept Mechanical and Aerospace Engineering Rutgers University 98 Brett Road Piscataway,
More informationData Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44
Data Mining Piotr Paszek piotr.paszek@us.edu.pl Introduction (Piotr Paszek) Data Mining DM KDD 1 / 44 Plan of the lecture 1 Data Mining (DM) 2 Knowledge Discovery in Databases (KDD) 3 CRISP-DM 4 DM software
More informationGRAD6/8104; INES 8090 Spatial Statistic Spring 2017
Lab #1 Basics in Spatial Statistics (Due Date: 01/30/2017) PURPOSES 1. Get familiar with statistics and GIS 2. Learn to use open-source software R for statistical analysis Before starting your lab, create
More informationWhere Does R Fit Into Your SQL Server Stack? Stacia Varga blog.datainspirations.com
Where Does R Fit Into Your SQL Server Stack? Stacia Varga Stacia@datainspirations.com blog.datainspirations.com Twitter: @_StaciaV_ Stacia (Misner) Varga Over 30 years of IT experience, 17 years of BI
More informationPHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION
PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION Sharon M. Passe, Hoffmann-La Roche Inc. Andrea L Contino, Hoffmann-La Roche Inc. ABSTRACT The statistician responsible for
More informationIntroduction to Multivariate Image Analysis (MIA) Table of Contents
Introduction to Multivariate Image Analysis (MIA) Copyright 1996-2013 Eigenvector Research, Inc. No part of this material may be photocopied or reproduced in any form without prior written consent from
More informationLearn Use Linux Made Easy Home Office On The Go
We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with learn use linux made
More informationINF160 IS Development Environments
INF160 IS Development Environments AUBG, COS dept Reference books: Baltzan Paige, Business Driven Information Systems, McGraw- Hill/Irwin, 3e, 2012. Doar Matthew B., Practical Development Environments,
More informationComputational statistics Jamie Griffin. Semester B 2018 Lecture 1
Computational statistics Jamie Griffin Semester B 2018 Lecture 1 Course overview This course is not: Statistical computing Programming This course is: Computational statistics Statistical methods that
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for accompanying R code on the course web site. Topics Exploratory Data Analysis
More informationDiagnostics and Exploratory Analysis Infrastructure for ACME Workflow
Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow ORNL: Brian Smith, John Harney, Brian Jewell LLNL: Jeffrey Painter, James McEnerney, ORNL is managed by UT-Battelle for the US Department
More informationContinuous Integration / Continuous Testing
Bitte decken Sie die schraffierte Fläche mit einem Bild ab. Please cover the shaded area with a picture. (24,4 x 7,6 cm) Continuous Integration / Continuous Testing IIC What s SW Integration? Integration
More informationPart I. An Introduction to R
Part I An Introduction to R 1 Chapter 1 Getting Started R is a programming language and comprehensive statistical platform for data exploration and analysis. It is free and open source, which means anyone
More informationLecture Topic Projects
Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data
More informationModule 1: Introduction RStudio
Module 1: Introduction RStudio Contents Page(s) Installing R and RStudio Software for Social Network Analysis 1-2 Introduction to R Language/ Syntax 3 Welcome to RStudio 4-14 A. The 4 Panes 5 B. Calculator
More informationPackage PRISMA. May 27, 2018
Type Package Package PRISMA May 27, 2018 Title Protocol Inspection and State Machine Analysis Version 0.2-7 Date 2018-05-26 Depends R (>= 2.10), Matrix, gplots, methods, ggplot2 Suggests tm (>= 0.6) Author
More informationIntroduction to R. base -> R win32.exe (this will change depending on the latest version)
Dr Raffaella Calabrese, Essex Business School 1. GETTING STARTED Introduction to R R is a powerful environment for statistical computing which runs on several platforms. R is available free of charge.
More informationLecture 25: Review I
Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,
More informationSQL Server 2017: Data Science with Python or R?
SQL Server 2017: Data Science with Python or R? Dejan Sarka Sponsor Introduction Dejan Sarka (dsarka@solidq.com, dsarka@siol.net, @DejanSarka) 30 years of experience SQL Server MVP, MCT, 16 books 20+ courses,
More informationPre-processing in vibrational spectroscopy, a when, why and how
Analytical Methods Pre-processing in vibrational spectroscopy, a when, why and how Journal: Analytical Methods Manuscript ID: AY-TRV--0-00.R Article Type: Tutorial Review Date Submitted by the Author:
More informationRaman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set
Raman Spectra of Chondrocytes in Cartilage: hyperspec s chondro data set Claudia Beleites DIA Raman Spectroscopy Group, University of Trieste/Italy (2 28) Spectroscopy Imaging,
More informationPython ecosystem for scientific computing with ABINIT: challenges and opportunities. M. Giantomassi and the AbiPy group
Python ecosystem for scientific computing with ABINIT: challenges and opportunities M. Giantomassi and the AbiPy group Frejus, May 9, 2017 Python package for: generating input files automatically post-processing
More informationWorkshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial
Note: for non-commercial purposes only Workshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial Presenters: Prof Marius Kublickas Laima Juodvirsiene Prof Magnus Westgren Friday,
More informationNMRProcFlow Macro-command Reference Guide
NMRProcFlow Macro-command Reference Guide This document is the reference guide of the macro-commands Daniel Jacob UMR 1332 BFP, Metabolomics Facility CGFB Bordeaux, MetaboHUB - 2018 1 NMRProcFlow - Macro-command
More informationPharmaSUG China Big Insights in Small Data with RStudio Shiny Mina Chen, Roche Product Development in Asia Pacific, Shanghai, China
PharmaSUG China 2016-74 Big Insights in Small Data with RStudio Shiny Mina Chen, Roche Product Development in Asia Pacific, Shanghai, China ABSTRACT Accelerating analysis and faster data interpretation
More informationPackage covr. October 18, 2018
Encoding UTF-8 Title Test Coverage for Packages Version 3.2.1 Package covr October 18, 2018 Track and report code coverage for your package and (optionally) upload the results to a coverage service like
More informationCSE 158. Web Mining and Recommender Systems. Midterm recap
CSE 158 Web Mining and Recommender Systems Midterm recap Midterm on Wednesday! 5:10 pm 6:10 pm Closed book but I ll provide a similar level of basic info as in the last page of previous midterms CSE 158
More informationChemical Shift Perturbation Tutorial
CcpNmr Analysis Version 3 Chemical Shift Perturbation Tutorial Written by: Luca Mureddu CCPN application developer, PhD Student MCB, University of Leicester, Lancaster Road, Leicester, LE1 9HN, UK 1 t:
More information