MATLAB COMPUTATIONAL FINANCE CONFERENCE Quantitative Sports Analytics using MATLAB

Size: px
Start display at page:

Download "MATLAB COMPUTATIONAL FINANCE CONFERENCE Quantitative Sports Analytics using MATLAB"

Transcription

1 MATLAB COMPUTATIONAL FINANCE CONFERENCE 2017 Quantitative Sports Analytics using MATLAB Robert Kissell, PhD September 28, 2017

2 Important and Web Addresses AlgoSports23/MATLAB Competition Are you smarter than the Algo? Website: AlgoSports23.com Please check the website for data updates, and contact for further information.

3 Quantitative Sports Modeling Modeling Techniques from: Presentation Outline Optimal Sports, Math, Statistics, and Fantasy Probability Models Rank Sports Teams Estimate Winning Probability Calculate Winning Margin Computing Probability of Beating a Spread AlgoSports23/MATLAB Competition

4 Quantitative Sports Modeling Modeling Techniques from: Presentation Outline Optimal Sports, Math, Statistics, and Fantasy Probability Models Rank Sports Teams Estimate Winning Probability Calculate Winning Margin Computing Probability of Beating a Spread AlgoSports23/MATLAB Competition Are you smarter than the Algo!

5 Transaction Cost Analysis and Algorithm Trading Suite of TCA Models and Optimizers have been fully integrated into MATLAB s Trading Toolbox. These suites of tools are being used for Algorithmic Trading and Portfolio Management. These include: Market Impact Estimation Pre-Trade Post-Trade Trade Schedule Optimization Liquidation Cost Analysis Portfolio Optimization with TCA Various Libraries are Available Access to a full suite of TCA libraries and MI Data is available upon request. Contact: info@kissellresearch.com or Robert.Kissell@KissellResearch.com

6 Optimal Sport Math, Statistics, and Fantasy Key items addressed include: Accurately rank sports teams Compute winning probability Demystify the black-box world of computer models Provide insight into the BCS and RPI selection process. Select optimal mix of players for a fantasy league competition Evaluate player skill and forecast future player performance Select team rosters Assist in salary negotiation Determine Hall of Fame eligibility Sabermetrics on Steroids!

7 What is Quantitative Finance? Quantitative Finance is the application of methods and analyses from the different sciences to solve financial problems. This include: Math, Statistics, Physics, Engineering, Economics, Computer Science, Biology, Psychology, Business, etc. Quantitative Finance is all about proper utilization of the Scientific Method and drawing statistically significant conclusions.

8 Scientist or Engineer A Scientist is someone who loves surprises. This is an opportunity to learn and make further advancements. The goal is to learn, improve, and progress.

9 Scientist or Engineer A Scientist is someone who loves surprises. This is an opportunity to learn and make further advancements. The goal is to learn, improve, and progress. A Engineer is someone who hates surprises. Surprises are usually a indication that something failed or gone wrong and often results in a loss or slowing of progress.

10 What about a Quant? A Quant is someone who learns from a proper application of the scientific method by finding Scientific surprises and profit opportunities. Quants go through great lengths to learn the cause of these surprises and to ensure that these relationships are statistically significant. Quants then seek to implement these scientific surprises without suffering any Engineering surprises and losses.

11 The Scientific Method in Practice Data Data Data Scientist Statistically Significant Conclusion

12 The Scientific Method in Practice Data Data Data Scientist Statistically Significant Conclusion Data Data Data Attorney Desired Outcome Find supporting data Data Mining

13 The Scientific Method in Practice Data Data Data Scientist Statistically Significant Conclusion Data Data Data Attorney Desired Outcome Find supporting data Data Mining Data? Data? Data? Doctor Educated Guess Test Data Worse Case Scenario?

14 Moral of the Story: Be a Scientist!

15 Moral of the Story: Be a Scientist! Don t be that Anti-Scientist!

16 Quantitative Sports Modeling

17 What is Quantitative Sports Modeling? The application of quantitative tools and analytics, and sound scientific methods, to sports related problems and questions. Quantitative sports modeling consists of the same tools used in quantitative finance and is comprised of: mathematics, statistics, engineering, machine learning, economics, business, etc. Sports Modeling is based on the same framework as Quantitative Finance, but solves different set of problems.

18 What do we want to solve? Expected Winning Team Probability of Winning Expected Winning Margin Probability of Beating a Specified Margin Future Player Performance Roster of Players (Best set of Complementary Players) Best Mix of Players given Opponent Salaries & Salary Negotiation

19 Sports Modeling Data: What we want to Predict (LHS) Win/Loss Win Margin Probability of winning by more than X points Player Statistics (Fantasy Sports) Evaluating Player Ability Roster Selection Salary and Salary Negotiations Line-up and Match-ups Player Trades Hall of Fame Selection

20 Sports Modeling Data: Explanatory Factors Data (RHS) Win/Loss Result Game Scores Game Data Team Statistics (AVG, OBP, ERA, HR, Comp. Ratio) Venue Location (Home Field Advantage) Momentum Players, Injuries Career Statistics Salary Age Teammates & Roster Principal Component Analysis

21 Different Sports Prediction Models Probability Models Non-Linear Regression Non-Parametric Statistics Neural Networks / Machine Learning Sabermetrics on Steroids!

22 Head-to-Head Competitions How do we Rank Teams B D A C E Ranking: A B & C D & E F F

23 Head-to-Head Competitions How do we Rank Teams A Ranking: A, B, C B C

24 Head-to-Head Competitions How do we Rank Teams B D A C E G Ranking: A & G B & C D & E F Ranking: A B & C & G D & E F F

25 Head-to-Head Competitions How do we Rank Teams B D A C E H Ranking: A B & C D & E F & H Ranking: A B & C D & E & H F F

26 Sports Models To Discuss Today

27 Probability Models: Probability (X>Y) Power Function: λ x λ x + λ y Logit Regression b 0 + b h b a = ln F 1 z 1 F 1 z In probability models, the LHS variable is (0,1)!

28 Power Function

29 Power Function The Power function is derived from the Exponential Distribution. Let, Then, f x ~λ x e λ xt f y ~λ y e λ yt Prob x > y = λ x λ x + λ y where, λ k = Team k Rating

30 Power Function with Home Field Advantage Let X be Home Team Prob X > Y = λ x + λ 0 λ x + λ y + λ 0 Let Y be Away Team Prob Y > X = λ y λ x + λ y + λ 0 λ k = Team k Rating λ 0 = Team k Rating

31 Power Function: Solving Parameters Function G = λ x + λ 0 λ x + λ y + λ 0 λ x + λ 0 λ x + λ y + λ 0 if home team wins game if away team wins game Max Max L = ς G i log L = σ log G i Solve using Maximum Likelihood Estimates ( MLE )

32 Power Function: Estimate Spread Run Second Regression, Spread = d 0 + d 1 Probability Results, d 0, d 1, sey

33 MATLAB Solving Power Function Parameters % Power Function Model % Num = matrix of winning team and location (HFA if at home) % Denon = matrix of all teams including HFA [b,fval,exitflag,output]=fmincon(@(b) mypower(b,num,denom),... b0,[],[],[],[],lb,ub,... [],... options); exitflag; function f = mypower(b,num,denom) Z=(Num*b)./(Denom*b); f=-sum(log(z)); end

34 Steps to Solve Power Function Set up Objective Function: Estimate Team Ratings using MLE Compute Winning Probabilities using Power Function Formula Run Regression of Home Team Win Margin (Spread ) as function of Predicted Home Team Winning Probability ( Prob ): Spread = d 0 + d 1 Prob This provides: 1) Probability that Home Team Wins Game 2) Expected Home Team Win Margin 3) Teams can be ranked based on Model Parameter (from highest to lowest)

35 Logit Regression

36 Logit Regression Model Start with Logistic Distribution Function: exp b 0 + b h b a = z 1 s = Home Pts Away Pts = Home Team Spread, (-inf, +inf) z = s avg(s) stdev(s), ( inf, +inf) z 1 = F 1 z = normcdf z, (0,1)

37 Logit Regression Model We transform the logistic function into the logit regression: b 0 + b h b a = ln z 1 1 z 1 s = Home Team Spread, (-inf, +inf) z = s avg(s) stdev(s), ( inf, +inf) z 1 = F 1 z = normcdf z, (0,1)

38 Steps to Solve Logit Spread Regression (Part 1) Calculate LHS Spread Value s = Home Team Spread, (-inf, +inf); s avg(s) z = stdev(s), inf, +inf ; z 1 = F 1 z = normcdf z, (0,1) Solve parameters from OLS b 0 + b h b a = ln 1 z 1 Estimate Home Team Win Margin z 1 = F 1 z = z exp b 0 +b h b a z = norminv z 1 s = z 1 stdev s + avg(s)

39 Steps to Solve Logit Spread Regression (Part 2) Run second regression: Actual Spread = d 0 + d 1 Estimated Spread Y = d 0 + d 1 s d 0, d 1, sey Compute Home Team Win Probability Prob Spread > 0 Prob Y > 0 Y~N s, sey

40 MATLAB Logit Regression % Logit Regression % s = home team win margin, % s>0, home team won game by s % s<0, home team lost game by s % z=zscore(s), mu = mean(s), stdev = stdev(s) % Finv=normcdf(z) % Y=log(Finv/(1-Finv)) % X=matrix of games, home team = +1, away team = -1 whichstats={'beta','tstat','r','yhat','mse','rsquare'}; mystats = regstats(y,x,'linear',whichstats); beta=mystats.tstat.beta; beta=[beta(2:end);beta(1)]; TeamRating=beta;

41 NFL

42 NFL Data: Only Three Weeks of Games (47 Games)

43 NFL Data: Only Three Weeks of Games

44 NFL Data: Only Three Weeks of Games

45 Power Function: Estimating Spreads prob = λ x + λ 0 λ x + λ y + λ 0 spread = d 0 + d 1 prob

46 NFL - Power Function Estimating Home Team Win Probability: prob = λ x + λ 0 λ x + λ y + λ 0 Estimating Home Team Spread s = d 0 + d 1 prob = prob

47 Example: Power Function New England (Home) vs. Carolina (Away) New England = Carolina = HFA = 0.01 prob = = 85% Estimating Home Team Spread s = = (need to adjust)

48 Logit Regression: Estimating Spreads Est. Spread = b 0 + b H b a Act. Spread = d 0 + d 1 Est. Spread

49 NFL Logit Regression Estimating Home Team Win Probability: ln z 1 1 z 1 = b 0 + b h b a Estimating Home Team Spread Y (Actual Spread) = d 0 + d 1 Estimated Spread s d 0, d 1, sey Prob Y > 0 = normcdf 0, s, sey

50 NFL Data: Only Three Weeks of Games

51 Example: Power Function New England (Home) vs. Carolina (Away) New England = Carolina = HFA = Estimating Home Team Spread: s = J K exp( ( ) = +6.7 Estimating Home Team Win Probability: p = f 6.7 =74%

52 NFL - Predictions

53 NCAA College Football

54 College Football: Only Four Weeks of Games (286 Games) Games with Div 1- FBS Teams Only

55 NCAA Football: Only Four Weeks of Games

56 NCAA Football - FBS: Model Results

57 NCAA Football - FBS: Algorithmic Rankings (after 4 weeks)

58 NCAA Football - FBS: Week 5 Predictions (Part 1)

59 NCAA Football - FBS: Week 5 Predictions (Part 2)

60 AlgoSports23/MATLAB Competition

61 AlgoSports23 / MATLAB Competition Are you Smarter than the Algo!

62 AlgoSports23 / MATLAB Competition Are you Smarter than the Algo! Can you Beat the Algo!

63 AlgoSports23 / MATLAB Competition Two Important s: Robert.Kissell@KissellResearch.com AlgoSports23@gmail.com

64 AlgoSports23 / MATLAB Competition Rules of the Competition All Analysis & Programming MATLAB Game Results Data will be Posted Weekly Game Prediction File will be Posted Weekly Return Model Predictions by Specified Date Top 23 performing Algorithms each week will be included in the AlgoSport23 Computer Rankings and Prediction National Media Attention! Are you smarter than the Algo?

65 AlgoSports23 / MATLAB Competition Your program and submission needs to include the following: 1) Ranking of Teams 2) Prediction of Home Team Winning Margin for all game in a week Models are measured based on: 1) RMSE 2) Avg Difference 3) Number of Wins

66 AlgoSports23 / MATLAB Competition Top 23 performing Algorithms each week will be included in the AlgoSport23 Computer Rankings and Prediction! National Media Attention! Bragging Rights!

Football result prediction using simple classification algorithms, a comparison between k-nearest Neighbor and Linear Regression

Football result prediction using simple classification algorithms, a comparison between k-nearest Neighbor and Linear Regression EXAMENSARBETE INOM TEKNIK, GRUNDNIVÅ, 15 HP STOCKHOLM, SVERIGE 2016 Football result prediction using simple classification algorithms, a comparison between k-nearest Neighbor and Linear Regression PIERRE

More information

American Football Route Identification Using Supervised Machine Learning

American Football Route Identification Using Supervised Machine Learning American Football Route Identification Using Supervised Machine Learning Hochstedler, Jeremy & Paul T. Gagnon Telemetry Sports Abstract Football organizations spend significant resources classifying on-field

More information

Weighted Powers Ranking Method

Weighted Powers Ranking Method Weighted Powers Ranking Method Introduction The Weighted Powers Ranking Method is a method for ranking sports teams utilizing both number of teams, and strength of the schedule (i.e. how good are the teams

More information

Stat 528 (Autumn 2008) Density Curves and the Normal Distribution. Measures of center and spread. Features of the normal distribution

Stat 528 (Autumn 2008) Density Curves and the Normal Distribution. Measures of center and spread. Features of the normal distribution Stat 528 (Autumn 2008) Density Curves and the Normal Distribution Reading: Section 1.3 Density curves An example: GRE scores Measures of center and spread The normal distribution Features of the normal

More information

The CMA Program: The Essential Credential

The CMA Program: The Essential Credential The CMA Program: The Essential Credential Institute of Management Accountants (IMA ) Presented by Olen L. Greer, Ph.D., CMA Regional V/P and Global Board Member, IMA AGENDA Perspective on Careers in Management

More information

Clustering Lecture 5: Mixture Model

Clustering Lecture 5: Mixture Model Clustering Lecture 5: Mixture Model Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics

More information

Week 7 Picturing Network. Vahe and Bethany

Week 7 Picturing Network. Vahe and Bethany Week 7 Picturing Network Vahe and Bethany Freeman (2005) - Graphic Techniques for Exploring Social Network Data The two main goals of analyzing social network data are identification of cohesive groups

More information

Skills Academy. Forensic Studies Courses

Skills Academy. Forensic Studies Courses Skills Academy Forensic Studies Courses www.skillsacademy.co.za Forensic Science Programmes Forensic Science Studies is for the person who wants to work in a laboratory or as a crime scene technician and

More information

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017 CPSC 340: Machine Learning and Data Mining Principal Component Analysis Fall 2017 Assignment 3: 2 late days to hand in tonight. Admin Assignment 4: Due Friday of next week. Last Time: MAP Estimation MAP

More information

Announcement date: December 1, 2009 New program launch date: May 1, 2010

Announcement date: December 1, 2009 New program launch date: May 1, 2010 Overview New Certified Management Accountant (CMA ) Certification Program Announcement date: December 1, 2009 New program launch date: May 1, 2010 Why did ICMA decide to update the curriculum and change

More information

CDG2A/CDZ4A/CDC4A/ MBT4A ELEMENTS OF OPERATIONS RESEARCH. Unit : I - V

CDG2A/CDZ4A/CDC4A/ MBT4A ELEMENTS OF OPERATIONS RESEARCH. Unit : I - V CDG2A/CDZ4A/CDC4A/ MBT4A ELEMENTS OF OPERATIONS RESEARCH Unit : I - V UNIT I Introduction Operations Research Meaning and definition. Origin and History Characteristics and Scope Techniques in Operations

More information

Stats 50: Linear Regression Analysis of NCAA Basketball Data April 8, 2016

Stats 50: Linear Regression Analysis of NCAA Basketball Data April 8, 2016 Stats 50: Linear Regression Analysis of NCAA Basketball Data April 8, 2016 Today we will analyze a data set containing the outcomes of every game in the 2012-2013 regular season, and the postseason NCAA

More information

Also please note there are a number of documents outlining more detailed League Manager processes at support.tennis.com.au

Also please note there are a number of documents outlining more detailed League Manager processes at support.tennis.com.au League Manager Support Instructions Please note; these instructions are directed at league and club administrators who have been given access to manage leagues and enter squads, these instructions are

More information

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni Nonparametric Risk Attribution for Factor Models of Portfolios October 3, 2017 Kellie Ottoboni Outline The problem Page 3 Additive model of returns Page 7 Euler s formula for risk decomposition Page 11

More information

Information Technology

Information Technology Courses - Victoria 2017 This document has been developed to assist students and parents in researching undergraduate information technology and computer science courses. It isn t an exhaustive list, and

More information

Machine Learning for Professional Tennis Match Prediction and Betting

Machine Learning for Professional Tennis Match Prediction and Betting Machine Learning for Professional Tennis Match Prediction and Betting Andre Cornman, Grant Spellman, Daniel Wright Abstract Our project had two main objectives. First, we wanted to use historical tennis

More information

Support Vector Machines: Brief Overview" November 2011 CPSC 352

Support Vector Machines: Brief Overview November 2011 CPSC 352 Support Vector Machines: Brief Overview" Outline Microarray Example Support Vector Machines (SVMs) Software: libsvm A Baseball Example with libsvm Classifying Cancer Tissue: The ALL/AML Dataset Golub et

More information

2017 Ethics & Compliance Hotline & Incident Management Benchmark Report Webinar

2017 Ethics & Compliance Hotline & Incident Management Benchmark Report Webinar 2017 Ethics & Compliance Hotline & Incident Management Benchmark Report Webinar Presented by Carrie Penman Chief Compliance Officer & Senior Vice President, Advisory Services Edwin O Mara Operations Manager,

More information

Subject : Mathematics Level B1 Class VII Lesson: 1 (Integers)

Subject : Mathematics Level B1 Class VII Lesson: 1 (Integers) Subject : Mathematics Level B1 Class VII Lesson: 1 (Integers) Skill/Competency /Concept Computational Skill Properties of Addition and subtraction of integers Multiplication and division Operation on integer.

More information

CMA. Certified Management Accountant

CMA. Certified Management Accountant CMA Certified Management Accountant ABOUT CMA PROGRAM Whether you are a fresh graduate or someone working in the finance and accounting role with an ambition to have a highly rewarding career - CMA is

More information

MSc Econometrics. VU Amsterdam School of Business and Economics. Academic year

MSc Econometrics. VU Amsterdam School of Business and Economics. Academic year MSc Econometrics VU Amsterdam School of Business and Economics Academic year 2018 2019 MSc Econometrics @ SBE VU Amsterdam prof. dr. Siem Jan Koopman (s.j.koopman@vu.nl) 2 of 27 MSc Econometrics @ SBE

More information

Print Article - Team Managers Manual. This article is also available for viewing online at

Print Article - Team Managers Manual. This article is also available for viewing online at This article is also available for viewing online at http://support.ngin.com/questions.php?questionid=200 Team Managers Manual The Sport NGIN website is a content management system, designed to help managers

More information

Rolling Markov Chain Monte Carlo

Rolling Markov Chain Monte Carlo Rolling Markov Chain Monte Carlo Din-Houn Lau Imperial College London Joint work with Axel Gandy 4 th September 2013 RSS Conference 2013: Newcastle Output predicted final ranks of the each team. Updates

More information

Puck Systems User Manual for Coaches and Team Managers

Puck Systems User Manual for Coaches and Team Managers USER MANUAL Puck Systems User Manual for Coaches and Team Managers Section 1 Section 2 As a User As a Coach Section 1 User: 1. Getting Started: How to Sign Up for the Site -On the right hand side near

More information

CPSC 340: Machine Learning and Data Mining. More Linear Classifiers Fall 2017

CPSC 340: Machine Learning and Data Mining. More Linear Classifiers Fall 2017 CPSC 340: Machine Learning and Data Mining More Linear Classifiers Fall 2017 Admin Assignment 3: Due Friday of next week. Midterm: Can view your exam during instructor office hours next week, or after

More information

PROFESSIONAL DEVELOPMENT COURSES. May - December Institute for Professional Excellence

PROFESSIONAL DEVELOPMENT COURSES. May - December Institute for Professional Excellence PROFESSIONAL DEVELOPMENT COURSES May - December 2018 Institute for Professional Excellence INSTITUTE FOR PROFESSIONAL EXCELLENCE (IPEx) Training to Advance Your Career Davenport University s IPEx offers

More information

GoVenture

GoVenture BBI1O, BBI2O, Grades 9 or 10 Introduction to Business Strand: Business Fundamentals Economic Basics Types of Businesses C C C C C Business Ethics and Social Responsibility C C C C C C C C C C International

More information

Lecture 4. Lecture 4: The E/R Model

Lecture 4. Lecture 4: The E/R Model Lecture 4 Lecture 4: The E/R Model Lecture 4 Today s Lecture 1. E/R Basics: Entities & Relations ACTIVITY: Crayon time! 2. E/R Design considerations ACTIVITY: Crayon time pt. II 3. Advanced E/R Concepts

More information

Associate of Science in Business Administration To Bachelor of Science in Business Administration

Associate of Science in Business Administration To Bachelor of Science in Business Administration Associate of Science in Business Administration To Bachelor of Science in Business Administration Completion of the following curriculum will satisfy the requirements for the Associate of Science in Business

More information

Team Administrator Instruction Guide: Access Information: Updating Home Game Times and Fields:

Team Administrator Instruction Guide: Access Information: Updating Home Game Times and Fields: Team Administrator Instruction Guide: Access Information: As a team administrator, your login will be the same username and password used to complete your RMA application online. Once logged in, if you

More information

Introduction to the Oracle Academy. Copyright 2007, Oracle. All rights reserved.

Introduction to the Oracle Academy. Copyright 2007, Oracle. All rights reserved. What Will I Learn? In this lesson, you will learn to: Give examples of jobs, salary, and opportunities available that could result from participating in the Academy Understand how participating in the

More information

More on Neural Networks. Read Chapter 5 in the text by Bishop, except omit Sections 5.3.3, 5.3.4, 5.4, 5.5.4, 5.5.5, 5.5.6, 5.5.7, and 5.

More on Neural Networks. Read Chapter 5 in the text by Bishop, except omit Sections 5.3.3, 5.3.4, 5.4, 5.5.4, 5.5.5, 5.5.6, 5.5.7, and 5. More on Neural Networks Read Chapter 5 in the text by Bishop, except omit Sections 5.3.3, 5.3.4, 5.4, 5.5.4, 5.5.5, 5.5.6, 5.5.7, and 5.6 Recall the MLP Training Example From Last Lecture log likelihood

More information

Bachelor of Computer Science (Course Code: C2001)

Bachelor of Computer Science (Course Code: C2001) Bachelor of Computer Science (Course Code: C2001) Bachelor of Computer Science Double degrees with: Commerce (Course Code: B2008) Science (Course Code: S2004) Enrolment Information 2019 Faculty of Information

More information

Function Algorithms: Linear Regression, Logistic Regression

Function Algorithms: Linear Regression, Logistic Regression CS 4510/9010: Applied Machine Learning 1 Function Algorithms: Linear Regression, Logistic Regression Paula Matuszek Fall, 2016 Some of these slides originated from Andrew Moore Tutorials, at http://www.cs.cmu.edu/~awm/tutorials.html

More information

CHAPTER 2 DESCRIPTIVE STATISTICS

CHAPTER 2 DESCRIPTIVE STATISTICS CHAPTER 2 DESCRIPTIVE STATISTICS 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is how the data is spread or distributed over the range of the data values. This is one of

More information

Section 2.1: Intro to Simple Linear Regression & Least Squares

Section 2.1: Intro to Simple Linear Regression & Least Squares Section 2.1: Intro to Simple Linear Regression & Least Squares Jared S. Murray The University of Texas at Austin McCombs School of Business Suggested reading: OpenIntro Statistics, Chapter 7.1, 7.2 1 Regression:

More information

Business Analytics Nanodegree Syllabus

Business Analytics Nanodegree Syllabus Business Analytics Nanodegree Syllabus Master data fundamentals applicable to any industry Before You Start There are no prerequisites for this program, aside from basic computer skills. You should be

More information

Using Machine Learning to Optimize Storage Systems

Using Machine Learning to Optimize Storage Systems Using Machine Learning to Optimize Storage Systems Dr. Kiran Gunnam 1 Outline 1. Overview 2. Building Flash Models using Logistic Regression. 3. Storage Object classification 4. Storage Allocation recommendation

More information

Nuts and Bolts Research Methods Symposium

Nuts and Bolts Research Methods Symposium Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Topics to Discuss: Types of Variables Constructing a Variable Code Book Developing Excel Spreadsheets

More information

MHPE 494: Data Analysis. Welcome! The Analytic Process

MHPE 494: Data Analysis. Welcome! The Analytic Process MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your

More information

NCAA Instructions Nerdy for Sports

NCAA Instructions Nerdy for Sports Home Page / Login 2 Registration 3 Main Menu 4 Making your Picks 5 Editing your Picks 7 View your Picks 8 Double Check your Picks 9 Actual Results 11 Payouts 12 Reports 12 League History 16 Contact Us

More information

Agile where are we at?

Agile where are we at? Consultant www.crisp.se Agile where are we at? Keynote - Agile Tour Bangkok Nov 2017 henrik.kniberg@crisp.se @HenrikKniberg Dad Climate guy Organizational coach & Change Instigator Author Scrum Retrospective

More information

The Chicago Punks CS300 PROJECT 2 WEB PROPOSAL

The Chicago Punks CS300 PROJECT 2 WEB PROPOSAL The Chicago Punks CS300 PROJECT 2 WEB PROPOSAL Enrique Ernesto Mike Carlos Table of Contents 1. Site Title & Domain Name 2. Focus 3. Design Goals 4. Target Audience 5. Main Elements &Description of Content

More information

Comparative analysis of data mining methods for predicting credit default probabilities in a retail bank portfolio

Comparative analysis of data mining methods for predicting credit default probabilities in a retail bank portfolio Comparative analysis of data mining methods for predicting credit default probabilities in a retail bank portfolio Adela Ioana Tudor, Adela Bâra, Simona Vasilica Oprea Department of Economic Informatics

More information

Java Outline (Upto Exam 2)

Java Outline (Upto Exam 2) Java Outline (Upto Exam 2) Part 4 IF s (Branches) and Loops Chapter 12/13 (The if Statement) Hand in Program Assignment#1 (12 marks): Create a program called Ifs that will do the following: 1. Ask the

More information

Citizen Data Scientist is the new Data Analyst

Citizen Data Scientist is the new Data Analyst Welcome # T C 1 8 Citizen Data Scientist is the new Data Analyst Mehmet Vanli Sales Consultant Tableau Australia Citizen data scientist: A person who creates models that use advanced diagnostic analytics

More information

Table of contents 2 / 42

Table of contents 2 / 42 NFL Prediction Model 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 2 //42 3 4 5 6 7 8 9 1 42 Table of contents Program Setup... 3 End User License Agreement...

More information

A BA General Business Administration Associate in Applied Science

A BA General Business Administration Associate in Applied Science Technical and Vocational Programs A 25 12 0 BA General Administration Associate in Applied Science CONCENTRATION OVERVIEW The General Administration curriculum is designed to introduce students to various

More information

A Brief Introduction to Data Mining

A Brief Introduction to Data Mining A Brief Introduction to Data Mining L. Torgo ltorgo@dcc.fc.up.pt Departamento de Ciência de Computadores Faculdade de Ciências / Universidade do Porto Sept, 2014 Introduction Motivation for Data Mining?

More information

Table XXX MBA Assessment Results for Basic Content Knowledge Learning Goal: Aggregate Subject Matter Scores

Table XXX MBA Assessment Results for Basic Content Knowledge Learning Goal: Aggregate Subject Matter Scores MBA for Basic Content Knowledge Learning Goal: Aggregate Subject Matter Scores Subject Matter Accounting Finance Information Systems Supply chain and analytics Management Marketing accounting related finance

More information

SIDStats Tennis User Documentation Updated: 7/9/2010

SIDStats Tennis User Documentation Updated: 7/9/2010 Statistics Collected for Tennis Individual athlete s season-to-date Conference Win/Loss Record and Overall Win/Loss Record for single and doubles players on your Varsity A team. KENTUCKY COACHES: If you

More information

Evaluation Metrics. (Classifiers) CS229 Section Anand Avati

Evaluation Metrics. (Classifiers) CS229 Section Anand Avati Evaluation Metrics (Classifiers) CS Section Anand Avati Topics Why? Binary classifiers Metrics Rank view Thresholding Confusion Matrix Point metrics: Accuracy, Precision, Recall / Sensitivity, Specificity,

More information

Rolling Markov Chain Monte Carlo

Rolling Markov Chain Monte Carlo Rolling Markov Chain Monte Carlo Din-Houn Lau Imperial College London Joint work with Axel Gandy 4 th July 2013 Predict final ranks of the each team. Updates quick update of predictions. Accuracy control

More information

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts Chapter 28 Data Mining Concepts Outline Data Mining Data Warehousing Knowledge Discovery in Databases (KDD) Goals of Data Mining and Knowledge Discovery Association Rules Additional Data Mining Algorithms

More information

Problem Set #6 Due: 11:30am on Wednesday, June 7th Note: We will not be accepting late submissions.

Problem Set #6 Due: 11:30am on Wednesday, June 7th Note: We will not be accepting late submissions. Chris Piech Pset #6 CS09 May 26, 207 Problem Set #6 Due: :30am on Wednesday, June 7th Note: We will not be accepting late submissions. For each of the written problems, explain/justify how you obtained

More information

DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS

DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS 1 Classification: Definition Given a collection of records (training set ) Each record contains a set of attributes and a class attribute

More information

Accounting Ethics and Auditing

Accounting Ethics and Auditing Accounting Ethics and Auditing Only three percent of adults have career-boosting professional certifications you can be one of them. And you can earn while meeting Colorado CPA licensure requirements including

More information

Handling Ties. Analysis of Ties in Input and Output Data of Rankings

Handling Ties. Analysis of Ties in Input and Output Data of Rankings Analysis of Ties in Input and Output Data of Rankings 16.7.2014 Knowledge Engineering - Seminar Sports Data Mining 1 Tied results in the input data Frequency depends on data source tie resolution policy

More information

Collecting data. stat 480 Heike Hofmann

Collecting data. stat 480 Heike Hofmann Collecting data stat 480 Heike Hofmann Salaries...economics..financial.data Health...fitness Movies..e.g..ratings..box.office.revenues... Global.issues..comparison.across.countries Favorite Like it very

More information

Data Science Training

Data Science Training Data Science Training R, Predictive Modeling, Machine Learning, Python, Bigdata & Spark 9886760678 Introduction: This is a comprehensive course which builds on the knowledge and experience a business analyst

More information

Application of PageRank Algorithm on Sorting Problem Su weijun1, a

Application of PageRank Algorithm on Sorting Problem Su weijun1, a International Conference on Mechanics, Materials and Structural Engineering (ICMMSE ) Application of PageRank Algorithm on Sorting Problem Su weijun, a Department of mathematics, Gansu normal university

More information

6 SHC 3-6 SHC Optional 3 SHC 3 SHC

6 SHC 3-6 SHC Optional 3 SHC 3 SHC Curriculum Standard for Business Administration Career Cluster: Business Management & Administration, Finance, Government & Public Administration, and Marketing. Cluster Description: The planning, organizing,

More information

Career and Technical Education. Career Clusters

Career and Technical Education. Career Clusters Career and Technical Education Career Clusters Foundational Courses courses essential for a particular career area - courses that compliment and enrich knowledge of a particular career area Agriculture,

More information

Knowledge Discovery and Data Mining. Neural Nets. A simple NN as a Mathematical Formula. Notes. Lecture 13 - Neural Nets. Tom Kelsey.

Knowledge Discovery and Data Mining. Neural Nets. A simple NN as a Mathematical Formula. Notes. Lecture 13 - Neural Nets. Tom Kelsey. Knowledge Discovery and Data Mining Lecture 13 - Neural Nets Tom Kelsey School of Computer Science University of St Andrews http://tom.home.cs.st-andrews.ac.uk twk@st-andrews.ac.uk Tom Kelsey ID5059-13-NN

More information

Panel Data 4: Fixed Effects vs Random Effects Models

Panel Data 4: Fixed Effects vs Random Effects Models Panel Data 4: Fixed Effects vs Random Effects Models Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 4, 2017 These notes borrow very heavily, sometimes verbatim,

More information

Sport performance analysis Project Report

Sport performance analysis Project Report Sport performance analysis Project Report Name: Branko Chomic Date: 14/04/2016 Table of Contents Introduction GUI Problem encountered Project features What have I learned? What was not achieved? Recommendations

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Lecture 13 - Neural Nets Tom Kelsey School of Computer Science University of St Andrews http://tom.home.cs.st-andrews.ac.uk twk@st-andrews.ac.uk Tom Kelsey ID5059-13-NN

More information

CATALOG 2018/2019 BINUS UNIVERSITY. Computer Science. Introduction. Vision. Mission

CATALOG 2018/2019 BINUS UNIVERSITY. Computer Science. Introduction. Vision. Mission Computer Science Introduction In the new millennium and the global era, the role of information and automation in the various domains and activities of the business industry are becoming more important.

More information

Agenda. Why choose our specification The GCSE reforms Changes to all GCSEs in Business Our new GCSE (9-1) qualification

Agenda. Why choose our specification The GCSE reforms Changes to all GCSEs in Business Our new GCSE (9-1) qualification Launch event Agenda Why choose our specification The GCSE reforms Changes to all GCSEs in Business Our new GCSE (9-1) qualification Overview Content Assessment Support Contact details and next steps 2

More information

Step-by-step data transformation

Step-by-step data transformation Step-by-step data transformation Explanation of what BI4Dynamics does in a process of delivering business intelligence Contents 1. Introduction... 3 Before we start... 3 1 st. STEP: CREATING A STAGING

More information

Portland Timbers ODP (Pre-Academy) Invite and Information 2001/02 and 2003/04 Age Groups Tryouts: August 12-13/Salem, Oregon

Portland Timbers ODP (Pre-Academy) Invite and Information 2001/02 and 2003/04 Age Groups Tryouts: August 12-13/Salem, Oregon Portland Timbers ODP (Pre-Academy) Invite and Information 2001/02 and 2003/04 Age Groups Tryouts: August 12-13/Salem, Oregon What is Portland Timbers Pre-Academy/Olympic Development Program (ODP)? All

More information

DakStats Web-Sync Setup Guide 1 of 7

DakStats Web-Sync Setup Guide 1 of 7 DakStats Web-Sync Setup Guide 1 of 7 This guide will help you set up your DakStats software to report scores and statistics using the Web-Sync service. If you have any questions about registering or setting

More information

Business Analytics. Admission and Degree Requirements. Courses. Business Analytics 1

Business Analytics. Admission and Degree Requirements. Courses. Business Analytics 1 Business Analytics 1 Business Analytics Mailing Address: UIC Liautaud Graduate School of Business University Hall, 11th Floor (MC 077) 601 South Morgan Street Chicago, IL 60607 Contact Information: Campus

More information

Section 2.1: Intro to Simple Linear Regression & Least Squares

Section 2.1: Intro to Simple Linear Regression & Least Squares Section 2.1: Intro to Simple Linear Regression & Least Squares Jared S. Murray The University of Texas at Austin McCombs School of Business Suggested reading: OpenIntro Statistics, Chapter 7.1, 7.2 1 Regression:

More information

Calculating Call Blocking and Utilization for Communication Satellites that Use Dynamic Resource Allocation

Calculating Call Blocking and Utilization for Communication Satellites that Use Dynamic Resource Allocation Calculating Call Blocking and Utilization for Communication Satellites that Use Dynamic Resource Allocation Leah Rosenbaum Mohit Agrawal Leah Birch Yacoub Kureh Nam Lee UCLA Institute for Pure and Applied

More information

Why Use Graphs? Test Grade. Time Sleeping (Hrs) Time Sleeping (Hrs) Test Grade

Why Use Graphs? Test Grade. Time Sleeping (Hrs) Time Sleeping (Hrs) Test Grade Analyzing Graphs Why Use Graphs? It has once been said that a picture is worth a thousand words. This is very true in science. In science we deal with numbers, some times a great many numbers. These numbers,

More information

The Finer Things In Alteryx Ken Black 10/2/17

The Finer Things In Alteryx Ken Black 10/2/17 The Finer Things In Alteryx Ken Black 10/2/17 Topic 4: Regex and Date Operations (Multiple weekly examples) From Week 4 of the Weekly challenges.*(\d\d-[[:alpha:]][[:alpha:]][[:alpha:]]-\d+).*.*(\u\l\l\s\d+,*\s\d\d+).*.*(\d+-\u\l\l+-\d\d+).*.*(\d-

More information

A Brief Introduction to Data Mining

A Brief Introduction to Data Mining A Brief Introduction to Data Mining L. Torgo ltorgo@dcc.fc.up.pt Departamento de Ciência de Computadores Faculdade de Ciências / Universidade do Porto Feb, 2017 What is Data Mining? Introduction A possible

More information

1 of 5 1/28/2015 12:27 PM BDA Program Program Mission/Purpose The mission of the Bachelor of Science in Business Data Analytics (BDA) program is to prepare students to understand the foundation of business

More information

Multicollinearity and Validation CIVL 7012/8012

Multicollinearity and Validation CIVL 7012/8012 Multicollinearity and Validation CIVL 7012/8012 2 In Today s Class Recap Multicollinearity Model Validation MULTICOLLINEARITY 1. Perfect Multicollinearity 2. Consequences of Perfect Multicollinearity 3.

More information

Data Science Tutorial

Data Science Tutorial Eliezer Kanal Technical Manager, CERT Daniel DeCapria Data Scientist, ETC Software Engineering Institute Carnegie Mellon University Pittsburgh, PA 15213 2017 SEI SEI Data Science in in Cybersecurity Symposium

More information

CONTENT OUTLINE AND FUNCTIONAL SPECIFICATION

CONTENT OUTLINE AND FUNCTIONAL SPECIFICATION 1 Hunter Davis CONTENT OUTLINE AND FUNCTIONAL SPECIFICATION 1. Executive Summary This document contains the content outline combined with the functional specification of the redesigned website for the

More information

GCS Training Extension of your team Solutions Quality teaching Understanding your needs Funding Range of options

GCS Training Extension of your team Solutions Quality teaching Understanding your needs Funding Range of options At GCS Training, we care about your business and want to create a positive IMPACT! An impact on your productivity, on your staff, on the quality of your product or service and, ultimately, an impact on

More information

Statistical Analysis of List Experiments

Statistical Analysis of List Experiments Statistical Analysis of List Experiments Kosuke Imai Princeton University Joint work with Graeme Blair October 29, 2010 Blair and Imai (Princeton) List Experiments NJIT (Mathematics) 1 / 26 Motivation

More information

Week 10: Heteroskedasticity II

Week 10: Heteroskedasticity II Week 10: Heteroskedasticity II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Dealing with heteroskedasticy

More information

Math 340 Fall 2014, Victor Matveev. Binary system, round-off errors, loss of significance, and double precision accuracy.

Math 340 Fall 2014, Victor Matveev. Binary system, round-off errors, loss of significance, and double precision accuracy. Math 340 Fall 2014, Victor Matveev Binary system, round-off errors, loss of significance, and double precision accuracy. 1. Bits and the binary number system A bit is one digit in a binary representation

More information

SIDStats Volleyball User Documentation

SIDStats Volleyball User Documentation Volleyball Statistics Collected INDIVIDUAL STATS: Games Played, Total Attack Attempts, Kills, Kill Errors, Total Serves, Serve Aces, Serve Errors, Assists, Blocks, Digs, Aced, Bad Pass, Good Pass and Perfect

More information

Forward-Looking and Cautionary Statements

Forward-Looking and Cautionary Statements Peter F. Volanakis President and Chief Operating Officer February 8, 28 Forward-Looking and Cautionary Statements Certain statements in this presentation constitute forwardlooking statements within the

More information

Excel and Tableau. A Beautiful Partnership. Faye Satta, Senior Technical Writer Eriel Ross, Technical Writer

Excel and Tableau. A Beautiful Partnership. Faye Satta, Senior Technical Writer Eriel Ross, Technical Writer Excel and Tableau A Beautiful Partnership Faye Satta, Senior Technical Writer Eriel Ross, Technical Writer Microsoft Excel is used by millions of people to track and sort data, and to perform various financial,

More information

R Language for the SQL Server DBA

R Language for the SQL Server DBA R Language for the SQL Server DBA Beginning with R Ing. Eduardo Castro, PhD, Principal Data Analyst Architect, LP Consulting Moderated By: Jose Rolando Guay Paz Thank You microsoft.com idera.com attunity.com

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Pearson BTEC Level 5 Higher National Diploma in Engineering (Electrical and Electronic Engineering)

Pearson BTEC Level 5 Higher National Diploma in Engineering (Electrical and Electronic Engineering) Pearson BTEC Programme Pearson BTEC Level 5 Higher National Diploma in Engineering (Electrical and Electronic Engineering) Code: BHNDE5 Guided Learning Hours: 960 Hours Programme Structure: The Higher

More information

DE LA SALLE SCHOOL LEARNING PROGRAMME. YEAR 7 PI Pathway. Half Term 1a

DE LA SALLE SCHOOL LEARNING PROGRAMME. YEAR 7 PI Pathway. Half Term 1a PI Pathway Half Term 1a Weeks 1 and 2: Pictograms Bar charts Bar line charts How can graphs be used to mislead readers in the media? frequency tally chart modal class consistency grouped data Grouped bar

More information

Graded Unit Title: Mechanical Engineering: Graded Unit 1

Graded Unit Title: Mechanical Engineering: Graded Unit 1 Higher National Graded Unit specification General information for centres This Graded Unit has been validated as part of the HNC and HND Mechanical Engineering awards. Centres are required to develop the

More information

European Risk Management Certification. Candidate Information Guide

European Risk Management Certification. Candidate Information Guide European Risk Management Certification Candidate Information Guide Presentation of FERMA Certification 3 Benefits 4 Eligibility criteria 5 Application and fees Examination details Syllabus: FERMA rimap

More information

SIMULTANEOUS EQUATIONS

SIMULTANEOUS EQUATIONS Mathematics Revision Guides Simultaneous Equations Page 1 of 6 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Foundation Tier SIMULTNEOUS EQUTIONS Version: 3.2 Date: 08-02-2015 Mathematics Revision

More information

Diploma in GRAPHIC DESIGN. - Part 1 LESSON 1

Diploma in GRAPHIC DESIGN. - Part 1 LESSON 1 Diploma in GRAPHIC DESIGN - Part 1 LESSON 1 Can you hear me? Type Yes or No ? WEBINAR How it Works Questions are posted in the Question Box. Student Microphones are Muted. Q&A s are left until the end

More information

Lie. Cheat. Deceive. How to Practice the Art of Deception at Machine Speed

Lie. Cheat. Deceive. How to Practice the Art of Deception at Machine Speed SESSION ID: ECO-R03 Lie. Cheat. Deceive. How to Practice the Art of Deception at Machine Speed Jason Bird Head of EMEA Technical Solutions CSG Invotas @securedsensibly Why continue to do things the way

More information

1.1 Defining Functions

1.1 Defining Functions 1.1 Defining Functions Functions govern many interactions in our society today. Whether buying a cup of coffee at the local coffee shop or playing a video game, we are using a function in some fashion.

More information

Data Mining Concepts

Data Mining Concepts Data Mining Concepts Outline Data Mining Data Warehousing Knowledge Discovery in Databases (KDD) Goals of Data Mining and Knowledge Discovery Association Rules Additional Data Mining Algorithms Sequential

More information