13. Geospatio-temporal Data Analytics. Jacobs University Visualization and Computer Graphics Lab
|
|
- Claude Jordan
- 5 years ago
- Views:
Transcription
1 13. Geospatio-temporal Data Analytics
2 Recall: Twitter Data Analytics 573
3 Recall: Twitter Data Analytics 574
4 13.1 Time Series Data Analytics
5 Introduction to Time Series Analysis A time-series is a set of observations on a quantitative variable collected over time. Examples: Dow Jones Industrial Averages Historical data on sales, inventory, customer counts, interest rates, costs, etc. Businesses are often very interested in forecasting time series variables. In time series analysis, we analyze the past behavior of a variable in order to predict its future behavior. Data Analytics 576
6 Methods used in forecasting Regression Analysis Time Series Analysis (TSA) A statistical technique that uses time-series data for explaining the past or forecasting future events. The prediction is a function of time (days, months, years, etc.) No causal variable; examine past behavior of a variable and and attempt to predict future behavior Data Analytics 577
7 Components of TSA Time Frame (How far can we predict?) short-term (1-2 periods) medium-term (5-10 periods) long-term (12+ periods) No line of demarcation Trend Gradual, long-term movement (up or down) of demand. Easiest to detect Data Analytics 578
8 Components of TSA Cycle An up-and-down repetitive movement in demand. repeats itself over a long period of time Seasonal Variation An up-and-down repetitive movement within a trend occurring periodically. Often weather related but could be daily or weekly occurrence Random Variations Erratic movements that are not predictable because they do not follow a pattern Data Analytics 579
9 Time Series Plot Actual Sales $3,000 $2,500 Sales (in $1,000s) $2,000 $1,500 $1,000 $500 $ Time Period Data Analytics 580
10 Components of TSA Difficult to forecast demand because... There are no causal variables The components (trend, seasonality, cycles, and random variation) cannot always be easily or accurately identified Data Analytics 581
11 Some Time Series Terms Stationary Data - a time series variable exhibiting no significant upward or downward trend over time. Nonstationary Data - a time series variable exhibiting a significant upward or downward trend over time. Seasonal Data - a time series variable exhibiting a repeating patterns at regular intervals over time. Data Analytics 582
12 Approaching Time Series Analysis There are many different time series techniques. It is usually impossible to know which technique will be best for a particular data set. It is customary to try out several different techniques and select the one that seems to work best. To be an effective time series modeler, you need to keep several time series techniques in your tool box. Data Analytics 583
13 Measuring Accuracy We need a way to compare different time series techniques for a given data set. Four common techniques are the: mean absolute deviation: mean absolute percent error: the mean square error: root mean square error: MAD = RMSE = n i= 1 n i= 1 Y i MSE Y$ n 100 n Yi Ŷi MAPE = n i= 1 Yi ( Y Y) i $ i MSE = n i 2 Data Analytics 584
14 Extrapolation Models Extrapolation models try to account for the past behavior of a time series variable in an effort to predict the future behavior of the variable. Y$ = f Y, Y, Y, ( ) t+ 1 t t 1 t 2 K We ll first talk about several extrapolation techniques that are appropriate for stationary data. Data Analytics 585
15 An Example Electra-City is a retail store that sells audio and video equipment for the home and car. Each month the manager of the store must order merchandise from a distant warehouse. Currently, the manager is trying to estimate how many VCRs the store is likely to sell in the next month. He has collected 24 months of data. Data Analytics 586
16 Moving averages 1 $Y = 1 1 t Y Y Y k + + k L k t t-1 t-k-1 No general method exists for determining k. We must try out several k values to see what works best. Data Analytics 587
17 Implementing the Model Data Analytics 588
18 Comment on Comparing MSE Values Care should be taken when comparing MSE values of two different forecasting techniques. The lowest MSE may result from a technique that fits older values very well but fits recent values poorly. It is sometimes wise to compute the MSE using only the most recent values. Data Analytics 589
19 Forecasting With The Moving Average Model Forecasts for time periods 25 and 26 at time period 24: Y$ Y24 + Y = = = $ $ Y25 + Y Y26 = = = Data Analytics 590
20 Weighted Moving Average The moving average technique assigns equal weight to all previous observations 1 $Y = 1 1 t Y Y Y k + + k L k t t-1 t-k-1 The weighted moving average technique allows for different weights to be assigned to previous observations. We must determine values for k and the weights w i $Y = wy + + w Y L w Y t t t-1 k t-k -1 where 0 w and i 1 wi = 1 Data Analytics 591
21 Implementing the Model Data Analytics 592
22 Optimal weights that minimize MSE Data Analytics 593
23 Forecasting with Weighted Moving Averaging Forecasts for time periods 25 and 26 at time period 24: Y$ 25 = w1y24 + w2y23 = = Ŷ = w1ŷ 25 + w2y = = Data Analytics 594
24 Exponential Smoothing Y $ $ ( $ t + 1 = Y t + α Y t Y t ) where 0 α 1 It can be shown that the above equation is equivalent to: $ 2 n Y = αy + α( 1 α) Y + α( 1 α) Y + L+ α( 1 α) Y + L t + 1 t t 1 t 2 t n Data Analytics 595
25 Examples of Exponential Smoothing Functions Units Sold Time Period Number of VCRs Sold Exp. Smoothing alpha=0.1 Exp. Smoothing alpha=0.9 Data Analytics 596
26 Implementing the Model Data Analytics 597
27 Forecasting With Exponential Smoothing Forecasts for time periods 25 and 26 at time period 24: Y$ Y$ ( Y Y$ 25 = 24 + α 24 24) = ( ) = Y$ = Y$ + α( Y Y$ ) Y$ + α( Y $ Y $ ) = Y$ = Note that, Y $ = 3581., for t = 25, 26, 27, K t Data Analytics 598
28 Seasonality Seasonality is a regular, repeating pattern in time series data. May be additive or multiplicative in nature... Data Analytics 599
29 Stationary Seasonal Effects Additive Seasonal Effects Time Period Multiplicative Seasonal Effects Time Period Data Analytics 600
30 Ŷ Stationary Data with Additive Seasonal Effects t+ n Et + where E S t t = S t+ n p = α(yt -St-p) + (1-α)Et 1 = β (Yt - Et ) + (1- β )S 0 α 1 0 β 1 t p p represents the number of seasonal periods E t is the expected level at time period t. S t is the seasonal factor for time period t. Data Analytics 601
31 Implementing the Model Data Analytics 602
32 Forecasting with Additive Seasonal Effects Model Forecasts for time periods at time period 24: Ŷ Ŷ n = E24 + S24+ n 4 = E + S = = 25 Ŷ = E + S = = 26 Ŷ = E + S = = 27 Ŷ = E + S = = Data Analytics 603
33 Stationary Data with Multiplicative Seasonal Effects Ŷ t+ n Et where E S 0 0 t t = S t+ n p = α(y t/st-p) + (1-α)Et 1 = β (Y /E ) + (1- β )S α 1 β 1 t t t p p represents the number of seasonal periods E t is the expected level at time period t. S t is the seasonal factor for time period t. Data Analytics 604
34 Implementing the Model Data Analytics 605
35 Forecasting With Multiplicative Seasonal Effects Forecasts for time periods at time period 24: Ŷ 24+ n = E24 S24+ n 4 Ŷ = E S = = 25 Ŷ = E S = = 26 Ŷ = E S = = 27 Ŷ = E S = = Data Analytics 606
36 Trend Models Trend is the long-term sweep or general direction of movement in a time series. We ll now consider some nonstationary time series techniques that are appropriate for data exhibiting upward or downward trends. Data Analytics 607
37 Example WaterCraft Inc. is a manufacturer of personal water crafts (also known as jet skis). The company has enjoyed a fairly steady growth in sales of its products. The officers of the company are preparing sales and manufacturing plans for the coming year. Forecasts are needed of the level of sales that the company expects to achieve each quarter. Data Analytics 608
38 Double Exponential Smoothing (Holt s Method) Ŷ = E + t+ n where t nt t E t = αy t + (1-α)(E t-1 + T t-1 ) T t = β(e t E t-1 ) + (1-β) T t-1 0 α 1and 0 β 1 E t is the expected base level at time period t. T t is the expected trend at time period t. n is the period-ahead forecast made in period t. Data Analytics 609
39 Implementing the Model Data Analytics 610
40 Optimal values for α and ß that minimizes MSE Data Analytics 611
41 Forecasting With Holt s Model Forecasts for time periods 21 through 24 at time period 20: Ŷ 20+ n = E20 + nt20 Y$ 21 = E20 + 1T20 = = Y$ 22 = E20 + 2T20 = = Y$ 23 = E20 + 3T20 = = Y$ = E + 4T = = Data Analytics 612
42 Ŷ Holt-Winter s Method for Additive Seasonal Effects t+ n = Et + ntt + St+ n p S t where α ( Y S ) + (1- α )(E T ) E t = t t p t 1 + t 1 T ( ) t = β Et Et 1 + (1- β )Tt 1 ( Y ) t Et + (1-γ t p = γ )S 0 α 1 0 β 1 0 γ 1 Data Analytics 613
43 Implementing the Model Data Analytics 614
44 The Linear Trend Model $Y = b + bx t 0 1 1t where X 1t = t For example: X = 1, X = 2, X = 3, K Data Analytics 615
45 Implementing the Model Data Analytics 616
46 Forecasting With The Linear Trend Model Forecasts for time periods 21 through 24 at time period 20: Y$ 21 = b0 + b1x1 21 = = Y $ X = b0 + b = = Y$ 23 = b0 + b1x1 23 = = Y $ X = b0 + b = = Data Analytics 617
47 The Quadratic Trend Model $Y = b + bx + b X t where X = tand X = t 1 2 t t t 2 t Data Analytics 618
48 Implementing the Model Data Analytics 619
49 Forecasting With Quadratic Trend Model Forecasts for time periods 21 through 24 at time period 20: Ŷ Ŷ 2 = b0 + b1x 1 + b2x 2 = = = b0 + b1x 1 + b2x 2 = = Ŷ Ŷ 2 = b0 + b1x 1 + b2x 2 = = = b0 + b1x 1 + b2x 2 = = Data Analytics 620
50 Computing Multiplicative Seasonal Indices We can compute multiplicative seasonal adjustment indices for period p as follows: S p = Y i Y $ i, for all i occuring in season i n p p The final forecast for period i is then Y $ adjusted = Y$ i i S p, for any ioccuring in season p Data Analytics 621
51 Implementing the Model Data Analytics 622
52 Implementing the Model Data Analytics 623
53 Forecasting with Seasonal Adjustments Applied to Quadratic Trend Model Forecasts for time periods 21 through 24 at time period 20: Ŷ 1 = ( b0 + b1x 1 + b2x 2 ) S = % = Ŷ 2 = ( b0 + b1x 1 + b2x 2 ) S = % = Ŷ 3 = ( b0 + b1x 1 + b2x 2 ) S = % = Ŷ 4 = ( b0 + b1x 1 + b2x 2 ) S = % = Data Analytics 624
54 Summary of the Calculation and Use of Seasonal Indices 1. Create a trend model and calculate the estimated value for each observation in the sample. 2. For each observation, calculate the ratio of the actual value to the predicted trend value: Yt / Ŷ t. (For additive effects, compute the difference: Yt Ŷ t ). 3. For each season, compute the average of the ratios calculated in Step 2. These are the seasonal indices. 4. Multiply any forecast produced by the trend model by the appropriate seasonal index calculated in Step 3. (For additive seasonal effects, add the appropriate factor to the forecast.) Ŷ t Data Analytics 625
55 Seasonal Regression Models Indicator variables may also be used in regression models to represent seasonal effects. If there are p seasons, we need p-1 indicator variables. Our example problem involves quarterly data, so p=4 and we define the following 3 indicator variables: X X 3 4 t t 1 =, 0, 1 =, 0, if Y is an observation from quarter 1 t otherwise if Y is an observation from quarter 2 t otherwise X 5 t 1 if Y t is an observation from quarter 3 =, 0, otherwise Data Analytics 626
56 Implementing the Model The regression function is: $Yt = b + bx + b X + b X + b X + b X where X = tand X = t 1 2 t t t t t t t 2 Data Analytics 627
57 Implementing the Model Data Analytics 628
58 Implementing the Model Data Analytics 629
59 Implementing the Model Data Analytics 630
60 Forecasting With Seasonal Regression Model Forecasts for time periods 21 through 24 at time period 20: 2 Ŷ21 = (21) (21) (1) (0) (0) = 2 Ŷ22 = (22) (22) (0) (1) (0) = 2 Ŷ23 = (23) (23) (0) (0) (1) = Ŷ24 = (24) (24) (0) (0) (0) = Data Analytics 631
61 Combining Forecasts It is also possible to combine forecasts to create a composite forecast. Suppose we used three different forecasting methods on a given data set. Denote the predicted value of time period t using each method as follows: F, F, and F t t t We could create a composite forecast as follows: $Yt = b + bf + b F + b F t t t Data Analytics 632
62 Visual representation The shown plots are one way to visualize a time series. There are other options. In particular, when multiple time series are to be considered simultaneously. Data Analytics 633
63 Bar charts Data Analytics 634
64 Recall: Visualization examples Data Analytics 635
65 Time lines Data Analytics 636
66 Recall: Playfair s plot of trade between England and Denmark/Norway from 1786 Data Analytics 637
67 Multiresolution time lines Data Analytics 638
68 Recall: Visualization examples Data Analytics 639
69 Stacked graphs Data Analytics 640
70 Recall: Visualization examples Data Analytics 641
71 Sparklines Data Analytics 642
72 Radial layout: spiral Data Analytics 643
73 Multivariate time series data Data Analytics 644
74 Time series in documents Data Analytics 645
75 13.2 Geospatial Data Analytics
76 Recall: Snow s map of cholera in London 1663 Data Analytics 647
77 Recall: Edge bundling Geospatial: Data Analytics 648
78 Recall: Edge bundling Geospatial: Data Analytics 649
79 The geospatial perspective A distinct perspective on the world Domain of geospatial analysis Dealing with what happens where Human computer interface Data Analytics 650
80 Basic primitives: Place Terrestrial scale - shape and size Naming of places Dynamics Need for rigor coordinate systems and data 2D vs 3D perspectives on geospatial issues Data Analytics 651
81 Basic primitives: Attributes Characteristics associated with place Nominal, ordinal, interval, etc. Attribute tables Data Analytics 652
82 Basic primitives: Objects Points Lines Areas Representative points Polylines Polygons Data Analytics 653
83 Basic primitives: Maps Communicating with maps Traditional maps Digital mapping Alternative visualisations 3D and dynamic Data Analytics 654
84 Basic primitives: Multiple properties Multiple properties of place Concept of layers Combining layers Resolving modelling and geocoding differences Data Analytics 655
85 Basic primitives: Fields Discrete-object view Continuous-field view Data modelling as a form of representation Conversion between model views Data Analytics 656
86 Basic primitives: Density Density as object count (n) divided by area (a), so d=n/a Density as link between discrete and continuous model views Sensitivity to definition of both n and a Data Analytics 657
87 Basic primitives: Spatial weights Adjacency models Proximity distance based Spatial weights matrices, W Data Analytics 658
88 Basic primitives: Networks Basic components of a network Forms of network Concept of network topology Applications of network analysis Data Analytics 659
89 Basic primitives: Topology General concepts of topology Topological dimension Adjacency Connectivity Containment Data Analytics 660
90 Basic primitives: detail, resolution, scale Intrinsic complexity of the real world Spatial resolution Representative fraction Resolution and digital datasets Temporal resolution Data Analytics 661
91 Spatial relationships: Distance & direction Location and distance/direction calculations Representative points Direction and circular measure Spatial weights matrices Determination of weights: binary and real-valued Reconstruction and MDS Data Analytics 662
92 Spatial relationships: Context Spatial context and proximity Neighbourhoods Uniform distance Zonal Weighted Heterogeneity Data Analytics 663
93 Spatial relationships: Dependence Spatial dependence Spatial autocorrelation Correlograms Geostatistics Data Analytics 664
94 Spatial relationships: Sampling & interpolation Point sampling Systematic Random Stratified Zonal sampling and statistics Fields and interpolation Data Analytics 665
95 Spatial relationships: Smoothing & sharpening Convolution Pattern and process Filtering Data Analytics 666
96 Spatial statistics: Probability & uncertainty Probability in a spatial context Marginal vs Joint probability Probability fields Probability and uncertainty Probability density Uncertainty and error propagation Uncertainty and accuracy Data Analytics 667
97 Spatial statistics: Inference Statistical inference and significance: Deduce properties of underlying distribution Confidence Controlled experiments Natural experiments Sampling & independence Samples vs populations Data Analytics 668
98 Visual representations Visualizations are often based on maps Attributes are visually encoded within such a map Data Analytics 669
99 Encoding by bars Data Analytics 670
100 Encoding by color Data Analytics 671
101 Encoding by size Data Analytics 672
102 Encoding by glyphs Data Analytics 673
103 Recall: Visualization examples Data Analytics 674
104 13.3 Spatio-temporal Patterns
105 Combination of time and space Data Analytics 676
106 Clustering Data Analytics 677
107 Similarity measure Considered points: Distance between any two points on trajectories Closest distance between points at same/similar time Symmetry: One-sided distance Two-sided distance Accumulated distance Average over trajectory Minimum over trajectory Maximum over trajectory Data Analytics 678
108 Visual analytics Visual analytics approaches may combine spatial and temporal representations in one encoding. However, typically multiple views are used. Interactions are used for correlating. Data Analytics 679
109 Recall: Minard s map of Napoleon s march on Moscow drawn in 1869 Data Analytics 680
110 Embedding time series in geospatial layout Data Analytics 681
111 Recall: Visualization examples Data Analytics 682
112 Recall: Visualization examples Data Analytics 683
113 Recall: Visualization examples Data Analytics 684
114 Recall: Visualization examples Data Analytics 685
115 Recall: Visualization examples Data Analytics 686
116 13.4 Assignment
117 Assignment 11 Consider the taxi service trajectory data set provided here: +Prediction+Challenge,+ECML+PKDD+2015 Aggregate the time stamps by the hour and by the day and create two time series for all taxi trips and for trips of types A, B, and C, respectively. You should have created 8 time series. Visualize them. For each time series, try the different TSA models presented in class for prediction (consider linear, quadratic, or no trend, additive, multiplicative, or no seasons, regression, and combinations thereof). Compute the MSE to check, which predictions worked best for which time series. What can you conclude from your findings? p.t.o. Data Analytics 688
118 Assignment 11 Check whether you are able to detect day types A and B with your analysis (of course, without using that information). How would you do that? Bonus: Visualize a density map for each of the time series that shows the spatial distribution of the taxi trajectories. Allow for filtering the density maps by the day or the hour and its repetitions, i.e., only showing the trajectories that started on day x, x+7, x+14, for x = 1,,7, or only showing the trajectories that started at hour x, x+24, x+48, for x = 1,,24. Data Analytics 689
3. Visual Analytics (Part 1: Visual Encoding) Jacobs University Visualization and Computer Graphics Lab
3. Visual Analytics (Part 1: Visual Encoding) 3.1 Introduction Motivation Big Data cannot be analyzed anymore without the help of computers. Computers are good in quickly processing large amounts of data.
More informationData Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski
Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...
More informationSection 4.1: Time Series I. Jared S. Murray The University of Texas at Austin McCombs School of Business
Section 4.1: Time Series I Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Time Series Data and Dependence Time-series data are simply a collection of observations gathered
More informationMultivariate Data More Overview
Multivariate Data More Overview CS 4460 - Information Visualization Jim Foley Last Revision August 2016 Some Key Concepts Quick Review Data Types Data Marks Basic Data Types N-Nominal (categorical) Equal
More informationReaders will be provided a link to download the software and Excel files that are used in the book after payment. Please visit
Readers will be provided a link to download the software and Excel files that are used in the book after payment. Please visit http://www.xlpert.com for more information on the book. The Excel files are
More informationPrentice Hall Pre-Algebra 2004 Correlated to: Hawaii Mathematics Content and Performance Standards (HCPS) II (Grades 9-12)
Hawaii Mathematics Content and Performance Standards (HCPS) II (Grades 9-12) NUMBER AND OPERATIONS STANDARD 1: Students understand numbers, ways of representing numbers, relationships among numbers, and
More informationUNIT 1: NUMBER LINES, INTERVALS, AND SETS
ALGEBRA II CURRICULUM OUTLINE 2011-2012 OVERVIEW: 1. Numbers, Lines, Intervals and Sets 2. Algebraic Manipulation: Rational Expressions and Exponents 3. Radicals and Radical Equations 4. Function Basics
More informationOnline Pattern Recognition in Multivariate Data Streams using Unsupervised Learning
Online Pattern Recognition in Multivariate Data Streams using Unsupervised Learning Devina Desai ddevina1@csee.umbc.edu Tim Oates oates@csee.umbc.edu Vishal Shanbhag vshan1@csee.umbc.edu Machine Learning
More informationEffects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex
Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex Keiko I. Powers, Ph.D., J. D. Power and Associates, Westlake Village, CA ABSTRACT Discrete time series
More informationBUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)
SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon
More informationQuadratic Equations. Learning Objectives. Quadratic Function 2. where a, b, and c are real numbers and a 0
Quadratic Equations Learning Objectives 1. Graph a quadratic function using transformations. Identify the vertex and axis of symmetry of a quadratic function 3. Graph a quadratic function using its vertex,
More informationCurve fitting. Lab. Formulation. Truncation Error Round-off. Measurement. Good data. Not as good data. Least squares polynomials.
Formulating models We can use information from data to formulate mathematical models These models rely on assumptions about the data or data not collected Different assumptions will lead to different models.
More informationAnalytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.
Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied
More informationIntegrated Math I. IM1.1.3 Understand and use the distributive, associative, and commutative properties.
Standard 1: Number Sense and Computation Students simplify and compare expressions. They use rational exponents and simplify square roots. IM1.1.1 Compare real number expressions. IM1.1.2 Simplify square
More informationEvaluation Measures. Sebastian Pölsterl. April 28, Computer Aided Medical Procedures Technische Universität München
Evaluation Measures Sebastian Pölsterl Computer Aided Medical Procedures Technische Universität München April 28, 2015 Outline 1 Classification 1. Confusion Matrix 2. Receiver operating characteristics
More informationChapter 1 Polynomials and Modeling
Chapter 1 Polynomials and Modeling 1.1 Linear Functions Recall that a line is a function of the form y = mx+ b, where m is the slope of the line (how steep the line is) and b gives the y-intercept (where
More informationFrequency Distributions
Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,
More informationThings to Know for the Algebra I Regents
Types of Numbers: Real Number: any number you can think of (integers, rational, irrational) Imaginary Number: square root of a negative number Integers: whole numbers (positive, negative, zero) Things
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 Tobler s Law All places are related, but nearby places are related more than distant places Corollary:
More informationAlgebra II Quadratic Functions
1 Algebra II Quadratic Functions 2014-10-14 www.njctl.org 2 Ta b le o f C o n te n t Key Terms click on the topic to go to that section Explain Characteristics of Quadratic Functions Combining Transformations
More informationSpatial Interpolation - Geostatistics 4/3/2018
Spatial Interpolation - Geostatistics 4/3/201 (Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Distance between pairs of points Lag Mean Tobler s Law All places are related, but nearby places
More informationThe Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC
The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC INTRODUCTION The Time Series Forecasting System (TSFS) is a component of SAS/ETS that provides a menu-based
More informationSTAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.
STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,
More informationTableau Think Tank. Tips, Tricks and Overview for the Ohio Tableau Community. COPYRIGHT 2014 RESULT DATA - All Rights Reserved SLIDE 1
Tableau Think Tank Tips, Tricks and Overview for the Ohio Tableau Community SLIDE 1 and Forecasting Outline Quick Table Calculations Computations Under the hood: Partitioning and Addressing Secondary Table
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationAnalysing Search Trends
Data Mining in Business Intelligence 7 March 2013, Ben-Gurion University Analysing Search Trends Yair Shimshoni, Google R&D center, Tel-Aviv. shimsh@google.com Outline What are search trends? The Google
More informationChapter 2: Frequency Distributions
Chapter 2: Frequency Distributions Chapter Outline 2.1 Introduction to Frequency Distributions 2.2 Frequency Distribution Tables Obtaining ΣX from a Frequency Distribution Table Proportions and Percentages
More informationRelease Notes for April StatCrunch Updates
Release Notes for April 2018 - StatCrunch Updates Major additions Introducing accessibility features that support full keyboard functionality including new keyboard shortcuts. [Go to page 2] Measures to
More information7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability
7 Fractions GRADE 7 FRACTIONS continue to develop proficiency by using fractions in mental strategies and in selecting and justifying use; develop proficiency in adding and subtracting simple fractions;
More informationCPSC 695. Methods for interpolation and analysis of continuing surfaces in GIS Dr. M. Gavrilova
CPSC 695 Methods for interpolation and analysis of continuing surfaces in GIS Dr. M. Gavrilova Overview Data sampling for continuous surfaces Interpolation methods Global interpolation Local interpolation
More informationWatershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS
HOUSEKEEPING Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS Quizzes Lab 8? WEEK EIGHT Lecture INTERPOLATION & SPATIAL ESTIMATION Joe Wheaton READING FOR TODAY WHAT CAN WE COLLECT AT POINTS?
More informationNESHAMINY SCHOOL DISTRICT LANGHORNE, PA CURRICULUM MAP
NESHAMINY SCHOOL DISTRICT LANGHORNE, PA CURRICULUM MAP 7th Grade Math - Course 2 Month: September Order of Operations Algebraic Expressions One step equations and inequalities Two step equations and inequalities
More informationALGEBRA II A CURRICULUM OUTLINE
ALGEBRA II A CURRICULUM OUTLINE 2013-2014 OVERVIEW: 1. Linear Equations and Inequalities 2. Polynomial Expressions and Equations 3. Rational Expressions and Equations 4. Radical Expressions and Equations
More informationData Mining and Analytics. Introduction
Data Mining and Analytics Introduction Data Mining Data mining refers to extracting or mining knowledge from large amounts of data It is also termed as Knowledge Discovery from Data (KDD) Mostly, data
More informationWKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems
Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring
More informationApplying Supervised Learning
Applying Supervised Learning When to Consider Supervised Learning A supervised learning algorithm takes a known set of input data (the training set) and known responses to the data (output), and trains
More informationAaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling
Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Abstract Forecasting future events and statistics is problematic because the data set is a stochastic, rather
More informationError Analysis, Statistics and Graphing
Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your
More informationThe Consequences of Poor Data Quality on Model Accuracy
The Consequences of Poor Data Quality on Model Accuracy Dr. Gerhard Svolba SAS Austria Cologne, June 14th, 2012 From this talk you can expect The analytical viewpoint on data quality Answers to the questions
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University 1 Outline of This Week Last topic, we learned: Spatial autocorrelation of areal data Spatial regression
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 1 Tobler s Law All places are related, but nearby places are related
More informationData Preprocessing. Why Data Preprocessing? MIT-652 Data Mining Applications. Chapter 3: Data Preprocessing. Multi-Dimensional Measure of Data Quality
Why Data Preprocessing? Data in the real world is dirty incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate data e.g., occupation = noisy: containing
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationCourse of study- Algebra Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by
Course of study- Algebra 1-2 1. Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by students in Grades 9 and 10, but since all students must
More informationCHAPTER 5 PROPAGATION DELAY
98 CHAPTER 5 PROPAGATION DELAY Underwater wireless sensor networks deployed of sensor nodes with sensing, forwarding and processing abilities that operate in underwater. In this environment brought challenges,
More informationECLT 5810 Data Preprocessing. Prof. Wai Lam
ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate
More informationAcknowledgments. Acronyms
Acknowledgments Preface Acronyms xi xiii xv 1 Basic Tools 1 1.1 Goals of inference 1 1.1.1 Population or process? 1 1.1.2 Probability samples 2 1.1.3 Sampling weights 3 1.1.4 Design effects. 5 1.2 An introduction
More informationIntroduction to Objective Analysis
Chapter 4 Introduction to Objective Analysis Atmospheric data are routinely collected around the world but observation sites are located rather randomly from a spatial perspective. On the other hand, most
More informationStatistics: Normal Distribution, Sampling, Function Fitting & Regression Analysis (Grade 12) *
OpenStax-CNX module: m39305 1 Statistics: Normal Distribution, Sampling, Function Fitting & Regression Analysis (Grade 12) * Free High School Science Texts Project This work is produced by OpenStax-CNX
More information1-2 9 Measures and accuracy
Year Term Week Chapter Ref Lesson 9.1 Estimation and approximation Year 2 m Autumn Term 1-2 9 Measures and accuracy 3-4 (Number) 9.2 Calculator methods 9.3 Measures and accuracy Assessment 9 10.1 Solving
More informationFew s Design Guidance
Few s Design Guidance CS 4460 Intro. to Information Visualization September 9, 2014 John Stasko Today s Agenda Stephen Few & Perceptual Edge Fall 2014 CS 4460 2 1 Stephen Few s Guidance Excellent advice
More informationHow to use FSBForecast Excel add-in for regression analysis (July 2012 version)
How to use FSBForecast Excel add-in for regression analysis (July 2012 version) FSBForecast is an Excel add-in for data analysis and regression that was developed at the Fuqua School of Business over the
More informationApplications of the k-nearest neighbor method for regression and resampling
Applications of the k-nearest neighbor method for regression and resampling Objectives Provide a structured approach to exploring a regression data set. Introduce and demonstrate the k-nearest neighbor
More informationGeometric Rectification of Remote Sensing Images
Geometric Rectification of Remote Sensing Images Airborne TerrestriaL Applications Sensor (ATLAS) Nine flight paths were recorded over the city of Providence. 1 True color ATLAS image (bands 4, 2, 1 in
More informationLab 3: From Data to Models
Lab 3: From Data to Models One of the goals of mathematics is to explain phenomena represented by data. In the business world, there is an increasing dependence on models. We may want to represent sales
More informationCHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY
23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series
More informationSpatio-Temporal Stereo Disparity Integration
Spatio-Temporal Stereo Disparity Integration Sandino Morales and Reinhard Klette The.enpeda.. Project, The University of Auckland Tamaki Innovation Campus, Auckland, New Zealand pmor085@aucklanduni.ac.nz
More informationGEOP 505/MATH 587 Fall 02 Homework 6
GEOP 55/MATH 587 Fall 2 Homework 6 In grading these homeworks, I found that the major problem that students had was a misunderstanding of the difference between the original time series z, the differenced
More informationPredict Outcomes and Reveal Relationships in Categorical Data
PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,
More informationGlyphs. Presentation Overview. What is a Glyph!? Cont. What is a Glyph!? Glyph Fundamentals. Goal of Paper. Presented by Bertrand Low
Presentation Overview Glyphs Presented by Bertrand Low A Taxonomy of Glyph Placement Strategies for Multidimensional Data Visualization Matthew O. Ward, Information Visualization Journal, Palmgrave,, Volume
More informationFMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu
FMA901F: Machine Learning Lecture 3: Linear Models for Regression Cristian Sminchisescu Machine Learning: Frequentist vs. Bayesian In the frequentist setting, we seek a fixed parameter (vector), with value(s)
More information9-1 GCSE Maths. GCSE Mathematics has a Foundation tier (Grades 1 5) and a Higher tier (Grades 4 9).
9-1 GCSE Maths GCSE Mathematics has a Foundation tier (Grades 1 5) and a Higher tier (Grades 4 9). In each tier, there are three exams taken at the end of Year 11. Any topic may be assessed on each of
More informationLet s review some things we learned earlier about the information we can gather from the graph of a quadratic.
Section 6: Quadratic Equations and Functions Part 2 Section 6 Topic 1 Observations from a Graph of a Quadratic Function Let s review some things we learned earlier about the information we can gather from
More informationWeek 7 Picturing Network. Vahe and Bethany
Week 7 Picturing Network Vahe and Bethany Freeman (2005) - Graphic Techniques for Exploring Social Network Data The two main goals of analyzing social network data are identification of cohesive groups
More informationActive Network Tomography: Design and Estimation Issues George Michailidis Department of Statistics The University of Michigan
Active Network Tomography: Design and Estimation Issues George Michailidis Department of Statistics The University of Michigan November 18, 2003 Collaborators on Network Tomography Problems: Vijay Nair,
More informationAcquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.
Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting
More information( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value.
Calibration OVERVIEW... 2 INTRODUCTION... 2 CALIBRATION... 3 ANOTHER REASON FOR CALIBRATION... 4 CHECKING THE CALIBRATION OF A REGRESSION... 5 CALIBRATION IN SIMPLE REGRESSION (DISPLAY.JMP)... 5 TESTING
More informationST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.
ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball
More informationLesson 2: Analyzing a Data Set
Student Outcomes Students recognize linear, quadratic, and exponential functions when presented as a data set or sequence, and formulate a model based on the data. Lesson Notes This lesson asks students
More informationYear 10 Mathematics Scheme of Work. Higher and Foundation
Year 10 Mathematics Scheme of Work Higher and Foundation Tiers Sets 1 and 2 will do the Higher specification. Sets 3 and 4 will do the Higher specification but with a focus on the topics that overlap Higher
More informationSAS (Statistical Analysis Software/System)
SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:
More informationRelations and Functions 2.1
Relations and Functions 2.1 4 A 2 B D -5 5 E -2 C F -4 Relation a set of ordered pairs (Domain, Range). Mapping shows how each number of the domain is paired with each member of the range. Example 1 (2,
More informationTopic 6 Representation and Description
Topic 6 Representation and Description Background Segmentation divides the image into regions Each region should be represented and described in a form suitable for further processing/decision-making Representation
More informationMath 1314 Test 2 Review Material covered is from Lessons 7 15
Math 1314 Test 2 Review Material covered is from Lessons 7 15 1. The total weekly cost of manufacturing x cameras is given by the cost function: 3 2 C( x) 0.0001x 0.4x 800x 3,000. Use the marginal cost
More informationTDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.
Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide
More informationAM205: lecture 2. 1 These have been shifted to MD 323 for the rest of the semester.
AM205: lecture 2 Luna and Gary will hold a Python tutorial on Wednesday in 60 Oxford Street, Room 330 Assignment 1 will be posted this week Chris will hold office hours on Thursday (1:30pm 3:30pm, Pierce
More informationInteractive Math Glossary Terms and Definitions
Terms and Definitions Absolute Value the magnitude of a number, or the distance from 0 on a real number line Addend any number or quantity being added addend + addend = sum Additive Property of Area the
More information9.1. How do the nerve cells in your brain communicate with each other? Signals have. Like a Glove. Least Squares Regression
Like a Glove Least Squares Regression.1 Learning Goals In this lesson, you will: Determine and interpret the least squares regression equation for a data set using a formula. Use interpolation to make
More informationLearner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display
CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &
More informationCMP Book: Investigation Number Objective: PASS: 1.1 Describe data distributions and display in line and bar graphs
Data About Us (6th Grade) (Statistics) 1.1 Describe data distributions and display in line and bar graphs. 6.5.1 1.2, 1.3, 1.4 Analyze data using range, mode, and median. 6.5.3 Display data in tables,
More informationarxiv: v1 [cond-mat.dis-nn] 30 Dec 2018
A General Deep Learning Framework for Structure and Dynamics Reconstruction from Time Series Data arxiv:1812.11482v1 [cond-mat.dis-nn] 30 Dec 2018 Zhang Zhang, Jing Liu, Shuo Wang, Ruyue Xin, Jiang Zhang
More informationQstatLab: software for statistical process control and robust engineering
QstatLab: software for statistical process control and robust engineering I.N.Vuchkov Iniversity of Chemical Technology and Metallurgy 1756 Sofia, Bulgaria qstat@dir.bg Abstract A software for quality
More informationLesson 5 overview. Concepts. Interpolators. Assessing accuracy Exercise 5
Interpolation Tools Lesson 5 overview Concepts Sampling methods Creating continuous surfaces Interpolation Density surfaces in GIS Interpolators IDW, Spline,Trend, Kriging,Natural neighbors TopoToRaster
More informationWind energy production forecasting
Wind energy production forecasting Floris Ouwendijk a Henk Koppelaar a Rutger ter Borg b Thijs van den Berg b a Delft University of Technology, PO box 5031, 2600 GA Delft, the Netherlands b NUON Energy
More informationMath 1314 Test 3 Review Material covered is from Lessons 9 15
Math 1314 Test 3 Review Material covered is from Lessons 9 15 1. The total weekly cost of manufacturing x cameras is given by the cost function: 3 2 Cx ( ) 0.0001x 0.4x 800x 3, 000. Use the marginal cost
More informationPreprocessing Short Lecture Notes cse352. Professor Anita Wasilewska
Preprocessing Short Lecture Notes cse352 Professor Anita Wasilewska Data Preprocessing Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept
More information08 An Introduction to Dense Continuous Robotic Mapping
NAVARCH/EECS 568, ROB 530 - Winter 2018 08 An Introduction to Dense Continuous Robotic Mapping Maani Ghaffari March 14, 2018 Previously: Occupancy Grid Maps Pose SLAM graph and its associated dense occupancy
More informationYEAR 12 Trial Exam Paper FURTHER MATHEMATICS. Written examination 1. Worked solutions
YEAR 12 Trial Exam Paper 2016 FURTHER MATHEMATICS Written examination 1 s This book presents: worked solutions, giving you a series of points to show you how to work through the questions mark allocations
More informationYear 9 Key Performance Indicators Maths (Number)
Key Performance Indicators Maths (Number) M9.1 N1: I can apply the four operations to negative numbers. Raise negative numbers to a power Add and subtract negative numbers and know when the answer should
More informationIBM SPSS Forecasting 24 IBM
IBM SPSS Forecasting 24 IBM Note Before using this information and the product it supports, read the information in Notices on page 59. Product Information This edition applies to ersion 24, release 0,
More informationdemonstrate an understanding of the exponent rules of multiplication and division, and apply them to simplify expressions Number Sense and Algebra
MPM 1D - Grade Nine Academic Mathematics This guide has been organized in alignment with the 2005 Ontario Mathematics Curriculum. Each of the specific curriculum expectations are cross-referenced to the
More informationTIPS4Math Grades 4 to 6 Overview Grade 4 Grade 5 Grade 6 Collect, Organize, and Display Primary Data (4+ days)
Collect, Organize, and Display Primary Data (4+ days) Collect, Organize, Display and Interpret Categorical Data (5+ days) 4m88 Collect data by conducting a survey or an experiment to do with the 4m89 Collect
More informationSTATISTICS (STAT) Statistics (STAT) 1
Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).
More informationround decimals to the nearest decimal place and order negative numbers in context
6 Numbers and the number system understand and use proportionality use the equivalence of fractions, decimals and percentages to compare proportions use understanding of place value to multiply and divide
More informationHow to use FSBforecast Excel add in for regression analysis
How to use FSBforecast Excel add in for regression analysis FSBforecast is an Excel add in for data analysis and regression that was developed here at the Fuqua School of Business over the last 3 years
More informationCS130 Regression. Winter Winter 2014 CS130 - Regression Analysis 1
CS130 Regression Winter 2014 Winter 2014 CS130 - Regression Analysis 1 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling is a form of statistical analysis
More informationRegression Analysis and Linear Regression Models
Regression Analysis and Linear Regression Models University of Trento - FBK 2 March, 2015 (UNITN-FBK) Regression Analysis and Linear Regression Models 2 March, 2015 1 / 33 Relationship between numerical
More informationModeling Plant Succession with Markov Matrices
Modeling Plant Succession with Markov Matrices 1 Modeling Plant Succession with Markov Matrices Concluding Paper Undergraduate Biology and Math Training Program New Jersey Institute of Technology Catherine
More informationPoints Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked
Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations
More informationNetwork Environments in AnyLogic. Nathaniel Osgood Agent-Based Modeling Bootcamp for Health Researchers
Network Environments in AnyLogic Nathaniel Osgood Agent-Based Modeling Bootcamp for Health Researchers August 23, 2011 Hands on Model Use Ahead Load model: Network Modification of SIR AB The Environment
More information