Spatial Interpolation & Geostatistics

Similar documents
Spatial Interpolation - Geostatistics 4/3/2018

Spatial Interpolation & Geostatistics

Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS

Geo372 Vertiefung GIScience. Spatial Interpolation

Lesson 5 overview. Concepts. Interpolators. Assessing accuracy Exercise 5

Creating Surfaces. Steve Kopp Steve Lynch

Esri International User Conference. San Diego, California. Technical Workshops. July Creating Surfaces. Steve Kopp and Steve Lynch

Geostatistics 2D GMS 7.0 TUTORIALS. 1 Introduction. 1.1 Contents

What can we represent as a Surface?

GEO 580 Lab 4 Geostatistical Analysis

Gridding and Contouring in Geochemistry for ArcGIS

Dijkstra's Algorithm

You will begin by exploring the locations of the long term care facilities in Massachusetts using descriptive statistics.

The Problem. Georeferencing & Spatial Adjustment. Nature Of The Problem: For Example: Georeferencing & Spatial Adjustment 9/20/2016

Georeferencing & Spatial Adjustment 2/13/2018

The Problem. Georeferencing & Spatial Adjustment. Nature of the problem: For Example: Georeferencing & Spatial Adjustment 2/4/2014

Using Kriging Methods to Estimate Damage Distribution

Surface Analysis with 3D Analyst

Software Tutorial Session Universal Kriging

Spatial Analysis (Vector) II

Petrel TIPS&TRICKS from SCM

ASSIGNMENT 3 Cobalt data T:\sys502\arcview\projects\Cobalt_2 T:\sys502\arcview\ projects\cobalt_2\cobalt_2.mxd T:\sys502\arcview\projects\Cobalt_2

Surface Creation & Analysis with 3D Analyst

Non Stationary Variograms Based on Continuously Varying Weights

GIS Tools - Geometry. A GIS stores data as different layers of information Different feature types are stored in individual files.

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Geostatistics 3D GMS 7.0 TUTORIALS. 1 Introduction. 1.1 Contents

Mapping Distance and Density

Introduction to ANSYS DesignXplorer

IMPROVING THE ACCURACY OF DIGITAL TERRAIN MODELS

Learn the various 3D interpolation methods available in GMS

DISTRIBUTION STATEMENT A Approved for public release: distribution unlimited.

GEOGRAPHIC INFORMATION SYSTEMS Lecture 24: Spatial Analyst Continued

Georeferencing & Spatial Adjustment

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data

Surface Smoothing Using Kriging

Statistical surfaces and interpolation. This is lecture ten

Analysis of different interpolation methods for uphole data using Surfer software

ASSIGNMENT 4. (a) First you will construct a visual representation of the data a follows:

Generalized Additive Model

Applied Statistics : Practical 11

Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University

Edge and local feature detection - 2. Importance of edge detection in computer vision

Geol 588. GIS for Geoscientists II. Zonal functions. Feb 22, Zonal statistics. Interpolation. Zonal statistics Sp. Analyst Tools - Zonal.

Region-based Segmentation

Geostatistics Predictions with Deterministic Procedures

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Lab 12: Sampling and Interpolation

Clustering Color/Intensity. Group together pixels of similar color/intensity.

Unsupervised Learning and Clustering

Spatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data

Modeling Multiple Rock Types with Distance Functions: Methodology and Software

Performance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018

Generating a New Shapefile

The Proportional Effect: What it is and how do we model it?

Lab 12: Sampling and Interpolation

USE OF DIRECT AND INDIRECT DISTRIBUTIONS OF SELECTIVE MINING UNITS FOR ESTIMATION OF RECOVERABLE RESOURCE/RESERVES FOR NEW MINING PROJECTS

A Geostatistical and Flow Simulation Study on a Real Training Image

Lecture 9. Raster Data Analysis. Tomislav Sapic GIS Technologist Faculty of Natural Resources Management Lakehead University

Local Features: Detection, Description & Matching

Lab 12: Sampling and Interpolation

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

ENGG1811: Data Analysis using Spreadsheets Part 1 1

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni

Unsupervised Learning and Clustering

Chapter 3 Image Registration. Chapter 3 Image Registration

Slides for Data Mining by I. H. Witten and E. Frank

Approximation Methods in Optimization

Flexible Lag Definition for Experimental Variogram Calculation

Making Science Graphs and Interpreting Data

Practical 11: Interpolating Point Data in R

Oasis montaj Advanced Mapping

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

University of Florida CISE department Gator Engineering. Clustering Part 4

GLM II. Basic Modeling Strategy CAS Ratemaking and Product Management Seminar by Paul Bailey. March 10, 2015

8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks

Supplementary Figure 1. Decoding results broken down for different ROIs

Nonparametric Approaches to Regression

EE795: Computer Vision and Intelligent Systems

Section 4 Matching Estimator

COMPUTER AND ROBOT VISION

Clustering Part 4 DBSCAN

Recent advances in Metamodel of Optimal Prognosis. Lectures. Thomas Most & Johannes Will

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

I can solve simultaneous equations algebraically, where one is quadratic and one is linear.

Moving Beyond Linearity

1. Estimation equations for strip transect sampling, using notation consistent with that used to

Markov Bayes Simulation for Structural Uncertainty Estimation

DIGITAL TERRAIN MODELLING. Endre Katona University of Szeged Department of Informatics

Methods to define confidence intervals for kriged values: Application on Precision Viticulture data.

Tensor Based Approaches for LVA Field Inference

Getting Started with montaj GridKnit

Processing of binary images

Variogram Inversion and Uncertainty Using Dynamic Data. Simultaneouos Inversion with Variogram Updating

Geometric Rectification of Remote Sensing Images

A Comparison of Spatial Prediction Techniques Using Both Hard and Soft Data

Surface Reconstruction with MLS

Geographic Information Systems (GIS) Spatial Analyst [10] Dr. Mohammad N. Almasri. [10] Spring 2018 GIS Dr. Mohammad N. Almasri Spatial Analyst

Applying Supervised Learning

Transcription:

(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 1

Tobler s Law All places are related, but nearby places are related more than distant places o Corollary: fields vary smoothly, slowly and show strong spatial autocorrelation attribute(s) and location are strongly correlated z i = f (x i, y i ) 11/3/2016 GEO327G/386G, UT Austin 2

Spatial Interpolation Determination of unknown values or attributes on the basis of values nearby o Used for data that define continuous fields E.g. temperature, rainfall, elevation, concentrations Contouring, raster resampling are applications already discussed Spatial Interpolation = Spatial Prediction 11/3/2016 GEO327G/386G, UT Austin 3

Spatial Interpolation E.g. Interpolate between variably spaced data to create uniform grid of values 9 5 8 6 8 7 8 9 9 8 7 7 5 6 8 8 7 8 8 9 8 9 8 8 8 9 9 8 8 11/3/2016 GEO327G/386G, UT Austin 4

Interpolation Methods All address the meaning of near in Tobler s law differently o How does space make a difference? o Statistical mean not best predictor if Tobler s law is true 11/3/2016 GEO327G/386G, UT Austin 5

Interpolation methods Inverse Distance Weighting (IDW) o Assumes influence of adjacent points decreases with distance Where: z 0 = E n i =1 E n i =1 z 0 = value of estimation point w i z i w i z i = value of neighboring point w i = weighting factor; e.g. = 1/(distance from neighbor) 2 11/3/2016 GEO327G/386G, UT Austin 6

Inverse Distance Weighting 9 9 7 5 6 8 8? 8 8 z 0 5 8 6 8 1 unit On basis of four nearest neighbors: z 0 = (8/(1) 2 8/(2) 2 6/(2.5) 2 5/(2) 2 )/(1.66) z 0 = (8.0 2.0 0.96 1.25)/(1.66) = 7.36 11/3/2016 GEO327G/386G, UT Austin 7

I.D.W. Unknown value is the average of the observed values, weighted by inverse of distance, squared o Distance to point doubles, weight decreases by factor of 4 Can alter IDW by: o Alter number of closest points o Choose points by distance/search radius o Weight be directional sectors o Alter distance weighting; e.g. cube instead of square 11/3/2016 GEO327G/386G, UT Austin 8

I.D.W. Characteristics Is an Exact Method of interpolation will return a measured value when applied to measured point. o Will not generate smoothness or account for trends, unlike methods that are inexact o Answer (a surface) passes through data points Weights never negative > interpolated values can never be less than smallest z or greater than largest z. Peaks and pits will never be represented. 11/3/2016 GEO327G/386G, UT Austin 9

Z-value I.D.W. Characteristics No peaks or pits possible; interpolated values must lie within range of known values Trend (actual) surface I.D.W. surface Data Point Regions of Misfit Distance 11/3/2016 GEO327G/386G, UT Austin 10

Interpolation Methods IDW is inappropriate for values that don t decrease as a function of distance (e.g. topography) Other deterministic techniques: o Spline o Trend 11/3/2016 GEO327G/386G, UT Austin 11

Exact Methods - Spline o Fit minimum curvature surface through observation points; interpolate value from surface o Good for gently varying surfaces E.g. topography, water table heights o Not good for fitting large changes over short distances o Surface is allowed to exceed highest and be less than lowest measured values 11/3/2016 GEO327G/386G, UT Austin 12

Exact Methods: IDW vs. Spline IDW: No predicted highs or lows above max. or min. values No smoothing; surface can be rough (images from ArcGIS 9.2 Help files) Spline: Minimum curvature result good for producing smooth surfaces Can t predict large changes over short distances 11/3/2016 GEO327G/386G, UT Austin 13

Comparisons-IDW vs. Spline Note smoothing of Spline less spikey IDW contours less continuous, fewer inferred maxima and minima IDW, 6 nearest, contoured for 6 classes Spline, contoured for same 6 classes 11/3/2016 GEO327G/386G, UT Austin 14

Inexact (Approximate) Methods Inexact: Answer (surface) need not pass through input data points Trend surface curve fitting by least squares regression Deterministic one output, no randomness allowed Kriging weight by distance, consider trends in data Stochastic incorporates randomness 11/3/2016 GEO327G/386G, UT Austin 15

Approximate Methods - Trend Fits a polynomial to input points using least squares regression. Resulting surfaces minimize variance w.r.t. input values, i.e. sum of difference between actual and estimated values for all inputs is minimized. Surface rarely goes through actual points Surface may be based on all data ( Global fit) or small neighborhoods of data ( Local fits). 11/3/2016 GEO327G/386G, UT Austin 16

Trend Surfaces Equations are either: o Linear 1 st Order: fit a plane Z = a bx cy o Quadratic 2 nd Order: fit a plane with one bend (parabolic) Z = (1 st Order) dx 2 exy fy 2 o Cubic 3 rd Order: fit a plane with 2 bends (hyperbolic) Z = (2 nd Order) gx 3 hx 2 Y ixy 2 Y 3 Where: a, b, c, d, etc. = constants derived from solution of simultaneous equations X, Y = geographic coordinates 11/3/2016 GEO327G/386G, UT Austin 17

Contour maps of trend surfaces for Z Y axis Trend Surfaces Global Fitting Input data (x, y, z) Linear (Plane) Quadratic (Parabolic surface) X axis Source: Burrough, 1986 Cubic (Hyperbolic surface) 11/3/2016 GEO327G/386G, UT Austin 18

Trend Surfaces Local fitting Local Polynomial Interpolation fits many polynomials, each within specified, overlapping neighborhoods. Neighborhood surface fitting is iterative; final solution is based on minimizing RMS error Final surface is composed of best fits to all neighborhoods Can be accomplished with tool in ESRI Geostatistical Analyst extension 11/3/2016 GEO327G/386G, UT Austin 19

Trend Surfaces Local fitting Excluded data Neighborhood 1 data points Interpolated point Model New interpolated point 1 st iteration, local fitted surface 2 nd iteration, local fitted surface Step 1 (images from ArcGIS 9.2 Help files) Step 1; 2 nd Iteration 2-D profile view of a model surface Neighborhood 1 points (red) are being fit to a plane by iteration (2 steps are shown) and an interpolated point is being created 11/3/2016 GEO327G/386G, UT Austin 20

Trend Surfaces Local fitting 2 Excluded data Interpolated point 2 Neighborhood 2 data points Neighborhood 3 data points Excluded data Interpolated point 3 local fitted surface 2 local fitted surface 3 Step 2 (images from ArcGIS 9.2 Help files) Step 3 Model surface generated by many local fits Note that several neighborhoods share some of the same data points: neighborhoods overlap 11/3/2016 GEO327G/386G, UT Austin 21

Trend Surfaces Local fitting 3 Interpolated point 4 Excluded data Interpolated point 5 Excluded data Neighborhood 4 data points local fitted surface 4 Neighborhood 5 data points local fitted surface 5 Step 4 (images from ArcGIS 9.2 Help files) Step 5 Five different polynomials generate five local fits; in this example all are 1 st Order. 11/3/2016 GEO327G/386G, UT Austin 22

Trend Surfaces Local fitting 4 Result: Original data points are black Interpolated points are in colors (image from ArcGIS 9.2 Help files) Note that model surface (purple) passes through interpolated points, not measured data points. 11/3/2016 GEO327G/386G, UT Austin 23

Why Trend, Spline or IDW Surfaces? No strong reason to assume that z correlated with x, y in these simple ways Fitted surface doesn t pass through all points in Trend Data aren t used to help select model Exploratory, deterministic techniques, but theoretically weak 11/3/2016 GEO327G/386G, UT Austin 24

Deterministic vs. Geostatistical Models Deterministic: purely a function of distance No associated uncertainties are used or derived E.g. IDW, Trend, Spline Geostatistical: based on statistical properties Uncertainties incorporated and provided as a result Kriging 11/3/2016 GEO327G/386G, UT Austin 25

Approximate Methods - Kriging Kriging o Another inverse distance method o Considers distance, cluster and spatial covariance (autocorrelation) look for patterns in data o Fit function to selected points; look at correlation, covariance and/or other statistical parameters to arrive at weights interactive process o Good for data that are spatially or directionally correlated (e.g. element concentrations) 11/3/2016 GEO327G/386G, UT Austin 26

Kriging Look for patterns over distances, then apply weights accordingly. Steps: 1) Make a description of the spatial variation of the data - variogram 2) Summarize variation by a function 3) Use this model to determine interpolation weights 11/3/2016 GEO327G/386G, UT Austin 27

y Kriging Step 1 Describe spatial variation with Semivariogram (Z i Z j ) 2 / 2 Point cloud 3 2 6 6 x 7 4 5 5 7 5 Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 28

Kriging Step 1 Divide range into series of lags ( buckets, bins ) Find mean values of lags (Z i Z j ) 2 / 2 Lag Lag Mean Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 29

Kriging Step 2 Summarize spatial variation with a function (Z i Z j ) 2 / 2 o Several choices possible; curve fitting defines different types of Kriging (circular, spherical, exponential, gaussian, etc.) Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 30

Semivariance Kriging Step 2 Key features of fitted variogram: Range Nugget Distance (d) Sill Nugget: semivariance at d = 0 Range: d at which semivariance is constant Sill: constant semivariance beyond the range 11/3/2016 GEO327G/386G, UT Austin 31

Kriging Step 2 Key features of fitted variogram: o Nugget Measure of uncertainty of z values; precision of measurements o Range No structure to data beyond the range; no correlation between distance and z beyond this value o Sill Measure of the approximate total variance of z 11/3/2016 GEO327G/386G, UT Austin 32

Kriging Step 2 Model surface profiles and their variograms: As local variation in surface increases, range decreases, nugget increases Source: O Sullivan and Unwin, 2003 11/3/2016 GEO327G/386G, UT Austin 33

Kriging Step 3 Determine Interpolated weights o Use fitted curve to arrive at weights not explained here; see O Sullivan and Unwin, 2003 for explanation o In general, nearby values are given greater weight (like IDW), but direction can be important (e.g. shielding can be considered) 11/3/2016 GEO327G/386G, UT Austin 34

Review: Deterministic vs. Geostatistical Models Deterministic: interpolation purely a function of distance No associated uncertainties are used or derived E.g. IDW, Trend, Spline Geostatistical: interpolation is statistically based Uncertainties incorporated and provided as a result Kriging 11/3/2016 GEO327G/386G, UT Austin 35

Kriging Part II Goal: predict values where no data have been collected Relies on first establishing: DEPENDENCY z is, in fact, correlated with distance STATIONARITY z values are stochastic (except for spatial dependency they are randomly distributed) and have no other dependence use detrending or transformation tools if not Gaussian DISTRIBUTION works best if data are Gaussian. If not they have to first be made close to Gaussian. 11/3/2016 GEO327G/386G, UT Austin 36

ESRI Geostatistical Analyst Products Map types: Prediction contours of interpolated values Prediction Standard Errors show error distribution, as quantified by minimized RMS error (see below) Probability show where values exceed a specified threshold Quantile show where thresholds overestimate or underestimated predictions 11/3/2016 GEO327G/386G, UT Austin 37

ESRI Geostatistical Analyst Products Maps: (e.g. max. ozone concentration, 1999) o Prediction o Prediction Standard Errors o Probability o Quantile Figure from ESRI Intro. to Modeling Spatial Processes Using Geostatistical Analyst 11/3/2016 GEO327G/386G, UT Austin 38

Some Kriging Products Prediction map interpolated values Probability map- showing where critical values exceeded Prediction Map Probability Map Prediction map of radioceasium soil contamination levels in the country of Belarus after the Chernobyl nuclear power plant accident. Dark orange and red indicate a probability greater than 62.5% that radiocesium contamination exceeds the critical threshold in forest berries. Figures from ESRI Using Geostatistical Analyst 11/3/2016 GEO327G/386G, UT Austin 39

Kriging Part II Goal: predict values where no data have been collected Relies on first establishing: DEPENDENCY z is, in fact, correlated with distance STATIONARITY z values are stochastic (except for spatial dependency they are randomly distributed) and have no other dependence use detrending or transformation tools if not Gaussian DISTRIBUTION works best if data are Gaussian. If not they have to first be made close to Gaussian. 11/3/2016 GEO327G/386G, UT Austin 40

1. SPATIAL DEPENDENCY o Test with semivariogram & cross-validation plots Violates Stationarity detrend first Semivariogram and temperature distribution map of winter temperature data for the USA. Figure from ESRI Using Geostatistical Analyst 11/3/2016 GEO327G/386G, UT Austin 41

Spatial Dependence: Semivariogram Strong Spatial Dependence Weak Spatial Dependence Figures from ESRI Intro. to Modeling Spatial Processes Using Geostatistical Analyst 11/3/2016 GEO327G/386G, UT Austin 42

Spatial Dependence: Cross-Validation Diagnostic Use a subset of the data to test measured vs. predicted values Strong Spatial Dependence Weak Spatial Dependence Figures from ESRI Intro. to Modeling Spatial Processes Using Geostatistical Analyst 11/3/2016 GEO327G/386G, UT Austin 43

Kriging Part II Goal: predict values where no data have been collected Relies on first establishing: DEPENDENCY z is, in fact, correlated with distance STATIONARITY z values are stochastic (except for spatial dependency they are randomly distributed) and have no other dependence use detrending or transformation tools if not Gaussian DISTRIBUTION works best if data are Gaussian. If not they have to first be made close to Gaussian. 11/3/2016 GEO327G/386G, UT Austin 44

2. STATIONARITY - Randomness Data variance and mean is the same at all localities (or within a neighborhood of nearest points); data variance is constant in the neighborhood of investigation Correlation (covariance) depends only on the vector that separates localities, not exact locations, number of measurement or direction Figures from ESRI Intro. to Modeling Spatial Processes Using Geostatistical Analyst 11/3/2016 GEO327G/386G, UT Austin 45

California Ozone Demo. Data in Geostat_demo folder 11/3/2016 GEO327G/386G, UT Austin 47

ArcGIS Kriging Processing Steps 1. Add and display the data 2. Explore the data s statistical properties 3. Select a model to create a surface make a prediction map! 4. Assess the result 5. Compare to other models 11/3/2016 GEO327G/386G, UT Austin 48

Data Exploration 1. Examine the distribution normal (Gaussian)? Transformation to normal required? o Histograms and QQPlots 2. Identify trends, if any o Trend Analysis 3. Understand spatial autocorrelation and directional influences Semivariogram analysis 11/3/2016 GEO327G/386G, UT Austin 49

Data Exploration: Examine the Distribution Normal (Gaussian) distribution? Transformation to normal required? Histogram tool, QQPlot tool (compare real and standard normal distributions) unimodal, near-symmetric straight line = nearly Normally Distributed data 11/3/2016 GEO327G/386G, UT Austin 50

Data Exploration: Identify Trends, If Any Underlying trends affect Kriging assumption of randomness remove and work with residuals o Trend Analysis tool Strong ~NE-SW Ushaped Trend Weaker ~NW-SE linear Trend 11/3/2016 GEO327G/386G, UT Austin 51

Data Exploration: Spatial Autocorrelation & Directional Influences Variogram Analysis: Look for correlation with distance Look for directional trends among pairs of points Semivariogram/ Covariance Cloud tool Exhibits Spatial Autocorrelation Directional Influence shown here pairs in NNW-SSE direction show largest covariance over shortest distances 11/3/2016 GEO327G/386G, UT Austin 52

ArcGISKriging Processing Steps 1. Add and display the data 2. Explore the data s statistical properties 3. Select a model to create a surface make a prediction map! 4. Assess the result 5. Compare to other models 11/3/2016 GEO327G/386G, UT Austin 53

Mapping Ozone Concentration 1. Incorporate results of Data Exploration into Model selection This example: remove underlying trends discovered during data exploration that have a rational explanation. (Analysis is then performed on residuals and trend surface is added back into final surface) = Detrending Remove directional trends between pairs of points in certain directions closer points are more alike than in other directions = anisotropy removal 11/3/2016 GEO327G/386G, UT Austin 54

Mapping Ozone Concentration Interpolation & Cross Validation 2. Define search neighborhood for interpolation (c.f. I.D.W.) Use a search ellipse (or circle) to find nearest neighbors; specify radii of ellipse, min. & max. number of points per sectors 3. Examine Cross Validation plot Predicted vs. Measured for subset(s) of the data Mean error should be close to zero RMS error and mean standardized error should be small RMS standardized error should be close to one. 11/3/2016 GEO327G/386G, UT Austin 55

ArcGISKriging Processing Steps 1. Add and display the data 2. Explore the data s statistical properties 3. Select a model to create a surface make a prediction map! 4. Assess the result Cross Validation Plots 5. Compare to other models 11/3/2016 GEO327G/386G, UT Austin 56

Comparing Model Results Cross validation comparisons: Mean error should be close to zero RMS error and mean standardized error should be small RMS standardized error should be close to one. 11/3/2016 GEO327G/386G, UT Austin 57

Probality Mapping with Indicator Kriging Task: Make a map that show the probability of exceeding a critical threshold, e.g. 0.12 ppm ozone for an 8 hr. period Technique: Transform data to a series of 0s and 1s according to whether they are above or below the threshold Use a semivariogram on transformed data; interpret indicator prediction values as probabilities 11/3/2016 GEO327G/386G, UT Austin 58