THE THEORY OF REGIONALIZED VARIABLES

Similar documents
y and the total sum of

Modeling Local Uncertainty accounting for Uncertainty in the Data

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

Support Vector Machines

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

S1 Note. Basis functions.

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Parallelism for Nested Loops with Non-uniform and Flow Dependences

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

CS 534: Computer Vision Model Fitting

Page 0 of 0 SPATIAL INTERPOLATION METHODS

Mathematics 256 a course in differential equations for engineering students

3D vector computer graphics

Machine Learning 9. week

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005

UNIT 2 : INEQUALITIES AND CONVEX SETS

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

X- Chart Using ANOM Approach

Lecture 5: Multilayer Perceptrons

Performance Evaluation of Information Retrieval Systems

CMPS 10 Introduction to Computer Science Lecture Notes

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Problem Set 3 Solutions

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation

Hermite Splines in Lie Groups as Products of Geodesics

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Wishing you all a Total Quality New Year!

Classifier Selection Based on Data Complexity Measures *

A Semi-parametric Regression Model to Estimate Variability of NO 2

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

Lecture 5: Probability Distributions. Random Variables


Available online at ScienceDirect. Procedia Environmental Sciences 26 (2015 )

APPLICATION OF A COMPUTATIONALLY EFFICIENT GEOSTATISTICAL APPROACH TO CHARACTERIZING VARIABLY SPACED WATER-TABLE DATA

TN348: Openlab Module - Colocalization

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

Analysis of Continuous Beams in General

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning

Adaptive Transfer Learning

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Feature Reduction and Selection

Reducing Frame Rate for Object Tracking

THE MAP MATCHING ALGORITHM OF GPS DATA WITH RELATIVELY LONG POLLING TIME INTERVALS

A New Approach For the Ranking of Fuzzy Sets With Different Heights

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Help for Time-Resolved Analysis TRI2 version 2.4 P Barber,

Lecture #15 Lecture Notes

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis

Mobility-dependent small-scale propagation model for applied simulation studies

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

The Codesign Challenge

Improved Methods for Lithography Model Calibration

Cell Count Method on a Network with SANET

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A Similarity-Based Prognostics Approach for Remaining Useful Life Estimation of Engineered Systems

Parameter estimation for incomplete bivariate longitudinal data in clinical trials

Interpolation and assimilation methods for European scale air quality assessment and mapping. Part I: Review and recommendations

A DATA ANALYSIS CODE FOR MCNP MESH AND STANDARD TALLIES

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

SVM-based Learning for Multiple Model Estimation

Learning-Based Top-N Selection Query Evaluation over Relational Databases

K-means and Hierarchical Clustering

A Statistical Model Selection Strategy Applied to Neural Networks

Intro. Iterators. 1. Access

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

A CLASS OF TRANSFORMED EFFICIENT RATIO ESTIMATORS OF FINITE POPULATION MEAN. Department of Statistics, Islamia College, Peshawar, Pakistan 2

Automatic selection of reference velocities for recursive depth migration

Stitching of off-axis sub-aperture null measurements of an aspheric surface

Lecture 4: Principal components

Understanding K-Means Non-hierarchical Clustering

Programming in Fortran 90 : 2017/2018

A Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers

MOTION BLUR ESTIMATION AT CORNERS

Cluster Analysis of Electrical Behavior

Math Homotopy Theory Additional notes

Review of approximation techniques

A Robust Method for Estimating the Fundamental Matrix

Outlier Detection based on Robust Parameter Estimates

Fusion Performance Model for Distributed Tracking and Classification

Multi-view 3D Position Estimation of Sports Players

A New MPS Simulation Algorithm Based on Gibbs Sampling

Module Management Tool in Software Development Organizations

User Authentication Based On Behavioral Mouse Dynamics Biometrics

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

CHAPTER 2 DECOMPOSITION OF GRAPHS

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

b * -Open Sets in Bispaces

The Man-hour Estimation Models & Its Comparison of Interim Products Assembly for Shipbuilding

GENETIC ALGORITHMS APPLIED FOR PATTERN GENERATION FOR DOWNHOLE DYNAMOMETER CARDS

Transcription:

CHAPTER 4 THE THEORY OF REGIONALIZED VARIABLES 4.1 Introducton It s ponted out by Armstrong (1998 : 16) that Matheron (1963b), realzng the sgnfcance of the spatal aspect of geostatstcal data, coned the term regonalzed varable to represent varables dstrbuted n space. Such varables possess both a random aspect, whch accounts for local rregulartes, and a structured aspect, whch reflects large scale tendences (Armstrong, 1998 : 16) and are deally suted to descrbe almost all varables encountered n the earth scences (Journel & Hujbregts, 1978 : 27). Consder, for example, a gold-ore depost. The grade of the gold-ore can be thought of as a regonalzed varable, snce values of the grade are dstrbuted n space and dsplay both a structured aspect (areas of rch and poor grade) and a random aspect (local varablty n grade measurements). From a mathematcal pont of vew, a [regonalzed varable] s smply a functon f(x) whch takes a value at every pont x [wthn a studed spatal regon] (Journel & Hujbregts, 1978 : 27). So, a random varable can be seen as varyng randomly accordng to some probablty dstrbuton, whle a regonalzed varable can be seen as a varable varyng regonally (.e. accordng to ts spatal locaton). 4.2 The Probablstc Model of Geostatstcs Geostatstcs utlzes a probablstc model whch s based on random functons and regonalzed varables (Journel & Hujbregts, 1978 : 29 and Wackernagel, 51

1995 : 25). A random functon s merely a set of random varables and s also commonly referred to as a stochastc process or random feld (Cresse, 1991 : 30 and Armstrong, 1998 : 17). Herewth follows a descrpton of the probablstc model of geostatstcs. Consder a spatal attrbute (such as gold-ore grade) defned over some spatal regon Ω, and suppose further that n values of ths attrbute have been measured at varous locatons wthn Ω. In order to perform any geostatstcal calculatons on the attrbute, the probablstc model of geostatstcs assumes the exstence of the random functon (.e. the set of random varables) { ( x), x Ω} Z (4.1) where Z(x) s the random varable defnng the value of the spatal attrbute at spatal locaton x, and Ω s a fxed regon whose spatal ndex x vares contnuously throughout Ω (Cresse, 1991 : 8, 30). Note 4.1 In geostatstcs, the random functon n Equaton 4.1 s often smply referred to as the random functon Z(x). Thus, Z(x) can refer to ether a random functon as n Equaton 4.1, or t can refer to a random varable stuated at spatal locaton x. It s generally always understood whch meanng of Z(x) s ntended, and throughout ths dssertaton t wll be made clear whch s ntended. Note 4.2 It s generally accepted n geostatstcs that Ω 2 or Ω 3 (.e. the studed spatal regon s generally of ether two- or three-dmensons). Thus, although spatal locatons n ths dssertaton wll generally be referred to smply by a sngle letter, say x (as has been done thus far n Chapter 4), ths notaton actually mples a spatal locaton vector n two- or three-dmensons. In other words, x wll ether mply the spatal locaton (x 1, x 2 ) n two-dmensons, or the spatal locaton (x 1, x 2, x 3 ) n three-dmensons. 52

A realzaton of the random functon Z(x) n Equaton 4.1 s classfed as the regonalzed varable z(x), and therefore the avalable spatal data set { ( x ) 1,..., n} z : = (4.2) s smply a number of observatons selected from the regonalzed varable z(x) (Journel & Hujbregts, 1978 : 30 and Wackernagel, 1995 : 26-27). Cresse (1991: 8-11, 29-30) provdes an excellent explanaton of ths probablstc model. Note 4.3 As wth Note 4.1, the notaton z(x) can refer to both a realzaton of the random functon Z(x) (.e. a regonalzed varable), or a realzaton of the random varable Z(x) defnng an attrbute value at spatal locaton x. Ths dssertaton wll agan nsure that t s clear as to whch meanng of z(x) s ntended. Note 4.4 Journel & Hujbregts (1978 : 29-30) pont out that the random functon construct n Equaton 4.1 s deally suted to the defnton of regonalzed varables, snce the value at each locaton wthn the spatal area s theoretcally the outcome of a random varable, whch need not be ndependent of the random varables at the other locatons. Snce geostatstcal data s vewed as a partal realzaton of a random functon, t s not possble to deduce the probablstc laws central to ths random functon solely from the avalable spatal data. Journel & Hujbregts (1978 : 30) state: Obvously, t s not rgorously possble to nfer the probablty law of a [random functon] Z(x) from a sngle realzaton z(x) whch s, n addton, lmted to a fnte number of sample ponts x. Therefore, n order to proceed wth the probablstc model of geostatstcs, certan assumptons need to be made. Some of the most mportant of these assumptons are dscussed n Secton 4.3. 53

4.3 Assumptons The followng assumptons permt estmaton of the probablstc laws governng the random functon Z(x) from a sngle partal realzaton. 4.3.1 Second-order statonarty The random functon defned n Equaton 4.1 s deemed second-order statonary f: and { ( x) } = m, x Ω E Z (4.3) ( ( x h), Z( x) ) = C( h), cov Z + x, x + h Ω (4.4) Ths assumpton mples that the frst two moments of all the random varables contaned n the random functon exst and are constant, and that the covarance between any two random varables depends only on the dstance vector between them and not on ther actual postons n Ω (Journel & Hujbregts, 1978 : 32 and Armstrong, 1998 : 18). In fact, accordng to Journel & Hujbregts (1978 : 33), n many stuatons n geostatstcs the assumpton of second-order statonarty can be weakened such that a covarance between the random varables as n Equaton 4.4 need not even exst at all. Ths weaker assumpton s called ntrnsc statonarty. 4.3.2 Intrnsc statonarty The random functon defned n Equaton 4.1 s sad to be ntrnscally statonary f: and ( ( x + h) Z( x) ) = 0 x, x + h Ω E Z (4.5) ( ( x + h) Z( x) ) = 2 ( h), x, x + h Ω var Z γ (4.6) Accordng to Armstrong (1998 : 19) the ntrnsc assumpton smply assumes that the ncrements of the random varables from the random functon Z(x) are 54

second-order statonary. Intrnsc statonarty assumes that the varablty of the dfference between two random varables from Z(x) depends only on the dstance vector between them, and not on ther actual postons wthn Ω. Note 4.5 A second-order statonary random functon s also ntrnscally statonary, but the converse s not necessarly true (Journel & Hujbregts, 1978 : 33). Note 4.6 The prevous two assumptons are commonly assocated wth lnear statonary geostatstcs (see Secton 1.4). Some authors, such as Rendu (1981), express Equaton 4.5 more generally as ( Z ( x + h) Z( x) ) = m( h) E (4.7) where m(h) s a lnear drft functon, whch mples the absence of drft, or the presence of a lnear drft (Rendu, 1981 : 17). Although n such cases the defnton of ntrnsc statonarty s slghtly dfferent to that gven n ths dssertaton, these authors all concede that when applyng the assumpton of ntrnsc statonarty to lnear statonary geostatstcs, m(h) n Equaton 4.7 must be equal to zero. Note 4.7 The functons C(h) n Equaton 4.4 and 2γ(h) n Equaton 4.6 are commonly referred to as the covarance functon and varogram respectvely. The functon γ(h) (.e. half of the varogram) s called the sem-varogram. 4.3.3 Ergodcty: An assumpton that s rarely stated s that of ergodcty. Ergodc random functons are second-order statonary (Cresse, 1991 : 53), and the assumpton of ergodcty n geostatstcs s dscussed n Matheron (1989 : 81- Certan texts carelessly defne γ(h) as the varogram, whle other napproprately use the term varogram and sem-varogram nterchangeably. 55

83), Cresse (1991 : 53-58) and Chlès & Delfner (1999 : 19-22). The dscusson n Chlès & Delfner (1999) s partcularly good. Accordng to Chlès & Delfner (1999 : 19-20), a spatal random functon Z(x) s sad to be ergodc n the mean f, for all realzatons z(x), lm 1 V V V z( x) dx = m where V denotes the spatal doman over whch Z(x) s defned, V denotes the volume of V, V s assumed to tend towards nfnty n all drectons, and m s a constant. In other words, ergodcty n the mean allows the determnaton of the mean from a sngle realzaton of the statonary random functon (Chlès & Delfner, 1999 : 20). Ergodcty n the covarance can also be defned (see Chlès & Delfner, 1999 : 21), as can ergodcty n any other moment of the random functon. Thus the assumpton of ergodcty n spatal statstcs mples that the moments of all realzatons z(x) of a random functon Z(x) converge to the moments of Z(x) as the spatal area n whch z(x) occurs tends towards nfnty (Chlès & Delfner, 1999: 19-22 and Chlès, J.-P. 2004, pers. comm., 21 Aprl). However, n geostatstcs, the spatal regon studed does not tend towards nfnty, and only one realzaton of Z(x) s consdered thus, there s no way of knowng f the [moments] would have converged to a dfferent value on another realzaton (Chlès & Delfner, 1999 : 21). Therefore, for the probablstc model of geostatstcs, ergodcty s smply assumed as a property of the random functon n Equaton 4.1. In fact, accordng to Cresse (1991 : 58), the assumpton of ergodcty n geostatstcs can be Accordng to Chlès & Delfner (1999 : 22), Matheron (1978) ntroduced the concept of mcroergodcty to deal wth cases of spatal domans that do not tend towards nfnty. 56

weakened to smply assume that the mean and covarance (or sem-varogram) of the sngle realzaton z(x) converges to the true mean and covarance (or semvarogram) of the random functon Z(x). 4.4 Alternatve Approaches to Spatal Estmaton Up to now the probablstc model of geostatstcs and the assumptons whch allow for geostatstcal estmaton have been dscussed. Varous other approaches to spatal estmaton also exst, each wth ther own model and assumptons. Besdes geostatstcs, three of the more common spatal estmaton technques nclude the nverse dstance weghtng method, the method of least squares regresson and the polygonal method. 4.4.1 Inverse dstance weghtng Ths s an uncomplcated and appealng technque that assumes that the relatonshp between values depends [only] on the dstance between ther locatons [and sometmes, also the drecton] (Clark and Harper, 2000 : 190). Suppose that the value of a spatal attrbute has been measured at n spatal locatons, resultng n the spatal data set { ( x ) 1,..., n} z : =. If the unknown value of the attrbute at spatal locaton x 0 s estmated by Q, the nverse dstance weghtng technque assumes that Q n w z = 1 = n = 1 ( x ) w (4.8) where the weghts w are based on some nverse functon of the dstance d between the locatons x and x 0 (Isaaks & Srvastava, 1989 : 257-258). Accordng to Clark & Harper (2000 : 190), examples of weght functons regularly used are 1 d, 1 d 2 and e d. 57

To llustrate the nverse dstance weghtng method, consder the followng example taken from Isaaks & Srvastava (1989 : 257): Example 4.1 Suppose that the value of seven pont support samples has been measured and that an estmate of the value z(x 0 ) at the locaton (65E, 137N) s desred (see Fgure 4.1). Suppose further that t s assumed that the relatonshp between sample values depends only on the nverse of ther Eucldean dstance d apart. Table 4.1 dsplays the requred calculatons for estmatng the value z(x 0 ) usng the method of nverse dstance weghtng. Fgure 4.1 The locaton of the seven measured pont support samples, as well as the locaton of the pont to be estmated (whch s ndcated by the arrow). Fgure 4.1 has been taken drectly from Isaaks & Srvastava (1989 : 251). Usng the calculatons dsplayed n Table 4.1 together wth Equaton 4.8, the nverse dstance weghtng estmate of the value z(x 0 ) s Q = 594. Regrettably, ths straghtforward method possesses many shortcomngs whch are hghlghted by a large amount of geostatstcal lterature (e.g. Clark and Harper 2000, Journel & Hujbrechts 1978, Wackernagel 1995, and Wellmer 1998). Accordng to these authors, nverse dstance weghtng methods: 58

59

Fal to defne measures of dealng wth samples that are not all of the same support (consder the mplcatons by revewng Secton 2.4.1 and Note 3.1). Often break down n tryng to provde estmates at the locaton of samples used n the calculaton. Ths problem s hghlghted by the queston of Clark & Harper (2000 : 192): What happens when d becomes zero? None of the 1 d type functons can be calculated f d s zero. May supply based solutons. These are but a few of the many problems assocated wth the method of nverse dstance weghtng most of whch are not problems for geostatstcal estmaton. 4.4.2 Least squares regresson Suppose the value of a spatal attrbute has been measured at n locatons. The theory of least squares regresson can be used to produce a predcton surface over the study area. A key assumpton of ths technque s the ndependence of resduals, whch s unfortunately not generally the case for geologcal varables (Armstrong, 1998 : 16). Fttng a surface whch does result n uncorrelated resduals s both tedous and results n complex regresson equatons (see Note 4.8). Unlke krgng, least squares regresson s not an exact nterpolator (Koch & Lnk, 1971 : 43-44) and, accordng to Journel & Hujbregts (1978 : 343), least squares regresson estmates are not as good (n terms of mnmum estmaton varance) as krgng estmates. Note 4.8 Accordng to Armstrong (1998), Gomez & Hazen (1970) ftted a regresson equaton to descrbe the proporton of pyrtc sulphur n a certan type of coal, whch resulted n uncorrelated resduals. Ths regresson equaton s coped straght from Armstrong (1998 : 17) and dsplayed n Table 4.2. Armstrong (1998 : 17) wrtes: Ths [equaton n Table 4.2] assumes uncorrelated errors, whch forces the [predcton] surface to twst and turn 60

rapdly, hence the trgonometrc and exponental terms. The dffcultes assocated wth usng such a complcated equaton (e.g., the estmaton of the equaton parameters, the concept of parsmony, etc.) are obvous. Table 4.2 The regresson equaton ftted to descrbe the proporton of pyrtc sulphur n a certan type of coal (taken from Armstrong, 1998 : 17). 4.4.3 Polygonal estmaton Polygonal estmaton s nothng more than choos[ng] as an estmate the sample value that s closest to the pont [or block beng estmated]... (Isaaks & Srvastava, 1989 : 250). In the case where two or more samples are equally close to the pont or block beng estmated, the arthmetc average of these sample values s used as the estmate. Example 4.2 Consder the estmaton of z(x 0 ), the pont located at (65E, 137N) n Example 4.1. In ths example, z(x 0 ) was estmated to be equal to 594. If polygonal estmaton was used nstead to estmate the value of z(x 0 ), t s seen n Table 4.1 that sample 61

two s located the closest to (65E, 137N), and hence z(x 0 ) s estmated as 696 (see Isaaks & Srvastava, 1989 : 251). The beneft of polygonal estmaton s that t s straght-forward to apply. Unfortunately, ths estmaton technque also possesses numerous shortcomngs some of whch are mentoned n Isaaks & Srvastava (1989 : 251-252), Goovaerts (1997 : 356) and Armstrong (1998 : 9). These shortcomngs nclude the fact that polygonal estmaton does not take nto account the spatal structure and contnuty of the attrbute beng studed and estmated. polygonal estmaton does not take nto account the support effect (see Secton 2.4.1 and Note 3.1). Suppose, for nstance, that pont support samples are used to estmate a block support area/volume va the polygonal estmaton method. Armstrong (1998 : 9), descrbng ths scenaro n a mnng context, rghtly warns that,...the polygonal method equates the grades of the samples (.e. a small support) wth those of the blocks [and hence does not] take account of the dfference between the supports of the samples and the blocks to be estmated; that s, of the support effect. the predcton surface of the polygonal estmaton method s dscontnuous. Note 4.9 Polygonal estmaton s also known by the names Thessen polygons and Vorono polygons (Olea, 1990). Note 4.10 Polygonal estmaton can be seen as an extreme case of nverse dstance weghtng, where the relatonshp between samples s assumed to be adequately explaned by p 1 d where p s assumed to be extremely large (n fact, tendng towards nfnty). 62

4.4.4 Addtonal comments Although geostatstcs would appear superor to nverse dstance weghtng, least squares regresson and the polygonal method (snce geostatstcs s not affected by the varous shortcomng lsted for each of the methods dscussed n Secton 4.4), these three methods stll retan mportant roles n spatal estmaton. Not only are the mathematcal calculatons of these methods undemandng, but regresson, for nstance, also plays a sgnfcant role n dentfyng and elmnatng trend from non-statonary data (see Secton 7.12.2). Note 4.11 An experment usng a smulated spatal data set s presented n Secton 8.3, and n ths experment the estmaton results of krgng, nverse dstance weghtng and polygonal estmaton are compared. 63