Generalized Random Tessellation Stratified (GRTS) Spatially-Balanced Survey Designs for Aquatic Resources

Similar documents
Package spsurvey. May 22, 2015

Welcome to NR402 GIS Applications in Natural Resources. This course consists of 9 lessons, including Power point presentations, demonstrations,

Compilation of GIS data for the Lower Brazos River basin

Package spsurvey. August 19, 2016

Spatial properties of design-based versus model-based approaches to environmental sampling

Lab 2. Vector and raster data.

Spatial Hydrologic Modeling Using NEXRAD Rainfall Data in an HEC-HMS (MODClark) Model

Geographic Surfaces. David Tenenbaum EEOS 383 UMass Boston

Topic 5: Raster and Vector Data Models

GRTS Survey Designs for a Linear Resource

HOW TO USE THE VERMONT DAM SCREENING TOOL

BAT Quick Guide 1 / 20

Longley Chapter 3. Representations

Stream Network and Watershed Delineation using Spatial Analyst Hydrology Tools

AK Hydro Data Editing Standards

Minnesota Department of Natural Resources ArcView Utilities Extension User s Guide

St. Johns River Water Management District. Tim Cera, P.E.

Creating raster DEMs and DSMs from large lidar point collections. Summary. Coming up with a plan. Using the Point To Raster geoprocessing tool

Using GIS to Site Minimal Excavation Helicopter Landings

Comparison of multiple point and single point calibration performance for the Saginaw River Watershed

SPATIAL DATA MODELS Introduction to GIS Winter 2015

GIS LAB 1. Basic GIS Operations with ArcGIS. Calculating Stream Lengths and Watershed Areas.

Computation of Slope

Terrain Modeling with ArcView GIS from ArcUser magazine

GIS LAB 8. Raster Data Applications Watershed Delineation

MODFLOW STR Package The MODFLOW Stream (STR) Package Interface in GMS

THE FISH HABITAT DECISION SUPPORT TOOL: COASTAL DATA TUTORIAL

Coastal Protection and Restoration Division

WMS 9.1 Tutorial GSSHA WMS Basics Watershed Delineation using DEMs and 2D Grid Generation Delineate a watershed and create a GSSHA model from a DEM

Welcome to the Surface Water Data Viewer!

GLOBAL GRIDS FROM RECURSIVE DIAMOND SUBDIVISIONS OF THE SURFACE OF AN OCTAHEDRON OR ICOSAHEDRON

Contents of Lecture. Surface (Terrain) Data Models. Terrain Surface Representation. Sampling in Surface Model DEM

Introducing ArcScan for ArcGIS

Correctly Compute Complex Samples Statistics

WMS 10.1 Tutorial GSSHA WMS Basics Watershed Delineation using DEMs and 2D Grid Generation Delineate a watershed and create a GSSHA model from a DEM

Data Assembly, Part II. GIS Cyberinfrastructure Module Day 4

Spatial Data Models. Raster uses individual cells in a matrix, or grid, format to represent real world entities

Understanding Geospatial Data Models

Finding and Using Spatial Data

Scientific Computing: An Introductory Survey

M. Andrea Rodríguez-Tastets. I Semester 2008

Channel Conditions in the Onion Creek Watershed. Integrating High Resolution Elevation Data in Flood Forecasting

A GIS SAMPLING ASSISTANT PROGRAM FOR FOREST INVENTORY POINT/PLOT SCHEMES

Spatial Data Management

FOR Save the Poudre: Poudre Waterkeeper BY PetersonGIS. Inputs, Methods, Results. June 15, 2012

WMS 9.1 Tutorial Hydraulics and Floodplain Modeling Floodplain Delineation Learn how to us the WMS floodplain delineation tools

The GIS Spatial Data Model

Class #2. Data Models: maps as models of reality, geographical and attribute measurement & vector and raster (and other) data structures

Exercise 5. Height above Nearest Drainage Flood Inundation Analysis

Automated Enforcement of High Resolution Terrain Models April 21, Brian K. Gelder, PhD Associate Scientist Iowa State University

v Introduction to WMS WMS 11.0 Tutorial Become familiar with the WMS interface Prerequisite Tutorials None Required Components Data Map

Spatial Data Management

Estimation of Design Flow in Ungauged Basins by Regionalization

Registering new custom watershed models or other polygonal themes

Notes: Notes: Notes: Notes:

Curve Fit: a pixel level raster regression tool

v MODFLOW SUB Package The MODFLOW SUB Package Interface in GMS GMS Tutorials Time minutes

4.0 DIGITIZATION, EDITING AND STRUCTURING OF MAP DATA

Statistical surfaces and interpolation. This is lecture ten

v MODPATH GMS 10.3 Tutorial The MODPATH Interface in GMS Prerequisite Tutorials MODFLOW Conceptual Model Approach I

Terrain Processing for Efficient 1D and 2D H&H Modeling. Dean Djokic and Zichuan Ye, Esri Inc.

Using GIS for hierarchial refinement of MODFLOW models

GIS IN ECOLOGY: CREATING RESEARCH MAPS

Local Pivotal Methods for Large Surveys

v Prerequisite Tutorials GSSHA Modeling Basics Stream Flow GSSHA WMS Basics Creating Feature Objects and Mapping their Attributes to the 2D Grid

Spatial Density Distribution

WMS 8.4 Tutorial Watershed Modeling MODRAT Interface (GISbased) Delineate a watershed and build a MODRAT model

Lecture 6: GIS Spatial Analysis. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

GIS Data Models. 4/9/ GIS Data Models

INTRODUCTION TO GIS WORKSHOP EXERCISE

layers in a raster model

v GMS 10.0 Tutorial MODFLOW SUB Package The MODFLOW SUB Package Interface in GMS Prerequisite Tutorials MODFLOW Conceptual Model Approach I

APPENDIX E2. Vernal Pool Watershed Mapping

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

The Global River Width Algorithm

Principles of Data Management. Lecture #14 (Spatial Data Management)

Lecture 06. Raster and Vector Data Models. Part (1) Common Data Models. Raster. Vector. Points. Points. ( x,y ) Area. Area Line.

An Example of Regional Web Resources: Massachusetts, USA Digital Data

Esri Best Practices: QA/QC For Your Geodata. Michelle Johnson & Chandan Banerjee

Exercise 1: Getting to know ArcGIS

The Geographic Names Register of the National Land Survey of Finland

Delineating Watersheds from a Digital Elevation Model (DEM)

MODFLOW-USG Complex Stratigraphy Create a MODFLOW-USG model of a site with complex 3D stratigraphy using GMS

GEOGRAPHIC INFORMATION SYSTEMS Lecture 18: Spatial Modeling

Uncertainty Analysis: Parameter Estimation. Jackie P. Hallberg Coastal and Hydraulics Laboratory Engineer Research and Development Center

Representing Geography

In this exercise we will:

WMS 9.0 Tutorial GSSHA WMS Basics Watershed Delineation using DEMs and 2D Grid Generation Delineate a watershed and create a GSSHA model from a DEM

CPSC 340: Machine Learning and Data Mining. Density-Based Clustering Fall 2016

Using ArcGIS 9.x: Quickstart Tutorial

Space Filling Curves and Hierarchical Basis. Klaus Speer

Neighbourhood Operations Specific Theory

v. 9.0 GMS 9.0 Tutorial MODPATH The MODPATH Interface in GMS Prerequisite Tutorials None Time minutes

MODFLOW UZF Package The MODFLOW Unsaturated-Zone Flow (UZF) Package Interface in GMS

Sedimentation in Aquilla Lake Hill County, Texas

RASTER ANALYSIS S H A W N L. P E N M A N E A R T H D A T A A N A LY S I S C E N T E R U N I V E R S I T Y O F N E W M E X I C O

GIS EXTENSION FOR BATHTUB

Questions: a User-friendly Interface to ANSWERS-2000

CSC Computer Graphics

University of Florida CISE department Gator Engineering. Clustering Part 4

Transcription:

Generalized Random Tessellation Stratified (GRTS) Spatially-Balanced Survey Designs for Aquatic Resources Anthony (Tony) R. Olsen USEPA NHEERL Western Ecology Division Corvallis, Oregon Voice: (541) 754-4790 Email: olsen.tony@epa.gov

Co-Developers Don Stevens, Oregon State U Denis White, EPA WED Richard Remington, EPA WED Barbara Rosenbaum, INDUS Corporation David Cassell, CSC EMAP Surface Waters Research Group State monitoring staff

Overview Aquatic resource characteristics Sample frame GIS coverages Imperfect representation of target population GRTS theory GRTS implementation Old: ArcInfo, SAS, C-program New: R program with GIS coverage preparation

Aquatic Resource Characteristics Types of aquatic resources Area polygons: large lakes and reservoirs, estuaries, coastal waters, everglades Linear networks: streams and rivers Discrete points: small lakes, stream reaches, prairie pothole wetlands, hydrologic units ( watersheds ) Target population Finite in a bounded geographic region: collection of points Continuous in a bounded geographic region As linear network As collection of polygonal areas Generalizations Geographic region may be 1-dimensional (p-dimensional) Space may be defined by other auxiliary variables

Typical Aquatic Sample Frames GIS coverages do exist for aquatic resources National Hydrography Dataset (NHD) Based on 1:100,000 USGS maps Combination of USGS Digital Line Graph (DLG) data set and USEPA River Reach File Version 3 (RF3) Includes lakes, ponds, streams, rivers Sample frames derived from NHD Use GIS to extract frame to match target population Enhance NHD with other attributes used in survey design Issues with NHD Known to include features not of interest (over-coverage) Known to exclude some aquatic resources (under-coverage)

Generalized Random Tessellation Stratified (GRTS) Survey Designs Probability sample producing design-based estimators and variance estimators Give another option to simple random sample and systematic sample designs Simple random samples tend to clump Systematic samples difficult to implement for aquatic resources and do not have design-based variance estimator Emphasize spatial-balance Every replication of the sample exhibits a spatial density pattern that closely mimics the spatial density pattern of the resource

GRTS Implementation Steps Concept of selecting a probability sample from a sampling line for the resource Create a hierarchical grid with hierarchical addressing Randomize hierarchical addresses Construct sampling line using randomized hierarchical addresses Select a systematic sample with a random start from sampling line Place sample in reverse hierarchical address order

Selecting a Probability Sample from a Sampling Line: Linear Network Case Place all stream segments in frame on a linear line Preserve segment length Identify segments by ID In what order do place segments on line? Randomly Systematically (minimal spanning tree) Randomized hierarchical grid Systematic sample with random start k=l/n, L=length of line, n=sample size Random start d between [0,k) Sample: d (i-1)*k for i=1,,n

Selecting a Probability Sample from a Sampling Line: Point and Area Cases Point Case: Identify all points in frame Assign each point unit length Place on sample line Area Case: Create grid covering region of interest Generate random points within each grid cell Keep random points within resource (A) Assign each point unit length Place on sample line

Randomized Hierarchical Grid Step 1 Step 2 Step 3 Step 4 Step 1: Frame: Large lakes: blue; Small lakes: pink; Randomly place grid over the region Step 2: Sub-divide region and randomly assign numbers to sub-regions Step 3: Sub-divide sub-regions; randomly assign numbers independently to each new sub-region; create hierarchical address. Continue sub-dividing until only one lake per cell. Step 4:Identify each lake with cell address; assign each lake length 1; place lakes on line in numerical cell address order.

Hierarchical Grid Addressing 213: hierarchical address

Population of 120 points Hierarchical Order Hiearchical Randomized Order y 0.0 0.2 0.4 0.6 0.8 1.0 y 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 x 0.0 0.2 0.4 0.6 0.8 1.0 x

RHO Reverse Base4 Base4 Original Order 1 00 00 1 2 01 10 5 3 02 20 9 4 03 30 13 5 10 01 2 6 11 11 6 7 12 21 10 8 13 31 14 9 20 02 3 10 21 12 7 11 22 22 11 12 23 32 15 13 30 03 4 14 31 13 8 15 32 23 12 Reverse Hierarchical Order Construct reverse hierarchical order Order the sites from 1 to n Create base 4 address for numbers Reverse base 4 address Sort by reverse base 4 address Renumber sites in RHO Why use reverse hierarchical order? Results in any contiguous set of sample sites being spatially-balanced Consequence: can begin at the beginning of list and continue using sites until have required number of sites sampled in field 16 33 33 16

Unequal Probability of Selection Assume want large lakes to be twice as likely to be selected as small lakes Instead of giving all lakes same unit length, give large lakes twice unit length of small lakes To select 5 sites divide line length by 5 (11/5 units); randomly select a starting point within first interval; select 4 additional sites at intervals of 11/5 units Same process is used for points and areas (using random points in area)

Complex Survey Designs based on GRTS Stratified GRTS: apply GRTS to each stratum Unequal probability GRTS: adjust unit length based on auxiliary information (eg lake area, strahler order, basin, ecoregion) Oversample GRTS: Design calls for n sites; some expected non-target, landowner denial, etc; select additional sites to guarantee n field sampled Apply GRTS for sample size 2n; place sites in RHO; use sites in RHO Panels for surveys over time Nested subsampling Two-stage sampling using GRTS at each stage Example: Select USGS 4 th field Hucs; then stream sites within Hucs

Two GRTS samples: Size 30 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 GRTS Sample of 30 x y 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 GRTS Sample of 30 x y

Spatial Balance: 256 points

Spatial Balance: With oversample

Ratio of GRTS to SRS Voronoi polygon size variance polygon area variance ratio 0.0 0.2 0.4 0.6 0.8 1.0 Continuous domain with no voids Constant polygon size, total perimeter = 88.4 Linearly increasing polygon size, total perimeter = 84.9 Exponentially increasing polygon size, total perimeter = 43.1 0 50 100 150 200 250 point density

Impact on Variance Estimators of Totals

RF3 Stream Length: EMAP West Strahle Total 1st 2nd 3rd 4th Rivers 0 500 1,000 1,500 2,000 Length (1,000 km) Non Perennial Perennial

Perennial Streams GRTS sample

RF3 Sample Frame: Lakes Lake Area (ha) Number of Lakes Percent Cumulative Number of Lakes Cumulative Percent 1 5 172,747 63.8 172,747 63.8 5 10 44,996 16.6 217,743 80.4 10 50 40,016 14.8 257,759 95.2 50 500 11,228 4.1 268,987 99.3 500 5000 1,500 0.6 270,387 99.9 >5000 274 0.1 270,761 100.0

US EPA NHEERL-WED EMAP Stat&Design j199.ow.lakes/plots/owsampall.ai 5/5/1999 National Fish Tissue Contaminant Lake Survey

Sample Selected: Lakes Lake Area (ha) 1999 2000 2001 2002 All Years Expected Weight 1-5 39 41 47 47 174 938.84 5-10 44 40 47 46 177 261.61 10-50 32 47 46 25 150 256.51 50-500 34 37 29 34 134 85.06 500-5000 36 30 31 41 138 11.36 >5000 40 30 25 32 127 2.21 Total 225 225 225 225 900

GRTS Sample of Streams All Streams and All Sites Category 1 Category 2 Category 3

Initial Software Implementation Hierarchical grid creation: C-program Extract sample frame from RF3/NHD: Arc/Info Intersect frame with hierarchical grid: Arc/Info Export data and summarize frame to determine inclusion densities for unequal probability sampling: SAS Complete hierarchical randomization, systematic sample of line, reverse hierarchical ordering: SAS Import sample to create design file with geographic coordinates: Arc/Info

New Implementation Extract sample frame from RF3/NHD Arc/Info, Arcview,. Required data format for points, lines, polygons Select sample: R program Input: Survey design specification Sample frame data Output: Data frame with sample

Comments GRTS using R can be applied in one dimension GRTS conceptually extends to sampling 3-d or greater dimensions X,Y coordinates can be any continuous variables Software implementation GIS preparation followed by R program Incorporate simple GIS operations in R operation Add extension to Arcview

Over-sample: use in implementation

GRTS Implementation Process Randomly place a square over the sample frame geographic region Construct hierarchical grid (e.g. quadtree ) with hierarchical addressing in the square Construct Peano mapping of twospace to one-space using hierarchical addressing Complete hierarchical randomization of Peano map Place sample frame elements on line in one-space using hierarchical randomization order, assigning length to element based on frame and inclusion density (unequal probability) Select systematic sample with random start from line Place sample in reverse hierarchical order