Response to API 1163 and Its Impact on Pipeline Integrity Management

Similar documents
ChristoHouston Energy Inc. (CHE INC.) Pipeline Anomaly Analysis By Liquid Green Technologies Corporation

Box-Cox Transformation for Simple Linear Regression

Ultrasonic Multi-Skip Tomography for Pipe Inspection

Impact of 3D Laser Data Resolution and Accuracy on Pipeline Dents Strain Analysis

Box-Cox Transformation

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

MONITORING THE REPEATABILITY AND REPRODUCIBILTY OF A NATURAL GAS CALIBRATION FACILITY

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

An Experiment in Visual Clustering Using Star Glyph Displays

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

This research aims to present a new way of visualizing multi-dimensional data using generalized scatterplots by sensitivity coefficients to highlight

Overview. Frequency Distributions. Chapter 2 Summarizing & Graphing Data. Descriptive Statistics. Inferential Statistics. Frequency Distribution

Machine Learning Techniques for Data Mining

Slides for Data Mining by I. H. Witten and E. Frank

MFL-COMBO Tools Technology and Experiences

IAT 355 Visual Analytics. Data and Statistical Models. Lyn Bartram

Data Analyst Nanodegree Syllabus

Data Preprocessing. S1 Teknik Informatika Fakultas Teknologi Informasi Universitas Kristen Maranatha

Data Analyst Nanodegree Syllabus

Business Club. Decision Trees

1. Estimation equations for strip transect sampling, using notation consistent with that used to

Table of Contents (As covered from textbook)

10/14/2017. Dejan Sarka. Anomaly Detection. Sponsors

DESIGN AND EVALUATION OF MACHINE LEARNING MODELS WITH STATISTICAL FEATURES

Chapter 12: Statistics

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Gene Clustering & Classification

Introduction to Geospatial Analysis

Use of GeoGebra in teaching about central tendency and spread variability

Training Manual for LRB Calculator (Leak Rupture Boundary Determination Project)

Evaluation Measures. Sebastian Pölsterl. April 28, Computer Aided Medical Procedures Technische Universität München

3D Finite Element Software for Cracks. Version 3.2. Benchmarks and Validation

CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12

Cpk: What is its Capability? By: Rick Haynes, Master Black Belt Smarter Solutions, Inc.

CNC Milling Machines Advanced Cutting Strategies for Forging Die Manufacturing

Using Statistical Techniques to Improve the QC Process of Swell Noise Filtering

Building Better Parametric Cost Models

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

Chapter 2 Describing, Exploring, and Comparing Data

Multiple Regression White paper

Application of a 3D Laser Inspection Method for Surface Corrosion on a Spherical Pressure Vessel

A Multiple Constraint Approach for Finite Element Analysis of Moment Frames with Radius-cut RBS Connections

Data Preprocessing. Why Data Preprocessing? MIT-652 Data Mining Applications. Chapter 3: Data Preprocessing. Multi-Dimensional Measure of Data Quality

ANALYSIS OF BOX CULVERT - COST OPTIMIZATION FOR DIFFERENT ASPECT RATIOS OF CELL

OVERVIEW & RECAP COLE OTT MILESTONE WRITEUP GENERALIZABLE IMAGE ANALOGIES FOCUS

SDG ATARI 8 BITS REFERENCE CARD

Lecture on Modeling Tools for Clustering & Regression

Comparison of Methods for Analyzing and Interpreting Censored Exposure Data

Simulation Supported POD Methodology and Validation for Automated Eddy Current Procedures

Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018

DARWIN 8.1 Release Notes

Common Mistakes And Errors In Modelling

TRACK MAINTENANCE STRATEGIES OPTIMISATION PROBLEM

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007

Statistics: Normal Distribution, Sampling, Function Fitting & Regression Analysis (Grade 12) *

COMPUTATIONAL FLUID DYNAMICS ANALYSIS OF ORIFICE PLATE METERING SITUATIONS UNDER ABNORMAL CONFIGURATIONS

Long Term Analysis for the BAM device Donata Bonino and Daniele Gardiol INAF Osservatorio Astronomico di Torino

CHAPTER 3. Preprocessing and Feature Extraction. Techniques

White Paper. Abstract

Probabilistic Analysis Tutorial

Introduction to ANSYS DesignXplorer

3D 3D SCANNING SOLUTION FOR FOR PIPELINE INTEGRITY ASSESSMENT

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

Optimal Clustering and Statistical Identification of Defective ICs using I DDQ Testing

WELCOME! Lecture 3 Thommy Perlinger

Establishing Virtual Private Network Bandwidth Requirement at the University of Wisconsin Foundation

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

STA 570 Spring Lecture 5 Tuesday, Feb 1

OPTIMISATION OF PIN FIN HEAT SINK USING TAGUCHI METHOD

Implementing Operational Analytics Using Big Data Technologies to Detect and Predict Sensor Anomalies

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering

Performance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018

Week 7 Picturing Network. Vahe and Bethany

Themes in the Texas CCRS - Mathematics

Bland-Altman Plot and Analysis

Summarising Data. Mark Lunt 09/10/2018. Arthritis Research UK Epidemiology Unit University of Manchester

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Getting to Know Your Data

New Opportunities for 3D SPI

Lecture Notes 3: Data summarization

v SMS 12.2 Tutorial Observation Prerequisites Requirements Time minutes

Using Excel for Graphical Analysis of Data

Machine Learning for Pre-emptive Identification of Performance Problems in UNIX Servers Helen Cunningham

Applied Regression Modeling: A Business Approach

Guidelines for proper use of Plate elements

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Multicollinearity and Validation CIVL 7012/8012

The Elimination of Correlation Errors in PIV Processing

Bootstrapping Method for 14 June 2016 R. Russell Rhinehart. Bootstrapping

SYS 6021 Linear Statistical Models

Robust Linear Regression (Passing- Bablok Median-Slope)

ANALYSIS OF THE CORRODED PIPELINE SEGMENTS USING IN-LINE INSPECTION DATA

Averages and Variation

DATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course:

Parametric. Practices. Patrick Cunningham. CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved.

Transcription:

ECNDT 2 - Tu.2.7.1 Response to API 3 and Its Impact on Pipeline Integrity Management Munendra S TOMAR, Martin FINGERHUT; RTD Quality Services, USA Abstract. Knowing the accuracy and reliability of ILI measurements is important for determining optimum integrity rehabilitation project scope, re-inspection interval and the risk associated with the pipeline section examined. The recent (September 2) introduction of API 3 (title) highlights the importance of these metrics. Qualifying these metrics requires verification measurements with accuracies a level of magnitude higher than in-line-inspection. Lasersure SM provides a process which uses in-the-ditch Laser Profilometry measurements which have an order of magnitude greater accuracy than the original ILI-MFL measurements, therefore are well suited to qualifying and potentially improve ILI results. The following paper outlines the Lasersure SM process and the potential impact of this process on pipeline integrity management process. Introduction In-Line Inspection (ILI) ILI (MFL) is commonly used by operators to determine the location and extent of metal loss (primarily) on their pipeline systems. MFL technology and its understanding have improved by leaps and bounds in the last decade or so, and has proven to be a valuable tool for operators. Despite the advances, significant uncertainty still exists regarding the accuracy of the data obtained from the MFL tools. Various factors such as the cleanliness of the pipe, chemical and physical properties of the steel, presence of localized residual stresses, variation of wall thickness, etc. all have a detrimental effect on the accuracy that can be expected from ILI. ILI Tool vendors typically specify the accuracy and certainty associated with their data. In some cases, such metrics are defined for different geometrical categories as by the Pipeline Operator s Forum and API 3. In other cases, it is provided as an aggregate metric. The accuracy, thus defined, is typically of the order of +/- % with a certainty of % with respect to the depth measurement. In other words, the depth measurements from the MFL data would typically have an error, no greater than % of the wall thickness in % of the data points. This still leaves a huge amount of uncertainty that an operator has to account for while making critical decisions. This uncertainty in the data has direct impact on the risk level that an operator associates with a pipeline system or the inspected section there-of. The economic impact on the integrity management program(s) can be huge, as operators have to make conservative 1

estimates of the associated risk, subsequently driving up the number of digs in a rehabilitation project and/or shortening the re-assessment interval. Laserscan SM Laserscan SM is a service provided by RTD wherein the LPIT TM tool uses laser profilometry for mapping the external surface of the pipe and thereby measuring with millimeter level resolution the extent of external corrosion as present on the pipe. As of now, it is a benchmark technology and has been proven to be very accurate and reliable in the field. It is capable of providing a high definition 3-dimensional map of the pipe surface in a digital format. Due to the nature of the dataset, the data from Laserscan SM is ideally suited for detailed analysis and comparison. API 3 API 3 (In-Line Inspection systems qualification standard) was introduced in August 2 as an umbrella document defining the manner in which the ILI data was to be reported and qualified. While it does define the specifications for anomalies belonging to different geometrical categories, it does not offer a robust process for comparing the ILI data with the field measurements. A metal loss defect, as measured in the field, usually comprises of multiple indications (each of the same or different geometrical shape) combined together using a pre-defined interaction criteria. Similarly, ILI data also undergoes a clustering process. To classify the indications into different geometrical categories, one has to use the unclustered data. Previous work Several efforts have been made in the past to do correlation analysis between MFL data and field measurements. While considerable insight was gained in the process, most of these efforts were marred by a few common drawbacks. Some of these are as listed below: The field data was usually collected using manual methods, and therefore, was limited in definition and accuracy Manual measurements typically are prone to measurement and indexing error, making it difficult to match with the corresponding ILI anomalies Field data resolution is generally low, limiting the ability to match data points by geometrical shape. Presented study The present study is based on the premise that the MFL tool has a different level of measuring accuracy for anomalies belonging to different geometries. For the purpose of this study, the anomalies were classified into the following categories as defined by the Pipeline Operators Forum: Extended/General Pitting 2

Pinholes Axial Grooves Axial Slots Circumferential Grooves Circumferential Slots There are a few key characteristics that differentiate the study performed from the previous works in this direction. They are summarized as follows: The data points were matched on an individual pit basis The ILI data used was boxed data (prior to the application of interaction rules) The analysis was performed on data sets belonging to each geometrical category individually All data points were taken into consideration, including corrosion with very small depth measurements (>=.3mm) Also, some assumptions were made to apply the process: It was assumed that the error in the LPIT TM measurements was negligible It was assumed that the measurements from the ILI data were reliable enough to determine the geometrical category a particular indication belonged to. Process The process used in this study comprised the following steps: 1. Data Assimilation 2. Classification 3. Matching the data points 4. Data Extraction. Analysis. Validation Data Assimilation: The boxed data from a previous ILI (MFL) inspection was provided for this study by a major North American pipeline operator. The inspection was performed over three valve sections. In addition, Laserscan SM service had been utilized during the rehabilitation project previously carried out by the operator. The data from all the locations inspected using the Laserscan SM and the MFL data from the corresponding pipe segments was located. Classification Each MFL data point (indication) was then classified into one of the seven categories based on the ILI reported length and width using the criteria as defined in the POF document. 3

Matching the data points The LPIT TM data and the MFL data was then overlaid using a graphical tool designed for the purpose. Some key indications were used to determine if there was any location error in the data sets and data points that overlaid well were documented. Following is a snapshot of the utility used to match the data points (Figure 1). The blue indications are LPIT TM indications and the red ones are MFL indications. LPIT ILI Figure 1: Data matching utility used to overlay the ILI and LPIT datasets In cases where on indication in either datasets corresponded to multiple indications on the other, the boundary conditions were used to aggregate the multiple indications so as to get a one to one correlation. Data Extraction The data from the matching data points was then accumulated and verified to ensure consistency. The parameters for consideration (Length and Depths) for these indications were extracted from the pool of data and arranged in a comparable format. Some of these matching data points were set aside as a control group at random for validation purposes. Analysis Statistical analysis was performed on these data sets. To streamline the process, and due to the limited amount of data available, the analysis was performed for two of the geometrical categories, viz. General corrosion and pitting. Within these categories, both parameters of interest (Depth and Length) were evaluated to determine the error and any evident bias in the positive or negative direction. Figure 2 shows the potential impact an error in length measurement can have. 4

Measured Interacting Actual length Non-interacting Figure 2: Anomalies clustered due to error in length measurement Validation Once the bias (or relation) was determined, a correction factor was calculated and applied to the MFL data points from the control data set. These corrected measurements were then compared against the LPIT TM measurements to determine if there was any improvement in the accuracy. The results are as provided in the following sections. Data Analysis The data considered for the purpose of this study was taken from different excavation locations. Data extraction provided us with 13 indications belonging to the pitting corrosion category and 9 indications belonging to the general corrosion category. 3 indications from the pitting corrosion data set and 2 indications from the general corrosion data set were randomly set aside as the control group for validation purposes. The Laserscan SM measurements were used as the reference data set to calculate the error in depth as well as length measurements for each data point. Different standard statistical techniques were utilized to measure the bias, if any, in the ILI data using the Depth and Length as the parameters of interest. They are: 1. Paired t-test [1] 2. Wilcoxon signed ranks test [2] 3. Passing-Bablok method [3] The two data sets, ILI and Laserscan SM, were treated as paired data, as it was assumed that since both measurements are for the same anomaly, values in one data set would change with the other [4][][]. As is shown in Figure 3, the bias plots [4] for both the depth and length measurements show a clear bias. The depth measurements reported by ILI seem to be almost consistently less than the corresponding LPIT TM measurements for both pitting and general corrosion.

a) Difference between metho 2 2 - Zero bias - 2 4 Mean of Depth 2 4 b) Difference between metho - - - -2-2 Zero bias c) -3 2 3 Mean of Length 3 2 4 Difference between metho 2 2 - - 2 3 Mean of depth Zero bias

d) Difference between metho 2 2 Zero bias - - - 2 3 Mean of Length 2 3 Figure 3: Bias plots showing that a bias is present in both length and depth measurements. (a) and (b) are Bias plots for Depth and Length respectively for Pitting Corrosion. (c) and (d) are bias plots for Depth and Length for General corrosion category The calculations for Depth The error in depth measurement (LPIT TM ILI) seemed to follow the normal distribution. This was verified using the kolmogorov-smirnov test at a significance level of. (Figure 4). (a) 14 12 Frequenc 4 2 Coefficient P Kolmogorov- Smirnov.21 >. Skewness.337.34 Kurtosis -.37.77 7

(b) 4 4 3 3 Frequenc 2 2 Coefficient P Kolmogorov- Smirnov. >. Skewness.23.23 Kurtosis.79.79 Figure 4: Results from Kolmogorov-Smirnov test demonstrating that the error does follow a normal distribution. (a) and (b) are histograms for Error in Depth for Pitting and General Corrosion respectively. Now, assuming normality, an estimation of the bias (assuming a constant bias) was performed at a significance level of.. Since the mean error was found to be positive, the estimated bias was added to the depth measurements in the control group for validation and the results compared to the uncorrected error. The results are as provided in Table 1. Similar calculations, when carried out on the control data for general corrosion, revealed a similar phenomenon. The SSE for uncorrected measurements was found to be 134, while the SSE for corrected measurements was found to be 744. The data, when subjected to the non-parametric wilcoxon signed ranks test revealed results almost identical to the results presented. Further investigation of the data seemed to indicate that the magnitude of the error seemed to be a function of the reported depth measurement itself. This trend is quite evident in the following charts (Figure )

Error 2 2 7 13 Error VS ILI Depth (General) ILI- depth 13 14 1 19 22 23 24 Error 12 4 2-2 -4 Error VS ILI depth (Pitting) 12 13 13 14 14 17 1 1 19 21 22 24 2 33 ILI-Depth Figure : Plots showing that the magnitude of error could be a function of the measured value This indicated that the bias in the data might be proportionate to the depth. Therefore, the data was then subjected to the Passing-Bablok comparison test. The test was carried out at a confidence level of 9%.The results from the tests are as follows (Figure ): 4 Pitting Identity line Y = X 3 3 General 4 2 Depth - ILI Dep 3 3 2 ILI - D 2 2 y =.733x -.39 y =.7x - 2.19 2 3 4 2 3 LPIT - D Depth - LPIT Depth Figure : Results from Passing-Bablok regression reflecting the proportionate bias Subsequently, the control data was corrected using the respective correction model. The results showed marked improvement over the uncorrected data. The results are as provided in Table 1. One thing to be noted here is that while the sum squared error seems to be lower for the non-parametric wilcoxon-bablok method, the absolute maximum error was found to be more in the parametric case. In other words, using a constant bias would cause the outliers to be even more pronounced. The calculations for Length The MFL tool reports depth as a relative measurement as a percentage of the wall thickness. This normalizes the data and therefore, the scatter doesn t seem as pronounced. Since the length measurements are absolute (mm), the effect of outliers on the analysis is 9

quite pronounced. To reduce this, the analysis for length was carried out using percentage length, defined as the ratio of the lengths reported by LPIT TM versus that by ILI times. The same calculations, when carried out for length, yielded results as shown in Table 2. As is obvious from these results, the improvement in length is not nearly as significant as for depth, although some improvement can be achieved. The reduced improvement can partly be attributed to the fact that the deviation from true measurement (error) is much larger for length than for depth. Conclusion The following key conclusions can be derived from this exercise: There is statistically significant bias in both depth and length measurements in the MFL data. The bias is different for different geometrical categories. While the process, as defined in this paper was completed on pitting and general corrosion, it can also be applied to calculate the ILI tool performance for different geometries of anomalies. While conventional (parametric) statistics does provide means for calculating these biases, these techniques typically assume that the data belongs to a particular distribution and therefore, when a sample does not conform to the given distribution, the error in the outliers is magnified causing larger maximum absolute errors. Non-parametric techniques seem to be more adequate as they are more resilient to outliers in the data set. These correction factors can be applied to boxed data and the data clustered again by applying the respective interaction criteria to provide more accurate data to the operators, thereby reducing the overall uncertainty and subsequently, risk. Development and subsequent application of such a process can assist operators in establishing qualified re-assessment intervals owing to higher certainty in the data. Methods such as bootstrapping and/or Bayesian techniques should be explored to determine if they can provide further improvement in results. References [1] Statistical methods for assessing agreements between two methods of clinical measurement. Bland J.M., Altman D.G. [2] Individual comparisons by ranking methods. Wilcoxon F. (194) [3] A general regression procedure for method transformation. Passing H., Bablok W [4] Principles and procedures of exploratory data analysis. Behrens, J. T. (1997). [] NIST/SEMATECH e-handbook of Statistical Methods, http://www.itl.nist.gov/div9/handbook. [] Essentials of Statistical Inference (Cambridge Series in Statistical and Probabilistic Mathematics) 2.

Uncorrecte d depth constant bias Table 1: Pitting corrosion Corrected Depth Passing Bablock Uncorrecte d Length constant bias Corrected Length Passing Bablock Mean Error.7 1.93.22-37.34 -. -37.2 Standard Deviation...4.3. 4.7 Max Error 19.37.2 13.4 233.33 144. 22.22 Sum squared error 2193.77.7 14.2 229.27 929. 21.21 Uncorrecte d depth constant bias Table 2: General corrosion Corrected Depth Passing Bablock Uncorrecte d Length constant bias Corrected Length Passing Bablock Mean Error.2 1.39-1.27-7.7 -.1 4.74 Standard Deviation 9.4 9.4.72 41.9 4.23 23.43 Max Error 27.7 2.9 2.9 2. 4. 39.44 Sum squared error 134.7 74.1 934.2 14. 1319.9 49.1