Nonparametric Risk Assessment of Gas Turbine Engines

Similar documents
Application of Parallel Processing to Probabilistic Fracture Mechanics Analysis of Gas Turbine Disks

Fracture Mechanics and Nondestructive Evaluation Modeling to Support Rapid Qualification of Additively Manufactured Parts

Summary of New DARWIN TM Capabilities for Surface Damage Tolerance Assessment and Related Enhancements

DARWIN 8.1 Release Notes

Frequency Distributions

Methodology for Application of 3-D Stresses to Surface Damage-Based Risk Assessment

DARWIN 8.0 Release Notes

An Introduction to PDF Estimation and Clustering

Kernel Density Estimation (KDE)

DARWIN 9.1 Release Notes

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

Inclusion of Aleatory and Epistemic Uncertainty in Design Optimization

Data Mining. CS57300 Purdue University. Bruno Ribeiro. February 1st, 2018

Kernel Density Estimation

Dynamic Thresholding for Image Analysis

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

CREATING THE DISTRIBUTION ANALYSIS

DARWIN 7.0 Release Notes

Adaptive Risk Refinement Methodology for Gas Turbine Engine Rotor Disks

Regression III: Advanced Methods

Section 4 Matching Estimator

On Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor

COMPUTATIONAL STATISTICS UNSUPERVISED LEARNING

Descriptive Statistics, Standard Deviation and Standard Error

DARWIN 9.0 Release Notes

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display

Bandwidth Selection for Kernel Density Estimation Using Total Variation with Fourier Domain Constraints

Improving the Post-Smoothing of Test Norms with Kernel Smoothing

Advanced Applied Multivariate Analysis

Available online at Fatigue Automated fatigue crack growth analysis of components

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

Fitting Fragility Functions to Structural Analysis Data Using Maximum Likelihood Estimation

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

ACCURACY AND EFFICIENCY OF MONTE CARLO METHOD. Julius Goodman. Bechtel Power Corporation E. Imperial Hwy. Norwalk, CA 90650, U.S.A.

Modelling Bivariate Distributions Using Kernel Density Estimation

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.

Robustness analysis of metal forming simulation state of the art in practice. Lectures. S. Wolff

Generalized Additive Model

Data Preprocessing. S1 Teknik Informatika Fakultas Teknologi Informasi Universitas Kristen Maranatha

ECLT 5810 Data Preprocessing. Prof. Wai Lam

Analyzing 'Noisy' Structural Problems with LS-OPT: Probabilistic and Deterministic Fundamentals

Chapter 1. Using the Cluster Analysis. Background Information

Chapter 1. Looking at Data-Distribution

Multivariate Analysis

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques

Chapter 2: The Normal Distribution

Measures of Central Tendency

Machine Learning in the Wild. Dealing with Messy Data. Rajmonda S. Caceres. SDS 293 Smith College October 30, 2017

Fatigue Reliability Analysis of Dynamic Components with Variable Loadings without Monte Carlo Simulation 1

CALCULATION OF OPERATIONAL LOSSES WITH NON- PARAMETRIC APPROACH: DERAILMENT LOSSES

Nonparametric Regression

Data Statistics Population. Census Sample Correlation... Statistical & Practical Significance. Qualitative Data Discrete Data Continuous Data

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

3 Nonlinear Regression

Density estimation. In density estimation problems, we are given a random from an unknown density. Our objective is to estimate

Visualizing Data: Freq. Tables, Histograms

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

STA 570 Spring Lecture 5 Tuesday, Feb 1

Perceptron as a graph

DARWIN 4.0 Release Notes August 2002 Southwest Research Institute

A COMPUTATIONAL METHOD FOR CALCULATING UNCERTAINTY IN PROJECT NETWORKS WITH DEPENDENCIES

COPULA MODELS FOR BIG DATA USING DATA SHUFFLING

Non-Parametric Modeling

Using the DATAMINE Program

CHAPTER-13. Mining Class Comparisons: Discrimination between DifferentClasses: 13.4 Class Description: Presentation of Both Characterization and

Machine Learning Lecture 3

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques.

Multivariate Standard Normal Transformation

Nonparametric Estimation of Distribution Function using Bezier Curve

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Response to API 1163 and Its Impact on Pipeline Integrity Management

Supplementary Figure 1. Decoding results broken down for different ROIs

Introduction to Pattern Recognition Part II. Selim Aksoy Bilkent University Department of Computer Engineering

CHAPTER 3: Data Description

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

MATLAB Routines for Kernel Density Estimation and the Graphical Representation of Archaeological Data

Chapter 16. Microscopic Traffic Simulation Overview Traffic Simulation Models

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering

Analog input to digital output correlation using piecewise regression on a Multi Chip Module

8. MINITAB COMMANDS WEEK-BY-WEEK

CS 195-5: Machine Learning Problem Set 5

NONPARAMETRIC REGRESSION TECHNIQUES

We deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR

Simulation Supported POD Methodology and Validation for Automated Eddy Current Procedures

Nonparametric Survey Regression Estimation in Two-Stage Spatial Sampling

Simulating multivariate distributions with sparse data: a kernel density smoothing procedure

Chapter 12: Statistics

Density estimation. In density estimation problems, we are given a random from an unknown density. Our objective is to estimate

Chapter 4. Clustering Core Atoms by Location

Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool.

Performance of Hybrid Adaptive Optics Systems. Michael Werth, Brandoch Calef, Daniel Thompson The Boeing Company

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.

Graphical Techniques for Displaying Multivariate Data

Chapter 7: Dual Modeling in the Presence of Constant Variance

Ms Nurazrin Jupri. Frequency Distributions

Robust PDF Table Locator

Transcription:

Nonparametric Risk Assessment of Gas Turbine Engines Michael P. Enright *, R. Craig McClung, and Stephen J. Hudak Southwest Research Institute, San Antonio, TX, 78238, USA The accuracy associated with risk assessment of gas turbine engine components is highly dependent on the statistical models used to represent the fundamental random variables. Although parametric models are commonly used to represent the random variables, they are often a poor fit near the tails of the distributions. Also, they may not address some of the general relationships among multiple random variables (e.g., multimodality, skewed discontinuities, among others). In this paper, a nonparametric approach is used to model the probability densities associated with rainflow stress pair data for actual military aircraft engine usages. The data are represented using univariate and multivariate Epanechnikov kernels where the window width is adaptively adjusted to address the number of data points in all regions of the distributions. A brief summary of the nonparametric adaptive kernel density estimator is presented including details regarding a specific algorithm. Algorithm performance is illustrated for standard parametric probability density functions and for rainflow stress pairs associated with military aircraft engine usage histories. The results can be applied to fracture risk prediction of gas turbine engine components subjected to complex usages. I. Introduction he crack growth life associated with a gas turbine engine disk is influenced by a number of random variables T (e.g., material properties, inherent/induced anomalies, applied stress, among others). The dispersion associated with each of these variables influences the probability of fracture. The accuracy of predicted risk is highly dependent on the statistical models used to represent each of the individual random variables. Parametric models (i.e., probability density functions such as normal, lognormal, Wiebull) are commonly used to represent the random variables in which the accuracy of each variable is dependent on a regression fit of the data. Since experimental data are typically most plentiful near the mean value, the regression curve can be biased towards the mean and is often a poor fit near the tails of the distribution. Also, parametric models contain values that are often physically impossible to achieve in physical systems (e.g., ± ). In addition, although the regression approach can be used to estimate the degree of correlation among two or more random variables, it may not adequately address general relationships among multiple random variables (e.g., multimodality, skewed discontinuities, among others). In this paper, a comparison of parametric and nonparametric methods is presented for density estimation of rainflow stress profiles associated with aircraft gas turbine engine usages. A brief summary of the nonparametric adaptive kernel density estimator is presented including details regarding a specific algorithm. Algorithm performance is illustrated using data generated from parametric uni- and multi-variate probability density functions and is compared to parametric estimates based on the same data. The adaptive kernel density estimator is applied to rainflow stress pairs associated with F-16/F engine usages and compared to parametric estimates, including a hybrid approach based on separate treatment of maximum stress pairs associated with engine startup and aircraft takeoff. II. Stochastic Modeling of Aircraft Engine Usages Usage histories for commercial aircraft are often well established with known stress pair magnitudes. A stress scatter factor can be used to account for the bias between predicted and measured stress values. This factor can be modeled as a random variable to address the variability associated with the measured values. As shown in Fig. 1a, * Principal Engineer, Reliability and Materials Integrity, 622 Culebra Road, Member AIAA. Senior Program Manager, Materials Engineering, 622 Culebra Road, Member AIAA. Institute Scientist and Program Director, Materials Engineering, 622 Culebra Road, Member AIAA. 1

Stress Bias Model Stress Stress Cycles Flight 1 Flight 2 Usage Variability Model Stress Stress Flight 1 Flight 2 Figure 1. Approaches for modeling variability associated with applied loads: stress bias model, and cycle-by-cycle usage variability model. the influence of stress scatter on the probability of fracture is commonly estimated by applying it repeatedly to the deterministic stress values over the entire life of a disk. For military usages, the number and magnitude of stress pair values are not always well defined, and may have greater variability compared to commercial usages. Since the stress pair values could vary significantly from flight to flight, a more accurate model (Fig. 1b) is needed to estimate the influence of stress variability on the probability of fracture. For gas turbine engines, the applied stress is a stochastic variable that is often difficult to measure and predict. Two primary approaches have been proposed for density estimation of rainflow stress values: parametric and nonparametric. The parametric approach consists of a regression fit of the data plotted on a cumulative probability 2

scale 1-3. The primary drawback to this approach is that it is often difficult to apply to multimodal densities. On the other hand, nonparametric methods provide a direct fit to the data and are routinely used to estimate multimodal densities. The kernel density estimator, the most commonly used nonparametric method, has been recently applied to rainflow stress data 4,5. III. Nonparametric Density Estimation Techniques A number of nonparametric methods can be used to estimate the probability density associated with a data set, such as the nearest neighbor method 6, convex intensity 7, and orthogonal series estimators 8, among many others. Many of the nonparametric methods are related to the histogram, which is commonly used to provide a measure of the relative occurrences of discrete data values. However, as shown in Fig. 2, histograms are highly dependent on the bin width selected for sampling. Furthermore, a histogram is not a true probability density function, and therefore cannot be used for probabilistic computations. The kernel density estimator is one of the most commonly used nonparametric methods to estimate probability densities. As illustrated in Fig. 3a, each data point is assumed to have an associated probability density within a specified range of values. If data points are relatively close together, their probability densities can overlap (Fig. 3b). The probability density at a given location can be estimated as the sum of the contributions of the kernels associated with each data point 6 : n 1 t1 X1 i t2 X2i f ( t1, t2) = K, 2 nh i= 1 h h (1) where t 1, t 2 = coordinates of evaluation point, n = number of data points, h = window width (smoothing parameter), K = kernel estimator, and X 1,i, X 2,i = coordinates of data point i. Eqn. (1) provides an initial density estimate associated with a fixed window width h. The window width can be adaptively adjusted to account for the number of data points within a region using the following equation 8 : 1 1 t X t X f t, t, ( ) 1 2 n 1 1i 2 2i = K d n i= 1 ( hλi ) hλi hλi (2) where λ i is a bandwidth parameter that is identified adaptively based on previous PDF estimates: Figure 2. Bin-width dependency associated with histogram density estimates: bin width = 5 units, and bin width = 1 units. 3

Figure 3. Nonparametric approach to probability density estimation: Epanechnikov Kernal density at individual data points, and combined density of all data points. λ = i n i= 1 f f ( X ) i ( X ) i 1 2n (3) An Epanechnikov kernel function can be used to describe the probability density associated with rainflow stress values 5,8 : t1 X1 i t2 X2i 2 1 2 2 K, 1 2 ( t1 X1 i) ( 2 2i) h h = + t X π h (4) An adaptive kernel numerical algorithm based on Eqns. (1)-(4) was developed and verified using a number of established probability density functions. For example, consider the univariate lognormal distribution (LN(3,2)) shown in Fig. 4a. The initial kernel PDF estimate based on Eqn. 1 (and 2 Monte Carlo samples from the parent distribution) is very similar to the parent PDF near the median, but differs near the right tail of the distribution. However, when the window width is adaptively adjusted using Eqn. (2), it converges to the parent distribution over most of the right tail. The parametric density estimate (LN(2.89, 1.81) is very similar to the parent both at the mode and tails of the distribution. The algorithm was applied to the bivariate lognormal distribution (LN(3,1.5) + LN(9,1.5)) shown in Fig. 4b. Both the initial and adaptive kernel density estimates predict the location of the modes and provide reasonable treatment at the tails of the distribution. On the other hand, the parametric estimate is a poor fit both at the modes and tails of the distribution. The algorithm was also verified for the bivariate normal distribution shown in Fig. 5 (X=N(,1), Y=N(,1), uncorrelated X and Y) based on 2 samples from the parent distribution. The parametric fit, shown in Fig 5a, is nearly identical to the parent distribution. The kernel density estimate (Fig. 5b) has an overall shape that is similar to the parent, but is quite different in the tails of the distribution. Standard algorithms are available to assess the accuracy of the kernel estimate 6 which should be strongly considered before applying the density estimate for risk predictions. 4

.3 Parent distribution - LN(3,2) Kernal density - initial estimate Kernal density - 5 adaptations Monte Carlo Samples (from parent) Parametric Fit - LN(2.89,1.81) f X (x).2.1 2 4 6 8 1 12 14 x.25.2 Parent distribution - LN(3,1.5) + LN(9,1.5) Kernal density - initial estimate Kernal density - 5 adaptations Monte Carlo Samples (from parent) Parametric Fit - LN(5.92,3.36).15 f X (x).1 5 5 1 15 2 25 3 Figure 4. Comparison of parametric and nonparametric density estimation methods for univariate unimodal lognormal parent distribution, and univariate bimodal parent distribution. x 5

13 13 12 12 11 11 X 2 X 2 9 8 2.e-4 4.e-4 6.e-4 8.e-4 1.e-3 1.2e-3 9 8 2.e-4 4.e-4 6.e-4 8.e-4 1.e-3 1.2e-3 1.4e-3 1.6e-3 7 7 8 9 11 12 13 7 7 8 9 11 12 13 X 1 X 1 Figure 5. Comparison of density estimation methods for a multivariate normal distribution: parametric density estimate, and Kernel density estimate. IV. Application to Density Estimation of Rainflow Stress Data The adaptive kernel algorithm was applied to the density estimation of rainflow stress pairs associated with actual aircraft gas turbine engine usages. Raw RPM values were obtained from engine flight data recorder data 9 for a variety of F-16/F missions. RPM values are shown in Fig. 6 for representative flights of missions classified as Live Fire and Instruments & Navigation. The RPM values were converted to stress values using an empirical algorithm including a shakedown procedure to address stress values that fall above the material yield stress 1. The stress values were sorted into min-max pairs using an established rainflow cycle counting algorithm 11. As shown in Fig. 6, a typical engine usage history can consist of thousands of RPM data points. However, the data can often be reduced to a few hundred values following application of the rainflow cycle counting algorithm. For example, the rainflow stress pairs associated with each of the individual flights shown in Fig. 7 are based on usages similar to the ones shown in Fig. 6. Most of the stress pairs are clustered in a region of minimum and maximum stress values ranging from 35 to 75 MPa. However, a few stress pairs associated with the aircraft major stress cycle are located in the region where the minimum stress is zero (associated with engine startup and aircraft takeoff). If the stress pairs can be completely described by a probability density function, then the influence of usage variability on the probability of fracture can be quantified by sampling directly from the rainflow stress PDF. The desired PDF must provide a probability density for each region of the population that is consistent with the clustering of data points shown in Fig. 7. It must attempt to describe the population from which the data points were drawn to reflect the influence of data points not included in the current sample. The stress pairs associated with startup/takeoff can be considered separately from the remaining data because they must be applied only once per flight (this cannot be guaranteed if they are included in the sampling population). In addition, since the minimum stress values associated with these stress pairs are always zero (to reflect the zero RPM value prior to engine startup), they can be modeled as a univariate (and probably unimodal) distribution. A parametric probability density estimate of the sub-population of Live Fire rainflow stress pairs that does not include values associated with startup/takeoff is shown in Figs. 8a&c. The parametric PDF appears to address the correlation structure of minimum and maximum stress values presented in Fig. 7. A kernel density estimate of the Live Fire mission rainflow stress pairs is shown in Figs. 8b&d. It provides explicit treatment of the stress pairs associated with engine startup, as indicated by the stress values that are separated from the primary cluster that appear near zero on the min stress axis. It also appears to capture the correlation structure of the minimum and maximum stress values. It is interesting to note that the parametric method predicts a unimodal density for the rainflow data, whereas the kernel method predicts a bimodal density. Since it is 6

1 12 9 RPM Live Fire Mission Instruments & Navigation Mission 2 8 12 Recorded Data Values Figure 6. F-16/F aircraft engine usage histories associated with representative flights of the Live Fire and Instruments & Navigation missions. 8 Live Fire Mission Max Stress, MPa Flight 1 Flight 2 Flight 3 Flight 4 Flight 5 Flight 6 Flight 7 Flight 8 Flight 9 Flight 1 Flight 11 Flight 12 Flight 13 Flight 14 Flight 15 Flight 16 Flight 17 Min = Max 2 8 Figure 7. Rainflow stress pairs associated with representative flights for the F-16/F aircraft Live Fire usage classification. 7

1.e-5 2.e-5 3.e-5 3.5e-5 3.5e-5 1e-5 2e-5 3e-5 4e-5 3.e-5 3e-5 2.5e-5 2.e-5 Probability 1.5e-5 Density 2e-5 Probability Density 1.e-5 1e-5 5.e-6 Max -5.e-6 2 2 Max Stress, MPa Stress, MPa Max Stress, MPa Max Stress, MPa Probability Density 5.e-6 1.e-5 1.5e-5 2.e-5 2.5e-5 Probability Density 5.e-6 1.e-5 1.5e-5 2.e-5 2.5e-5 3.e-5 3.5e-5 Usage: Live Fire Parametric Density No Engine Startup Stress Pairs Usage: Live Fire Initial Kernel Density 2 2 (c) (d) Figure 8. Comparison of density estimation methods for rainflow stress pairs associated with Live Fire usage histories. and (c): parametric density estimate (no engine startup stress pairs); and (d): kernel density estimate. expected that extreme stress values will have more influence on the probability of fracture, it may be more appropriate to assess the accuracy of the methods in the tails of the distributions. V. Conclusions A comparison of parametric and nonparametric methods was presented for density estimation of rainflow stress profiles associated with military aircraft gas turbine engine usages. Both methods appear to address the correlation relationship among the minimum and maximum stress values. However, the parametric method does not provide treatment of the multi-modal behavior predicted using the adaptive kernel approach. The results can be used to develop improved stress estimates for probabilistic life predictions. 8

Acknowledgments This work was performed with funding from the U.S. Air Force Research Laboratory and Southwest Research Institute under AFRL Dual Use Agreement No. F33615-3-2-523. The support and encouragement of Jay Jira, Jim Larsen, and Patrick Golden (technical monitor) of AFRL is gratefully acknowledged. References 1 Ang, A. H-S., and Tang, W.H. (1975). Probabilistic Concepts in Engineering Planning and Design. John Wiley, New York. 2 Haldar, A. and Mahadevan, S., (2). Probability, Reliability, and Statistical Methods in Engineering Design. John Wiley, New York. 3 Ayyub, B.M., and McCuen, R.H. (23). Probability, Statistics, and Reliability for Engineers and Scientists. Chapman & Hall, New York. 4 Dressler, K., Hack, M., and Kruger, W. (1997). Stochastic reconstruction of loading histories from a rainflow matrix, Zeitschrift Fur Angewandte Mathematik und Mechanik, Vol. 77, No. 3, pp. 217-226. 5 Socie, D.F., and Pompetski, M.A. (24). Modeling variability in service loading spectra, Probabilistic Aspects of Life Prediction, ASTM STP 145, pp. 46-57. 6 Scott, D.W. (1992). Multivariate Density Estimation. John Wiley, New York. 7 Rigdon, S.E., and Basu, A.P. (2). Statistical Methods for the Reliability of Repairable Systems. John Wiley, New York. 8 Silverman, B.W. (1986). Density Estimation for Statistics and Data Analysis. Chapman & Hall, New York. 9 Enright, M.P., Hudak, S.J., McClung, R.C., and Millwater, H.R. (25). Application of probabilistic fracture mechanics to prognosis of aircraft engine components, AIAA Journal, AIAA, Vol. 44, No. 2, pp. 311-316. 1 Southwest Research Institute (24). DARWIN User s Guide. Southwest Research Institute, San Antonio, TX. 11 Bannantine, J.A., Comer, J.J., and Handrock, J.L. (199). Fundamentals of Metal Fatigue Analysis. Prentice Hall, NJ, pp. 193-195. 9