Accelerating the Hessian-free Gauss-Newton Full-waveform Inversion via Preconditioned Conjugate Gradient Method

Similar documents
Model parametrization strategies for Newton-based acoustic full waveform

A projected Hessian matrix for full waveform inversion Yong Ma and Dave Hale, Center for Wave Phenomena, Colorado School of Mines

Full waveform inversion by deconvolution gradient method

Source Estimation for Wavefield Reconstruction Inversion

Inexact Full Newton Method for Full Waveform Inversion using Simultaneous encoded sources

Well-log derived step lengths in Full Waveform Inversion Tunde Arenrin Gary Margrave

Downloaded 09/16/13 to Redistribution subject to SEG license or copyright; see Terms of Use at

Improvements in time domain FWI and its applications Kwangjin Yoon*, Sang Suh, James Cai and Bin Wang, TGS

Applying Gauss-Newton and Exact Newton method to Full Waveform Inversion

Full-waveform inversion using seislet regularization a

We N Converted-phase Seismic Imaging - Amplitudebalancing Source-independent Imaging Conditions

Multicomponent land data pre-processing for FWI: a benchmark dataset

Stochastic conjugate gradient method for least-square seismic inversion problems Wei Huang*, Hua-Wei Zhou, University of Houston

Post-stack iterative modeling migration and inversion (IMMI)

SUMMARY INTRODUCTION THEORY. Objective function of ERWI

A new take on FWI: Wavefield Reconstruction Inversion

A hybrid tomography method for crosswell seismic inversion

Implicit and explicit preconditioning for least squares nonstationary phase shift

Extending the search space of time-domain adjoint-state FWI with randomized implicit time shifts

F031 Application of Time Domain and Single Frequency Waveform Inversion to Real Data

SUMMARY. min. m f (m) s.t. m C 1

Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited

Downloaded 10/26/15 to Redistribution subject to SEG license or copyright; see Terms of Use at

U043 3D Prestack Time Domain Full Waveform Inversion

Comments on wavefield propagation using Reverse-time and Downward continuation

Building accurate initial models using gain functions for. waveform inversion in the Laplace domain

Least squares Kirchhoff depth migration: important details

Inversion after depth imaging

G004 Robust Frequency-domain Multi-parameter Elastic Full Waveform Inversion

WS10-A03 Alternative Multicomponent Observables for Robust Full-waveform Inversion - Application to a Near-surface Example

1.5D internal multiple prediction on physical modeling data

SUMMARY INTRODUCTION THEORY

Least-squares migration/inversion of blended data

Strategies for elastic full waveform inversion Espen Birger Raknes and Børge Arntsen, Norwegian University of Science and Technology

WS 09 Velocity Model Building

SUMMARY. earth is characterized by strong (poro)elasticity.

A least-squares shot-profile application of time-lapse inverse scattering theory

Multi-source Least-squares Migration of Gulf of Mexico Data

Full waveform inversion of physical model data Jian Cai*, Jie Zhang, University of Science and Technology of China (USTC)

Fast Gabor Imaging with Spatial Resampling

Full-waveform inversion for reservoir characterization: A synthetic study

Geophysical Journal International

Temporal windowing and inverse transform of the wavefield in the Laplace-Fourier domain

Target-oriented wave-equation inversion

Downloaded 11/13/12 to Redistribution subject to SEG license or copyright; see Terms of Use at

Building starting model for full waveform inversion from wide-aperture data by stereotomography

Downloaded 01/20/16 to Redistribution subject to SEG license or copyright; see Terms of Use at

Full-waveform inversion with randomized L1 recovery for the model updates

Elastic full waveform Inversion for land walkaway VSP data

Approximate Constant Density Acoustic Inverse Scattering Using Dip-Dependent Scaling Rami Nammour and William Symes, Rice University

SUMMARY LEAST-SQUARES MIGRATION THEORY

Amplitude and kinematic corrections of migrated images for non-unitary imaging operators

Seismic Modeling, Migration and Velocity Inversion

(x, y, z) m 2. (x, y, z) ...] T. m 2. m = [m 1. m 3. Φ = r T V 1 r + λ 1. m T Wm. m T L T Lm + λ 2. m T Hm + λ 3. t(x, y, z) = m 1

A hybrid stochastic-deterministic optimization method for waveform inversion

Refraction Full-waveform Inversion in a Shallow Water Environment

Total variation tomographic inversion via the Alternating Direction Method of Multipliers

Application of Weighted Early-Arrival Waveform Inversion to Shallow Land Data

Automatic wave equation migration velocity analysis by differential semblance optimization

Amplitude and kinematic corrections of migrated images for nonunitary imaging operators

Least squares Kirchhoff depth migration with. preconditioning

A Framework for Online Inversion-Based 3D Site Characterization

Target-oriented wave-equation inversion with regularization in the subsurface-offset domain

Model-space vs. data-space normalization for recursive depth migration

Data dependent parameterization and covariance calculation for inversion of focusing operators

1D internal multiple prediction in a multidimensional world: errors and recommendations

Imaging and Full Waveform Inversion of seismic data from the CO 2 gas cloud at Sleipner

Tu P13 08 A MATLAB Package for Frequency Domain Modeling of Elastic Waves

Frequency Domain Wave Equation Inversion and Its Application on the Heterogeneous Reservoir Model Data

SEG/San Antonio 2007 Annual Meeting

We B3 12 Full Waveform Inversion for Reservoir Characterization - A Synthetic Study

Least Squares Kirchhoff Depth Migration: potentials, challenges and its relation to interpolation

Target-oriented wavefield tomography: A field data example

Electromagnetic migration of marine CSEM data in areas with rough bathymetry Michael S. Zhdanov and Martin Čuma*, University of Utah

H003 Deriving 3D Q Models from Surface Seismic Data Using Attenuated Traveltime Tomography

Full waveform inversion guided migration velocity analysis Thibaut Allemand* and Gilles Lambaré (CGG)

A MAP Algorithm for AVO Seismic Inversion Based on the Mixed (L 2, non-l 2 ) Norms to Separate Primary and Multiple Signals in Slowness Space

We G Time and Frequency-domain FWI Implementations Based on Time Solver - Analysis of Computational Complexities

Target-oriented computation of the wave-equation imaging Hessian

Wave equation least square imaging using the local angular Hessian for amplitude correction

k are within the range 0 k < k < π dx,

Efficient Beam Velocity Model Building with Tomography Designed to Accept 3d Residuals Aligning Depth Offset Gathers

An Efficient 3D Elastic Full Waveform Inversion of Time-Lapse seismic data using Grid Injection Method

Iterative resolution estimation in Kirchhoff imaging

Multi Layer Perceptron trained by Quasi Newton learning rule

SUMMARY THEORY INTRODUCTION

HTI anisotropy in heterogeneous elastic model and homogeneous equivalent model

Least-squares RTM: Reality and possibilities for subsalt imaging Ping Wang*, Adriano Gomes, Zhigang Zhang, and Ming Wang, CGG

Target-oriented wavefield tomography using demigrated Born data

3D image-domain wavefield tomography using time-lag extended images

G012 Scattered Ground-roll Attenuation for 2D Land Data Using Seismic Interferometry

Wave-equation inversion prestack Hessian

Which parameterization for acoustic vertical transverse. isotropic full waveform inversion? - Part 2: synthetic and

Bi-objective optimization for seismic survey design

SUMMARY. method to synthetic datasets is discussed in the present paper.

SUMMARY. Pursuit De-Noise (BPDN) problem, Chen et al., 2001; van den Berg and Friedlander, 2008):

Seismic data interpolation beyond aliasing using regularized nonstationary autoregression a

Lecture 2: Seismic Inversion and Imaging

Downloaded 10/23/13 to Redistribution subject to SEG license or copyright; see Terms of Use at

An imaging technique for subsurface faults using Teleseismic-Wave Records II Improvement in the detectability of subsurface faults

Transcription:

Accelerating the Hessian-free Gauss-Newton Full-waveform Inversion via Preconditioned Conjugate Gradient Method Wenyong Pan 1, Kris Innanen 1 and Wenyuan Liao 2 1. CREWES Project, Department of Geoscience, University of Calgary 2. Department of Mathematics and Statistics, University of Calgary Summary Full-waveform inversion (FWI) has emerged as a powerful strategy for estimating the subsurface model parameter. The gradient-based methods promise to converge globally but suffer from slow convergence rate. The Newton-type methods provide a quadratic convergence, but the computation, storage and inversion of the Hessian are beyond the current computation ability for large-scale inverse problem. The Hessian-free (HF) optimization method represents an attractive alternative to these above-mentioned optimization methods. At each iteration, it obtains the search direction by approximately solving the Newton linear system using a conjugate-gradient (CG) algorithm. The main goal of this paper is to accelerate the HF FWI by preconditioning the CG algorithm. In this research, different preconditioning schemes for the HF Gauss-Newton optimization method are developed. It is concluded that the quasi-newton L-BFGS preconditioning scheme with the pseudo diagonal Gauss-Newton Hessian as initial guess can speed up the HF Gauss-Newton FWI most efficiently. Introduction Full-waveform inversion (FWI) promises to provide high-resolution estimates of the subsurface parameter. FWI iteratively reconstructs the model parameters by minimizing a L-2 norm misfit function, which measures the difference between the observed data and synthetic data (Lailly, 1983; Tarantola, 1984; Virieux and Operto, 2009). The traditional optimization methods for FWI in exploration geophysics are gradient-based methods. The gradient-based methods are computationally attractive for large-scale inverse problem but also suffer from slow convergence rate. The search direction can be significantly enhanced by multiplying the gradient with the inverse Hessian matrix, which serves as a deconvolution operator for compensating the geometrical spreading effects and de-blurring the gradient. However, explicit calculation, storage and inversion of the Hessian at each iteration are beyond the current computation capability for large-scale inverse problem. In Gauss-Newton method, an approximate Hessian is introduced by only considering the first-order term (Pratt et al., 1998; Pan et al., 2015b). Pan et al. (2015a) used phase-encoding technology to construct the diagonal Gauss- Newton Hessian efficiently. Preconditioning the gradient with the diagonal pseudo-hessian resembles a deconvolution imaging condition (Pan et al., 2014b, 2015a). Instead of constructing the Hessian explicitly, the quasi-newton L-BFGS methods approximate the inverse Hessian iteratively by storing the model and gradient changes from previous iteration. The Hessian-free optimization method represents an attractive alternative to the above-described optimization methods (Nash, 1985). At each iteration, the search direction is computed by approximately solving the Newton equations through a matrix-free fashion of the conjugate-gradient (CG) algorithm, which is an optimal method for solving a positive definite system. In this paper, the full Hessian is replaced with the Gauss-Newton Hessian, which is always symmetric positive definite. Our main goal in this paper is to GeoConvention 2016: Optimizing Resources 1

precondition the CG algorithm for reducing the CG iterations and accelerating the HF Gauss-Newton fullwaveform inversion. Different preconditioning schemes are developed for comparison in this paper. The Hessian-free Optimization Methods Instead of constructing Hessian approximations or approximating inverse Hessian, the Hessian-free (HF) optimization method obtains the search direction by solving the Newton linear system approximately using a conjugate-gradient (CG) method. The CG method is an optimal algorithm for solving a symmetric positive definite system and it only requires computing the Hessian-vector products. In the context of HF optimization method, the Hessian is always replaced with Gauss-Newton Hessian, which is always symetric and positive definite: (H k+ εa) mk = gk, (1) where m k is the search direction in k th iteration, H k is the Gauss-Newton Hessian, g k is the gradient, ε A is the stabilization term, ε is a small constant value and A=max(H ). Preconditioning One problem of the CG iterative algorithm is that it requires many iterations when obtaining the approximate solution of a linear system. The convergence rate of the CG method depends on the spectral properties (e.g., its eigenvalues) of the coefficient matrix. It is often convenient to transform the equation system into a system that has the same solution but has more favorable spectral properties. 1 This can be achieved by applying a suitable preconditioner M on the linear system. Thus, the preconditioned Newton system for the HF Gauss-Newton FWI is given by: 1 1 M (H + εa) m = M g. (2) k k k k k The solution of equation (2) can be obtained by the preconditioned conjugate-gradient (PCG) method. In this paper, we developed efficient preconditioning schemes for the inner iterative algorithm. k The preconditioner for the CG method is always devised to approximate the Hessian or the inverse Hessian. We first consider the traditional Hessian approximations (e.g., diagonal pseudo-hessian and diagonal Gauss-Newton Hessian) as the preconditioners for the CG inner iteration. The pseudo-hessian is constructed by replacing the partial derivative wavefield with the virtual source in the correlation process. We also propose a pseudo diagonal Gauss-Newton Hessian approximation as the preconditioner for the CG algorithm in the inner loop, which can be expressed as: 4 2 4 H ( x) = R( ω f ( ω) G( xx,, ω) ). (3) diag s s xg xs ω Furthermore, we develop an L-BFGS preconditioning scheme for the HF optimization method, namely L- BFGS-GN method for H 0 = I. The approximated inverse Hessian H can also be used as a preconditioner for the CG iterative method: 1 1 H (H + εa) m = H g. k k k k k An identity matrix is usually set as the initial guess for the L-BFGS preconditioner, which is actually important for the L-BFGS method. To improve the performance of the L-BFGS preconditioning scheme, we consider using the stabilized diagonal pseudo-hessian, diagonal Gauss-Newton Hessian and pseudo diagonal Gauss-Newton Hessian as the initial guess for constructing the L-BFGS preconditioner, which are named as L-BFGS-GN-DPH, L-BFGS-GN-DGH and L-BFGS-GN-PDGH methods respectively. Numerical Examples A more complex modified Marmousi model is used to examine the efficiency of different preconditioning schemes for the HF Gauss-Newton full-waveform inversion. The truncated Marmousi model has 100 X 100 GeoConvention 2016: Optimizing Resources 2

grid cells with a grid interval of 10 m in both horizontal and vertical directions. Figures 1a and 1b show the true P-wave velocity model and initial P-wave velocity model. The initial velocity model is obtained by smoothing the true model with a Gaussian function. The inversion process is carried out with a multi-scale approach for mitigating the cycle-skipping problem. The frequency is increased from 5 Hz to 30 Hz with a partial overlap-frequency selection strategy, in which a group of 3 frequencies are used for inversion simultaneously. The frequency group increases from low to high with 2 frequencies overlapped and for each frequency band, a number of 5 outer iterations are performed. The stopping criteria for the inner iteration is k max = 10 or γ min = 2e 1. The stabilization parameters are σ = 1.0e 2 and λ = 1.0e 2. Figure 1. (a) True P-wave velocity model; (b) Initial P-wave velocity model. Figures 2a and 2b show the inversion results by steepest-descent (SD) and L-BFGS methods. Figures 2c and 2d show the well log data comparison at 0.1 km and 0.6 km respectively. As we can see, the deep parts of the velocity model can not be recovered very well. Figures 3a, 3b, 3c and 3d show the inverted models by CG-GN, L- BFGS-GN, DPH-GN and L-BFGS-GN-DPH methods respectively. We notice that compared to SD and L-BFGS methods, the HF GN methods can recover the deep parts of the velocity model much better. Furthermore, we also see that the inversion results by preconditioned HF GN methods are enhanced compared to non-preconditioned HF GN method (GN-CG). Figure 2. (a) SD method; (b) L-BFGS method. (c) and (d) show the well log data comparison at 0.1 km and 0.6 km respectively. Figures 4a and 4b show the inversion results by DGH-GN and L-BFGS-GN-DGH methods with λ = 1.0e 2. The inversion results are contaminated by artifacts. Figures 4c and 4d are the inversion results by DGH-GN and L-BFGS-GN-DGH methods with λ = 5.0e 2. As we can see, the inversion results become much better. Figure 5a and 5b are the inversion results by PDGH-GN and L-BFGS-GN- PDGH methods with λ = 5.0e 2. Figure 5c and 5d show the well log data comparison at 0.1 km and 0.6 km respectively. It can be seen that the velocity model has been reconstructed well. Figure 6 shows the convergence rate of differen methods. Figures 7a and 7b show the normalized misfit vs. number of forward problems solved and normalized misfit vs. computation time. The preconditioned HF Gauss-Newton methods can reconstruct the model more efficiently than the non-preconditioned one (CG- GN method). The L-BFGS preconditioning schemes can reduce the computation burden better than the preconditioning methods based on diagonal Hessian approximations. Proper stabilization parameter should be determined to ensure the effectiveness and stability of the DGH-GN and $l$-bfgs-gn-dgh methods. Although, L-BFGS-GN-DGH method ( λ = 5.0e 2 ) provides fast convergence rate as shown in Figure 6. More computation cost is required for calculating the receiver-side Green's functions. The L-BFGS-GN- PDGH method ( λ = 5.0e 2 ) based on the proposed pseudo diagonal Gauss-Newton does not need additional computational cost compared CG-GN method and it shows the best performance in reducing the computation burden and accelerating the HF Gauss-Newton FWI. GeoConvention 2016: Optimizing Resources 3

Figure 3. (a) CG-GN method; (b) L-BFGS-GN method; (c) DPH-GN method; (d) L-BFGS-GN-DPH method. (e) and (f) show the well log data comparison at 0.1 km and 0.6 km respectively. Figure 4. (a) and (b) are the inversion results by DGH-GN and L- BFGS-GN-DGH methods with λ = 1.0e 2. (c) and (d) are the inversion results by DGH-GN and L-BFGS-GN-DGH methods with λ = 5.0e 2. (e) and (f) show the well log data comparison at 0.1 km and 0.6 km respectively. Figure 5. (a) and (b) are the inversion results by PDGH-GN and L-BFGS-GN-PDGH methods with λ = 5.0e 2. (c) and (d) show the well log data comparison at 0.1 km and 0.6 km respectively. Figure 6. The convergence history of different methods. Conclusions In this paper, a Hessian-free Gauss-Newton method is carried out for full-waveform inversion. To accelerate the Hessian-free Gauss-Newton method, we develop different preconditioning schemes for the inner CG algorithm at each outer iteration. A pseudo diagonal Gauss-Newton Hessian is also proposed as preconditioner based on the reciprocal property of the Green's function. We present numerical examples to show that the preconditioning schemes can improve the convergence rate of Hessian-free Gauss-Newton FWI and reduce the computation cost. Acknowledgements Figure 7. (a) Normalized misfit vs. Number of forward problems solved; (b) Normalized misfit vs. Computation time (s). This research was supported by the Consortium for Research in Elastic Wave Exploration Seismology (CREWES) and National Science and Engineering Research Council of Canada (NSERC, CRDPJ 379744-08). GeoConvention 2016: Optimizing Resources 4

References Lailly, P., 1983, The seismic inverse problem as a sequence of before stack migration: Conference on Inverse Scattering, Theory and Applications, SIAM, Expanded Abstracts, 206-220. Nash, S. G., 1985, Preconditioning of truncated-newton methods: SIAM J. Sci. Statist. Comput, 6, 599-616. Pratt, R. G., C. Shin, and G. J. Hicks, 1998, Gauss-Newton and full Newton methods in frequency-space seismic waveform inversion: Geophysical Journal International, 133, 341-362. Pan, W., K. A. Innanen, and G. F. Margrave, 2014a, A comparison of different scaling methods for least-squares migration/inversion: EAGE Expanded Abstracts, We G103 14. Pan, W., K. A. Innanen, G. F. Margrave, and D. Cao, 2015a, Efficient pseudo-gauss-newton full-waveform inversion in the t-p domain: Geophysics, 80, no. 5, R225-R14. Pan, W., K. A. Innanen, G. F. Margrave, M. C. Fehler, X. Fang, and J. Li, 2015b, Estimation of elastic constants in HTI media using Gauss-Newton and Full-Newton multi-parameter full waveform inversion: SEG Technical Program Expanded Abstracts, 1177-1182. Pan, W., G. F. Margrave, and K. A. Innanen, 2014b, Iterative modeling migration and inversion (IMMI): Combining full waveform inversion with standard inversion methodology: SEG Technical Program Expanded Abstracts, 938-943. Tarantola, A., 1984, Inversion of seismic reflection data in the acoustic approximation: Geophysics, 49, 1259-1266. Virieux, A. and S. Operto, 2009, An overview of full-waveform inversion in exploration geophysics: Geophysics, 74, no. 6, WCC1- WCC26. GeoConvention 2016: Optimizing Resources 5