A Provably Convergent Multifidelity Optimization Algorithm not Requiring High-Fidelity Derivatives

Similar documents
Multidisciplinary System Design Optimization (MSDO)

Multifidelity and Adaptive Optimization Methods

APPLICATION OF VARIABLE-FIDELITY MODELS TO AERODYNAMIC OPTIMIZATION

Model Reduction for Variable-Fidelity Optimization Frameworks

Introduction to optimization methods and line search

Delaunay-based Derivative-free Optimization via Global Surrogate. Pooriya Beyhaghi, Daniele Cavaglieri and Thomas Bewley

Comparison of Interior Point Filter Line Search Strategies for Constrained Optimization by Performance Profiles

Reduced Order Models for Oxycombustion Boiler Optimization

Today. Golden section, discussion of error Newton s method. Newton s method, steepest descent, conjugate gradient

A penalty based filters method in direct search optimization

Tree-GP: A Scalable Bayesian Global Numerical Optimization algorithm

Modern Methods of Data Analysis - WS 07/08

Recent Developments in Model-based Derivative-free Optimization

Convex Optimization MLSS 2015

Note Set 4: Finite Mixture Models and the EM Algorithm

RESPONSE SURFACE BASED OPTIMIZATION WITH A CARTESIAN CFD METHOD

4.12 Generalization. In back-propagation learning, as many training examples as possible are typically used.

Lecture 12: Feasible direction methods

Constrained and Unconstrained Optimization

State of the art at DLR in solving aerodynamic shape optimization problems using the discrete viscous adjoint method

Performance Evaluation of an Interior Point Filter Line Search Method for Constrained Optimization

Introduction to ANSYS DesignXplorer

COMPARISON OF INFILL SAMPLING CRITERIA IN KRIGING-BASED AERODYNAMIC OPTIMIZATION

A penalty based filters method in direct search optimization

Design optimization and design exploration using an open source framework on HPC facilities Presented by: Joel GUERRERO

Model Fitting. CS6240 Multimedia Analysis. Leow Wee Kheng. Department of Computer Science School of Computing National University of Singapore

Introduction to Aerodynamic Shape Optimization

Lift Superposition and Aerodynamic Twist Optimization for Achieving Desired Lift Distributions

Scattered Data Problems on (Sub)Manifolds

A derivative-free trust-region algorithm for reliability-based optimization

Parameterization of Airfoils and Its Application in Aerodynamic Optimization

Topology Optimization of Two Linear Elastic Bodies in Unilateral Contact

Analysis of the Adjoint Euler Equations as used for Gradient-based Aerodynamic Shape Optimization

Lecture 11: Randomized Least-squares Approximation in Practice. 11 Randomized Least-squares Approximation in Practice

SURVEY OF MULTIFIDELITY METHODS IN UNCERTAINTY PROPAGATION, INFERENCE, AND OPTIMIZATION

INTRODUCTION AN INTRODUCTION TO DIRECT OPTIMIZATION METHODS. SIMPLE EXAMPLE: COMPASS SEARCH (Con t) SIMPLE EXAMPLE: COMPASS SEARCH

Studies of the Continuous and Discrete Adjoint Approaches to Viscous Automatic Aerodynamic Shape Optimization

CHAOTIC ANT SYSTEM OPTIMIZATION FOR PATH PLANNING OF THE MOBILE ROBOTS

Constrained Aero-elastic Multi-Point Optimization Using the Coupled Adjoint Approach

Simulation Calibration with Correlated Knowledge-Gradients

Improving network robustness

1. Introduction. performance of numerical methods. complexity bounds. structural convex optimization. course goals and topics

Design parameterization for topology optimization by intersection of an implicit function

A NURBS-BASED APPROACH FOR SHAPE AND TOPOLOGY OPTIMIZATION OF FLOW DOMAINS

Conceptual Design and CFD

Computational Fluid Dynamics - Incompressible Flows

Non-Stationary Covariance Models for Discontinuous Functions as Applied to Aircraft Design Problems

A Comparative Study of LOWESS and RBF Approximations for Visualization

Comparing different interpolation methods on two-dimensional test functions

The Mesh Adaptive Direct Search Algorithm for Discrete Blackbox Optimization

Numerical and theoretical analysis of shock waves interaction and reflection

A fast algorithm for sparse reconstruction based on shrinkage, subspace optimization and continuation [Wen,Yin,Goldfarb,Zhang 2009]

Aerospace Applications of Optimization under Uncertainty

Efficient Multi-point Aerodynamic Design Optimization Via Co-Kriging

Convexization in Markov Chain Monte Carlo

CPSC 340: Machine Learning and Data Mining. Robust Regression Fall 2015

CS281 Section 3: Practical Optimization

A Multi-scale Approach to 3D Scattered Data Interpolation with Compactly Supported Basis Functions

A Deterministic Global Optimization Method for Variational Inference

Introduction to unconstrained optimization - derivative-free methods

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University

Approximation of a Fuzzy Function by Using Radial Basis Functions Interpolation

Vehicle Sketch Pad Applied To Propulsion-Airframe Integration

Simulation Calibration with Correlated Knowledge-Gradients

Single and multi-point aerodynamic optimizations of a supersonic transport aircraft using strategies involving adjoint equations and genetic algorithm

Exploring Econometric Model Selection Using Sensitivity Analysis

Introduction to Computer Graphics. Modeling (3) April 27, 2017 Kenshi Takayama

ATI Material Do Not Duplicate ATI Material. www. ATIcourses.com. www. ATIcourses.com

Progressive Validity Metamodel Trust Region Optimization. Quinn Parker Thomson

CS321 Introduction To Numerical Methods

Experimental Data and Training

A Data-based Approach for Fast Airfoil Analysis and Optimization

Airfoil shape optimization using adjoint method and automatic differentiation. Praveen. C

Iteratively Re-weighted Least Squares for Sums of Convex Functions

A family of Non-Oscillatory, 6 points, Interpolatory subdivision scheme

ADAPTIVE FINITE ELEMENT

Edge and local feature detection - 2. Importance of edge detection in computer vision

An augmented Lagrangian method for equality constrained optimization with fast infeasibility detection

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University

Recent advances in Metamodel of Optimal Prognosis. Lectures. Thomas Most & Johannes Will

CONFORMAL MAPPING POTENTIAL FLOW AROUND A WINGSECTION USED AS A TEST CASE FOR THE INVISCID PART OF RANS SOLVERS

Introduction. Optimization

LS-OPT Current development: A perspective on multi-level optimization, MOO and classification methods

UNIVERSITY OF OSLO. Please make sure that your copy of the problem set is complete before you attempt to answer anything. k=1

Tools & Applications 1. Introduction 2. Design of Matrix Turbines

A progressive barrier derivative-free trust-region algorithm for constrained optimization

Mixture Models and EM

An Optimization Method Based On B-spline Shape Functions & the Knot Insertion Algorithm

Reflector profile optimisation using Radiance

Algorithms for convex optimization

IE598 Big Data Optimization Summary Nonconvex Optimization

An Efficient Aerodynamic Optimization Method using a Genetic Algorithm and a Surrogate Model

A Multilevel Proximal Gradient Algorithm for a Class of Composite Optimization Problems

Randomized Algorithms 2017A - Lecture 10 Metric Embeddings into Random Trees

Quick Start with CASSY Lab. Bi-05-05

A Brief Look at Optimization

Nelder-Mead Enhanced Extreme Learning Machine

Fast oriented bounding box optimization on the rotation group SO(3, R)

THE EFFECTS OF THE PLANFORM SHAPE ON DRAG POLAR CURVES OF WINGS: FLUID-STRUCTURE INTERACTION ANALYSES RESULTS

Truss structural configuration optimization using the linear extended interior penalty function method

Transcription:

A Provably Convergent Multifidelity Optimization Algorithm not Requiring High-Fidelity Derivatives Multidisciplinary Design Optimization Specialist Conference April 4, 00 Andrew March & Karen Willcox

Motivation BCFD viscous solution: 90 CPU hours Pre/post-processing time not included Can this configuration be optimized? Winler et al. 4 th CFD Drag Prediction Worshop, San Antonio TX, June 009

Multifidelity Surrogates Definition: High-Fidelity The best model of reality that is available and affordable, the analysis that is used to validate the design. Definition: Low(er)-Fidelity A method with unnown accuracy that estimates metrics of interest but requires lesser resources than the high-fidelity analysis. Reduced Physics Coarsened Mesh Hierarchical Models Reduced Order Model Regression Model Approximation Models x f(x) x f(x) f(x ) x x 3

Main Messages Bayesian model calibration offers an efficient framewor for multifidelity optimization. Can reduce the number of high-fidelity function evaluations compared to other multifidelity methods. Does not require high-fidelity gradient estimates. Provides a flexible and robust alternative to nesting when there are multiple low-fidelity models. 4

Motivation-Calibration Methods First-order trust-region methods: Efficient for multifidelity optimization when derivatives are available or can be approximated efficiently Calibrated surrogate models are only used for one iteration Pattern-search methods: High-fidelity information can be reused Can be slow to converge Bayesian calibration methods (e.g., Efficient Global Optimization) Reuse high-fidelity information from iteration to iteration Can be quite efficient in practice Heuristic, no guarantee they converge to an optimum Goal: Develop a multifidelity optimization algorithm that combines Bayesian calibration and reuse of high-fidelity information in a manner provably convergent to an optimum of the high-fidelity function 5

Bayesian Model Calibration Define a surrogate model of the high-fidelity function: m ( x) f ( x) + e ( x) f ( x) low high The error model, e(x): Is a radial basis function model Interpolates f high (x)- f low (x) exactly at all selected calibration points Convergence can be proven if surrogate model is fully linear within a trust region Define trust region at iteration : B n { x R : x } = x 6

Definition: Fully Linear Model Definition: For all x within a trust region of size (0, max ), a fully linear model, m (x), satisfies f ( x) ( x) high m for a Lipschitz constant κ g, and with a Lipschitz constant κ f. f κ high( x) m ( x) κ f Conn et al. (009) shows that in a trust region setting, fully linear models are sufficient to prove convergence to a stationary point of f high (x). Requires: f high (x) is continuously differentiable, has Lipschitz continuous first derivative, and is bounded from below Multifidelity method also requires that f low (x) is continuously differentiable and has Lipschitz continuous first derivative g 7

Fully Linear RBF Models Standard radial basis function model: e y ( x) = λiφ( x x yi ; ξ ) + i= Radial basis function (RBF) model requirements: RBF, φ, is twice continuously differentiable φ(r) has zero derivative at r=0 φ = e Polynomial basis, π, is linear Wild et al. (008) showed that an RBF model can be made fully linear by construction Places conditions on the sample points used to construct the RBF model n+ i= v π ( x x i ) ξ r 8

Function Evaluation Points RBF model has sufficient local behavior to guarantee convergence It also captures some global behavior First-order trust region approaches only loo at the center of the current trust region RBF model will liely require fewer high-fidelity evaluations x 9 x Initial Trust-Region Interpolation Points x Radial Basis Function Calibration Approach x Initial Trust-Region Finite Difference Points x x x nd Iteration Interpolation Points First-Order Trust Region Approach x nd Iteration Finite Difference Points

Unconstrained Algorithm Summary Solve the trust-region subproblem to determine a candidate step, s : min m ( x + s ) Evaluate f high at the candidate point and compute the ratio of actual to predicted reduction: fhigh( x ) fhigh( x + s ) ρ = m ( x ) m ( x + s ) Accept/reject iterate: x + s ρ > 0 Update trust region size: + Form new fully linear model = Perform convergence chec: s R s. t. x n s + otherwise min{, = 0.5 ρ η ρ < η and reduce size of trust region until convergence proved [called the criticality chec in Conn et al. (009)] m x m max } ( x ) ε and ε { x x x } + ( x), on : + + 0

Supersonic Airfoil Test Problem Linear Panel Method Shoc-Expansion Theory Biconvex airfoil in supersonic flow - α= o,m =.5 - (t/c) = 5% Linear Panels Shoc Expansion Cart3D C L 0.44 0.78 0.498 % Difference 0.46%.6% 0.00% Cart3D C D 0.064 0.067 0.0666 % Difference.56% 0.4% 0.00%

Airfoil Parameterization Panel method and shoc-expansion theory require sharp leading and trailing edges Parameterization has equal number of spline points on upper and lower surface and angle of attac Figure shows minimum drag solutions for design variables

Airfoil Optimization Results Models: Low-Fidelity: Panel Method High-Fidelity: S-E Theory Airfoil parameterization: Angle of Attac Equal # of upper/lower surface spline points Optimization tolerance: step 5x0-6 or dm/dx 5x0-4 Criteria: Fewest highfidelity function evaluations Average of 5 runs with random ICs 3 # of Variables: 7 5 Quasi-Newton 667 048 408 73 β-correlation 83 46 503 546 Add-Correction 68 3 49 RBF α= 6 76 7 8 RBF α* 38 6 46 High-Fidelity Function Calls

Combining Multiple Fidelity Levels Nesting Maximum Lielihood Nested Approach Classic approach Possible exponential scaling in function evaluations, e.g. 50 high-fidelity evaluations, 500 medium-fidelity evaluations, 5,000 low-fidelity evaluations Maximum Lielihood Approach Flexibility in selecting low/medium-fidelity function calls Robust to poor models 4

Combining Multiple Lower-Fidelities The RBF error interpolation can be treated as a Kriging model that predicts the high-fidelity function with a normally distributed error: f f Using Kriging models for each of the lower-fidelity functions, a maximum lielihood estimate for the high-fidelity function is: f high high high σ high ( x) f ( x) + Ν e ( x) σ ( x) ( x) = σ med ( ) + low f + ( + ) + + low elow f med emed σ low σ med σ low σ med low low + σ med ( emed x, σ med x ) ( ) ( x) f ( x) + Ν ( ) ( ) med low σ, low To fit in the original multifidelity algorithm, only one of the two lowerfidelity functions needs to be sampled at the required calibration points. This allows substantial flexibility in selecting when each lower-fidelity function is used. σ 5

3-Fidelity Supersonic Airfoil Results Maximum lielihood approach reduced high-fidelity function calls for all cases. Results use the same calibration points for all lower-fidelity functions Fancier sampling methods can be used Nested approach failed to converge with a non-smooth high-fidelity function (Cart3D): Cart3D Shoc-Expansion Theory Panel Method Two-Fidelities 88 0 47679 Max. Lielihood 66 397 397 Nested 66* 790* 67644 Function Calls The maximum lielihood approach is robust to the poor information. A camberline model estimates drag poorly (thicness is ignored) The best result of the nested approach is shown, average result otherwise Shoc-Expansion Theory Panel Method Camberline Two-Fidelities 6 43665 0 Max. Lielihood 84 30057 30057 Nested ** 597** 3496** Function Calls 6

Conclusion Explained the need for convergent high-fidelity derivativefree methods Demonstrated convergence of an unconstrained multifidelity optimization algorithm using Bayesian model calibration Through numerical experiments, showed that the method wors for nonsmooth functions Has performance comparable to other state-of-the-art design methods Developed a maximum lielihood method to combine multiple lower fidelities into a single estimate of the highfidelity function. Showed that this technique converges faster than nesting, is robust to poor information, and allows flexible sampling. 7

Acnowledgements The authors gratefully acnowledge support from NASA Langley Research Center contract NNL07AA33C technical monitor Natalia Alexandrov. A National Science Foundation graduate research fellowship. Michael Aftosmis and Marian Nemec for support with Cart3D. 8

Questions? 9