ALAMO: Automatic Learning of Algebraic Models for Optimization

Similar documents
DERIVATIVE-FREE OPTIMIZATION ENHANCED-SURROGATE MODEL DEVELOPMENT FOR OPTIMIZATION. Alison Cozad, Nick Sahinidis, David Miller

The ALAMO approach to machine learning: Best subset selection, adaptive sampling, and constrained regression

Surrogate models and the optimization of systems in the absence of algebraic models

Reduced Order Models for Oxycombustion Boiler Optimization

Learning surrogate models for simulation-based optimization

Bridging The Gap Between Industry And Academia

GA A22637 REAL TIME EQUILIBRIUM RECONSTRUCTION FOR CONTROL OF THE DISCHARGE IN THE DIII D TOKAMAK

Go SOLAR Online Permitting System A Guide for Applicants November 2012

Washington DC October Consumer Engagement. October 4, Gail Allen, Sr. Manager, Customer Solutions

OPTIMIZING CHEMICAL SENSOR ARRAY SIZES

SmartSacramento Distribution Automation

In-Field Programming of Smart Meter and Meter Firmware Upgrade

NIF ICCS Test Controller for Automated & Manual Testing

Portable Data Acquisition System

Intelligent Grid and Lessons Learned. April 26, 2011 SWEDE Conference

PJM Interconnection Smart Grid Investment Grant Update

On Demand Meter Reading from CIS

Adding a System Call to Plan 9

Southern Company Smart Grid

Information to Insight

The NMT-5 Criticality Database

TUCKER WIRELINE OPEN HOLE WIRELINE LOGGING

MULTIPLE HIGH VOLTAGE MODULATORS OPERATING INDEPENDENTLY FROM A SINGLE COMMON 100 kv dc POWER SUPPLY

FY97 ICCS Prototype Specification

Big Data Computing for GIS Data Discovery

Advanced Synchrophasor Protocol DE-OE-859. Project Overview. Russell Robertson March 22, 2017

LosAlamos National Laboratory LosAlamos New Mexico HEXAHEDRON, WEDGE, TETRAHEDRON, AND PYRAMID DIFFUSION OPERATOR DISCRETIZATION

Testing PL/SQL with Ounit UCRL-PRES

Large Scale Test Simulations using the Virtual Environment for Test Optimization

Real-Time Particulate Filter Soot and Ash Measurements via Radio Frequency Sensing

Protecting Control Systems from Cyber Attack: A Primer on How to Safeguard Your Utility May 15, 2012

Entergy Phasor Project Phasor Gateway Implementation

Centrally Managed. Ding Jun JLAB-ACC This process of name resolution and control point location

PJM Interconnection Smart Grid Investment Grant Update

ACCELERATOR OPERATION MANAGEMENT USING OBJECTS*

METADATA REGISTRY, ISO/IEC 11179

GA A22720 THE DIII D ECH MULTIPLE GYROTRON CONTROL SYSTEM

Electronic Weight-and-Dimensional-Data Entry in a Computer Database

Integrated Volt VAR Control Centralized

REAL-TIME MULTIPLE NETWORKED VIEWER CAPABILITY OF THE DIII D EC DATA ACQUISITION SYSTEM

CHANGING THE WAY WE LOOK AT NUCLEAR

R. P. Evans (INEEL) J. G. Keller (INEEL) A. G. Stephens (ISU) J. Blotter (ISU) May 17, 1999 May 20, 1999

Parallel Execution of Functional Mock-up Units in Buildings Modeling

and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

NATIONAL GEOSCIENCE DATA REPOSITORY SYSTEM

ENDF/B-VII.1 versus ENDFB/-VII.0: What s Different?

Alignment and Micro-Inspection System

Real Time Price HAN Device Provisioning

Systems Integration Tony Giroti, CEO Bridge Energy Group

5A&-qg-oOL6c AN INTERNET ENABLED IMPACT LIMITER MATERIAL DATABASE

Stereo Vision Based Automated Grasp Planning

Optimizing Bandwidth Utilization in Packet Based Telemetry Systems. Jeffrey R Kalibjian

@ST1. JUt EVALUATION OF A PROTOTYPE INFRASOUND SYSTEM ABSTRACT. Tom Sandoval (Contractor) Los Alamos National Laboratory Contract # W7405-ENG-36

COMPUTATIONAL FLUID DYNAMICS (CFD) ANALYSIS AND DEVELOPMENT OF HALON- REPLACEMENT FIRE EXTINGUISHING SYSTEMS (PHASE II)

Development of Web Applications for Savannah River Site

Graphical Programming of Telerobotic Tasks

Cross-Track Coherent Stereo Collections

ESNET Requirements for Physics Reseirch at the SSCL

Contributors: Surabhi Jain, Gengbin Zheng, Maria Garzaran, Jim Cownie, Taru Doodi, and Terry L. Wilmarth

High Scalability Resource Management with SLURM Supercomputing 2008 November 2008

FSEC Procedure for Testing Stand-Alone Photovoltaic Systems

Data Management Technology Survey and Recommendation

Saiidia National Laboratories. work completed under DOE ST485D sponsored by DOE

DOE EM Web Refresh Project and LLNL Building 280

Final Report for LDRD Project Learning Efficient Hypermedia N avi g a ti o n

ALAMO user manual and installation guide v

Resource Management at LLNL SLURM Version 1.2

ABSTRACT. W. T. Urban', L. A. Crotzerl, K. B. Spinney', L. S. Waters', D. K. Parsons', R. J. Cacciapouti2, and R. E. Alcouffel. 1.

KCP&L SmartGrid Demonstration

MAS. &lliedsignal. Design of Intertransition Digitizing Decomutator KCP Federal Manufacturing & Technologies. K. L.

Charles L. Bennett, USA. Livermore, CA 94550

Tim Draelos, Mark Harris, Pres Herrington, and Dick Kromer Monitoring Technologies Department Sandia National Laboratories

Waste Treatment Plant (WTP) Pre-Engineering Platform (PEP) Automated Operation Control via Programmable Logic Controllers (PLC)

zorder-lib: Library API for Z-Order Memory Layout

NFRC Spectral Data Library #4 for use with the WINDOW 4.1 Computer Program

Cyber Threat Prioritization

Software Integration Plan for Dynamic Co-Simulation and Advanced Process Control

BWXT Y-12 Y-12. A BWXT/Bechtel Enterprise COMPUTER GENERATED INPUTS FOR NMIS PROCESSOR VERIFICATION. Y-12 National Security Complex

IN-SITU PROPERTY MEASUREMENTS ON LASER-DRAWN STRANDS OF SL 5170 EPOXY AND SL 5149 ACRYLATE

A VERSATILE DIGITAL VIDEO ENGINE FOR SAFEGUARDS AND SECURITY APPLICATIONS

Leonard E. Duda Sandia National Laboratories. PO Box 5800, MS 0665 Albuquerque, NM

REAL-TIME CONTROL OF DIII D PLASMA DISCHARGES USING A LINUX ALPHA COMPUTING CLUSTER

v /4 Quick Short Test Report Technical Publication Transfer Test Hughes Tucson Support Systems Operation MIL-D-28001A (SGML) CTN Test Report AFTB-ID

- Q807/ J.p 7y qj7 7 w SfiAJ D--q8-0?dSC. CSNf. Interferometric S A R Coherence ClassificationUtility Assessment

The APS Control System-Network

A METHOD FOR EFFICIENT FRACTIONAL SAMPLE DELAY GENERATION FOR REAL-TIME FREQUENCY-DOMAIN BEAMFORMERS

Site Impact Policies for Website Use

The ANLABM SP Scheduling System

Collaborating with Human Factors when Designing an Electronic Textbook

Productivity and Injectivity of Horizohtal Wells

EPICS Add On Products SourceRelease Control

Implementation of the AES as a Hash Function for Confirming the Identity of Software on a Computer System

ADVANCED ENDPOINT PROTECTION COMPARATIVE REPORT

Testing of PVODE, a Parallel ODE Solver

Biological Material Transfer Agreement. between (PROVIDER) and. Date: A. Specific Terms of Agreement (Implementing Section)

Causal Modeling of Observational Cost Data: A Ground-Breaking use of Directed Acyclic Graphs

ADAPTIVE MESH REFINEMENT IN CTH

INCLUDING MEDICAL ADVICE DISCLAIMER

Fermi National Accelerator Laboratory

Los Alamos National Laboratory

Transcription:

ALAMO: Automatic Learning of Algebraic Models for Optimization Alison Cozad 1,2, Nick Sahinidis 1,2, David Miller 2 1 National Energy Technology Laboratory, Pittsburgh, PA,USA 2 Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

CARBON CAPTURE CHALLENGE The traditional pathway from discovery to commercialization of energy technologies can be quite long, i.e., ~ 2-3 decades President s plan requires that barriers to the widespread, safe, and cost-effective deployment of CCS be overcome within 10 years To help realize the President s objectives, new approaches are needed for taking carbon capture concepts from lab to power plant, quickly, and at low cost and risk CCSI will accelerate the development of carbon capture technology, from discovery through deployment, with the help of science-based simulations Bench Research ~ 1 kwe Small pilot < 1 MWe Medium pilot 1 5 MWe Semi-works pilot 20-35 MWe First commercial plant, 100 MWe Deployment, >500 MWe, >300 plants Carnegie Mellon University 2

CARBON CAPTURE SIMULATION INITIATIVE Identify promising concepts Reduce the time for design & troubleshooting Quantify the technical risk, to enable reaching larger scales, earlier Stabilize the cost during commercial deployment National Labs Academia Industry Carnegie Mellon University 3

CENTRAL ACTIVITY: OPTIMIZATION Carnegie Mellon University 4

PROCESS DISAGGREGATION Block 1: Simulator Model generation Block 2: Simulator Model generation Block 3: Simulator Model generation Process Simulation Disaggregate process into process blocks Surrogate Models Build simple and accurate models with a functional form tailored for an optimization framework Optimization Model Add algebraic constraints h(x)=0: design specs, heat/mass balances, and logic constraints Carnegie Mellon University 5

LEARNING PROBLEM STATEMENT Build a model of output variables z as a function of input variables x over a specified interval Process simulation Independent variables: Operating conditions, inlet flow properties, unit geometry Dependent variables: Efficiency, outlet flow conditions, conversions, heat flow, etc. Carnegie Mellon University 6

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set false Adaptive sampling Model converged? true Stop Black-box function Carnegie Mellon University 7

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set false Adaptive sampling Model converged? true Stop Training data Black-box function Carnegie Mellon University 8

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set false Adaptive sampling Model converged? true Stop Current model Training data Black-box function Carnegie Mellon University 9

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set Adaptive sampling Model error false Model converged? true Stop Current model Training data Black-box function Carnegie Mellon University 10

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set false Adaptive sampling Model converged? true Stop Training data Black-box function Carnegie Mellon University 11

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set false Adaptive sampling Model converged? true Stop New model Training data Black-box function Carnegie Mellon University 12

HOW TO BUILD THE SURROGATES We aim to build surrogate models that are Accurate We want to reflect the true nature of the simulation Tailored for algebraic optimization Generated from a minimal data set Carnegie Mellon University 13

MODEL IDENTIFICATION Goal: Identify the functional form and complexity of the surrogate models Functional form: General functional form is unknown: Our method will identify models with combinations of simple basis functions Carnegie Mellon University 14

OVERFITTING AND TRUE ERROR Error Ideal Model Complexity True error Empirical error Underfitting Overfitting Step 1: Define a large set of potential basis functions Step 2: Model reduction Carnegie Mellon University 15

OVERFITTING AND TRUE ERROR Error Ideal Model True error To identify the simple Complexity functional form we need to solve two problems: Underfitting Step 1: Define a large set of potential basis functions Overfitting 1. Model Sizing 2. Basis function selection Empirical error Step 2: Model reduction Carnegie Mellon University 16

BASIS FUNCTION SELECTION Carnegie Mellon University 17

BASIS FUNCTION SELECTION Find the model with the least error Carnegie Mellon University 18

BASIS FUNCTION SELECTION We will solve this model for increasing T until we determine a model Carnegie Mellon University 19

BASIS FUNCTION SELECTION Carnegie Mellon University 20

BASIS FUNCTION SELECTION Carnegie Mellon University 21

ALAMO Automated Learning of Algebraic Models for Optimization Start Initial sampling Build surrogate model Update training data set Adaptive sampling Model error false Model converged? true Stop Current model Training data Black-box function Carnegie Mellon University 22

ERROR MAXIMIZATION SAMPLING New goal: Search the problem space for areas of model inconsistency or model mismatch More succinctly, we are trying to find points that maximizes the model error with respect to the independent variables Surrogate model Optimized using a black-box or derivative-free solver (SNOBFIT) [Huyer and Neumaier, 08] Carnegie Mellon University 23

COMPUTATIONAL TESTING Modeling methods compared MIP Proposed methodology EBS Exhaustive best subset method Note: due to high CPU times this was only tested on smaller problems LASSO The lasso regularization OLR Ordinary least-squares regression Sampling methods compared DFO Proposed error maximization technique SLH Single Latin hypercube (no feedback) Carnegie Mellon University 24

COMPUTATIONAL EXPERIMENTS Model accuracy Modeling efficiency 1.0 1.0 Fraction of problems solved 0.8 0.6 0.4 0.2 Fraction of problems solved 0.8 0.6 0.4 0.2 0.0 0 0.002 0.004 0.006 0.008 0.01 Normalized test error 0.0 0 10 20 30 40 Simulation runs Modeling methods Sampling methods Our method LASSO Least squares Error maximization Single Latin hypercube Carnegie Mellon University 25

COMPUTATIONAL EXPERIMENTS Model accuracy Modeling efficiency 1.0 1.0 Fraction of problems solved 0.8 0.6 0.4 0.2 80% of the runs yielded <0.1% error Fraction of problems solved 0.8 0.6 0.4 0.2 0.0 0 0.002 0.004 0.006 0.008 0.01 Normalized test error 0.0 0 10 20 30 40 Simulation runs Modeling methods Sampling methods Our method LASSO Least squares Error maximization Single Latin hypercube Carnegie Mellon University 26

MODEL COMPLEXITY Carnegie Mellon University 27

BUBBLING FLUIDIZED BED Bubbling fluidized bed adsorber diagram Outlet gas Solid feed Cooling water Model inputs (14 total) Geometry (3) Operating conditions (4) Gas mole fractions (2) Solid compositions (2) Flow rates (4) CO 2 rich gas Model created by Andrew Lee at the National Energy and Technology Laboratory CO 2 rich solid outlet Model outputs (13 total) Geometry required (2) Operating condition required (1) Gas mole fractions (2) Solid compositions (2) Flow rates (2) Outlet temperatures (3) Design constraint (1) Carnegie Mellon University 28

ADAPTIVE SAMPLING Progression of mean error through the algorithm 16% Estimated relative error 12% 8% 4% 0% 1 3 5 7 9 Iterations FGas_out Gas_In_P THX_out Tgas_out Tsorb_out dt gamma_out lp vtr xh2o_ads_out xhco3_ads_out zco2_gas_out zh2o_gas_out false yes true Initial data set: 137 pts Final data set: 261 Carnegie Mellon University 29

EXAMPLE MODELS Cooling water CO 2 rich gas Solid feed Carnegie Mellon University 30

CONCLUSIONS The algorithm we developed is able to model black-box functions for use in optimization such that the models are Accurate Tractable in an optimization framework (low-complexity models) Generated from a minimal number of function evaluations Surrogate models can then be incorporated within an optimization framework with complex objective functions and additional constraints ALAMO site: archimedes.cheme.cmu.edu/?q=alamo Carnegie Mellon University 31