Solving singularity issues in the estimation of econometric models

Similar documents
Biogeme & Binary Logit Model Estimation

Specification, estimation and validation of a pedestrian walking behavior model

Three challenges in route choice modeling

Optimization and Simulation

Modern Methods of Data Analysis - WS 07/08

Modelling competition in demand-based optimization models

Dynamic facial expression recognition using a behavioural model

Estimation of discrete choice models: BIOGEME. Michel Bierlaire Mamy Fetiarison

PandasBiogeme: a short introduction

RANDOM SAMPLING OF ALTERNATIVES IN A ROUTE CHOICE CONTEXT

Using Machine Learning to Optimize Storage Systems

Modelling Human Perception of Facial Expressions by Discrete Choice Models

Ali Abur Northeastern University Department of Electrical and Computer Engineering Boston, MA 02115

Unsupervised Learning: Clustering

Clustering Lecture 5: Mixture Model

I How does the formulation (5) serve the purpose of the composite parameterization

THE RECURSIVE LOGIT MODEL TUTORIAL

Javad Lavaei. Department of Electrical Engineering Columbia University

Computer Lab I Datasets and BIOGEME

UNBIASED ESTIMATION OF DESTINATION CHOICE MODELS WITH ATTRACTION CONSTRAINTS

LSRN: A Parallel Iterative Solver for Strongly Over- or Under-Determined Systems

The fastclime Package for Linear Programming and Large-Scale Precision Matrix Estimation in R

Locally Weighted Least Squares Regression for Image Denoising, Reconstruction and Up-sampling

ME 555: Distributed Optimization

Discrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 3

Algebraic Iterative Methods for Computed Tomography

Numerical schemes for Hamilton-Jacobi equations, control problems and games

Energy Minimization for Segmentation in Computer Vision

Logit Mixture Models

Segmentation: Clustering, Graph Cut and EM

The Mesh Adaptive Direct Search Algorithm for Discrete Blackbox Optimization

Robot Mapping. TORO Gradient Descent for SLAM. Cyrill Stachniss

Index. ,a-possibility efficient solution, programming, 275

CS281 Section 3: Practical Optimization

Prof. Fanny Ficuciello Robotics for Bioengineering Visual Servoing

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University

Lecture 6 - Multivariate numerical optimization

Branch-price-and-cut for vehicle routing. Guy Desaulniers

A novel method for identification of critical points in flow sheet synthesis under uncertainty

A derivative-free trust-region algorithm for reliability-based optimization

CS 450 Numerical Analysis. Chapter 7: Interpolation

Theoretical Concepts of Machine Learning

CHAPTER 2 MULTI-OBJECTIVE REACTIVE POWER OPTIMIZATION

COMPARISON OF ALGORITHMS FOR NONLINEAR REGRESSION ESTIMATES

Complexity Challenges to the Discovery of Relationships in Eddy Current Non-destructive Test Data

Optimization of Noisy Fitness Functions by means of Genetic Algorithms using History of Search with Test of Estimation

Note Set 4: Finite Mixture Models and the EM Algorithm

Study and implementation of computational methods for Differential Equations in heterogeneous systems. Asimina Vouronikoy - Eleni Zisiou

Computer Lab I Datasets and BIOGEME

Model Building in Operations Research MA4260 Instructor: Sun Defeng Office: S Phone:

Expectation Propagation

Data mining with sparse grids

Constraints in Particle Swarm Optimization of Hidden Markov Models

Ensemble methods in machine learning. Example. Neural networks. Neural networks

Integration of automatic differentiation tools within object-oriented codes: accuracy and timings

REDUCTION OF CODING ARTIFACTS IN LOW-BIT-RATE VIDEO CODING. Robert L. Stevenson. usually degrade edge information in the original image.

A primal-dual framework for mixtures of regularizers

Unsupervised: no target value to predict

Inverse KKT Motion Optimization: A Newton Method to Efficiently Extract Task Spaces and Cost Parameters from Demonstrations

Comparison of Interior Point Filter Line Search Strategies for Constrained Optimization by Performance Profiles

MTTTS17 Dimensionality Reduction and Visualization. Spring 2018 Jaakko Peltonen. Lecture 11: Neighbor Embedding Methods continued

Tracking Minimum Distances between Curved Objects with Parametric Surfaces in Real Time

Bilevel Sparse Coding

ECS289: Scalable Machine Learning

Large Scale Manifold Transduction

Poisson Regression and Model Checking

Blackbox Optimization

Multi-view stereo. Many slides adapted from S. Seitz

of Iterative Closest Point ICP algorithm to

Towards the completion of assignment 1

Super-resolution on Text Image Sequences

Clustering Relational Data using the Infinite Relational Model

Scan Matching. Pieter Abbeel UC Berkeley EECS. Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics

Optimization Plugin for RapidMiner. Venkatesh Umaashankar Sangkyun Lee. Technical Report 04/2012. technische universität dortmund

Experimental Data and Training

Splines and penalized regression

INVERSE PROBLEMS IN GROUNDWATER MODELING

Lecture 3 January 22

Adaptive Multiple-Frame Image Super- Resolution Based on U-Curve

Predicting Web Service Levels During VM Live Migrations

Data mining with sparse grids using simplicial basis functions

ADVANCED IMAGE PROCESSING METHODS FOR ULTRASONIC NDE RESEARCH C. H. Chen, University of Massachusetts Dartmouth, N.

PATTERN CLASSIFICATION AND SCENE ANALYSIS

MR IMAGE SEGMENTATION

Visual Servoing for Floppy Robots Using LWPR

Epipolar geometry. x x

An algorithm for censored quantile regressions. Abstract

DATA MINING II - 1DL460

Optimization. Industrial AI Lab.

Inclusion of Aleatory and Epistemic Uncertainty in Design Optimization

A Brief Look at Optimization

Lecture 25 Nonlinear Programming. November 9, 2009

Projection Technique for Vortex-Free Image Registration

Optimization Methods. Final Examination. 1. There are 5 problems each w i t h 20 p o i n ts for a maximum of 100 points.

An Introduction to Model-Fitting with the R package glmm

HYPERDRIVE IMPLEMENTATION AND ANALYSIS OF A PARALLEL, CONJUGATE GRADIENT LINEAR SOLVER PROF. BRYANT PROF. KAYVON 15618: PARALLEL COMPUTER ARCHITECTURE

Random projection for non-gaussian mixture models

Semi-Supervised Construction of General Visualization Hierarchies

Introduction to Stochastic Combinatorial Optimization

DESIGN OF EXPERIMENTS and ROBUST DESIGN

Transcription:

Solving singularity issues in the estimation of econometric models Michel Bierlaire Michaël Thémans Transport and Mobility Laboratory École Polytechnique Fédérale de Lausanne Solving singularity issues in the estimation of econometric models p.1/23

Outline Estimation of advanced discrete choice models Singularity issues Sources and drawbacks of singularity Structural singularity Singularity at the solution Adaptation of optimization algorithms Numerical results Conclusions and perspectives Solving singularity issues in the estimation of econometric models p.2/23

Recent advances in DCM More Logit-like models in the GEV family Mixtures of Logit models Mixtures of GEV models Discrete mixtures of GEV models Estimating those models (via maximum log-likelihood) becomes more and more problematic Solving singularity issues in the estimation of econometric models p.3/23

Difficulties in the estimation Objective function becomes highly nonlinear and non concave Computational cost of evaluating the objective function and its derivatives can be significantly high Constraints imposed on parameters to overcome overspecification or to obtain meaningful values of parameters Identification issues singularity in the log-likelihood function Estimation requires specific optimization algorithms Solving singularity issues in the estimation of econometric models p.4/23

Sources of singularity Structural Identication issue for the parameters of the error terms e.g. std. errors in heteroscedastic EC models Contextual Identification issue for the parameters in the deterministic part irrelevant attributes data limitations Solving singularity issues in the estimation of econometric models p.5/23

Types of singularity We want to solve the maximum log-likelihood estimation problem Structural singularity max β R n L(β) 2 L(β) is singular β R n Singularity at solution 2 L(β) is singular at β R n where β is the vector of estimated values of parameters Solving singularity issues in the estimation of econometric models p.6/23

Impacts of singularity The convergence of the estimation process can be much slower The estimation time can be huge! The variance-covariance matrix cannot be obtained Not possible to assess the quality of the calibrated model Numerical difficulties can arise in the estimation Develop robust optimization algorithms able to solve efficiently singular problems Solving singularity issues in the estimation of econometric models p.7/23

Dealing with singularity issues Some of the algorithms implemented in the optimization package BIOGEME are already robust to face structural singularity issues Focus our work on singularities issues at solution Even if only 2 L(β ) is singular, the convergence of the overall sequences of iterates {β k } k N is significantly deteriorated The associated optimization problem is ill-conditioned Adapt existing optimization algorithms Solving singularity issues in the estimation of econometric models p.8/23

Adaptation of optimization algorithms Two main issues have to be addressed Detection of a singularity during the course of the optimization algorithm Strategies to fix the singularity by adding constraints No details about optimization algorithms themselves Description of the identification process Fixing the singularity by adding constraints to the problem Solving singularity issues in the estimation of econometric models p.9/23

Problem formulation Defining x = β and f(x) = L(β) 8 < : max L(β) β R n 8 < : min f(x) x R n L is the log-likelihood function β is the vector of parameters to be estimated f : R n R twice continuously differentiable Solving singularity issues in the estimation of econometric models p.10/23

Problem formulation Defining x = β and f(x) = L(β) 8 < : max L(β) β R n 8 < : min f(x) x R n L is the log-likelihood function β is the vector of parameters to be estimated f : R n R twice continuously differentiable We assume that the problem is singular at x, that is 2 f(x ) is singular x is a local minimum of the problem Solving singularity issues in the estimation of econometric models p.10/23

Ideas of the algorithm Identification of the singularity Issue 1: must analyze 2 f(x ), without knowing x Issue 2: eigenstructure analysis is time consuming Fixing the singularity Issue 1: how to correct the singularity? Issue 2: how to adapt existing algorithms? Solving singularity issues in the estimation of econometric models p.11/23

Ideas of the algorithm Identification of the singularity Solution 1: analyze 2 f(x k ) instead of 2 f(x ) Solution 2: generalized inverse iteration Fixing the singularity Solution 1: add curvature Solution 2: trust-region framework is well designed for easy adaptations Solving singularity issues in the estimation of econometric models p.12/23

Numerical tests Around 75 test problems All containing a singularity at solution Dimension between 2 and 40 4 optimization algorithms Basic trust-region algorithm (in BIOGEME) Trust-region algorithm designed to handle singularity Filter-trust-region algorithm Filter-trust-region algorithm designed to handle singularity 2 measures of performance Number of iterations CPU time Solving singularity issues in the estimation of econometric models p.13/23

Performance profile 1 All algorithms Number of iterations 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 basic trust-region new trust-region filter new filter 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.14/23

Performance profile 2 Two filter-trust-region variants Number of iterations 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 filter new filter 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.15/23

Performance profile 3 Two filter-trust-region variants CPU time 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 filter new filter 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.16/23

Performance profile 4 Two trust-region variants Number of iterations 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 basic trust-region new trust-region 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.17/23

Performance profile 5 Two trust-region variants CPU time 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 basic trust-region new trust-region 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.18/23

Test against preconditioning 1 New variants vs preconditioned version of algorithms Number of iterations 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 new trust-region preconditioned trust-region new filter preconditioned filter 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.19/23

Test against preconditioning 2 New variants vs preconditioned version of algorithms CPU time 1 0.8 Probability ( r <= Pi ) 0.6 0.4 0.2 new trust-region preconditioned trust-region new filter preconditioned filter 0 1 1.5 2 2.5 3 Pi Solving singularity issues in the estimation of econometric models p.20/23

Conclusions Issues of singularity often arise in the estimation of DCM Adaptation of optimization algorithms to deal with a type of singularity Numerical results are very good Significant improvement in term of number of iterations Computational overhead highly compensed by the better efficiency Significant gain expected in the estimation time of advanced DCM (Simulated Maximum Likelihood) Solving singularity issues in the estimation of econometric models p.21/23

Perspectives Singularity issues Perform tests on real DCM involving singularities at the solution Generalization of both theoretical and algorithmic ideas to singular constrained nonlinear optimization Non-concavity issues Nonlinear global optimization Adapt optimization algorithms in order to be able to identify the global maximum of the optimization problem A global maximum makes much more sense Solving singularity issues in the estimation of econometric models p.22/23

Thank you for your attention! Solving singularity issues in the estimation of econometric models p.23/23