Smoothing Spline ANOVA for variable screening

Similar documents
The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

Machine Learning 9. week

Support Vector Machines

Feature Reduction and Selection

Support Vector Machines

S1 Note. Basis functions.

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Review of approximation techniques

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

GSLM Operations Research II Fall 13/14

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Unsupervised Learning

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Solving two-person zero-sum game by Matlab

Polyhedral Compilation Foundations

5 The Primal-Dual Method

Lecture 4: Principal components

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Cost-efficient deployment of distributed software services

Hermite Splines in Lie Groups as Products of Geodesics

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

A New Approach For the Ranking of Fuzzy Sets With Different Heights

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

Discriminative Dictionary Learning with Pairwise Constraints

Classification / Regression Support Vector Machines

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

CS 534: Computer Vision Model Fitting

Multi-objective Optimization Using Adaptive Explicit Non-Dominated Region Sampling

Cluster Analysis of Electrical Behavior

Greedy Technique - Definition

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms

Meta-heuristics for Multidimensional Knapsack Problems

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

Data Mining For Multi-Criteria Energy Predictions

Sequential Projection Maximin Distance Sampling Method

NGPM -- A NSGA-II Program in Matlab

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

Classifier Selection Based on Data Complexity Measures *

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Wavefront Reconstructor

K-means and Hierarchical Clustering

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

A MULTI-OBJECTIVE GENETIC ALGORITHM FOR EXTEND

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

A Binarization Algorithm specialized on Document Images and Photos

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

A Statistical Model Selection Strategy Applied to Neural Networks

A Semi-parametric Regression Model to Estimate Variability of NO 2

Edge Detection in Noisy Images Using the Support Vector Machines

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Multiobjective fuzzy optimization method

Today Using Fourier-Motzkin elimination for code generation Using Fourier-Motzkin elimination for determining schedule constraints

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

LECTURE NOTES Duality Theory, Sensitivity Analysis, and Parametric Programming

ANN MODELING AND MULTI OBJECTIVE OPTIMIZATION OF ELECTRICAL DISCHARGE MACHINING PROCESS

Biostatistics 615/815

Title: A Novel Protocol for Accuracy Assessment in Classification of Very High Resolution Images

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Recognizing Faces. Outline

X- Chart Using ANOM Approach

Optimizing Document Scoring for Query Retrieval

Optimal Design of Nonlinear Fuzzy Model by Means of Independent Fuzzy Scatter Partition

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Tuning of Fuzzy Inference Systems Through Unconstrained Optimization Techniques

Vectorization in the Polyhedral Model

Optimization Methods: Integer Programming Integer Linear Programming 1. Module 7 Lecture Notes 1. Integer Linear Programming

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Support Vector Machines. CS534 - Machine Learning

Distributed Model Predictive Control Methods For Improving Transient Response Of Automated Irrigation Channels

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

The Research of Support Vector Machine in Agricultural Data Classification

LECTURE : MANIFOLD LEARNING

User Authentication Based On Behavioral Mouse Dynamics Biometrics

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Graph-based Clustering

On the Efficiency of Swap-Based Clustering

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

An Efficient Pareto Set Identification Approach for Multi-objective Optimization on Black-box Functions

A COMPARISON OF TWO METHODS FOR FITTING HIGH DIMENSIONAL RESPONSE SURFACES

Report on On-line Graph Coloring

Efficient Load-Balanced IP Routing Scheme Based on Shortest Paths in Hose Model. Eiji Oki May 28, 2009 The University of Electro-Communications

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga

Concurrent Apriori Data Mining Algorithms

Analysis of Continuous Beams in General

An Optimal Algorithm for Prufer Codes *

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Using Neural Networks and Support Vector Machines in Data Mining

Modeling, Manipulating, and Visualizing Continuous Volumetric Data: A Novel Spline-based Approach

11. APPROXIMATION ALGORITHMS

An Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc.

CONCURRENT OPTIMIZATION OF MULTI RESPONCE QUALITY CHARACTERISTICS BASED ON TAGUCHI METHOD. Ümit Terzi*, Kasım Baynal

Adaptive Virtual Support Vector Machine for the Reliability Analysis of High-Dimensional Problems

Active Contours/Snakes

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem

Transcription:

Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco

Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory Possble rankngs Test Cases Interestng Applcaton

SS-Anova Smoothng Splne ANOVA (SS-ANOVA) s a statstcal modelng algorthm based on a functon decomposton smlar to the classcal analyss of varance (ANOVA) decomposton and the assocated notons of man effects and nteractons. Each term man effects and nteractons exhbts the measure of ts contrbuton to the global varance (so ts relatve sgnfcance). SS-ANOVA s a sutable screenng technque for detectng mportant varables n a gven dataset

Unvarate case: Cubc Smoothng Splne In the smple unvarate case, the SS-ANOVA model f(x) s the soluton of ths mnmzaton problem: mn 1 n n 1 2 1 '' 2 f f ( x ) f ( x) dx 0 The left term guarantees a good ft to the data. The rght term represents a penalty on the roughness of the model. The soluton s called Cubc Smoothng Splne, and t corresponds to the usual natural cubc splne.

General multvarate case In the general multvarate case, the model s the soluton of ths penalzed least square problem: mn 1 n n 1 2 f f ( x ) J( f ) The ANOVA decomposton can be bult nto the above formulaton through the proper constructon of the roughness functonal J(f). The theory behnd ths formulaton s based on the so-called reproducng kernel Hlbert space.

Screenng wth SS-ANOVA The functon ANOVA decomposton of the traned model, evaluated at samplng ponts, can be wrtten as: f * The relatve sgnfcance of the dfferent terms can be assessed by means of the contrbuton ndces k p k 1 * ( f k f Gven the aforementoned formulas, we obtan a sort of decomposton of unty: p k 1 k,f * f * k 2 1 * )

Metamodelng A RSM use the nformaton gven by a tranng database to predct the response of the system at unsampled ponts.

Metamodelng PROs: Tranng and evaluatng a RSM s usually less costly than runnng a real smulaton, both consderng computatonal resources and tme. CONs: The curse of dmensonalty s the man restrcton. For example the number of monomals composng a full polynomal of degree deg usng nvar varables s equal to: (deg nvar )! deg! nvar!

SS-Anova and Metamodelng SS-Anova s able to scan a complex and possbly large database. Once the most relevant nput varables are detected, t s possble to restrct the tranng mprovng tme and accuracy performances.

An Example (mmckng a true database) The ntroducton of SS-Anova n our software s lnked to a customer request havng a very pecular database. We cannot show ther data, but we bult a toy model wth smlar propertes, whch helped us n the development phase. We consder a polynomal of degree 3 n 6 varables, but we set to zero all the coeffcents of the terms nvolvng the last 3 varables.

An Example (mmckng a true database) We buld randomly a tranng (140 ponts) and a valdaton (60 ponts) database and we compare the performances of dfferent RSM before and after the use of SS-Anova. Interpolaton methods performed better on ths toy problem, whle Krgng model (ncludng nose) was the best n the orgnal test. Mean absolute error comparson Full tranng Krgng 0.01 0.0001 RBF 3.0E-5 1.0E-7 SVD 2.0E-15 6.0E-16 Usng SS-ANOVA The orgnal test case results n a mprovement of about 3 orders of magntude

Mult Objectve Optmzaton Mathematcal formulaton: When m>1 and the functons are n contrast, we speak about multobjectve optmzaton. The goal of MOO s to fnd the Pareto front composed by nondomnated ponts.

Game Theory and MOO

Game Theory and MOO SS - Anova

Rankng Varables We desgned two possble rankng and selecton strateges. 1) Determnstc: Normalze SS-Anova coeffcents Look for the hghest coeffcent and record ts varable-objectve assocaton. Remove all the coeffcents related to the assgned varable and terate 2) Stochastc: Normalze SS-Anova coeffcents For each varable perform roulette wheel selecton usng the dfferent coeffcents as weghts n order to assgn the varable to an objectve. SS-Anova coeffcent for varable x In both cases we check for objectves wthout assgned varables. Obj A Obj B Obj C

Test Cases: full separaton of varables Ths problem has a clear and unque answer to the varable-objectve partton problem, but t s a dffcult optmzaton task, snce the objectve functons exhbt several local optma. f f 1 2 k 1 2k k 1 ( x ( x 20) 0.3(sn(2 ( x 30) 0.3(sn(2 ( x 20))) 30))) k=2 # eval = 300 mn(f1+f2) Old MOGT 1.3536 New MOGT 0.0071 MOGA2 6.2607 NSGA2 1.7086 k=5 # eval = 1600 mn(f1+f2) Old MOGT 5.8995 New MOGT 2.6502 MOGA2 2.7829 NSGA2 2.6939 k=10 # eval = 6000 mn(f1+f2) Old MOGT 14.365 New MOGT 3.0213 MOGA2 4.4598 NSGA2 2.2674

Test Cases: shared varables KUR100 problem: f f 1 2 99 1 100 1 10exp x 0.8 0.2 5sn x 3 x 2 x 2 1 As hghlghted n the orgnal paper, ths algorthm s able to reach almost-optmal confguratons n few teratons. The stochastc varableobjectve couplng s used for ths problem.

Applcaton: boomerang optmzaton

Applcaton: boomerang optmzaton The optmzaton follows a b-level scheme, the outer loop tres to optmze the shape of the boomerang, the nner one ensures a satsfactory trajectory. Wth the new MOGT algorthm we are tryng to go further and to fully optmze the launch: maxmze range, mnmze force, loop constrant.