Model Evaluation. ACT-R, IMPRINT, and Matlab Comparisons, Parameter Optimizations, and Opportunities. Bengt Fornberg

Similar documents
Lecture 9: Ultra-Fast Design of Ring Oscillator

ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2

Metaheuristic Optimization with Evolver, Genocop and OptQuest

Webinar. Machine Tool Optimization with ANSYS optislang

Post Silicon Electrical Validation

CITS2401 Computer Analysis & Visualisation

Numerical Quadrature over the Surface of a Sphere

Multidisciplinary Analysis and Optimization

Executable Requirements: Opportunities and Impediments

RESPONSE SURFACE METHODOLOGIES - METAMODELS

UvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication

Dealing with Data in Excel 2013/2016

Chapter 2: The Normal Distribution

Modeling with Uncertainty Interval Computations Using Fuzzy Sets

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Evolutionary Computation Algorithms for Cryptanalysis: A Study

We deliver Global Engineering Solutions. Efficiently. This page contains no technical data Subject to the EAR or the ITAR

Analysis of Sorting as a Streaming Application

Weka ( )

Here is the data collected.

Accurate Thermo-Fluid Simulation in Real Time Environments. Silvia Poles, Alberto Deponti, EnginSoft S.p.A. Frank Rhodes, Mentor Graphics

Metaheuristic Development Methodology. Fall 2009 Instructor: Dr. Masoud Yaghini

HARNESSING CERTAINTY TO SPEED TASK-ALLOCATION ALGORITHMS FOR MULTI-ROBOT SYSTEMS

CPSC 340: Machine Learning and Data Mining. Regularization Fall 2016

Applied Regression Modeling: A Business Approach

Recent Developments in Model-based Derivative-free Optimization

Structural Graph Matching With Polynomial Bounds On Memory and on Worst-Case Effort

Simulation: Solving Dynamic Models ABE 5646 Week 12, Spring 2009

Machine Learning for Pre-emptive Identification of Performance Problems in UNIX Servers Helen Cunningham

CHAPTER 6 ORTHOGONAL PARTICLE SWARM OPTIMIZATION

Investigation of Sampling and Interpolation Techniques for DEMs Derived from Different Data Sources

Chapter 4. The Classification of Species and Colors of Finished Wooden Parts Using RBFNs

Numerical approach estimate

Introduction to Genetic Algorithms

User-Centered Analysis & Design

^ Springer. Computational Intelligence. A Methodological Introduction. Rudolf Kruse Christian Borgelt. Matthias Steinbrecher Pascal Held

A Genetic Algorithm-Based Approach for Building Accurate Decision Trees

Optimizing Pharmaceutical Production Processes Using Quality by Design Methods

STATISTICAL CALIBRATION: A BETTER APPROACH TO INTEGRATING SIMULATION AND TESTING IN GROUND VEHICLE SYSTEMS.

A grid representation for Distributed Virtual Environments

Chapter 8: Elevation Data

STATISTICS (STAT) Statistics (STAT) 1

Error Analysis, Statistics and Graphing

EE613 Machine Learning for Engineers LINEAR REGRESSION. Sylvain Calinon Robot Learning & Interaction Group Idiap Research Institute Nov.

Mathematical Optimization of Clamping Processes in Car-Body Production

Introduction to Scientific Computing with Matlab

Origins of Operations Research: World War II

Design of Experiments

DESIGNER S NOTEBOOK Proximity Detection and Link Budget By Tom Dunn July 2011

CPSC 340: Machine Learning and Data Mining. More Regularization Fall 2017

Advanced data visualization (charts, graphs, dashboards, fever charts, heat maps, etc.)

Train schedule diagram drawing algorithm considering interrelationship between labels

Scale Free Network Growth By Ranking. Santo Fortunato, Alessandro Flammini, and Filippo Menczer

Measuring the Stack Height of Nested Styrofoam Cups

On-line handwriting recognition using Chain Code representation

Performance Characterization in Computer Vision

Computationally Efficient Serial Combination of Rotation-invariant and Rotation Compensating Iris Recognition Algorithms

Results of computer search for a perfect cuboid

Webinar Parameter Identification with optislang. Dynardo GmbH

Dijkstra's Algorithm

Parallel Simulated Annealing for VLSI Cell Placement Problem

What can we represent as a Surface?

Active contour: a parallel genetic algorithm approach

3D object comparison with geometric guides for Interactive Evolutionary CAD

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

5. Computational Geometry, Benchmarks and Algorithms for Rectangular and Irregular Packing. 6. Meta-heuristic Algorithms and Rectangular Packing

1. Assumptions. 1. Introduction. 2. Terminology

Where s the Boss? : Monte Carlo Localization for an Autonomous Ground Vehicle using an Aerial Lidar Map

Flight Systems are Cyber-Physical Systems

Design World. Graphical Integrity. largely from Edward Tufte, The Visual Display of Quantitative Information, Graphics Press, 1983.

Application of Genetic Algorithms to CFD. Cameron McCartney

EXECUTION PLAN OPTIMIZATION TECHNIQUES

Genetic Analysis. Page 1

SIMULATED ANNEALING TECHNIQUES AND OVERVIEW. Daniel Kitchener Young Scholars Program Florida State University Tallahassee, Florida, USA

A Novel Approach to Planar Mechanism Synthesis Using HEEDS

Chapter 1: Introducing Roadway Modeling

Thomas R Kronhamn Ericsson Microwave Systems AB Mölndal, Sweden

Variability in Annual Temperature Profiles

Experimental Comparison of Different Techniques to Generate Adaptive Sequences

A Binary Model on the Basis of Cuckoo Search Algorithm in Order to Solve the Problem of Knapsack 1-0

Chapter 2: Linear Equations and Functions

An introduction to plotting data

KS4 Course Outline: Mathematics

Ancient System of Operations: Computing with Mathematical Devices. By: Calvin L. Fitts Jr. History of Mathematics. Ms.

Getting Started with MATLAB

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

Determining optimal value of the shape parameter c in RBF for unequal distances topographical points by Cross-Validation algorithm

Intro to Stata for Political Scientists

A/D Converter. Sampling. Figure 1.1: Block Diagram of a DSP System

International Journal of Current Research and Modern Education (IJCRME) ISSN (Online): & Impact Factor: Special Issue, NCFTCCPS -

CHAPTER 2 CONVENTIONAL AND NON-CONVENTIONAL TECHNIQUES TO SOLVE ORPD PROBLEM

FFT-Based Astronomical Image Registration and Stacking using GPU

Statistical graphics in analysis Multivariable data in PCP & scatter plot matrix. Paula Ahonen-Rainio Maa Visual Analysis in GIS

Using Mplus Monte Carlo Simulations In Practice: A Note On Non-Normal Missing Data In Latent Variable Models

DESIGNER S NOTEBOOK Proximity Calibration and Test by Kerry Glover August 2011

x y

EE613 Machine Learning for Engineers LINEAR REGRESSION. Sylvain Calinon Robot Learning & Interaction Group Idiap Research Institute Nov.

W7 DATA ANALYSIS 2. Your graph should look something like that in Figure W7-2. It shows the expected bell shape of the Gaussian distribution.

Using the DATAMINE Program

Transcription:

Model Evaluation Slide 1 of 12 ACT-R, IMPRINT, and Matlab Comparisons, Parameter Optimizations, and Opportunities - 'Unified test problems' Keystroke entry task, and RADAR - RADAR Some modeling results - Parameter optimization Genetic algorithms / Simulated annealing - Radial Basis Functions (RBFs) Opportunity to remove stochastic 'noise' - Summarizing observations Bengt Fornberg University of Colorado, Boulder Department of Applied Mathematics Presentation at annual MURI meeting, held at Carnegie Mellon University, Pittsburgh, August 14, 2009.

Modeling Systems: ACT-R IMPRINT Matlab Designed for: Slide 2 of 12 cognitive modeling human and system performances in military tasks science and engineering applications Key issues highlighted in recent MURI presentations: Accuracy and Speed comparisons on Keystroke Entry task: - Accuracies equivalent: Expected: IMPRINT vs. Matlab (near line-by-line conversion, identical algorithm) Less expected: ACT-R vs. IMPRINT (Matlab) - Speed: 10,000 times advantage with Matlab (on comparable hardware) Parameter optimizations: Local vs. Global optimization Genetic Algorithms and Simulated Annealing New issues: - RADAR task now implemented in Matlab (no-tone counting only): Speed, Accuracy, Parameter Optimizations - Ability to extract LARGE amounts of information from relatively little experimental data - Opportunity with Radial Basis Functions

Unified Test Problems Slide 3 of 12 Tasks implemented in all three systems (ACT-R, IMPRINT, and Matlab) in order to allow most direct guides to key characteristics 1. Keystroke Data Entry task: TASK: Healy, Kole, Buck-Gengler, and Bourne (2004) ACT-R, IMPRINT, & Matlab: Fornberg, Raymond, and Best (2007) Type on a number key pad a series of 4-digit numbers (e.g. 1395) as quickly and accurately as possible when they are presented; terminate with pressing the "Enter" key 2. RADAR: TASK, ACT-R: Best, Gonzales, Young, Healy, and Bourne (2007) IMPRINT: Buck-Gengler, Work in progress RADAR task combines: - Mapping type: Targets and distractors from same or different character sets - Load level: Number of items in target set, Number of items to look at to see if target - Tone counting: Concurrent secondary task using auditory modality

Codes for modeling the RADAR task: Slide 4 of 12 - The ACT-R and the IMPRINT codes were developed separately. - The Matlab code (so far, 'no-tone' case only) is mathematically equivalent to the IMPRINT code. Implemented by B. Fornberg and B. Raymond, with help from C. Buck-Gengler. Two main differences compared to Keystroke Data Entry task: - Stochastic features enter in the RADAR task in such a way that Matlab's array processing features are no longer practical to apply, - General programming style of Matlab (shared with Fortran, C/C++) allows for particularly simple and effective code structure; nested loops instead of many interacting modules. Roughly offsetting advantages/disadvantages - Matlab speed again about 10,000 times faster than ACT-R / IMPRINT on equivalent hardware. The Matlab code is extraordinarily compact and readable - about 100 lines only (plus about 30 lines for entering parameters)

Example of output of Matlab RADAR model (Run on Dell D430 notebook computer, 1.33 GHz processor) Means per block for 3 different performance measures (total of 48 data points) used to optimize model performance. Slide 5 of 12 In the sample display to the right: Model_RT Response times (for hits) Experiment_RT RMSE_RT 0.0497 Model_hits Proportion of hits Experiment_hits RMSE_hits 0.0192 Model_FA Proportion of false alarms Experiment_FA RMSE_FA 0.0532 RMSE 0.0407 Elapsed time 0.1642 seconds Rows: Two Sessions ; Columns: Eight Blocks These show average over: 12 Subjects, each performing 20 Shifts, containing 7 Frames Very high level of stochastic fluctuations: RMSE = 0.0407. If averaged over 1000 runs: RMSE = 0.0369.

Parameter Optimization Slide 6 of 12 The model has about 30 parameters We want to optimize the model with respect to 16 parameters in four categories: 1. 4 parameters controlling the decision times during visual search; 2. 4 parameters controlling the probability that a statistical subject will respond that a target has been detected on a trial containing a target; 3. 5 parameters controlling the false alarm rate, as a function of block type; 4. 3 parameters controlling the rate at which statistical subjects learn to avoid false alarms. Fundamental question: Do the 48 data points (means of experimental data) provide enough information to separate the influences of the 16 parameters from each other?

Global vs. local Parameter Optimization 2-D Example: Find the GLOBAL maximum value of elevation in the Bryce Canyon amphitheater Slide 7 of 12 - Search for LOCAL maximum is trivial: Start anywhere and steps upwards until one reaches a top. - Is there anything more effective than exhaustive search to locate a GLOBAL maximum? - Is this illustration above at all relevant to the task at hand?

Second question first: We will encounter highly irregular functions We want to minimize the RMSE by searching a space defined by the 16 free parameters. Below is a plot of RMSE when we vary just 2 of the 16 parameters over their 'reasonable range' Slide 8 of 12 0.065 0.06 0.055 RMSE 0.05 0.045 0.04 0.035 0.03 5 4.5 4 3.5 3 2.5 2 1.5 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 FABlockTypeRate 4 FABlockTypeRate 3 Analogy not perfect: - What we see is stochastic simulation noise, which varies from run to run. - We will be working in a lot more than 2-D. Can we reduce or - Can we find deterministic equations in terms of means, deviations, etc.? eliminate the noise? A long story... Answer: probably NOT effectively. - Fit model output with Radial Basis Functions (RBFs), then optimize? Answer: Excellent opportunity (novel in the field; will be pursued).

Effective approaches for global optimization Slide 9 of 12 Two major strategies for a much more efficient 'exploration' of the highdimensional space than exhaustive search: - Simulated annealing (SA) - Genetic algorithms (GA) The main challenges are: - To explore vast spaces effectively (here 16-D, rather than 2-D in the illustrations) - To find global minima without getting trapped in any local minima. Both strategies (SA and GA): - Borrow their concept from processes in nature (formation of a crystal, and evolution of species, respectively), - Are effective for functions both without and with 'stochastic noise' present in the data, - Are available in a Matlab 'tool box'; can be invoked by just 3-4 lines of extra programming. Experience from Unified Test Problem 1 - key stroke entry task: Both approaches (GA and SA) worked well, but possibly with a slight edge in favor of GA. Hence, we only use GA in the present RADAR task.

Example of RADAR parameter optimization results with GA Slide 10 of 12 The Matlab version of the RADAR model (like the IMPRINT model) has about thirty parameters describing cognitive factors. - Select a subset of key parameters: We have chosen a total of 16 parameters in 4 categories. - Optimize in each group separately, in order to improve agreement between model and experiment. Each optimization is made up of 20 GA runs with population size 40, run through 10-30 generations

Radial Basis Functions Topic discussed at length at the Training MURI kickoff meeting, November 4, 2005 (in Washington, DC; well before we had any actual models or data to work with). Notes below from that presentation: Brief history of RBF 1970 RBF proposed for drawing contour lines on maps from scattered elevation measurements (R.L. Hardy) 1985 Methodology proven to be robust for any number of points in any number of dimensions. Intense work on RBF starts at several centers. 1993 First industrial use of RBF for modeling /data mining (Exxon and Shell) 2000+ Literature exceeds 2000 research papers, 4 books published, Fast computational algorithms emerging, Vast number of applications What about modeling data in, say, 6-D? - RBF methodology capable of doing automatically what splines do in 1-D. - Permits multidimensional visualization - Can be used for interpolation or regression (to filter out noise) Slide 11 of 12 - Comparatively robust also for extrapolation Present idea is to re-represent the model output by RBFs before optimization with GA.

Slide 12 of 12 Some thoughts about opportunities, and what realistically can be achieved Stochastic vs. fully deterministic modeling - If stochastic quantity does not influence subsequent actions (such as eye movement time, response time), we can use distributions functions effectively - no need for 'Monte-Carlo' approach. - If it influences a chain of further actions that may or may not be taken, one gets in very few steps far too many possibilities for effective non-stochastic modeling. This most certainly becomes best choice. - Stochastic modeling can be followed up by RBF-based 'de-noising' for improved optimization. Parameter optimization: Ability to extract VAST amounts of information from relatively little data In present example, 48 data values - sufficed for extensive optimization (here of 16 parameters - this could easily have been extended). All results were found to be in good agreement with hand derived values. Opportunity: To introduce and then optimize parameters that are NOT directly measurable. This might allow assessment of cognitive variables that are more abstract or general (e.g. situation awareness, attentional allocation, or motivation). Computational scaling to much larger / more complex scenarios We are far from any present hardware (or software) limitations: - Matlab some 10,000 times faster than ACT-R / IMPRINT on both 'Unified Test Problems'. - Transfer of code from PC to large-scale supercomputer might offer another factor of 10,000. Visualization Very advanced multivariate visualization has been developed for many applications. Attempts should be made to apply these capabilities to psychological data: experimental as well as simulated.