Correction of Model Reduction Errors in Simulations

Size: px
Start display at page:

Download "Correction of Model Reduction Errors in Simulations"

Transcription

1 Correction of Model Reduction Errors in Simulations MUQ 15, June 2015 Antti Lipponen UEF // University of Eastern Finland

2 Janne Huttunen Ville Kolehmainen University of Eastern Finland University of Eastern Finland Sami Romakkaniemi Finnish Meteorological Institute Harri Kokkola Finnish Meteorological Institute

3 Contents 1. Background 2. Correction of model reduction errors in simulations 3. Examples & results 4. Conclusions

4 Background ACRONYM project at the UEF Aerosols and climate: reduction of uncertainty of the models Aerosol models Formation of cloud droplets Evolution of aerosol size distribution Used in global climate simulations

5 Background Black-box models

6 Background Black-box models Very strict time constraints

7 Background Black-box models Very strict time constraints High-dimensional models

8 Background Black-box models Very strict time constraints High-dimensional models Discontinuities / If-then-else-models

9 Background Black-box models Very strict time constraints High-dimensional models Discontinuities / If-then-else-models No knowledge on true/accurate model

10 Problem description Given Black-box model 1 (fast, approximative) To do

11 Problem description Given Black-box model 1 (fast, approximative) Black-box model 2 (slow, accurate) To do

12 Problem description Given Black-box model 1 (fast, approximative) Black-box model 2 (slow, accurate) A couple of experts To do

13 Problem description Given To do Black-box model 1 Combine models 1 and 2 (fast, approximative) to construct model 3 Black-box model 2 (fast, accurate) (slow, accurate) A couple of experts

14 Cloud droplet formation Inputs: ambient conditions, aerosol size distribution and chemical compositions etc. Output: Cloud droplet number concentration (number of cloud droplets / cm 3 )

15 Cloud droplet formation Inputs: ambient conditions, aerosol size distribution and chemical compositions etc. Output: Cloud droplet number concentration (number of cloud droplets / cm 3 ) Reduced model: Abdul-Razzak-Ghan model Accurate model: Air parcel model

16 Cloud droplet formation Computation times Reduced model: Abdul-Razzak-Ghan model Accurate model: Air parcel model

17 Cloud droplet formation Computation times Reduced model: Abdul-Razzak-Ghan model (0.3 ms) Accurate model: Air parcel model ( s)

18 Cloud droplet formation Computation times Reduced model: Abdul-Razzak-Ghan model (0.3 ms) Accurate model: Air parcel model ( s) Global simulation grid = minutes / timestep

19 Cloud droplet formation Computation times Reduced model: Abdul-Razzak-Ghan model (0.3 ms) Accurate model: Air parcel model ( s) Global simulation grid = minutes / timestep 1 year timesteps about 25 billion evaluations / year of simulation

20 Cloud droplet formation CDNC REDUCED [cm 3 ] CDNC AIR PARCEL [cm 3 ]

21 Approximation errors (AE) in inverse problems Bayesian inverse problems (Kaipio & Somersalo) Additive error term Probability density model for the AEs based on Monte Carlo sampling An approximative marginalization of the unknown and uninteresting AE

22 Approximation errors (AE) in inverse problems UEF // University of Eastern Finland MUQ 15, June 2015

23 Approximation error model Accurate model y = ˆf(x) Approximative model y f(x)

24 Approximation error model Accurate model y = ˆf(x)+f(x) f(x) Approximative model y f(x)

25 Approximation error model Accurate model ] y = f(x) + [ˆf(x) f(x) Approximative model y f(x)

26 Approximation error model Accurate model y = f(x) + ɛ Approximative model Approximation error y f(x) ɛ = ˆf(x) f(x)

27 Approximation error model Accurate model y = f(x) + ɛ Approximative model y f(x) In our approach, we construct a computationally low-cost predictor model for ɛ and use that model in the simulations to predict the realization of ɛ. Approximation error ɛ = ˆf(x) f(x)

28 Example: Cloud droplet formation 1. Construct prior model for x 2. Construct training set (x, ɛ) 3. Train/construct the predictor model for ɛ 4. Construct the final simulation model

29 Example: Cloud droplet formation 1. Construct prior model for x

30 Example: Cloud droplet formation 2. Construct training set (x, ɛ) ɛ = ˆf(x) f(x)

31 Example: Cloud droplet formation 2. Construct training set (x, ɛ) ɛ = ˆf(x) f(x) Here we use inputs x = (x, f(x))

32 Example: Cloud droplet formation 3. Train/construct the predictor model for ɛ Predictor model needs to be

33 Example: Cloud droplet formation 3. Train/construct the predictor model for ɛ Predictor model needs to be fast

34 Example: Cloud droplet formation 3. Train/construct the predictor model for ɛ Predictor model needs to be fast able to handle non-linearities / discontinuities

35 Example: Cloud droplet formation 3. Train/construct the predictor model for ɛ Predictor model needs to be fast able to handle non-linearities / discontinuities Random Forest regression model

36 Example: Cloud droplet formation Random Forest regression model - regression tree

37 Example: Cloud droplet formation Random Forest regression model - ensemble of regression trees...

38 Example: Cloud droplet formation 3. Train/construct the predictor model for ɛ

39 Example: Cloud droplet formation 4. Construct the final simulation model

40 Example: Cloud droplet formation Results: No correction CDNC REDUCED [cm 3 ] CDNC AIR PARCEL [cm 3 ]

41 Example: Cloud droplet formation Results: RF for ˆf(x) RF ONLY [cm 3 ] CDNC AIR PARCEL [cm 3 ]

42 Example: Cloud droplet formation Results: RF for AE 10 4 CDNC REDUCED WITH RF [cm 3 ] CDNC AIR PARCEL [cm 3 ]

43 Example: Cloud droplet formation Results Model Root mean squared Median relative error (cm 3 ) error (%) ARG RF ARG+RF

44 Example: Cloud droplet formation Relative errors

45 Example: Cloud droplet formation Computation times Air parcel model s Abdul-Razzak-Ghan model 0.3 ms RF models about ms

46 Results: Navier-Stokes equations v v µ ρ v + 1 ρ p f = 0 v = 0

47 Results: Navier-Stokes equations v v µ ρ v + 1 ρ p f = 0 v = 0

48 Results: Navier-Stokes equations v v µ ρ v + 1 ρ p f = 0 v = 0 v = ˆf(ρ, µ, v x )

49 Results: Navier-Stokes equations 1.0 Reduced mesh (2.6s) 1.0 Accurate mesh (15.0s) y (m) y (m) x (m) x (m)

50 Results: Navier-Stokes equations

51 Results: Navier-Stokes equations Prior models ρ U(2.5, 5.0) µ U(0.001, 1.0) v x U(1.0, 2.0) 5000 training samples RF predictor model for the errors

52 Results: Navier-Stokes equations

53 Results: Navier-Stokes equations

54 Results: Navier-Stokes equations

55 Results: Lotka-Volterra model Lotka-Volterra model is used to describe the dynamics of prey-predator systems da t dt db t dt = a t (α βb t ) = b t (γ δa t )

56 Results: Lotka-Volterra model Numerical approximation with explicit Euler scheme Models Accurate: t = Reduced: t = 0.25

57 Results: Lotka-Volterra model Prior models α U(0.5, 1.5) β U(0.5, 1.5) γ U(0.5, 1.5) δ U(0.5, 1.5) training samples RF predictor model for the errors

58 Results: Lotka-Volterra model b b a b a a a b

59 Results: Lotka-Volterra model

60 Results: Lotka-Volterra model

61 Results: Lotka-Volterra model

62 Results: Lotka-Volterra model

63 Results: Lotka-Volterra model

64 Results: Lotka-Volterra model

65 Results: Lotka-Volterra model

66 Results: Lotka-Volterra model

67 Results: Lotka-Volterra model

68 Results: Lotka-Volterra model

69 Results: Lotka-Volterra model

70 Results: Lotka-Volterra model

71 Results: Lotka-Volterra model

72 Results: Lotka-Volterra model

73 Results: Lotka-Volterra model

74 Conclusions An approach to reduce the inaccuracies and uncertainties in a simulation model was developed In all tests, the approach significantly improved the accuracy of the models There was only a minor increase in the computational costs Use of the approach with commercial/black-box codes is mostly straightforward

75 Thank you! References Breiman, L., Random forests, Machine Learning 45(1), 5 32, Kaipio, J. and Somersalo, E., Statistical and Computational Inverse Problems, Springer, New York, Lipponen, A., Kolehmainen, V., Romakkaniemi, S. and Kokkola, H., Correction of approximation errors with Random Forests applied to modelling of cloud droplet formation, Geoscientific Model Development, 6, , uef.fi Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M. and Duchesnay, E., Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research, 12, , 2011.

Machine Learning Final Project

Machine Learning Final Project Machine Learning Final Project Team: hahaha R01942054 林家蓉 R01942068 賴威昇 January 15, 2014 1 Introduction In this project, we are asked to solve a classification problem of Chinese characters. The training

More information

Big Data Regression Using Tree Based Segmentation

Big Data Regression Using Tree Based Segmentation Big Data Regression Using Tree Based Segmentation Rajiv Sambasivan Sourish Das Chennai Mathematical Institute H1, SIPCOT IT Park, Siruseri Kelambakkam 603103 Email: rsambasivan@cmi.ac.in, sourish@cmi.ac.in

More information

A Model-based Application Autonomic Manager with Fine Granular Bandwidth Control

A Model-based Application Autonomic Manager with Fine Granular Bandwidth Control A Model-based Application Autonomic Manager with Fine Granular Bandwidth Control Nasim Beigi-Mohammadi, Mark Shtern, and Marin Litoiu Department of Computer Science, York University, Canada Email: {nbm,

More information

arxiv: v1 [cs.ne] 2 Jul 2013

arxiv: v1 [cs.ne] 2 Jul 2013 Comparing various regression methods on ensemble strategies in differential evolution Iztok Fister Jr., Iztok Fister, and Janez Brest Abstract Differential evolution possesses a multitude of various strategies

More information

Effectiveness of Sparse Features: An Application of Sparse PCA

Effectiveness of Sparse Features: An Application of Sparse PCA 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Clustering Startups Based on Customer-Value Proposition

Clustering Startups Based on Customer-Value Proposition Clustering Startups Based on Customer-Value Proposition Daniel Semeniuta Stanford University dsemeniu@stanford.edu Meeran Ismail Stanford University meeran@stanford.edu Abstract K-means clustering is a

More information

Interactive Image Segmentation with GrabCut

Interactive Image Segmentation with GrabCut Interactive Image Segmentation with GrabCut Bryan Anenberg Stanford University anenberg@stanford.edu Michela Meister Stanford University mmeister@stanford.edu Abstract We implement GrabCut and experiment

More information

Author profiling using stylometric and structural feature groupings

Author profiling using stylometric and structural feature groupings Author profiling using stylometric and structural feature groupings Notebook for PAN at CLEF 2015 Andreas Grivas, Anastasia Krithara, and George Giannakopoulos Institute of Informatics and Telecommunications,

More information

arxiv: v1 [cs.ai] 10 May 2017

arxiv: v1 [cs.ai] 10 May 2017 Mind the Gap: a Well Log Data Analysis Rui L. Lopes 1,2 rmlc@inesctec.pt Alípio M. Jorge 1,3 amjorge@fc.up.pt 1 LIAAD, INESC TEC, Porto, Portugal 2 HASLab, INESC TEC, Porto, Portugal 3 DCC - FCUP, Universidade

More information

Forecasting Lung Cancer Diagnoses with Deep Learning

Forecasting Lung Cancer Diagnoses with Deep Learning Forecasting Lung Cancer Diagnoses with Deep Learning Data Science Bowl 2017 Technical Report Daniel Hammack April 22, 2017 Abstract We describe our contribution to the 2nd place solution for the 2017 Data

More information

MLweb: A toolkit for machine learning on the web

MLweb: A toolkit for machine learning on the web MLweb: A toolkit for machine learning on the web Fabien Lauer To cite this version: Fabien Lauer. MLweb: A toolkit for machine learning on the web. Neurocomputing, Elsevier, 2018, 282, pp.74-77.

More information

Predicting the Outcome of H-1B Visa Applications

Predicting the Outcome of H-1B Visa Applications Predicting the Outcome of H-1B Visa Applications CS229 Term Project Final Report Beliz Gunel bgunel@stanford.edu Onur Cezmi Mutlu cezmi@stanford.edu 1 Introduction In our project, we aim to predict the

More information

Model Parameter Estimation

Model Parameter Estimation Model Parameter Estimation Shan He School for Computational Science University of Birmingham Module 06-23836: Computational Modelling with MATLAB Outline Outline of Topics Concepts about model parameter

More information

Fault Detection in Hard Disk Drives Based on Mixture of Gaussians

Fault Detection in Hard Disk Drives Based on Mixture of Gaussians Fault Detection in Hard Disk Drives Based on Mixture of Gaussians Lucas P. Queiroz, Francisco Caio M. Rodrigues, Joao Paulo P. Gomes, Felipe T. Brito, Iago C. Brito, Javam C. Machado LSBD - Department

More information

Predicting Parallelization of Sequential Programs Using Supervised Learning

Predicting Parallelization of Sequential Programs Using Supervised Learning Predicting Parallelization of Sequential Programs Using Supervised Learning Daniel Fried, Zhen Li, Ali Jannesari, and Felix Wolf University of Arizona, Tucson, Arizona, USA German Research School for Simulation

More information

A data-driven approach for sensor data reconstruction for bridge monitoring

A data-driven approach for sensor data reconstruction for bridge monitoring A data-driven approach for sensor data reconstruction for bridge monitoring * Kincho H. Law 1), Seongwoon Jeong 2), Max Ferguson 3) 1),2),3) Dept. of Civil and Environ. Eng., Stanford University, Stanford,

More information

Position Estimation for Control of a Partially-Observable Linear Actuator

Position Estimation for Control of a Partially-Observable Linear Actuator Position Estimation for Control of a Partially-Observable Linear Actuator Ethan Li Department of Computer Science Stanford University ethanli@stanford.edu Arjun Arora Department of Electrical Engineering

More information

HUMAN activities associated with computational devices

HUMAN activities associated with computational devices INTRUSION ALERT PREDICTION USING A HIDDEN MARKOV MODEL 1 Intrusion Alert Prediction Using a Hidden Markov Model Udaya Sampath K. Perera Miriya Thanthrige, Jagath Samarabandu and Xianbin Wang arxiv:1610.07276v1

More information

Visualizing Deep Network Training Trajectories with PCA

Visualizing Deep Network Training Trajectories with PCA with PCA Eliana Lorch Thiel Fellowship ELIANALORCH@GMAIL.COM Abstract Neural network training is a form of numerical optimization in a high-dimensional parameter space. Such optimization processes are

More information

Determining Aircraft Sizing Parameters through Machine Learning

Determining Aircraft Sizing Parameters through Machine Learning h(y) Determining Aircraft Sizing Parameters through Machine Learning J. Michael Vegh, Tim MacDonald, Brian Munguía I. Introduction Aircraft conceptual design is an inherently iterative process as many

More information

Supervised Ranking for Plagiarism Source Retrieval

Supervised Ranking for Plagiarism Source Retrieval Supervised Ranking for Plagiarism Source Retrieval Notebook for PAN at CLEF 2013 Kyle Williams, Hung-Hsuan Chen, and C. Lee Giles, Information Sciences and Technology Computer Science and Engineering Pennsylvania

More information

No more questions will be added

No more questions will be added CSC 2545, Spring 2017 Kernel Methods and Support Vector Machines Assignment 2 Due at the start of class, at 2:10pm, Thurs March 23. No late assignments will be accepted. The material you hand in should

More information

X-Ray Photoelectron Spectroscopy Enhanced by Machine Learning

X-Ray Photoelectron Spectroscopy Enhanced by Machine Learning X-Ray Photoelectron Spectroscopy Enhanced by Machine Learning Alexander Gabourie (gabourie@stanford.edu), Connor McClellan (cjmcc@stanford.edu), Sanchit Deshmukh (sanchitd@stanford.edu) 1. Introduction

More information

Classification of Building Information Model (BIM) Structures with Deep Learning

Classification of Building Information Model (BIM) Structures with Deep Learning Classification of Building Information Model (BIM) Structures with Deep Learning Francesco Lomio, Ricardo Farinha, Mauri Laasonen, Heikki Huttunen Tampere University of Technology, Tampere, Finland Sweco

More information

CME 345: MODEL REDUCTION

CME 345: MODEL REDUCTION CME 345: MODEL REDUCTION Parameterized Partial Differential Equations Charbel Farhat Stanford University cfarhat@stanford.edu 1 / 19 Outline 1 Initial Boundary Value Problems 2 Typical Parameters of Interest

More information

Efficiency Comparison of Unstable Transductive and Inductive Conformal Classifiers

Efficiency Comparison of Unstable Transductive and Inductive Conformal Classifiers Efficiency Comparison of Unstable Transductive and Inductive Conformal Classifiers Henrik Linusson 1, Ulf Johansson 1,HenrikBoström 2,andTuveLöfström 1 1 School of Business and IT University of Borås,

More information

ACCELERATING THE HIERARCHICAL MULTI-SCALE SOFTWARE BY SPATIAL CLUSTERING STRATEGIES

ACCELERATING THE HIERARCHICAL MULTI-SCALE SOFTWARE BY SPATIAL CLUSTERING STRATEGIES Accelerating the Hierarchical Multi-Scale (HMS) software by spatial clustering strategies XIII International Conference on Computational Plasticity. Fundamentals and Applications COMPLAS XIII E. Oñate,

More information

Optimal Generalized Decision Trees via Integer Programming

Optimal Generalized Decision Trees via Integer Programming Optimal Generalized Decision Trees via Integer Programming Matt Menickelly Department of Industrial and Systems Engineering, Lehigh University, Harold S. Mohler Laboratory, 200 West Packer Avenue, Bethlehem,

More information

Sampling Method for Fast Training of Support Vector Data Description

Sampling Method for Fast Training of Support Vector Data Description Sampling Method for Fast Training of Support Vector Data Description Arin Chaudhuri and Deovrat Kakde and Maria Jahja and Wei Xiao and Seunghyun Kong and Hansi Jiang and Sergiy Peredriy SAS Institute Cary,

More information

PyCP: An Open-Source Conformal Predictions Toolkit

PyCP: An Open-Source Conformal Predictions Toolkit PyCP: An Open-Source Conformal Predictions Toolkit Vineeth N. Balasubramanian, Aaron Baker, Matthew Yanez, Shayok Chakraborty, and Sethuraman Panchanathan Center for Cognitive Ubiquitous Computing School

More information

The Remote Sensing and GIS Software Library

The Remote Sensing and GIS Software Library January 2017 The Software System Course Overview 1 Introduction The Software System Course Overview 2 Part 1 Part 2 Part 3 Pixel Base Object Base 3 The Software System Course Overview Paradigm Shifting

More information

arxiv: v1 [cs.sy] 9 Apr 2018

arxiv: v1 [cs.sy] 9 Apr 2018 Sample-Derived Disjunctive Rules for Secure Power System Operation Jochen. Cremer, Ioannis Konstantelos, Goran Strbac Department of Electrical and Electronic Engineering Imperial College ondon ondon, nited

More information

hyperspace: Automated Optimization of Complex Processing Pipelines for pyspace

hyperspace: Automated Optimization of Complex Processing Pipelines for pyspace hyperspace: Automated Optimization of Complex Processing Pipelines for pyspace Torben Hansing, Mario Michael Krell, Frank Kirchner Robotics Research Group University of Bremen Robert-Hooke-Str. 1, 28359

More information

TouchML: A Machine Learning Toolkit for Modelling Spatial Touch Targeting Behaviour

TouchML: A Machine Learning Toolkit for Modelling Spatial Touch Targeting Behaviour TouchML: A Machine Learning Toolkit for Modelling Spatial Touch Targeting Behaviour Daniel Buschek, Florian Alt Media Informatics Group, University of Munich (LMU) {daniel.buschek, florian.alt}@ifi.lmu.de

More information

Learning when to use a decomposition

Learning when to use a decomposition Learning when to use a decomposition Markus Kruber 1, Marco E. Lübbecke 1, and Axel Parmentier 2 1 Chair of Operations Research, RWTH Aachen University, Kackertstrasse 7, 52072 Aachen, Germany, {kruber,

More information

Distance-based Kernels for Dynamical Movement Primitives

Distance-based Kernels for Dynamical Movement Primitives Distance-based Kernels for Dynamical Movement Primitives Diego ESCUDERO-RODRIGO a,1, René ALQUEZAR a, a Institut de Robotica i Informatica Industrial, CSIC-UPC, Spain Abstract. In the Anchoring Problem

More information

Detecting People in Large Crowded Spaces Using 3D Data from Multiple Cameras

Detecting People in Large Crowded Spaces Using 3D Data from Multiple Cameras Detecting People in Large Crowded Spaces Using 3D Data from Multiple Cameras João Carvalho 1, Manuel Marques 1, João Paulo Costeira 1 and Pedro Mendes Jorge 2 1 Institute for Systems and Robotics (ISR/IST),

More information

Tapkee: An Efficient Dimension Reduction Library

Tapkee: An Efficient Dimension Reduction Library Journal of Machine Learning Research 14 (2013) 2355-2359 Submitted 2/12; Revised 5/13; Published 8/13 Tapkee: An Efficient Dimension Reduction Library Sergey Lisitsyn Samara State Aerospace University

More information

Sifaka: Text Mining Above a Search API

Sifaka: Text Mining Above a Search API Sifaka: Text Mining Above a Search API ABSTRACT Cameron VandenBerg Language Technologies Institute Carnegie Mellon University Pittsburgh, PA 15213, USA cmw2@cs.cmu.edu Text mining and analytics software

More information

Appliance Level Energy Disaggregation

Appliance Level Energy Disaggregation Appliance Level Energy Disaggregation Samir Sen samirsen@stanford.edu Fred Lu fredlu@stanford.edu Abstract We build and experiment with various models for separating total electric energy usage (kwh) into

More information

Bayesian aerosol retrieval algorithm for MODIS AOD retrieval over land

Bayesian aerosol retrieval algorithm for MODIS AOD retrieval over land https://doi.org/10.5194/amt-11-1529-2018 Author(s) 2018. This work is distributed under the Creative Commons Attribution 4.0 License. Bayesian aerosol retrieval algorithm for MODIS AOD retrieval over land

More information

Global Optimization in Learning with Important Data: an FCA-Based Approach

Global Optimization in Learning with Important Data: an FCA-Based Approach Global Optimization in Learning with Important Data: an FCA-Based Approach Yury Kashnitsky, Sergei O. Kuznetsov National Research University Higher School of Economics, Moscow, Russia {ykashnitsky, skuznetsov}@hse.ru

More information

SCC: Automatic Classification of Code Snippets

SCC: Automatic Classification of Code Snippets SCC: Automatic Classification of Code Snippets Kamel Alreshedy, Dhanush Dharmaretnam, Daniel M. German, Venkatesh Srinivasan and T. Aaron Gulliver University of Victoria Victoria, BC, Canada V8W 2Y2 {kamel,

More information

arxiv: v1 [cs.ro] 11 Mar 2018

arxiv: v1 [cs.ro] 11 Mar 2018 Low-cost Autonomous Navigation System Based on Optical Flow Classification Michel C. Meneses 1, Leonardo N. Matos 1, Bruno O. Prado 1 1 Department of Computer Science Federal University of Sergipe (UFS)

More information

SoccEval: An Annotation Schema for Rating Soccer Players

SoccEval: An Annotation Schema for Rating Soccer Players SoccEval: An Annotation Schema for Rating Soccer Players Jose Ramirez Matthew Garber Department of Computer Science Brandeis University Xinhao Wang {jramirez, mgarber, xinhao}@brandeis.edu Abstract This

More information

Surrogate Modeling to Enable Higher Fidelity Optimization

Surrogate Modeling to Enable Higher Fidelity Optimization AIAA AVIATION Forum 5-9 June 2017, Denver, Colorado 18th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference 10.2514/6.2017-3661 Surrogate Modeling to Enable Higher Fidelity Optimization

More information

P4ML: A Phased Performance-Based Pipeline Planner for Automated Machine Learning

P4ML: A Phased Performance-Based Pipeline Planner for Automated Machine Learning Proceedings of Machine Learning Research 1:1 8, 2018 ICML2018AutoMLWorkshop P4ML: A Phased Performance-Based Pipeline Planner for Automated Machine Learning Yolanda Gil, Ke-Thia Yao, Varun Ratnakar, Daniel

More information

An Introduction to Machine Learning

An Introduction to Machine Learning TRIPODS Summer Boorcamp: Topology and Machine Learning August 6, 2018 General Set-up Introduction Set-up and Goal Suppose we have X 1,X 2,...,X n data samples. Can we predict properites about any given

More information

Variational autoencoders for tissue heterogeneity exploration from (almost) no preprocessed mass spectrometry imaging data.

Variational autoencoders for tissue heterogeneity exploration from (almost) no preprocessed mass spectrometry imaging data. arxiv:1708.07012v2 [q-bio.qm] 24 Aug 2017 Variational autoencoders for tissue heterogeneity exploration from (almost) no preprocessed mass spectrometry imaging data. Paolo Inglese, James L. Alexander,

More information

Continuum-Microscopic Models

Continuum-Microscopic Models Scientific Computing and Numerical Analysis Seminar October 1, 2010 Outline Heterogeneous Multiscale Method Adaptive Mesh ad Algorithm Refinement Equation-Free Method Incorporates two scales (length, time

More information

Discovery of the Source of Contaminant Release

Discovery of the Source of Contaminant Release Discovery of the Source of Contaminant Release Devina Sanjaya 1 Henry Qin Introduction Computer ability to model contaminant release events and predict the source of release in real time is crucial in

More information

FAILURE CLASSIFICATION IN ELECTRIC SWITCH MACHINES USING SYMBOLIC CONCEPTS AND COMPUTATIONAL INTELLIGENCE

FAILURE CLASSIFICATION IN ELECTRIC SWITCH MACHINES USING SYMBOLIC CONCEPTS AND COMPUTATIONAL INTELLIGENCE FAILURE CLASSIFICATION IN ELECTRIC SWITCH MACHINES USING SYMBOLIC CONCEPTS AND COMPUTATIONAL INTELLIGENCE Nielson Soares Eduardo Pestana de Aguiar Leonardo Goliatt da Fonseca soares.nielson@engenharia.ufjf.br

More information

Computer Vision for Head Pose Estimation: Review of a Competition

Computer Vision for Head Pose Estimation: Review of a Competition Computer Vision for Head Pose Estimation: Review of a Competition Heikki Huttunen 1, Ke Chen 1, Abhishek Thakur 2, Artus Krohn-Grimberghe 2, Oguzhan Gencoglu 1, Xingyang Ni 1, Mohammed Al-Musawi 1, Lei

More information

Classifying and Ranking Search Engine Results as Potential Sources of Plagiarism

Classifying and Ranking Search Engine Results as Potential Sources of Plagiarism Classifying and Ranking Search Engine Results as Potential Sources of Plagiarism Kyle Williams, Hung-Hsuan Chen, C. Lee Giles Information Sciences and Technology, Computer Science and Engineering Pennsylvania

More information

song2vec: Determining Song Similarity using Deep Unsupervised Learning

song2vec: Determining Song Similarity using Deep Unsupervised Learning song2vec: Determining Song Similarity using Deep Unsupervised Learning CS229 Final Project Report (category: Music & Audio) Brad Ross (bross35), Prasanna Ramakrishnan (pras1712) 1 Introduction Humans are

More information

MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS

MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS Aymeric Masurelle, Slim Essid, Gaël Richard To cite this version: Aymeric Masurelle, Slim Essid, Gaël Richard.

More information

PROCEEDINGS OF SPIE. Integrated dielectric scatterers for fast optical classification of biological cells

PROCEEDINGS OF SPIE. Integrated dielectric scatterers for fast optical classification of biological cells PROCEEDINGS OF SPIE SPIEDigitalLibrary.org/conference-proceedings-of-spie Integrated dielectric scatterers for fast optical classification of biological cells Alessio Lugnan, Joni Dambre, Peter Bienstman

More information

Sliced Curvature Scale Space for Representing and Recognizing 3D objects

Sliced Curvature Scale Space for Representing and Recognizing 3D objects Sliced Curvature Scale Space for Representing and Recognizing 3D objects Billy Okal Social Robotics Laboratory Albert-Ludwigs-Universität Freiburg Georges-Köhler-Allee 74, D-79110 Freiburg Email: okal@informatik.uni-freiburg.de

More information

Using Cumulative Distribution Based Performance Analysis to Benchmark Models

Using Cumulative Distribution Based Performance Analysis to Benchmark Models Using Cumulative Distribution Based Performance Analysis to Benchmark Models Scott M. Jordan Daniel Cohen University of Massachusetts Amherst [sjordan,dcohen,pthomas]@cs.umass.edu Philip S. Thomas Abstract

More information

Large synthetic data sets to compare different data mining methods

Large synthetic data sets to compare different data mining methods Large synthetic data sets to compare different data mining methods Victoria Ivanova, Yaroslav Nalivajko Superviser: David Pfander, IPVS ivanova.informatics@gmail.com yaroslav.nalivayko@gmail.com June 3,

More information

Recognizing Compound Events in Spatio-Temporal Football Data

Recognizing Compound Events in Spatio-Temporal Football Data Recognizing Compound Events in Spatio-Temporal Football Data Keywords: Abstract: football analytics, event recognition, spatio-temporal data analysis, supervised machine learning In the world of football,

More information

RGB-D-Based Features for Recognition of Textureless Objects

RGB-D-Based Features for Recognition of Textureless Objects RGB-D-Based Features for Recognition of Textureless Objects Santosh Thoduka, Stepan Pazekha, Alexander Moriarty, and Gerhard K. Kraetzschmar Bonn-Rhein-Sieg University of Applied Sciences Department of

More information

Chat Disentanglement: Identifying Semantic Reply Relationships with Random Forests and Recurrent Neural Networks

Chat Disentanglement: Identifying Semantic Reply Relationships with Random Forests and Recurrent Neural Networks Chat Disentanglement: Identifying Semantic Reply Relationships with Random Forests and Recurrent Neural Networks Shikib Mehri Department of Computer Science University of British Columbia Vancouver, Canada

More information

Search Engine Switching Detection Based on User Personal Preferences and Behavior Patterns

Search Engine Switching Detection Based on User Personal Preferences and Behavior Patterns Search Engine Switching Detection Based on User Personal Preferences and Behavior Patterns Denis Savenkov, Dmitry Lagun, Qiaoling Liu Mathematics & Computer Science Emory University Atlanta, US {denis.savenkov,

More information

arxiv: v1 [cs.ir] 7 Nov 2017

arxiv: v1 [cs.ir] 7 Nov 2017 Quality-Efficiency Trade-offs in Machine Learning for Text Processing arxiv:1711.02295v1 [cs.ir] 7 Nov 2017 Abstract Nowadays, the amount of available digital documents is rapidly growing from a variety

More information

An Adaptive Scaling Mechanism for Managing Performance Variations in Network Functions Virtualization: A Case Study in an NFV-based EPC

An Adaptive Scaling Mechanism for Managing Performance Variations in Network Functions Virtualization: A Case Study in an NFV-based EPC An Adaptive Scaling Mechanism for Managing Performance Variations in Network Functions Virtualization: A Case Study in an NFV-based EPC Carlos Hernán Tobar Arteaga, Fulvio Risso and Oscar Mauricio Caicedo

More information

Mondrian Forests: Efficient Online Random Forests

Mondrian Forests: Efficient Online Random Forests Mondrian Forests: Efficient Online Random Forests Balaji Lakshminarayanan (Gatsby Unit, UCL) Daniel M. Roy (Cambridge Toronto) Yee Whye Teh (Oxford) September 4, 2014 1 Outline Background and Motivation

More information

Dealing with overlapping clustering: a constraint-based approach to algorithm selection

Dealing with overlapping clustering: a constraint-based approach to algorithm selection Dealing with overlapping clustering: a constraint-based approach to algorithm selection Antoine Adam and Hendrik Blockeel KULeuven, Department of Computer Science, Celestijnenlaan 200A, 3001 Leuven, Belgium

More information

arxiv: v6 [cs.ai] 17 Feb 2018

arxiv: v6 [cs.ai] 17 Feb 2018 Scott M. Lundberg SLUND1@CS.WASHINGTON.EDU Paul G. Allen School of Computer Science, University of Washington, Seattle, WA 98105 USA Su-In Lee SUINLEE@CS.WASHINGTON.EDU Paul G. Allen School of Computer

More information

Final Report: Kaggle Soil Property Prediction Challenge

Final Report: Kaggle Soil Property Prediction Challenge Final Report: Kaggle Soil Property Prediction Challenge Saurabh Verma (verma076@umn.edu, (612)598-1893) 1 Project Goal Low cost and rapid analysis of soil samples using infrared spectroscopy provide new

More information

CO2 Forest: Improved Random Forest by Continuous Optimization of Oblique Splits

CO2 Forest: Improved Random Forest by Continuous Optimization of Oblique Splits 1 : Improved by Continuous Optimization of Oblique Splits Mohammad Norouzi, Maxwell D. Collins, David J. Fleet, Pushmeet Kohli typical splitting criteria for a one-level decision tree (decision stump)

More information

Mondrian Forests: Efficient Online Random Forests

Mondrian Forests: Efficient Online Random Forests Mondrian Forests: Efficient Online Random Forests Balaji Lakshminarayanan Joint work with Daniel M. Roy and Yee Whye Teh 1 Outline Background and Motivation Mondrian Forests Randomization mechanism Online

More information

arxiv: v1 [stat.ml] 20 May 2017

arxiv: v1 [stat.ml] 20 May 2017 Forward Thinking: Building Deep Random Forests arxiv:1705.07366v1 [stat.ml] 20 May 2017 Kevin Miller, Chris Hettinger, Jeffrey Humpherys, Tyler Jarvis, and David Kartchner Department of Mathematics Brigham

More information

Learning Document Image Binarization from. Data

Learning Document Image Binarization from. Data Learning Document Image Binarization from 1 Data Yue Wu, Stephen Rawls, Wael AbdAlmageed and Premkumar Natarajan arxiv:1505.00529v1 [cs.cv] 4 May 2015 Abstract In this paper we present a fully trainable

More information

Use of look-up tables (LUT) for aerosol optics and activation to cloud droplets

Use of look-up tables (LUT) for aerosol optics and activation to cloud droplets Use of look-up tables (LUT) for aerosol optics and activation to cloud droplets Alf Kirkevåg 28.11.2013 Norwegian Meteorological Institute Aerosol growth by: - condensation of H 2 SO 4 - coagulation of

More information

Auto-Extraction of Modelica Code from Finite Element Analysis or Measurement Data

Auto-Extraction of Modelica Code from Finite Element Analysis or Measurement Data Auto-Extraction of Modelica Code from Finite Element Analysis or Measurement Data The-Quan Pham, Alfred Kamusella, Holger Neubert OptiY ek, Aschaffenburg Germany, e-mail: pham@optiyeu Technische Universität

More information

Importance Sampled Learning Ensembles

Importance Sampled Learning Ensembles Importance Sampled Learning Ensembles Jerome H. Friedman Bogdan E. Popescu September 9, 2003 Abstract Learning a function of many arguments is viewed from the perspective of high dimensional numerical

More information

Fast Constrained Non-negative Matrix Factorization for Whole-Brain Calcium Imaging Data

Fast Constrained Non-negative Matrix Factorization for Whole-Brain Calcium Imaging Data Fast Constrained Non-negative Matrix Factorization for Whole-rain Calcium Imaging Data Johannes Friedrich 1,2, Daniel Soudry 1, Yu Mu 2, Jeremy Freeman 2, Misha hrens 2, Liam Paninski 1 1 Department of

More information

SYDNEY CMCRC at TAC

SYDNEY CMCRC at TAC SYDNEY CMCRC at TAC 2013 Glen Pink Will Radford Will Cannings Andrew Naoum Joel Nothman Daniel Tse James R. Curran -lab, School of Information Technologies Capital Markets CRC University of Sydney 55 Harrington

More information

arxiv: v1 [cs.lg] 28 Jan 2016

arxiv: v1 [cs.lg] 28 Jan 2016 Automating biomedical data science through tree-based pipeline optimization Randal S. Olson 1, Ryan J. Urbanowicz 1, Peter C. Andrews 1, Nicole A. Lavender 2, La Creis Kidd 2, and Jason H. Moore 1 arxiv:1601.07925v1

More information

Pruning Decision Trees via Max-Heap Projection

Pruning Decision Trees via Max-Heap Projection Pruning Decision Trees via Max-Heap Projection Downloaded 06/15/17 to 72.66.114.89. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php Abstract Zhi Nie Binbin

More information

Solution of 2D Euler Equations and Application to Airfoil Design

Solution of 2D Euler Equations and Application to Airfoil Design WDS'6 Proceedings of Contributed Papers, Part I, 47 52, 26. ISBN 8-86732-84-3 MATFYZPRESS Solution of 2D Euler Equations and Application to Airfoil Design J. Šimák Charles University, Faculty of Mathematics

More information

SUAVE: An Open-Source Environment Enabling Multi-Fidelity Vehicle Optimization

SUAVE: An Open-Source Environment Enabling Multi-Fidelity Vehicle Optimization SUAVE: An Open-Source Environment Enabling Multi-Fidelity Vehicle Optimization Timothy MacDonald, Matthew Clarke, Emilio Botero, J. Michael Vegh Juan J. Alonso Stanford University, Stanford, CA 94305,

More information

Nested Mini-Batch K-Means

Nested Mini-Batch K-Means Nested Mini-Batch K-Means James Newling Idiap Research Institue & EPFL james.newling@idiap.ch François Fleuret Idiap Research Institue & EPFL francois.fleuret@idiap.ch Abstract A new algorithm is proposed

More information

Global modelling of air pollution using multiple data sources

Global modelling of air pollution using multiple data sources Global modelling of air pollution using multiple data sources Matthew Thomas M.L.Thomas@bath.ac.uk Supervised by Dr. Gavin Shaddick In collaboration with IHME and WHO June 14, 2016 1/ 1 MOTIVATION Air

More information

On the Value of Incorporating Domain Knowledge Into Robot Design Tools

On the Value of Incorporating Domain Knowledge Into Robot Design Tools On the Value of Incorporating Domain Knowledge Into Robot Design Tools hervin hasemlou and Jason M. O Kane Abstract This paper aims to improve the practical scalability of automated tools to assist in

More information

Bayesian approach to single-tree detection in airborne laser scanning use of training data for prior and likelihood modeling

Bayesian approach to single-tree detection in airborne laser scanning use of training data for prior and likelihood modeling Bayesian approach to single-tree detection in airborne laser scanning use of training data for prior and likelihood modeling Teemu Luostari 1, Timo Lähivaara 1, Petteri Packalen 2 and Aku Seppänen 1 1

More information

An Exploratory Search User Interface Concept Supporting Vague Querying and a Novel Result Representation

An Exploratory Search User Interface Concept Supporting Vague Querying and a Novel Result Representation An Exploratory Search User Interface Concept Supporting Vague Querying and a Novel Result Representation Timo Lüddecke Markus Jüttner Marcus Nitsche Andreas Nürnberger Data and Knowledge Engineering Group,

More information

Manifold Spanning Graphs

Manifold Spanning Graphs Manifold Spanning Graphs CJ Carey and Sridhar Mahadevan School of Computer Science University of Massachusetts, Amherst Amherst, Massachusetts, 01003 {ccarey,mahadeva}@cs.umass.edu Abstract Graph construction

More information

Improving Object Recognition with Entity Co-Occurrence

Improving Object Recognition with Entity Co-Occurrence Improving Object Recognition with Entity Co-Occurrence Willie Boag wboag@cs.uml.edu Pete Maniatis maniatisp@gmail.com Abstract State-of-the-art object detection is powerful, but limited. Most systems use

More information

Timo Lähivaara, Tomi Huttunen, Simo-Pekka Simonaho University of Kuopio, Department of Physics P.O.Box 1627, FI-70211, Finland

Timo Lähivaara, Tomi Huttunen, Simo-Pekka Simonaho University of Kuopio, Department of Physics P.O.Box 1627, FI-70211, Finland Timo Lähivaara, Tomi Huttunen, Simo-Pekka Simonaho University of Kuopio, Department of Physics P.O.Box 627, FI-72, Finland timo.lahivaara@uku.fi INTRODUCTION The modeling of the acoustic wave fields often

More information

3D Interest Point Detection via Discriminative Learning

3D Interest Point Detection via Discriminative Learning 3D Interest Point Detection via Discriminative Learning Leizer Teran 1 and Philippos Mordohai 2 1 Drexel University 2 Stevens Institute of Technology Abstract. The task of detecting the interest points

More information

Towards High-Performance Prediction Serving Systems

Towards High-Performance Prediction Serving Systems Towards High-Performance Prediction Serving Systems Yunseong Lee Seoul National University Seoul, South Korea yunseong@snu.ac.kr Alberto Scolari Politecnico di Milano Milano, Italy alberto.scolari@polimi.it

More information

A peer-reviewed version of this preprint was published in PeerJ on 15 March 2018.

A peer-reviewed version of this preprint was published in PeerJ on 15 March 2018. A peer-reviewed version of this preprint was published in PeerJ on 15 March 2018. View the peer-reviewed version (peerj.com/articles/4411), which is the preferred citable publication unless you specifically

More information

Random Forests and Boosting

Random Forests and Boosting Random Forests and Boosting Tree-based methods are simple and useful for interpretation. However they typically are not competitive with the best supervised learning approaches in terms of prediction accuracy.

More information

Performance Prediction for Graph Queries

Performance Prediction for Graph Queries Performance Prediction for Graph Queries Mohammad Hossein Namaki, Keyvan Sasani, Yinghui Wu, Assefaw H. Gebremedhin Washington State University {mnamaki, ksasani, yinghui, assefaw}@eecs.wsu.edu ABSTRACT

More information

Online quality assessment of human motion from skeleton data

Online quality assessment of human motion from skeleton data 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 AUTHOR(S): ONLINE

More information

Identifying Support Surfaces of Climbable Structures from 3D Point Clouds

Identifying Support Surfaces of Climbable Structures from 3D Point Clouds Identifying Support Surfaces of Climbable Structures from 3D Point Clouds Anna Eilering 1, Victor Yap 2, Jeff Johnson 1, and Kris Hauser 1 Abstract This paper presents a probabilistic technique for identifying

More information

TITLE. Bruno Bogaz Zarpelão Department of Computer Science of State University of Londrina.

TITLE. Bruno Bogaz Zarpelão Department of Computer Science of State University of Londrina. TITLE Victor Guilherme Turrisi da Costa Department of Computer Science of State University of Londrina Londrina, Paraná 86057-970 Email: vt.turrisi@gmail.com Bruno Bogaz Zarpelão Department of Computer

More information

Robust Calibration Marker Detection in Powder Bed Images from Laser Beam Melting Processes

Robust Calibration Marker Detection in Powder Bed Images from Laser Beam Melting Processes Robust Calibration Marker Detection in Powder Bed Images from Laser Beam Melting Processes Joschka zur Jacobsmühlen, Jan Achterhold, Stefan Kleszczynski, Gerd Witt, Dorit Merhof Institute of Imaging and

More information