Model Based Impact Location Estimation Using Machine Learning Techniques
|
|
- Alvin Norman
- 5 years ago
- Views:
Transcription
1 Model Based Impact Location Estimation Using Machine Learning Techniques 1. Introduction Impacts on composite structures result in invisible damages that need to be detected and corrected before they lead to catastrophic failure of the structures. Location of the impact is a crucial factor that determines the overall response of the system to the impact. Several attempts using interpolation methods have been made to estimate impact location based on sensor measurements. The results in most cases, however, are not satisfying due to the limitation on training impacts. With the aid of a numerical model of the structure, more training impacts can be taken numerically at low cost. This report presents an alternative way of estimating the impact location using machine- learning techniques. 2. Data 2.1 System Setup Figure 1: system setup (physical panel) Figure 2: system setup (FEM simulation) The overall purpose of the project is to identify an impact on a composite structure. A T800H/ graphite- epoxy panel is used to validate the concept of model- based training technique for general purpose (Figure 1). To analyze this problem, a simulation model is built using finite element method (FEM). As a result, impact experiments can be taken on the FEM simulation at virtually no cost (Figure 2). This allows us to collect a large amount of samples during the training process. 2.2 Data Collection The main purpose of the data collection is to create a database that captures the characteristics of impacts over the entire panel (Figure 3). The impacts (yellow arrow) are simulated on the FEM model at different locations. For this specific panel, there are 800 training impacts uniformly distributed. For each impact, 20 sensor measurements (red dot) are recorded and each measurement contains 50 data points. The true value of the impact location (x, y) is also recorded for each of the impact. The total sample space, as a result, contains the following data that is stored as vectors and matrices in the following format. 1
2 X (!) = y (!) = x, y!!!!"" s!,! s!,!" s!",! s!",!"!!!!"" 2.3 Feature Figure 3: sample data collection for the panel For a given sensor signal of 50 data points, several important characteristics can be extracted. Three are examined in this case: (1) windowed energy, (2) time of flight, and (3) peak amplitude (Figure 4). Those features are ideal because they have strong correlation with impact locations. By extracting those important features of the data, the size of the input space is decreased from to (1) (2) (3) Figure 4: features used for sensor measurements 3. Methods Using the data and features collected above, the goal is to estimate the impact location based on the given set of sensor measurements. Linear regression is an intuitive and reasonable model because the target variables are continuous rather than discrete classes. Finite element method is also a linear model, although the sensor response due to change of impact location is non- linear. 2
3 3.1 holdout cross validation To validate the learning model, the samples are divided into training (70%) and testing (30%) samples. To ensure sufficient coverage of the entire panel, the training samples are chosen at even- spaced distances. A random selection method has also been tried assuming uniform distribution. It has a lower training result due to additional selection uncertainty. 3.2 k- means clustering There are 20 sensors in total and 3 features for each measurement. To model the entire panel at once would require 60 parameters. Moreover, the sensor measurements far away from the impact location have low signal. It is insufficient and biased to use all the signals at once. One way to deal with this issue is to divide the panel into different regions and develop models in each sub region. It is not straightforward, but meaningless, to divide the panel by physical coordinates (say small rectangles), because it is impossible to assign a new impact to any region without knowing the location. K- means clustering technique is used to divide samples into 12 groups for the specific panel. Only energy feature is used to perform clustering, because there is no need to develop a perfect clustering, as long as it is meaningful and distinctive. The energy of the sensor measurement are quite sensitive to impact location, providing 12 set of clusters in the 20- dimension feature space. 3.3 Linear Regression After dividing the training sample set into 12 different groups, linear regression is performed on each group consisting 47 training samples ( /12). To further simplify the model, instead of using all the features from all the sensors (60 parameters in total), each are modified to have a weighted- averaged contribution from all the sensors (3 parameters in total). Different combinations of the features are used. For the most comprehensive case: x! = Energy ToF Amp!!!!" y! = x y!!!!" X = (x! )! (x!" )!!"! Y = (y! )! (y!" )!!"! By setting Θ = (X! X)!! X! Y, the parameter [Θ]!! is created. The fist and second column of the Θ correspond to the parameters of location x and location y respectively. 3
4 4. Results 4.1 training error and test error First, the training errors and test errors are studied as a function of data size. As mentioned above, the total sample size is 800. It is purposely reduced to study the accuracy of the model. For total sample size lower than 51 (3 12/0.7), the training error is 0 because the system is underdetermined. The training error increases as the number of sample size increases. The test set, on the other hand, decreases as the sample size increases. In this model, the samples are taken evenly spaced over the entire panel. As a result, the training error and test error converge fairly well as sample size goes up (Figure 5). The effective error used to evaluate the accuracy is calculated by averaging the normalized Euclidean distance between the real and estimated location. It is also noted that the error is higher than expected, suggesting future improvement Test Error Feature Combination Figure 5: model accuracy as a function of sample size Figure 6: accuracy vs. features The modeling process is also repeated for different combinations of features. It is observed that there is a significant increase in accuracy by adding the energy (1) feature. Adding an additional feature of peak amplitude (2), which is a lower order version of the energy, does not further increase the accuracy (Figure 6). Using only time of flight (2) is not sufficient in this case. 4.2 individual cases To further understand the hidden flaws of the model, errors from the test samples are studied in details. Impact samples are chosen in three categories: (1) flat region, (2) edges, and (3) stiffened region. It is found, in general, that the estimation errors appear larger when the impact location is around the edges and certain locations 4
5 with stiffeners on the panel. This is contributed to the high variance in the FEM model of the structure. The signals are more sensitive in these regions. 5. Future Improvements From analyzing the results and the error distribution on the panel, certain improvements can be made in the future. As mentioned above, a significant part of the error is contributed due to the high variance around certain locations. Thus a signal- preprocessing scheme can be developed to detect data with high variance and eliminate them from the training set. This will decrease the test error in many cases, but will increase the error in some corner cases. Secondly, the sensor location is not used in this model. The error could be further reduced if those parameters are used in the model. 6. Conclusion Machine learning techniques can be used to estimate the impact location during numerical training process. Compare to traditional methods, it explores more features of the signals. The accuracy is comparable with other methods, which takes sensor locations into account. The training and test errors converge as sample size gets larger, although the accuracy is lower than expected. The error is mainly contributed to some outlier signals with high variance. In sum, k- clustering and linear regression provides an efficient way of estimating the impact location from sensor signals. Modeling and understanding the features of the signal is the main difficulty. 5
Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces
Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces Eric Christiansen Michael Gorbach May 13, 2008 Abstract One of the drawbacks of standard reinforcement learning techniques is that
More informationChapter 2 Basic Structure of High-Dimensional Spaces
Chapter 2 Basic Structure of High-Dimensional Spaces Data is naturally represented geometrically by associating each record with a point in the space spanned by the attributes. This idea, although simple,
More informationSYDE Winter 2011 Introduction to Pattern Recognition. Clustering
SYDE 372 - Winter 2011 Introduction to Pattern Recognition Clustering Alexander Wong Department of Systems Design Engineering University of Waterloo Outline 1 2 3 4 5 All the approaches we have learned
More informationk-means, k-means++ Barna Saha March 8, 2016
k-means, k-means++ Barna Saha March 8, 2016 K-Means: The Most Popular Clustering Algorithm k-means clustering problem is one of the oldest and most important problem. K-Means: The Most Popular Clustering
More informationLecture 2 The k-means clustering problem
CSE 29: Unsupervised learning Spring 2008 Lecture 2 The -means clustering problem 2. The -means cost function Last time we saw the -center problem, in which the input is a set S of data points and the
More informationComputer Experiments: Space Filling Design and Gaussian Process Modeling
Computer Experiments: Space Filling Design and Gaussian Process Modeling Best Practice Authored by: Cory Natoli Sarah Burke, Ph.D. 30 March 2018 The goal of the STAT COE is to assist in developing rigorous,
More information3D Modeling from Range Images
1 3D Modeling from Range Images A Comprehensive System for 3D Modeling from Range Images Acquired from a 3D ToF Sensor Dipl.-Inf. March 22th, 2007 Sensor and Motivation 2 3D sensor PMD 1k-S time-of-flight
More informationEfficient Orienteering-Route Search over Uncertain Spatial Datasets
Efficient Orienteering-Route Search over Uncertain Spatial Datasets Mr. Nir DOLEV, Israel Dr. Yaron KANZA, Israel Prof. Yerach DOYTSHER, Israel 1 Route Search A standard search engine on the WWW returns
More informationClustering CS 550: Machine Learning
Clustering CS 550: Machine Learning This slide set mainly uses the slides given in the following links: http://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf http://www-users.cs.umn.edu/~kumar/dmbook/dmslides/chap8_basic_cluster_analysis.pdf
More informationModel Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer
Model Assessment and Selection Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Model Training data Testing data Model Testing error rate Training error
More informationCRF Based Point Cloud Segmentation Jonathan Nation
CRF Based Point Cloud Segmentation Jonathan Nation jsnation@stanford.edu 1. INTRODUCTION The goal of the project is to use the recently proposed fully connected conditional random field (CRF) model to
More informationFinal Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm
6.801/6.866 Machine Vision Final Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm Problem 1 Line Fitting through Segmentation (Matlab) a) Write a Matlab function to generate noisy line segment data with
More informationNote Set 4: Finite Mixture Models and the EM Algorithm
Note Set 4: Finite Mixture Models and the EM Algorithm Padhraic Smyth, Department of Computer Science University of California, Irvine Finite Mixture Models A finite mixture model with K components, for
More informationTransient Rendering. Adam Smith James Skorupski. University of California, Santa Cruz December 11, 2007 CMPS 290B
Transient Rendering Adam Smith James Skorupski University of California, Santa Cruz December 11, 2007 CMPS 290B {amsmith,jskorups}@cs.ucsc.edu 2/32 Motivation There is growing interest in time-of-flight
More informationBBS654 Data Mining. Pinar Duygulu. Slides are adapted from Nazli Ikizler
BBS654 Data Mining Pinar Duygulu Slides are adapted from Nazli Ikizler 1 Classification Classification systems: Supervised learning Make a rational prediction given evidence There are several methods for
More informationCombining Top-down and Bottom-up Segmentation
Combining Top-down and Bottom-up Segmentation Authors: Eran Borenstein, Eitan Sharon, Shimon Ullman Presenter: Collin McCarthy Introduction Goal Separate object from background Problems Inaccuracies Top-down
More informationData: a collection of numbers or facts that require further processing before they are meaningful
Digital Image Classification Data vs. Information Data: a collection of numbers or facts that require further processing before they are meaningful Information: Derived knowledge from raw data. Something
More informationChapter 4: Analyzing Bivariate Data with Fathom
Chapter 4: Analyzing Bivariate Data with Fathom Summary: Building from ideas introduced in Chapter 3, teachers continue to analyze automobile data using Fathom to look for relationships between two quantitative
More informationAAL 217: DATA STRUCTURES
Chapter # 4: Hashing AAL 217: DATA STRUCTURES The implementation of hash tables is frequently called hashing. Hashing is a technique used for performing insertions, deletions, and finds in constant average
More informationDiscovery of the Source of Contaminant Release
Discovery of the Source of Contaminant Release Devina Sanjaya 1 Henry Qin Introduction Computer ability to model contaminant release events and predict the source of release in real time is crucial in
More informationWorksheet Answer Key: Scanning and Mapping Projects > Mine Mapping > Investigation 2
Worksheet Answer Key: Scanning and Mapping Projects > Mine Mapping > Investigation 2 Ruler Graph: Analyze your graph 1. Examine the shape formed by the connected dots. i. Does the connected graph create
More informationCross-validation. Cross-validation is a resampling method.
Cross-validation Cross-validation is a resampling method. It refits a model of interest to samples formed from the training set, in order to obtain additional information about the fitted model. For example,
More informationMachine Learning for Signal Processing Clustering. Bhiksha Raj Class Oct 2016
Machine Learning for Signal Processing Clustering Bhiksha Raj Class 11. 13 Oct 2016 1 Statistical Modelling and Latent Structure Much of statistical modelling attempts to identify latent structure in the
More informationUsing Machine Learning to Optimize Storage Systems
Using Machine Learning to Optimize Storage Systems Dr. Kiran Gunnam 1 Outline 1. Overview 2. Building Flash Models using Logistic Regression. 3. Storage Object classification 4. Storage Allocation recommendation
More information1. INTRODUCTION. AMS Subject Classification. 68U10 Image Processing
ANALYSING THE NOISE SENSITIVITY OF SKELETONIZATION ALGORITHMS Attila Fazekas and András Hajdu Lajos Kossuth University 4010, Debrecen PO Box 12, Hungary Abstract. Many skeletonization algorithms have been
More informationComputational Methods. Randomness and Monte Carlo Methods
Computational Methods Randomness and Monte Carlo Methods Manfred Huber 2010 1 Randomness and Monte Carlo Methods Introducing randomness in an algorithm can lead to improved efficiencies Random sampling
More informationReddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011
Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 1. Introduction Reddit is one of the most popular online social news websites with millions
More informationStress Intensity Factors for Finite Width Plates
Stress Intensity Factors for Finite Width Plates 10 September 2013 AFGROW Workshop 2013 US Air Force Academy CAStLE Matthew Hammond * Scott Fawaz * James Greer, Technical Director, CAStLE ** *SAFE Inc.
More informationScan Matching. Pieter Abbeel UC Berkeley EECS. Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics
Scan Matching Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics Scan Matching Overview Problem statement: Given a scan and a map, or a scan and a scan,
More informationData Statistics Population. Census Sample Correlation... Statistical & Practical Significance. Qualitative Data Discrete Data Continuous Data
Data Statistics Population Census Sample Correlation... Voluntary Response Sample Statistical & Practical Significance Quantitative Data Qualitative Data Discrete Data Continuous Data Fewer vs Less Ratio
More information3 Nonlinear Regression
CSC 4 / CSC D / CSC C 3 Sometimes linear models are not sufficient to capture the real-world phenomena, and thus nonlinear models are necessary. In regression, all such models will have the same basic
More informationTransactions on Modelling and Simulation vol 20, 1998 WIT Press, ISSN X
r-adaptive boundary element method Eisuke Kita, Kenichi Higuchi & Norio Kamiya Department of Mechano-Informatics and Systems, Nagoya University, Nagoya 464-01, Japan Email: kita@mech.nagoya-u.ac.jp Abstract
More informationThe Problem of Calculating Vertex Normals for Unevenly Subdivided Smooth Surfaces
The Problem of Calculating Vertex Normals for Unevenly Subdivided Smooth Surfaces Ted Schundler tschundler (a) gmail _ com Abstract: Simply averaging normals of the faces sharing a vertex does not produce
More informationComputer Experiments. Designs
Computer Experiments Designs Differences between physical and computer Recall experiments 1. The code is deterministic. There is no random error (measurement error). As a result, no replication is needed.
More informationA Random Number Based Method for Monte Carlo Integration
A Random Number Based Method for Monte Carlo Integration J Wang and G Harrell Department Math and CS, Valdosta State University, Valdosta, Georgia, USA Abstract - A new method is proposed for Monte Carlo
More informationThe latest trend of hybrid instrumentation
Multivariate Data Processing of Spectral Images: The Ugly, the Bad, and the True The results of various multivariate data-processing methods of Raman maps recorded with a dispersive Raman microscope are
More informationThe Fly & Anti-Fly Missile
The Fly & Anti-Fly Missile Rick Tilley Florida State University (USA) rt05c@my.fsu.edu Abstract Linear Regression with Gradient Descent are used in many machine learning applications. The algorithms are
More informationHidden-Surface Removal.
Hidden-Surface emoval. Here we need to discover whether an object is visible or another one obscures it. here are two fundamental approaches to remove the hidden surfaces: ) he object-space approach )
More informationFigure 1. Figure 2. The BOOTSTRAP
The BOOTSTRAP Normal Errors The definition of error of a fitted variable from the variance-covariance method relies on one assumption- that the source of the error is such that the noise measured has a
More informationSelect, Move, Rotate and Scale
Select, Move, Rotate and Scale In this tutorial we will cover the basic tools necessary for navigating, moving, and manipulating objects in 3DS Max. It is less of a how to tutorial and more of a guide
More informationRETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2
Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1907-1911 1907 Web-Based Data Mining in System Design and Implementation Open Access Jianhu
More informationDecision Trees Oct
Decision Trees Oct - 7-2009 Previously We learned two different classifiers Perceptron: LTU KNN: complex decision boundary If you are a novice in this field, given a classification application, are these
More informationMultiple Regression White paper
+44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms
More information3 Nonlinear Regression
3 Linear models are often insufficient to capture the real-world phenomena. That is, the relation between the inputs and the outputs we want to be able to predict are not linear. As a consequence, nonlinear
More informationIntroduction to Mobile Robotics
Introduction to Mobile Robotics Clustering Wolfram Burgard Cyrill Stachniss Giorgio Grisetti Maren Bennewitz Christian Plagemann Clustering (1) Common technique for statistical data analysis (machine learning,
More informationExpensive to drill wells when all goes to plan. Drilling problems add costs throughout the well s life.
Beau Rollins Map View Cross Section View Expensive to drill wells when all goes to plan. Surface Location Drilling problems add costs throughout the well s life. The best scenario for minimizing costs
More informationChapter 7: Linear regression
Chapter 7: Linear regression Objective (1) Learn how to model association bet. 2 variables using a straight line (called "linear regression"). (2) Learn to assess the quality of regression models. (3)
More informationULTRASONIC DECONVOLUTION TOOLBOX User Manual v1.0
ULTRASONIC DECONVOLUTION TOOLBOX User Manual v1.0 Acknowledgment The authors would like to acknowledge the financial support of European Commission within the project SPIQNAR FIKS-CT-2000-00065 copyright
More informationClustering and Visualisation of Data
Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some
More informationChapter 1. Introduction
Chapter 1 Introduction A Monte Carlo method is a compuational method that uses random numbers to compute (estimate) some quantity of interest. Very often the quantity we want to compute is the mean of
More informationDI TRANSFORM. The regressive analyses. identify relationships
July 2, 2015 DI TRANSFORM MVstats TM Algorithm Overview Summary The DI Transform Multivariate Statistics (MVstats TM ) package includes five algorithm options that operate on most types of geologic, geophysical,
More informationCS5620 Intro to Computer Graphics
So Far wireframe hidden surfaces Next step 1 2 Light! Need to understand: How lighting works Types of lights Types of surfaces How shading works Shading algorithms What s Missing? Lighting vs. Shading
More informationLocally Weighted Learning for Control. Alexander Skoglund Machine Learning Course AASS, June 2005
Locally Weighted Learning for Control Alexander Skoglund Machine Learning Course AASS, June 2005 Outline Locally Weighted Learning, Christopher G. Atkeson et. al. in Artificial Intelligence Review, 11:11-73,1997
More informationCIE L*a*b* color model
CIE L*a*b* color model To further strengthen the correlation between the color model and human perception, we apply the following non-linear transformation: with where (X n,y n,z n ) are the tristimulus
More informationData Representation in Visualisation
Data Representation in Visualisation Visualisation Lecture 4 Taku Komura Institute for Perception, Action & Behaviour School of Informatics Taku Komura Data Representation 1 Data Representation We have
More informationThe Artifact Subspace Reconstruction method. Christian A Kothe SCCN / INC / UCSD January 2013
The Artifact Subspace Reconstruction method Christian A Kothe SCCN / INC / UCSD January 2013 Artifact Subspace Reconstruction New algorithm to remove non-stationary highvariance signals from EEG Reconstructs
More informationADAPTIVE FINITE ELEMENT
Finite Element Methods In Linear Structural Mechanics Univ. Prof. Dr. Techn. G. MESCHKE SHORT PRESENTATION IN ADAPTIVE FINITE ELEMENT Abdullah ALSAHLY By Shorash MIRO Computational Engineering Ruhr Universität
More informationHere is the probability distribution of the errors (i.e. the magnitude of the errorbars):
The BOOTSTRAP Normal Errors The definition of error of a fitted variable from the variance-covariance method relies on one assumption- that the source of the error is such that the noise measured has a
More informationSimulation Supported POD Methodology and Validation for Automated Eddy Current Procedures
4th International Symposium on NDT in Aerospace 2012 - Th.1.A.1 Simulation Supported POD Methodology and Validation for Automated Eddy Current Procedures Anders ROSELL, Gert PERSSON Volvo Aero Corporation,
More informationCS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp
CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp Chris Guthrie Abstract In this paper I present my investigation of machine learning as
More informationCSC 2515 Introduction to Machine Learning Assignment 2
CSC 2515 Introduction to Machine Learning Assignment 2 Zhongtian Qiu(1002274530) Problem 1 See attached scan files for question 1. 2. Neural Network 2.1 Examine the statistics and plots of training error
More informationCS534: Introduction to Computer Vision Edges and Contours. Ahmed Elgammal Dept. of Computer Science Rutgers University
CS534: Introduction to Computer Vision Edges and Contours Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines What makes an edge? Gradient-based edge detection Edge Operators Laplacian
More informationA Course in Machine Learning
A Course in Machine Learning Hal Daumé III 13 UNSUPERVISED LEARNING If you have access to labeled training data, you know what to do. This is the supervised setting, in which you have a teacher telling
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and
More informationEstimation of Item Response Models
Estimation of Item Response Models Lecture #5 ICPSR Item Response Theory Workshop Lecture #5: 1of 39 The Big Picture of Estimation ESTIMATOR = Maximum Likelihood; Mplus Any questions? answers Lecture #5:
More informationBased on Raymond J. Mooney s slides
Instance Based Learning Based on Raymond J. Mooney s slides University of Texas at Austin 1 Example 2 Instance-Based Learning Unlike other learning algorithms, does not involve construction of an explicit
More informationistrdyn - integrated Stress, Thermal, and Rotor Dynamics
istrdyn - integrated Stress, Thermal, and Rotor Dynamics Jeffcott Rotor Analysis Example istrdyn Modeling, Solutions, and Result Processing July 2007 This presentation shows an analysis sequence using
More informationEx.1 constructing tables. a) find the joint relative frequency of males who have a bachelors degree.
Two-way Frequency Tables two way frequency table- a table that divides responses into categories. Joint relative frequency- the number of times a specific response is given divided by the sample. Marginal
More informationSemester Final Report
CSUMS SemesterFinalReport InLaTex AnnKimball 5/20/2009 ThisreportisageneralsummaryoftheaccumulationofknowledgethatIhavegatheredthroughoutthis semester. I was able to get a birds eye view of many different
More informationEdge detection. Winter in Kraków photographed by Marcin Ryczek
Edge detection Winter in Kraków photographed by Marcin Ryczek Edge detection Goal: Identify sudden changes (discontinuities) in an image Intuitively, most semantic and shape information from the image
More informationCluster analysis. Agnieszka Nowak - Brzezinska
Cluster analysis Agnieszka Nowak - Brzezinska Outline of lecture What is cluster analysis? Clustering algorithms Measures of Cluster Validity What is Cluster Analysis? Finding groups of objects such that
More informationEconomics Nonparametric Econometrics
Economics 217 - Nonparametric Econometrics Topics covered in this lecture Introduction to the nonparametric model The role of bandwidth Choice of smoothing function R commands for nonparametric models
More informationTutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures
: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and February 24, 2014 Sample to Insight : RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and : RNA-Seq Analysis
More informationCPSC 340: Machine Learning and Data Mining
CPSC 340: Machine Learning and Data Mining Fundamentals of learning (continued) and the k-nearest neighbours classifier Original version of these slides by Mark Schmidt, with modifications by Mike Gelbart.
More informationTraining Intelligent Stoplights
Training Intelligent Stoplights Thomas Davids, Michael Celentano, and Luke Knepper December 14, 2012 1 Introduction Traffic is a huge problem for the American economy. In 2010, the average American commuter
More informationDocumentation for BayesAss 1.3
Documentation for BayesAss 1.3 Program Description BayesAss is a program that estimates recent migration rates between populations using MCMC. It also estimates each individual s immigrant ancestry, the
More informationUsing a multipoint interferometer to measure the orbital angular momentum of light
CHAPTER 3 Using a multipoint interferometer to measure the orbital angular momentum of light Recently it was shown that the orbital angular momentum of light can be measured using a multipoint interferometer,
More informationRaytracing & Epsilon. Today. Last Time? Forward Ray Tracing. Does Ray Tracing Simulate Physics? Local Illumination
Raytracing & Epsilon intersects light @ t = 25.2 intersects sphere1 @ t = -0.01 & Monte Carlo Ray Tracing intersects sphere1 @ t = 10.6 Solution: advance the ray start position epsilon distance along the
More informationRobert Collins CSE598G. Intro to Template Matching and the Lucas-Kanade Method
Intro to Template Matching and the Lucas-Kanade Method Appearance-Based Tracking current frame + previous location likelihood over object location current location appearance model (e.g. image template,
More informationComputer Graphics. Sampling Theory & Anti-Aliasing. Philipp Slusallek
Computer Graphics Sampling Theory & Anti-Aliasing Philipp Slusallek Dirac Comb (1) Constant & δ-function flash Comb/Shah function 2 Dirac Comb (2) Constant & δ-function Duality f(x) = K F(ω) = K (ω) And
More informationA Multiple-Line Fitting Algorithm Without Initialization Yan Guo
A Multiple-Line Fitting Algorithm Without Initialization Yan Guo Abstract: The commonest way to fit multiple lines is to use methods incorporate the EM algorithm. However, the EM algorithm dose not guarantee
More informationMapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.
Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008. Introduction There is much that is unknown regarding
More informationTh A4 11 Interpretational Aspects of Multispectral Coherence
Th A4 11 Interpretational Aspects of Multispectral oherence K.J. Marfurt* (University of Oklahoma) Summary Seismic coherence volumes are routinely used to delineate geologic features that might otherwise
More informationRecommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu
Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu 1 Introduction Yelp Dataset Challenge provides a large number of user, business and review data which can be used for
More informationVisualizing Metaclusters
Visualizing Metaclusters Casey Smith May, 007 Introduction The metaclustering process results in many clusterings of a data set, a handful of which may be most useful to a user. Examining all the clusterings
More informationCS839: Probabilistic Graphical Models. Lecture 10: Learning with Partially Observed Data. Theo Rekatsinas
CS839: Probabilistic Graphical Models Lecture 10: Learning with Partially Observed Data Theo Rekatsinas 1 Partially Observed GMs Speech recognition 2 Partially Observed GMs Evolution 3 Partially Observed
More informationLogger Pro Resource Sheet
Logger Pro Resource Sheet Entering and Editing Data Data Collection How to Begin How to Store Multiple Runs Data Analysis How to Scale a Graph How to Determine the X- and Y- Data Points on a Graph How
More informationCAD - How Computer Can Aid Design?
CAD - How Computer Can Aid Design? Automating Drawing Generation Creating an Accurate 3D Model to Better Represent the Design and Allowing Easy Design Improvements Evaluating How Good is the Design and
More informationStatistics 1 - Basic Commands. Basic Commands. Consider the data set: {15, 22, 32, 31, 52, 41, 11}
Statistics 1 - Basic Commands http://mathbits.com/mathbits/tisection/statistics1/basiccommands.htm Page 1 of 3 Entering Data: Basic Commands Consider the data set: {15, 22, 32, 31, 52, 41, 11} Data is
More informationUnderstanding Clustering Supervising the unsupervised
Understanding Clustering Supervising the unsupervised Janu Verma IBM T.J. Watson Research Center, New York http://jverma.github.io/ jverma@us.ibm.com @januverma Clustering Grouping together similar data
More informationRobust PDF Table Locator
Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records
More informationCIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, :59pm, PDF to Canvas [100 points]
CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, 2015. 11:59pm, PDF to Canvas [100 points] Instructions. Please write up your responses to the following problems clearly and concisely.
More informationOLS Assumptions and Goodness of Fit
OLS Assumptions and Goodness of Fit A little warm-up Assume I am a poor free-throw shooter. To win a contest I can choose to attempt one of the two following challenges: A. Make three out of four free
More informationA Hybrid Magnetic Field Solver Using a Combined Finite Element/Boundary Element Field Solver
A Hybrid Magnetic Field Solver Using a Combined Finite Element/Boundary Element Field Solver Abstract - The dominant method to solve magnetic field problems is the finite element method. It has been used
More informationDecision Trees Dr. G. Bharadwaja Kumar VIT Chennai
Decision Trees Decision Tree Decision Trees (DTs) are a nonparametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target
More informationSUPPLEMENTARY INFORMATION
DOI: 1.138/NPHOTON.234 Detection and tracking of moving objects hidden from view Genevieve Gariepy, 1 Francesco Tonolini, 1 Robert Henderson, 2 Jonathan Leach 1 and Daniele Faccio 1 1 Institute of Photonics
More informationEdge detection. Winter in Kraków photographed by Marcin Ryczek
Edge detection Winter in Kraków photographed by Marcin Ryczek Edge detection Goal: Identify sudden changes (discontinuities) in an image Intuitively, edges carry most of the semantic and shape information
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 Tobler s Law All places are related, but nearby places are related more than distant places Corollary:
More information( ) =cov X Y = W PRINCIPAL COMPONENT ANALYSIS. Eigenvectors of the covariance matrix are the principal components
Review Lecture 14 ! PRINCIPAL COMPONENT ANALYSIS Eigenvectors of the covariance matrix are the principal components 1. =cov X Top K principal components are the eigenvectors with K largest eigenvalues
More informationObtaining Feature Correspondences
Obtaining Feature Correspondences Neill Campbell May 9, 2008 A state-of-the-art system for finding objects in images has recently been developed by David Lowe. The algorithm is termed the Scale-Invariant
More information