Finding Holes in Data

Size: px
Start display at page:

Download "Finding Holes in Data"

Transcription

1 Finding Holes in Data Leonid Pekelis Stat 208 June 18th, 2009 Abstract Attempts are made to employ persistent homology to infer topological properties of point cloud data. Two potential approaches are introducted and briefly evaluated, both relying on Monte Carlo estimation and the program PLEX [9]. First, homology signatures are compared to a null distribution that exhibits complete spatial randomness. This can be regarded as similar to hypothesis testing with the bootstrap. The second applies permutation testing to landmark selection. 1

2 1 Intro Topological Statistics is a relatively new and exciting field that lies in the intersection of Mathematics (Algebraic Topology) and Statistics. The main points are that methods from geometry and topology attempt to make precise the notions that concepts such as measures and coordinate systems should also be intrinsic to the data. There is no a priori reason that the Euclidean distance function and IR n are the best choices. Although the mathematical theory behind Topological Statistics has at least a solid foundation (see [2] for example), the statistical application is not to the same level of rigor.this paper employs the bootstrap in order to study the statistics of some topological methods used to study data sets. In order to present a complete argument, we briefly define some terminology and explain the general procedures behind estimating persistent homology of data. Much of what follows is taken from some combination of [2], [3], and [11], and contains no proofs whatsoever. Let X be a subset of IR N ; the space that our data lives in. And let X X be the, assumed random, sample of X that we observe. The purpose of persistent homology is to estimate the topological structure of X by using X to build a simplicial complex, S, that is a close approximatino to X itself. The simplicial complex is defined by three pieces of information: A vertex set Z X, with fixed, but arbirtrary ordering {z 0,...,z m } A rule that specifies when a p-simplex ω = [z 0,...,z p ] belongs to S And the recursive rule that if we define the faces of σ by the set of p- 1 simplexes U = {z [0]...z [p 1] : []is any permulation of0,...,p}, then U S. Note that a p-simplex is defined as the convex hull of a set of p + 1 independent points, or the p-dimensional analogue of a triangle. Standard examples of simplicial complexes include Cech, Rips and α- shape complexes. Generally, the vertex set is chosen to be Z = X. The problem is the resulting simplicial complex quickly becomes too unweildy for computation. For example, it is not uncommon for a Rips complex to have 10 times the number of points in the sample. The witness complex, defined next, instead operates by choosing a small set L X of landmark points (usually th to 20th of the sample) as the vertex set, and the rest of the unchosen points are witnesses in determining the simplexes forming S. We define the lazy witness complex,w t (X, ν), as follows: 2

3 The vertex set is L. For each xinx, define m(x) to be 0 for ν = 0, and the distance from x to the vth closest landmark, l L, for v > 0. The edge, or line, [ab] is then in W t (X, ν) if there exists a witness x X such that max{ a x, b x } t + m(x). A higher dimensional simplex is in W t (X, ν) if and only if all of it s edges are. There does not seem to be much theory motivating this definition, but de Silva and Carlsson (2004) note that for ν = 1, W t (X, ν) can be linked to Vitori-like regions surrounding each point. For the remainder of the paper we take ν = 1, and drop both the parameters ν and X, except for where the sample taken is not obvious. At the heart of topological statistics rests the fact that the witness complex, W t, can be characterized by a sequence of Betti numbers, b(w t ; t) = (b 0, b 0,...), for all t 0. Each integer, b k, corresponds to te number of k- dimensional holes found in W t. For example, b 1 corresponds to the number of 1-dimensional holes, or the number of closed curves (or loops) that cannot be deformed into a single point without tearing. Analogously, b 2, corresponds to when a closed 2-dimensional surface on the simplicial complex cannot be deformed. The b 0 number denotes the number of connected components, and can be seen analogous to the above descriptions when considering line segments in IR 1. Examples of Betti sequences include (1, 0, 0,...) for a single point, (1, 2, 1, 0, 0,...) for a torus, and (1, 0, 1, 0, 0,...) for a sphere. Furthermore, these sequences are usually not constant in t for the general simplicial complex W t. We do not go into details, but it should be intuitive from the definition of W t that many disconnected components should exist for small t, but that they will most likely merge to form bigger, but less numerous, connected components as t increases. At the same time, other features may appear only with larger t values. Thus, the persistent homology of W t can be defined by a series if intervals [t, t ] k, that dictate when a k- dimensional hole opens in the complex, and again closes. Combining such intervals for all holes observed produces a bar code of intervals. See Figure 5 in the appendix for an example barcode. The map back to Betti sequences from bar codes forms by selecting any value of t on the x-axis. The number of intervals that t is included in is the Betti sequence, b(w t ; t). In this sense, t can be thought of as an index of time, even though it really denotes a distance threshold in selecting simplexes. 3

4 The main assumption is that if a certain interval persists for a long time, it can be regarded as a genuine hole in the data, while shorter intervals are noise, either from the sample itself, or introduced in the homology algorithm. This paper attempts some headway into quantifying a long time. We propose, and evaluate, 2 potential approaches. The first borrows strength from spatial statistics and compares barcodes observed in a sample to a distribution of barcodes obtained from a process exhibiting no topological strucure in X. The second applies permutation testing to landmark selection. Both methods use Monte Carlo sampling to approximate underlying distributions. It should be commented that such sampling techniques are practically indispensible at this stage since hardly any theory behind persistent homology has been established. Furthermore, buth approaches exhibit reasonable assumptions, a priori. When testing for topological structure in X, our null hypothesis should be its absence: H 0 : X exhibits no topological structure. Although it is not entirely obvious what this means, we discuss in the next section why a homogeneous Poisson process may be a good approximation. Second, the largest source of uncertainty in the homology algorithm is the choice of landmark points. Currnet methods involves sequential max-min or random sampling [2], but neither method is particularly well supported, or even seems to give consistent results. We demonstrate this inconsistency, and test a third approach that involves a simple scheme of weighted permutations. The next section outlines the null-hypothesis method and presents some results concerning estimated distribution of barcodes for the homogeneous Poisson process, as well as one example. Section three gives findings for permutation tests. Section four concludes. 4

5 2 The Hypothesis Testing Approach 2.1 Definition and Null Distributions As per the null hypothesis presented earlier, we wish to compare the barcodes obtained from our sample X to a process with no topological structure. A candidate, the homogeneous Poisson process, P, is characterized by the following properties: [1] 1. The number N(P B) of points in any region B is distributed Poisson(λm(B)), where m is a measure on the space X, that P is defined on, and λ > If B 1 B 2 is the null set, then N(P B 1 ) N(P B 2 ) are independent. 3. Given N(P B) = n, the points n are independently and uniformly distributed in B. From the above properties we see that any topological structure is possible in any region, B X so long as m(b) > 0. Since the parameter λ and the space itself comletely define P, we can use these characterizations to define our test. Given the sample X X, let ˆλ = #{x X } and ˆX = X, be the parameters defining ˆP. The hope is then that such a process is a good approximation of P for X, which is itself a good approximation of H 0. Figure 1 demonstrates the distribution of a few parameter functions under H 0. To form the images, we selected 1000 random samples of P(λ, F), where λ = 50 and F = [0, 1] [0, 1]. Image (a) shows the distribution of lengths of b 0 intervals. This can be interpreted as the distribution of the length of time a connected component is expected to appear. Interestingly, it almost resembles a Poisson distribution. This makes some intuitive sense if we consider the b 0 interval length to be proportionate to the size of a cluster of points, which is distributed Poisson according to property 1. Parts (b) and (c) demonstrate the distribution of b 1 intervals, or 1-dimensional holes, and the time to coalesce, defined by the first time where the Betti code signature is (1, 0, 0,...). We see that 1-dimensional holes are rather uncommon when sampling from the null distribution, but it is possible for holes to form over varying lengths of t. The time to coalesce is instructive as after this time we are certain to see the barcode of a single point forever. This means that t has grown so large that 1 connected simplex covers the data and we cannot infer any topological structure afterwards. Again this parameter seems to be roughly Poisson or exponential in distribution. We do not make any further inferences other than to note that these result should be taken with 5

6 a grain of salt. We will later note that even though the homogeneous Poisson exhibits interesting barcode distributions, the lengths themselves are very small compared to data with any actual structure. Frequency Frequency β 0 means (t) (a) Average b 0 interval length β 1 means (t) (b) Average b 1 interval length Frequency Time to coalesce (t) (c) Average time to coalesce Figure 1: Approximated Distributions for various parameters of interest of the homogeneous Poisson Process. 6

7 2.2 The Circle Example The circle example is defined as follows. Pick n points uniformly distributed on a unit circle, and apply to each x i, i {1,...,n}, noise ǫ i N(0, ǫ) iid. In the first example, let n = 50 and ǫ {0,.5,.10,...,.95}. This gives 20 models, indexed by increasing noise (see Figure 6 in the appendix for an image of a sample dataset). If X i is a sample from model i, we let W i be the smallest rectangle parallel to the x and y axis that encompasses the data. Then the null-distribution process corresponding to X i is gives by P(λ = 50, W i ), and the achieved significanle level (ASL) is defined by ASL i = P H0 {ˆl b 1 ˆl b1 }, where l b1 denotes the average length of a b 1 interval. The idea is that we expect ASL i to increase with higher noise, corresponding to a lower probability of detecting structure. Such a result would give evidence to the sensitivity of persistent homology in detecting topological structure. The results are summarized in Table 1. Each null-distribution is obtained by 1000 samples from P(λ, W i ). We see that for the majority of models, the ASL is virtually zero. This corresponds to zero or 1 b 1 interval lengths being larger than the observed length in the circle data, given the null hypothesis. We also see two ASL values of 1, corresponding to the models of ǫ =.40 and.60. This occured because among the landmarks selected for those models, not once did a simplicial complex occur that encircled some area of IR 2. We attribute this to two possible explanations: one, that the random points were selected such that they all bunched up along one side of the circle, or, two, that the lard mark points were. The first hypothesis is not interesting. Since we see that longer b 1 intervals were correctly expressed for models with larger noise, it seems obvious that if we sampled data multiple times from the models, a sort of averaging of b 1 interval lengths would occur. The second hypothesis, though, deserves more investigation. In the current set of trials we chose a landmark set of 5 points samples without replacement from 50. It seems that the choice of these 5 points has a strong effect on the resulting barcode structure. In fact, the above trial took averages over 10 independent landmark selections for each model in order to get sensible results. Figure 2 (a) shows a boxplot of the b 1 lengths when each trial for each model is viewed separately, 10trials 20models = 200 values in all. Almost 70% of landmarks selected did not find evidence of a b 1 -hole at any level of t. The next trial demonstrates that this is most likely due to a combination of landmark size and number of sample points. We repeat the same test now 7

8 Model Noise ASL value Table 1: ASL values for n = 50 test. for 100 points in a unit curve, and landmarks of size 10 instead of 5. Figure 2 (b) shows the same box plot of all b 1 lengths for 10 trials under each model noise, ǫ i {0,...,.95}. The median length of a 1-dimensional hole is no longer 0. In face, only about 10% of all samples mistakenly did not observe a 1-dimensional hole for any interval. If we take this to be an estimate of type I error for testing the presense of 1-dimensional holes in the data, then taking 3 or 4 landmark sample should reduce the chance of effor sufficiently. It is reasonable to believe that increasing sample size and landmark amounts further would reduce the error rate even more, but we do not rule out a leveling-off effect. The ASL values for the second trial are not shown. It is enough to say that all were identically zero, but one which had a p-value of It seems that even low amounts of circular structure produce b 1 intervals that exceed those found under structureless conditions. In fact, one worries that the hypothesis test may be too strong to allow for comparison between samples. 8

9 (a) Sample size 50 (b) 100 Figure 2: Bot Plot of 200 b 1 interval lengths, 10 each sampled by landmark from model i in {1,...,20}. Avg β 1 interval length Avg β 1 interval length Model index (a) Data Points Model index (b) Bootstrap Nonparametric Regression Figure 3: Data Points and Nonparametric loess regression bootstrap for length of b 1 interval, with n = 100 data. A perhaps more interesting figure is 3, giving the average b 1 interval length among the 10 trials for each model. As theorized, we see what apears 9

10 to be a gradual decrease in the intervals lengths for which a 1-dimensional hole appeared. A local polynomial regression of degree 2 with a moving neighborhood of 75% of the data is superimposed on the points to suggest a trend. We examine this trend by running a simple pairwise bootstrap. Figure 3 (b) plots the resulting polynomial regression for 200 multinomial samples of the data points. A decreasing trend is clearly visible up to about a noise level of.4 to.5 (models 8-10), after which the interval length appears to level off. Note that this level still lies within the top 0.01 percentile fo b 1 lengths under the null hypothesis. Intuitively, with more noise, landmarks have more trouble closing around a central hole as they are inflienced noisy witnesses and hence the underlying structure is expressed less. We have seen that the size of a landmark set and sample have a reasonable effect on 2 dimensional data. Also, the persistent homology algorithm seems to do well in detectin a single 1-dimensional hole in the data described. The next section attempts to apply permutation of landmark sets on real data in many dimensions. 10

11 3 Landmark Permutation It has been of interest to the author to use the persistent homology algorithm on a set of real data. The notable examples of applications to real data have so far been largely confined to two examples: neuron firing in the visual cortex [11], and a database of high contrast images [4]. A further, related example used the Mapper algorithm to compare three dimensional meshes of various figures [10]. In this section, we attempt a topological examination of the dataset - noisyapples. The data consists of NIR spectroscopy signals measured on 5 different types of apples, at 51 uniformy spaced frequencies with added white noise. There are 5 varieties of apples: Red Delicious, Golden Delicious, Rome, Granny Smith, and Macintosh. As can be shown with standard techniques (smoothed PCA for example) the five varieties can be roughly clustered according to NIR signature. Since the 0th betti interval measures connected components, it would seem plausible that one could test for clustering in a data set. If the data contained more than one cluster, it seems plausible that it could be described by multiple simplicial complexes, one at each cluster. Topologically speaking, this corresponds to a b 0 value equal to the number of clusters. In terms of persistent homology, if the barcode of a sample has a long period of time where more than 1 b 0 interval overlaps, one could conclude that there is some clustering in the data. Moreover, since it is possible to extract the simplicial complex itself (although we do not do it here), the location and even rough shape of the cluster can be found. Persistent homology also has the advantage of being unsupervised as compared to other grouping methods such as LDA. Hence it could be useful where collecting any training data is too costly. There are a number of issues that must be confronted to see if persistent homology can be used for clustering. In the following paragraphs we review some of these issues and pose potential solutions for further study. The term long is used abstractly above as no definitive measures exist for significance of barcodes. How do we tell if an interval is long enough to be considered intrinsic to the data, or simply noise from the algorithm? The null hypothesis approach of the previous section could be applied here, but is currently untested. Also, there is still question as to whether a homogeneous poisson process is a correct choice for the null. Second, it was shown previously that homology results are sensitive to the particular choice of landmark set, at least when data sets are small. It appears that parameters may become more robust as sample size increases, 11

12 but there is currently no evidence on what an appropriate sample size may be. One direction may be a study on how robustness to random landmark choices changes with increased sample size. It would be instructive to compare results to other landmark choosing algorithms, such as maxmin. The other, which we illustrate below, instead attempts to reduce random landmark error directly, by permutation resampling of landmark sets. In other words, make many landmark choices and describe parameter results in terms of the resulting distribuion. The following presents an introduction to this approach. Figure 4 shows the results of 100 independent landmark selections. Each landmark consisted of 50 points selected without replacement from the 320 NIR signals observed. The y axis records the number of b 0 intervals observed at any point in time. Now the horizontal bars correspond to the length of time n connected components are observed in the sample. The x axis terminates at the maximum time to coalesce (described in the previous section) among all 100 resamples. Figure (a) plots results for all 100 landmarks simultaneously. Here the transparency of any single point corresponds to how many landmark samples had that many connected components at that particular point in time. Figure (b) shows the middle 90th percent quantile, calculated pointwise in each bar. For reference an arrow points to 5 connected components, the desired result. We do not attempt any description of the results, other than to say it is expected that many connected components are seen while the threshold is very low. Also, 50 is the highest b 0 number, corresponding to each landmark having its own complex. Finally, the resemblence to a transposed plot of explained variances for PCA suggests looking for an elbow may also be possible here. 12

13 90% CI β 0 barcodes # Connected Components Correct number t (a) (b) Figure 4: b 0 interval distributions for 100 independent random landmark choices of noisyapples data. 13

14 4 Conclusion In this study we examined the uses of Monte Carlo sampling in studying the persistent homology of data. The results were largely inconclusive. On the one hand, it looks as though the bar code distributions under a null hypothesis of no topological structure are concentrated around interval lengths that are too small to compare to any data with structure. On the other hand, this does suggest that if data can be characterized by having n-dimensional holes then null hypothesis testing will detect it. The problem of effectively selecting landmarks sets still exists. One possible avenue for future study is to weigh the results from a landmark set by the value max z Z {min l L z l }, which can be thought of as respresenting how poorly the landmark set was chosen. Another future test will be to take arbitrary rotations of low dimensional figures (with noise) in a high dimensional space and see if the correct bar code values are still expressed. Of course, there is no end to such toy examples. The real interest lies in devising methods to employ persistent homology in studying arbitrary data sets. We have presented the possibility that topological structure is present in any high dimensional data set, but it is as open question as to how best to harness this structure in describing data. One existing method, Mapper, identified earlier and described in Singh et. al. (2007), has effectively used persistent homology analysis to create simplicial complexes that illustrate the structure of data. It may be instructive to see if Monte Carlo sampling can be of help. 14

15 5 Appendix For the analysis in this paper we use a combination of the Java-based software JPLEX ( the spatstat R package by Adrian Baddeley and Rolf Turner ( and self created R and Java code. For the latter 2 see attached code appendix. 5.1 Images Figure 5: Sample Barcodes from PLEX. 15

16 Noise sd = 0 Noise sd = 0.1 Noise sd = 0.2 Noise sd = Noise sd = Noise sd = Noise sd = 0.25 Noise sd = (a) Noise sd = 0.4 Noise sd = 0.5 Noise sd = 0.6 Noise sd = Noise sd = Noise sd = Noise sd = 0.65 Noise sd = (b) Figure 6: Example of Circle data used in Section 2.2

17 References [1] Adrian Baddeley. Analysing spatial point patterns in r. Workshop Notes Version 3, [2] Gunnar Carlsson. Topology and data. Bull. Amer. Math. Soc. (N.S.), 46(2): , [3] Gunnar Carlsson and Vin de Silva. Topological estimation using witness complexes. In M. Alexa and S. Rusinkiewicz, editors, Eurographics Symposium on Point-Based Graphics (2004), page na, [4] Gunnar Carlsson, Tigran Ishkhanov, Vin Silva, and Afra Zomorodian. On the local behavior of spaces of natural images. Int. J. Comput. Vision, 76(1):1 12, [5] M. Greenbug and Harper J. Algebraic Topology. Westview Press, [6] Trevor Hastie and Patrice Y. Simard. Metrics and models for handwritten character recognition. Statistical Science, 13(1):54 65, [7] V. Sankrithi Krishnan. An introduction to category theory / V. Sankrithi Krishnan. North Holland, New York :, [8] J Munkres. Topology: A First Course. Prentice Hall College Div, [9] Parry P. and de Silva V. Plex: Simplicial complexes in matlab. available at [10] Gurjeet Singh, Facundo Memoli, and Gunnar Carlsson. Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition. In Point Based Graphics 2007, pages , Prague, [11] Gurjeet Singh, Facundo Memoli, Tigran Ishkhanov, Guillermo Sapiro, Gunnar Carlsson, and Dario L. Ringach. Topological analysis of population activity in visual cortex. J. Vis., 8(8):1 18,

Topological Classification of Data Sets without an Explicit Metric

Topological Classification of Data Sets without an Explicit Metric Topological Classification of Data Sets without an Explicit Metric Tim Harrington, Andrew Tausz and Guillaume Troianowski December 10, 2008 A contemporary problem in data analysis is understanding the

More information

JPlex. CSRI Workshop on Combinatorial Algebraic Topology August 29, 2009 Henry Adams

JPlex. CSRI Workshop on Combinatorial Algebraic Topology August 29, 2009 Henry Adams JPlex CSRI Workshop on Combinatorial Algebraic Topology August 29, 2009 Henry Adams Plex: Vin de Silva and Patrick Perry (2000-2006) JPlex: Harlan Sexton and Mikael Vejdemo- Johansson (2008-present) JPlex

More information

Topological Data Analysis

Topological Data Analysis Topological Data Analysis Deepak Choudhary(11234) and Samarth Bansal(11630) April 25, 2014 Contents 1 Introduction 2 2 Barcodes 2 2.1 Simplical Complexes.................................... 2 2.1.1 Representation

More information

Topology and the Analysis of High-Dimensional Data

Topology and the Analysis of High-Dimensional Data Topology and the Analysis of High-Dimensional Data Workshop on Algorithms for Modern Massive Data Sets June 23, 2006 Stanford Gunnar Carlsson Department of Mathematics Stanford University Stanford, California

More information

Analysis of high dimensional data via Topology. Louis Xiang. Oak Ridge National Laboratory. Oak Ridge, Tennessee

Analysis of high dimensional data via Topology. Louis Xiang. Oak Ridge National Laboratory. Oak Ridge, Tennessee Analysis of high dimensional data via Topology Louis Xiang Oak Ridge National Laboratory Oak Ridge, Tennessee Contents Abstract iii 1 Overview 1 2 Data Set 1 3 Simplicial Complex 5 4 Computation of homology

More information

On the Nonlinear Statistics of Range Image Patches

On the Nonlinear Statistics of Range Image Patches SIAM J. IMAGING SCIENCES Vol. 2, No. 1, pp. 11 117 c 29 Society for Industrial and Applied Mathematics On the Nonlinear Statistics of Range Image Patches Henry Adams and Gunnar Carlsson Abstract. In [A.

More information

JPlex Software Demonstration. AMS Short Course on Computational Topology New Orleans Jan 4, 2011 Henry Adams Stanford University

JPlex Software Demonstration. AMS Short Course on Computational Topology New Orleans Jan 4, 2011 Henry Adams Stanford University JPlex Software Demonstration AMS Short Course on Computational Topology New Orleans Jan 4, 2011 Henry Adams Stanford University What does JPlex do? Input: a filtered simplicial complex or finite metric

More information

An Analysis of Spaces of Range Image Small Patches

An Analysis of Spaces of Range Image Small Patches Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 2015, 9, 275-279 275 An Analysis of Spaces of Range Image Small Patches Open Access Qingli Yin 1,* and Wen

More information

On the Local Behavior of Spaces of Natural Images

On the Local Behavior of Spaces of Natural Images On the Local Behavior of Spaces of Natural Images Gunnar Carlsson Tigran Ishkhanov Vin de Silva Afra Zomorodian Abstract In this study we concentrate on qualitative topological analysis of the local behavior

More information

Topological estimation using witness complexes. Vin de Silva, Stanford University

Topological estimation using witness complexes. Vin de Silva, Stanford University Topological estimation using witness complexes, Acknowledgements Gunnar Carlsson (Mathematics, Stanford) principal collaborator Afra Zomorodian (CS/Robotics, Stanford) persistent homology software Josh

More information

Western TDA Learning Seminar. June 7, 2018

Western TDA Learning Seminar. June 7, 2018 The The Western TDA Learning Seminar Department of Mathematics Western University June 7, 2018 The The is an important tool used in TDA for data visualization. Input point cloud; filter function; covering

More information

Topological Data Analysis - I. Afra Zomorodian Department of Computer Science Dartmouth College

Topological Data Analysis - I. Afra Zomorodian Department of Computer Science Dartmouth College Topological Data Analysis - I Afra Zomorodian Department of Computer Science Dartmouth College September 3, 2007 1 Acquisition Vision: Images (2D) GIS: Terrains (3D) Graphics: Surfaces (3D) Medicine: MRI

More information

Observing Information: Applied Computational Topology.

Observing Information: Applied Computational Topology. Observing Information: Applied Computational Topology. Bangor University, and NUI Galway April 21, 2008 What is the geometric information that can be gleaned from a data cloud? Some ideas either already

More information

Stratified Structure of Laplacian Eigenmaps Embedding

Stratified Structure of Laplacian Eigenmaps Embedding Stratified Structure of Laplacian Eigenmaps Embedding Abstract We construct a locality preserving weight matrix for Laplacian eigenmaps algorithm used in dimension reduction. Our point cloud data is sampled

More information

Topological Data Analysis

Topological Data Analysis Mikael Vejdemo-Johansson School of Computer Science University of St Andrews Scotland 22 November 2011 M Vejdemo-Johansson (St Andrews) TDA 22 November 2011 1 / 32 e Shape of Data Outline 1 e Shape of

More information

Simplicial Complexes: Second Lecture

Simplicial Complexes: Second Lecture Simplicial Complexes: Second Lecture 4 Nov, 2010 1 Overview Today we have two main goals: Prove that every continuous map between triangulable spaces can be approximated by a simplicial map. To do this,

More information

arxiv: v2 [cs.cg] 8 Feb 2010 Abstract

arxiv: v2 [cs.cg] 8 Feb 2010 Abstract Topological De-Noising: Strengthening the Topological Signal Jennifer Kloke and Gunnar Carlsson arxiv:0910.5947v2 [cs.cg] 8 Feb 2010 Abstract February 8, 2010 Topological methods, including persistent

More information

A noninformative Bayesian approach to small area estimation

A noninformative Bayesian approach to small area estimation A noninformative Bayesian approach to small area estimation Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 glen@stat.umn.edu September 2001 Revised May 2002 Research supported

More information

On the Topology of Finite Metric Spaces

On the Topology of Finite Metric Spaces On the Topology of Finite Metric Spaces Meeting in Honor of Tom Goodwillie Dubrovnik Gunnar Carlsson, Stanford University June 27, 2014 Data has shape The shape matters Shape of Data Regression Shape of

More information

A TESSELLATION FOR ALGEBRAIC SURFACES IN CP 3

A TESSELLATION FOR ALGEBRAIC SURFACES IN CP 3 A TESSELLATION FOR ALGEBRAIC SURFACES IN CP 3 ANDREW J. HANSON AND JI-PING SHA In this paper we present a systematic and explicit algorithm for tessellating the algebraic surfaces (real 4-manifolds) F

More information

Mapper, Manifolds, and More! Topological Data Analysis and Mapper

Mapper, Manifolds, and More! Topological Data Analysis and Mapper Mapper, Manifolds, and More! Topological Data Analysis and Mapper Matt Piekenbrock Data Science and Security Cluster (DSSC) Talk: March 28th, 2018 Topology Topology is a branch of mathematics concerned

More information

Homology and Persistent Homology Bootcamp Notes 2017

Homology and Persistent Homology Bootcamp Notes 2017 Homology and Persistent Homology Bootcamp Notes Summer@ICERM 2017 Melissa McGuirl September 19, 2017 1 Preface This bootcamp is intended to be a review session for the material covered in Week 1 of Summer@ICERM

More information

More Summer Program t-shirts

More Summer Program t-shirts ICPSR Blalock Lectures, 2003 Bootstrap Resampling Robert Stine Lecture 2 Exploring the Bootstrap Questions from Lecture 1 Review of ideas, notes from Lecture 1 - sample-to-sample variation - resampling

More information

Euler s Theorem. Brett Chenoweth. February 26, 2013

Euler s Theorem. Brett Chenoweth. February 26, 2013 Euler s Theorem Brett Chenoweth February 26, 2013 1 Introduction This summer I have spent six weeks of my holidays working on a research project funded by the AMSI. The title of my project was Euler s

More information

Chapter 4. Clustering Core Atoms by Location

Chapter 4. Clustering Core Atoms by Location Chapter 4. Clustering Core Atoms by Location In this chapter, a process for sampling core atoms in space is developed, so that the analytic techniques in section 3C can be applied to local collections

More information

9 About Intersection Graphs

9 About Intersection Graphs 9 About Intersection Graphs Since this lecture we focus on selected detailed topics in Graph theory that are close to your teacher s heart... The first selected topic is that of intersection graphs, i.e.

More information

Preferred directions for resolving the non-uniqueness of Delaunay triangulations

Preferred directions for resolving the non-uniqueness of Delaunay triangulations Preferred directions for resolving the non-uniqueness of Delaunay triangulations Christopher Dyken and Michael S. Floater Abstract: This note proposes a simple rule to determine a unique triangulation

More information

Data Analysis, Persistent homology and Computational Morse-Novikov theory

Data Analysis, Persistent homology and Computational Morse-Novikov theory Data Analysis, Persistent homology and Computational Morse-Novikov theory Dan Burghelea Department of mathematics Ohio State University, Columbuus, OH Bowling Green, November 2013 AMN theory (computational

More information

arxiv: v1 [cs.it] 10 May 2016

arxiv: v1 [cs.it] 10 May 2016 Separating Topological Noise from Features using Persistent Entropy Nieves Atienza 1, Rocio Gonzalez-Diaz 1, and Matteo Rucco 2 arxiv:1605.02885v1 [cs.it] 10 May 2016 1 Applied Math Department, School

More information

TOPOLOGICAL DATA ANALYSIS

TOPOLOGICAL DATA ANALYSIS TOPOLOGICAL DATA ANALYSIS BARCODES Ghrist, Barcodes: The persistent topology of data Topaz, Ziegelmeier, and Halverson 2015: Topological Data Analysis of Biological Aggregation Models 1 Questions in data

More information

One-Point Geometric Crossover

One-Point Geometric Crossover One-Point Geometric Crossover Alberto Moraglio School of Computing and Center for Reasoning, University of Kent, Canterbury, UK A.Moraglio@kent.ac.uk Abstract. Uniform crossover for binary strings has

More information

Clustering and Visualisation of Data

Clustering and Visualisation of Data Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some

More information

On the Local Behavior of Spaces of Natural Images

On the Local Behavior of Spaces of Natural Images Int J Comput Vis (2008) 76: 1 12 DOI 10.1007/s11263-007-0056-x POSITION PAPER On the Local Behavior of Spaces of Natural Images Gunnar Carlsson Tigran Ishkhanov Vin de Silva Afra Zomorodian Received: 19

More information

Clustering algorithms and introduction to persistent homology

Clustering algorithms and introduction to persistent homology Foundations of Geometric Methods in Data Analysis 2017-18 Clustering algorithms and introduction to persistent homology Frédéric Chazal INRIA Saclay - Ile-de-France frederic.chazal@inria.fr Introduction

More information

Manifolds. Chapter X. 44. Locally Euclidean Spaces

Manifolds. Chapter X. 44. Locally Euclidean Spaces Chapter X Manifolds 44. Locally Euclidean Spaces 44 1. Definition of Locally Euclidean Space Let n be a non-negative integer. A topological space X is called a locally Euclidean space of dimension n if

More information

CLASSIFICATION OF SURFACES

CLASSIFICATION OF SURFACES CLASSIFICATION OF SURFACES JUSTIN HUANG Abstract. We will classify compact, connected surfaces into three classes: the sphere, the connected sum of tori, and the connected sum of projective planes. Contents

More information

Lofting 3D Shapes. Abstract

Lofting 3D Shapes. Abstract Lofting 3D Shapes Robby Prescott Department of Computer Science University of Wisconsin Eau Claire Eau Claire, Wisconsin 54701 robprescott715@gmail.com Chris Johnson Department of Computer Science University

More information

Persistence stability for geometric complexes

Persistence stability for geometric complexes Lake Tahoe - December, 8 2012 NIPS workshop on Algebraic Topology and Machine Learning Persistence stability for geometric complexes Frédéric Chazal Joint work with Vin de Silva and Steve Oudot (+ on-going

More information

CSE 5559 Computational Topology: Theory, algorithms, and applications to data analysis. Lecture 0: Introduction. Instructor: Yusu Wang

CSE 5559 Computational Topology: Theory, algorithms, and applications to data analysis. Lecture 0: Introduction. Instructor: Yusu Wang CSE 5559 Computational Topology: Theory, algorithms, and applications to data analysis Lecture 0: Introduction Instructor: Yusu Wang Lecture 0: Introduction What is topology Why should we be interested

More information

Clique homological simplification problem is NP-hard

Clique homological simplification problem is NP-hard Clique homological simplification problem is NP-hard NingNing Peng Department of Mathematics, Wuhan University of Technology pnn@whut.edu.cn Foundations of Mathematics September 9, 2017 Outline 1 Homology

More information

A Multicover Nerve for Geometric Inference

A Multicover Nerve for Geometric Inference A Multicover Nerve for Geometric Inference Donald R. Sheehy Abstract We show that filtering the barycentric decomposition of a Čech complex by the cardinality of the vertices captures precisely the topology

More information

8.B. The result of Regiomontanus on tetrahedra

8.B. The result of Regiomontanus on tetrahedra 8.B. The result of Regiomontanus on tetrahedra We have already mentioned that Plato s theory that the five regular polyhedra represent the fundamental elements of nature, and in supplement (3.D) to the

More information

A Flavor of Topology. Shireen Elhabian and Aly A. Farag University of Louisville January 2010

A Flavor of Topology. Shireen Elhabian and Aly A. Farag University of Louisville January 2010 A Flavor of Topology Shireen Elhabian and Aly A. Farag University of Louisville January 2010 In 1670 s I believe that we need another analysis properly geometric or linear, which treats place directly

More information

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Xavier Le Faucheur a, Brani Vidakovic b and Allen Tannenbaum a a School of Electrical and Computer Engineering, b Department of Biomedical

More information

A Singular Example for the Averaged Mean Curvature Flow

A Singular Example for the Averaged Mean Curvature Flow To appear in Experimental Mathematics Preprint Vol. No. () pp. 3 7 February 9, A Singular Example for the Averaged Mean Curvature Flow Uwe F. Mayer Abstract An embedded curve is presented which under numerical

More information

Algebraic Topology: A brief introduction

Algebraic Topology: A brief introduction Algebraic Topology: A brief introduction Harish Chintakunta This chapter is intended to serve as a brief, and far from comprehensive, introduction to Algebraic Topology to help the reading flow of this

More information

Statistical properties of topological information inferred from data

Statistical properties of topological information inferred from data Saclay, July 8, 2014 DataSense Research day Statistical properties of topological information inferred from data Frédéric Chazal INRIA Saclay Joint works with B.T. Fasy (CMU), M. Glisse (INRIA), C. Labruère

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Outlines Mid-level vision What is segmentation Perceptual Grouping Segmentation

More information

4. Simplicial Complexes and Simplicial Homology

4. Simplicial Complexes and Simplicial Homology MATH41071/MATH61071 Algebraic topology Autumn Semester 2017 2018 4. Simplicial Complexes and Simplicial Homology Geometric simplicial complexes 4.1 Definition. A finite subset { v 0, v 1,..., v r } R n

More information

High Dimensional Indexing by Clustering

High Dimensional Indexing by Clustering Yufei Tao ITEE University of Queensland Recall that, our discussion so far has assumed that the dimensionality d is moderately high, such that it can be regarded as a constant. This means that d should

More information

Ranking Clustered Data with Pairwise Comparisons

Ranking Clustered Data with Pairwise Comparisons Ranking Clustered Data with Pairwise Comparisons Alisa Maas ajmaas@cs.wisc.edu 1. INTRODUCTION 1.1 Background Machine learning often relies heavily on being able to rank the relative fitness of instances

More information

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs Advanced Operations Research Techniques IE316 Quiz 1 Review Dr. Ted Ralphs IE316 Quiz 1 Review 1 Reading for The Quiz Material covered in detail in lecture. 1.1, 1.4, 2.1-2.6, 3.1-3.3, 3.5 Background material

More information

Topology and Data. Gunnar Carlsson Department of Mathematics, Stanford University Stanford, California October 2, 2008

Topology and Data. Gunnar Carlsson Department of Mathematics, Stanford University Stanford, California October 2, 2008 Topology and Data Gunnar Carlsson Department of Mathematics, Stanford University Stanford, California 94305 October 2, 2008 1 Introduction An important feature of modern science and engineering is that

More information

66 III Complexes. R p (r) }.

66 III Complexes. R p (r) }. 66 III Complexes III.4 Alpha Complexes In this section, we use a radius constraint to introduce a family of subcomplexes of the Delaunay complex. These complexes are similar to the Čech complexes but differ

More information

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Structured Light II Johannes Köhler Johannes.koehler@dfki.de Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Introduction Previous lecture: Structured Light I Active Scanning Camera/emitter

More information

EXTREME POINTS AND AFFINE EQUIVALENCE

EXTREME POINTS AND AFFINE EQUIVALENCE EXTREME POINTS AND AFFINE EQUIVALENCE The purpose of this note is to use the notions of extreme points and affine transformations which are studied in the file affine-convex.pdf to prove that certain standard

More information

arxiv: v5 [cs.dm] 9 May 2016

arxiv: v5 [cs.dm] 9 May 2016 Tree spanners of bounded degree graphs Ioannis Papoutsakis Kastelli Pediados, Heraklion, Crete, reece, 700 06 October 21, 2018 arxiv:1503.06822v5 [cs.dm] 9 May 2016 Abstract A tree t-spanner of a graph

More information

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order. Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good

More information

FUZZY INFERENCE SYSTEMS

FUZZY INFERENCE SYSTEMS CHAPTER-IV FUZZY INFERENCE SYSTEMS Fuzzy inference is the process of formulating the mapping from a given input to an output using fuzzy logic. The mapping then provides a basis from which decisions can

More information

Chordal deletion is fixed-parameter tractable

Chordal deletion is fixed-parameter tractable Chordal deletion is fixed-parameter tractable Dániel Marx Institut für Informatik, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany. dmarx@informatik.hu-berlin.de Abstract. It

More information

The equal tangents property

The equal tangents property The equal tangents property J. Jerónimo-Castro, G. Ruiz-Hernández and S. Tabachnikov April 29, 2012 Abstract Let M be a C 2 -smooth strictly convex closed surface in R 3 and denote by H the set of points

More information

10701 Machine Learning. Clustering

10701 Machine Learning. Clustering 171 Machine Learning Clustering What is Clustering? Organizing data into clusters such that there is high intra-cluster similarity low inter-cluster similarity Informally, finding natural groupings among

More information

Power Functions and Their Use In Selecting Distance Functions for. Document Degradation Model Validation. 600 Mountain Avenue, Room 2C-322

Power Functions and Their Use In Selecting Distance Functions for. Document Degradation Model Validation. 600 Mountain Avenue, Room 2C-322 Power Functions and Their Use In Selecting Distance Functions for Document Degradation Model Validation Tapas Kanungo y ; Robert M. Haralick y and Henry S. Baird z y Department of Electrical Engineering,

More information

SOM+EOF for Finding Missing Values

SOM+EOF for Finding Missing Values SOM+EOF for Finding Missing Values Antti Sorjamaa 1, Paul Merlin 2, Bertrand Maillet 2 and Amaury Lendasse 1 1- Helsinki University of Technology - CIS P.O. Box 5400, 02015 HUT - Finland 2- Variances and

More information

Random Simplicial Complexes

Random Simplicial Complexes Random Simplicial Complexes Duke University CAT-School 2015 Oxford 8/9/2015 Part I Random Combinatorial Complexes Contents Introduction The Erdős Rényi Random Graph The Random d-complex The Random Clique

More information

Hidden Loop Recovery for Handwriting Recognition

Hidden Loop Recovery for Handwriting Recognition Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of

More information

THE BASIC THEORY OF PERSISTENT HOMOLOGY

THE BASIC THEORY OF PERSISTENT HOMOLOGY THE BASIC THEORY OF PERSISTENT HOMOLOGY KAIRUI GLEN WANG Abstract Persistent homology has widespread applications in computer vision and image analysis This paper first motivates the use of persistent

More information

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation.

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation. Equation to LaTeX Abhinav Rastogi, Sevy Harris {arastogi,sharris5}@stanford.edu I. Introduction Copying equations from a pdf file to a LaTeX document can be time consuming because there is no easy way

More information

Exploring Domains of Approximation in R 2 : Expository Essay

Exploring Domains of Approximation in R 2 : Expository Essay Exploring Domains of Approximation in R 2 : Expository Essay Nicolay Postarnakevich August 12, 2013 1 Introduction In this paper I explore the concept of the Domains of Best Approximations. These structures

More information

Conjectures concerning the geometry of 2-point Centroidal Voronoi Tessellations

Conjectures concerning the geometry of 2-point Centroidal Voronoi Tessellations Conjectures concerning the geometry of 2-point Centroidal Voronoi Tessellations Emma Twersky May 2017 Abstract This paper is an exploration into centroidal Voronoi tessellations, or CVTs. A centroidal

More information

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight Three-Dimensional Object Reconstruction from Layered Spatial Data Michael Dangl and Robert Sablatnig Vienna University of Technology, Institute of Computer Aided Automation, Pattern Recognition and Image

More information

Fractals: Self-Similarity and Fractal Dimension Math 198, Spring 2013

Fractals: Self-Similarity and Fractal Dimension Math 198, Spring 2013 Fractals: Self-Similarity and Fractal Dimension Math 198, Spring 2013 Background Fractal geometry is one of the most important developments in mathematics in the second half of the 20th century. Fractals

More information

Sheaf Theory Approach to Distributed Applications: Analysing Heterogeneous Data in Air Traffic Monitoring

Sheaf Theory Approach to Distributed Applications: Analysing Heterogeneous Data in Air Traffic Monitoring International Journal of Data Science and Analysis 2017; 3(5): 34-39 http://www.sciencepublishinggroup.com/j/ijdsa doi: 10.11648/j.ijdsa.20170305.11 ISSN: 2575-1883 (Print); ISSN: 2575-1891 (Online) Sheaf

More information

6. Concluding Remarks

6. Concluding Remarks [8] K. J. Supowit, The relative neighborhood graph with an application to minimum spanning trees, Tech. Rept., Department of Computer Science, University of Illinois, Urbana-Champaign, August 1980, also

More information

Statistical Methods in Topological Data Analysis for Complex, High-Dimensional Data

Statistical Methods in Topological Data Analysis for Complex, High-Dimensional Data Libraries Conference on Applied Statistics in Agriculture 2015-27th Annual Conference Proceedings Statistical Methods in Topological Data Analysis for Complex, High-Dimensional Data Patrick S. Medina Purdue

More information

CRF Based Point Cloud Segmentation Jonathan Nation

CRF Based Point Cloud Segmentation Jonathan Nation CRF Based Point Cloud Segmentation Jonathan Nation jsnation@stanford.edu 1. INTRODUCTION The goal of the project is to use the recently proposed fully connected conditional random field (CRF) model to

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

Computational Topology in Neuroscience

Computational Topology in Neuroscience Computational Topology in Neuroscience Bernadette Stolz Lincoln College University of Oxford A dissertation submitted for the degree of Master of Science in Mathematical Modelling & Scientific Computing

More information

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University. 3D Computer Vision Structured Light II Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Introduction

More information

Topological Persistence

Topological Persistence Topological Persistence Topological Persistence (in a nutshell) X topological space f : X f persistence X Dg f signature: persistence diagram encodes the topological structure of the pair (X, f) 1 Topological

More information

Computational Statistics and Mathematics for Cyber Security

Computational Statistics and Mathematics for Cyber Security and Mathematics for Cyber Security David J. Marchette Sept, 0 Acknowledgment: This work funded in part by the NSWC In-House Laboratory Independent Research (ILIR) program. NSWCDD-PN--00 Topics NSWCDD-PN--00

More information

Linear Programming in Small Dimensions

Linear Programming in Small Dimensions Linear Programming in Small Dimensions Lekcija 7 sergio.cabello@fmf.uni-lj.si FMF Univerza v Ljubljani Edited from slides by Antoine Vigneron Outline linear programming, motivation and definition one dimensional

More information

A convenient way to construct large simplicial complexes is through specifying sets and recording their intersection patterns.

A convenient way to construct large simplicial complexes is through specifying sets and recording their intersection patterns. III.2 Convex Set Systems 53 III.2 Convex Set Systems A convenient way to construct large simplicial complexes is through specifying sets and recording their intersection patterns. Nerves. Let F be a finite

More information

Edge-exchangeable graphs and sparsity

Edge-exchangeable graphs and sparsity Edge-exchangeable graphs and sparsity Tamara Broderick Department of EECS Massachusetts Institute of Technology tbroderick@csail.mit.edu Diana Cai Department of Statistics University of Chicago dcai@uchicago.edu

More information

Level-set MCMC Curve Sampling and Geometric Conditional Simulation

Level-set MCMC Curve Sampling and Geometric Conditional Simulation Level-set MCMC Curve Sampling and Geometric Conditional Simulation Ayres Fan John W. Fisher III Alan S. Willsky February 16, 2007 Outline 1. Overview 2. Curve evolution 3. Markov chain Monte Carlo 4. Curve

More information

Nonparametric Regression

Nonparametric Regression Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis

More information

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY KARL L. STRATOS Abstract. The conventional method of describing a graph as a pair (V, E), where V and E repectively denote the sets of vertices and edges,

More information

An Introduction to Belyi Surfaces

An Introduction to Belyi Surfaces An Introduction to Belyi Surfaces Matthew Stevenson December 16, 2013 We outline the basic theory of Belyi surfaces, up to Belyi s theorem (1979, [1]), which characterizes these spaces as precisely those

More information

Figure : Example Precedence Graph

Figure : Example Precedence Graph CS787: Advanced Algorithms Topic: Scheduling with Precedence Constraints Presenter(s): James Jolly, Pratima Kolan 17.5.1 Motivation 17.5.1.1 Objective Consider the problem of scheduling a collection of

More information

GEOMETRY CCR MATH STANDARDS

GEOMETRY CCR MATH STANDARDS CONGRUENCE, PROOF, AND CONSTRUCTIONS M.GHS. M.GHS. M.GHS. GEOMETRY CCR MATH STANDARDS Mathematical Habits of Mind. Make sense of problems and persevere in solving them.. Use appropriate tools strategically..

More information

Lecture 3: Linear Classification

Lecture 3: Linear Classification Lecture 3: Linear Classification Roger Grosse 1 Introduction Last week, we saw an example of a learning task called regression. There, the goal was to predict a scalar-valued target from a set of features.

More information

Some Applications of Graph Bandwidth to Constraint Satisfaction Problems

Some Applications of Graph Bandwidth to Constraint Satisfaction Problems Some Applications of Graph Bandwidth to Constraint Satisfaction Problems Ramin Zabih Computer Science Department Stanford University Stanford, California 94305 Abstract Bandwidth is a fundamental concept

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervised Learning and Clustering Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2008 CS 551, Spring 2008 c 2008, Selim Aksoy (Bilkent University)

More information

Clustering CS 550: Machine Learning

Clustering CS 550: Machine Learning Clustering CS 550: Machine Learning This slide set mainly uses the slides given in the following links: http://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf http://www-users.cs.umn.edu/~kumar/dmbook/dmslides/chap8_basic_cluster_analysis.pdf

More information

APPROXIMATING THE MAXMIN AND MINMAX AREA TRIANGULATIONS USING ANGULAR CONSTRAINTS. J. Mark Keil, Tzvetalin S. Vassilev

APPROXIMATING THE MAXMIN AND MINMAX AREA TRIANGULATIONS USING ANGULAR CONSTRAINTS. J. Mark Keil, Tzvetalin S. Vassilev Serdica J. Computing 4 00, 3 334 APPROXIMATING THE MAXMIN AND MINMAX AREA TRIANGULATIONS USING ANGULAR CONSTRAINTS J. Mark Keil, Tzvetalin S. Vassilev Abstract. We consider sets of points in the two-dimensional

More information

Course of study- Algebra Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by

Course of study- Algebra Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by Course of study- Algebra 1-2 1. Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by students in Grades 9 and 10, but since all students must

More information

Lecture 5: Simplicial Complex

Lecture 5: Simplicial Complex Lecture 5: Simplicial Complex 2-Manifolds, Simplex and Simplicial Complex Scribed by: Lei Wang First part of this lecture finishes 2-Manifolds. Rest part of this lecture talks about simplicial complex.

More information

Topology and fmri Data

Topology and fmri Data Topology and fmri Data Adam Jaeger Statistical and Applied Mathematical Sciences Institute & Duke University May 5, 2016 fmri and Classical Methodology Most statistical analyses examine data with a variable

More information

[8] that this cannot happen on the projective plane (cf. also [2]) and the results of Robertson, Seymour, and Thomas [5] on linkless embeddings of gra

[8] that this cannot happen on the projective plane (cf. also [2]) and the results of Robertson, Seymour, and Thomas [5] on linkless embeddings of gra Apex graphs with embeddings of face-width three Bojan Mohar Department of Mathematics University of Ljubljana Jadranska 19, 61111 Ljubljana Slovenia bojan.mohar@uni-lj.si Abstract Aa apex graph is a graph

More information

Area of Lattice Point Polygons. Andrejs Treibergs. Wednesday, January 16, 2019

Area of Lattice Point Polygons. Andrejs Treibergs. Wednesday, January 16, 2019 Undergraduate Colloquium: Area of Lattice Point Polygons Andrejs Treibergs University of Utah Wednesday, January 16, 2019 2. Undergraduate Colloquium Lecture on Area of Lattice Point Polygons The URL for

More information