Analysis of Symbolic Data
|
|
- Kathlyn Benson
- 5 years ago
- Views:
Transcription
1 Hans-Hermann Bock Edwin Diday (Eds.) Analysis of Symbolic Data Exploratory Methods for Extracting Statistical Information from Complex Data Springer
2 Contents Preface of the Scientific Editors Preface of the Project Managers v viii 1 Symbolic Data Analysis and the SODAS Project: Purpose, History, Perspective 1 E.Diday 1.1 Introduction Symbolic Data Tables and Symbolic Objects The Input of SDA: Symbolic Data Tables, Rules and Taxonomies Sources of Symbolic Data Symbolic Objects Tools and Operations for Symbolic Objects History and Evolution of SDA The Content of the SODAS Project SDA Methods Realized in SODAS An Illustrative Example Overview on the SODAS Software Examples for the SODAS Strategy in Applications Philosophical Background: Concepts and Symbolic Objects First- and Second-Order Individuais Intent and Extent, the Two Kinds of Concepts Concepts: The Four Traditions and 'Symbolic Objects' Advantages of Using Symbolic Data Analysis The Future Development of SODAS 22 2 The Classical Data Situation 24 H.H. Bock 2.1 Introduction Variables as Input Data Quantitative Variables Qualitative Variables 26
3 XU Contents Nominal Variables Ordinal Variables and Generalized Ordinal Variables Data Vectors and the Data Matrix Dependent Variables Logical Dependence Hierarchical Dependence (Mother-Daughter) Stochastic Dependence Missing Values 37 3 Symbolic Data 39 H.H. Bock 3.1 Three Introductory Examples Multi-Valued and Interval Variables Modal Variables ' A Synthesis of Symbolic Data Types The Symbolic Data Array 49 4 Symbolic Objects 54 H.H. Bock, E. Diday 4.1 Introduction and Examples Relations and Descriptions Relations Descriptions, Description Vectors and Description Sets Product Relations Events and Assertion Objects Boolean Symbolic Objects as Triples Modal Symbolic Objects 75 5 Generation of Symbolic Objects from Relational Databases 78 V. Stephan, G. Hebrail, Y. Lechevallier 5.1 Introduction to Relational Databases Principles of Symbolic Object Acquisition from Relational Databases Interaction with the Database Interpretation of SQL Queries 85
4 Contents xiii Sampling Individuais Dependent Variables and Missing Values A Generalization Operator Basic Generalization Operator Problem of Over-Generalization A Quality Criterion to Evaluate a Generalized Description Coding by Testing for a Uniform Distribution Among Intervals A Reduction Algorithm A Numerical Example Further Operations on Generated Assertions Joining Two Arrays of Assertions Validation of Generated Assertions Descriptive Statistics for Symbolic Data 106 P. Bertrand, F. Goupil 6.1 Descriptive Statistics for a Classical Numerical Variable The Observed Symbolic Data Set The Data Table Logical Dependencies The Virtual Extension of a Description Vector The Case of Multi-Valued Variables Frequency Distribution for a Categorical or Quantitative Multi-Valued Variable Summary Measures for a Numerical Multi-Valued Variable The Case of an Interval-Valued Variable Visualizing and Editing Symbolic Objects 125 M. Noirhomme-Fraiture, M. Rouard 7.1 The Zoom Star Representation Existing Solutions Our Graphical Representation Use of Zoom Star Conclusion Editing Symbolic Objects Modification of an Existing Symbolic Object Modification of Labels 138
5 xiv Contents 8 Similarity and Dissimilarity Classical Resemblance Measures 139 F. Esposito, D. Malerba, V. Tamma, H.H. Bock Resemblance Measures Dissimilarity and Distance: Special Cases Distance Measures from a Classical Data Matrix Similarity Measures from a Categorical Data Matrix Dissimilarity Measures for Probability Distributions 153 H.H. Bock Divergence Measures: The General Case Divergence Measures: Special Cases The Affinity Coefficient (H Bacelar-Nicolau) Dissimilarity Measures for Symbolic Objects 165 F. Esposito, D. Malerba, V. Tamma Gowda and Diday's Dissimilarity Measure The Approach by Ichino and Yaguchi Dissimilarity Measures of De Carvalho De Carvalho's Dissimilarity: Constrained Case The Dissimilarity Options in the SODAS Package Matching Symbolic Objects 186 F. Esposito, D. Malerba, F.A. Lisi Canonical Matching of Boolean Symbolic Objects Flexible Matching of Boolean Symbolic Objects An Application Symbolic Factor Analysis Classical Principal Component Analysis 198 H.H. Bock 9.2 Symbolic Principal Component Analysis 200 A. Chouakria, P. Cazes, E. Diday Introduction: Interval Data The Purpose of the Method The VERTICES Method The CENTERS Method Representation by Rectangles Example of Oils and Fats Conclusions 212
6 Contents xv 9.3 Factorial Discriminant Analysis on Symbolic Objects 212 N.C. Lauro, R. Verde, F. Palumbo Introduction A Reminder of Factorial Discriminant Analysis FDA on Symbolic Data Illustrative Application to a Data Set Discrimination: Assigning Symbolic Objects to Classes Classical Methods of Discrimination 234 J.P. Rasson, S. Lissoir Introduction The Problem The Decision Rule The Classical Probabilistic Framework Density Estimation Symbolic Kernel Discriminant Analysis 240 J.P. Rasson, S. Lissoir Kernel Intensity Measures for Symbolic Data Determining the Prior Probabilities The Output Data Symbolic Discrimination Rules 244 E. Perinel, Y. Lechevallier Introduction The Underlying Population and the Variables The Set of Binary Questions and the Construction of a New Data Table from Binary Variables The Recursive Partition Algorithm Detailed Description of the Different Steps Decisional Considerations Example Segmentation Trees for Stratified Data 266 M.C. Bravo Llatas, J.M. Garcia-Santesmases Introduction Input and Output Data An Example; Distinction from Classical Decision Trees.. 271
7 xvi Contents Main Steps of the Algorithm Detailed Description of the Algorithm Choices in the Algorithm for Classical Data Choices in the Algorithm for Probabilistic Data Symbolic Object Description of Strata The Example Revisited Conclusion Clustering Methods for Symbolic Objects Clustering Problem, Clustering Methods for Classical Data M. Chavent, H.H. Bock 11.2 Criterion-Based Divisive Clustering for Symbolic Data 299 M. Chavent The Symbolic Data Matrix Two Distance Measures Extension of the Within-Class Variance Criterion Bipartitioning a Cluster Choice of the Cluster to be Split The Stopping Rule and the Output Example of a Classical Dataset Example of a Symbolic Data Set Hierarchical and Pyramidal Clustering with Complete Symbolic Objects 312 P. Brito Pyramidal Clustering Complete Symbolic Objects A Hierarchical-Pyramidal Clustering Algorithm for Symbolic Data Extension to More Complex Symbolic Data Types A Numerical Example Pyramidal Classification for Interval Data Using Galois Lattice Reduction 324 G. Polaillon Definition and Construction of Galois Lattices Reduction of a Galois Lattice into a Pyramid A Real-case Application 337
8 Contents xvn 12 Symbolic Approaches for Three-way Data 342 M. Gettler-Summa, C. Pardoux 12.1 Introduction The Input and Output Data Processing Temporal Data Two Approaches for Analysis Data Compression by Time Clustering Adapted Data Analysis Methods Interpretation of Outcomes from Processing of Temporal Changes Outcomes from a Factorial Analysis Symbolic Interpretation of Clustering Results Real-Case Examples Behavioural Data Resulting in Rule Objects On-site Telecommunication: Fuzzy Coding and Compression Fishery Study: Temporal Changes of Nominal Variables Fishing Tactics: Using Time Lines for Markings Illustrative Benchmark Analyses Introduction 355 R. Bisdorff 13.2 Professional Careers of Retired Working Persons 356 R. Bisdorff Basic Statistical Data Matrix Divisive Clustering of Professional Careers About the Discrimination of the Retiring Age from the Professional Careers Comparing European Labour Force Survey Results from the Basque Country and Portugal 374 A. Iztueta, P. Calvo The European Labour Force Survey Data Building Symbolic Objects Processing Census Data from ONS 382 F. Goupil, M. Touati, E. Diday, R. Moult Data Description Analysis of Census Data General Conclusion 385
9 xviii Contents 14 The SODAS Software Package 386 A. Morineau 14.1 Short Introduction to the SODAS Software Short Processing of a Chaining Short List of Methods in SODAS Software DB2SO: From Data Base to Symbolic Objects DI: Computing a Distance Matrix for Symbolic Objects DIV: Divisive Classification of Symbolic Data DKS: Symbolic Kernel Discriminant Analysis DSD: Symbolic Description of Groups FDA: Factorial Discriminant Analysis PCM: Principal Component Analysis SOE: Symbolic Object Editor STAT: Histograms and Elementary Statistics STD: Segmentation Tree for Stratified Data TREE: Decision Tree 391 Notations and Abbreviations 392 Bibliography 394 Addresses of Contributors to this Volume 414 Subject Index 417
Exporting symbolic objects to databases
3 Exporting symbolic objects to databases Donato Malerba, Floriana Esposito and Annalisa Appice 3.1 The method SO2DB is a SODAS module that exports a set of symbolic objects (SOs) to a relational database
More informationcomplex data Edwin Diday, CEREMADE, Beijing 2011
Symbolic data analysis of complex data Edwin Diday, CEREMADE, University i Paris Dauphine, France Beijing 2011 OUTLINE What is the Symbolic Data Analysis (SDA) paradigm? Why SDA is a good tool for Complex
More informationThe Knowledge Mining Suite (KMS)
The Knowledge Mining Suite (KMS) Oldemar Rodríguez 1 University of Costa Rica and Predisoft International S.A., San José Costa Rica. oldemar.rodriguez@predisoft.com Abstract. The Knowledge Mining Suite
More informationContents. Foreword to Second Edition. Acknowledgments About the Authors
Contents Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments About the Authors xxxi xxxv Chapter 1 Introduction 1 1.1 Why Data Mining? 1 1.1.1 Moving toward the Information Age 1
More informationS-Class, A Divisive Clustering Method, and Possible Dual Alternatives
S-Class, A Divisive Clustering Method, and Possible Dual Alternatives Jean-Paul Rasson, François Roland, Jean-Yves Pirçon, Séverine Adans, and Pascale Lallemand Facultés Universitaires Notre-Dame de la
More informationCLUSTER ANALYSIS. V. K. Bhatia I.A.S.R.I., Library Avenue, New Delhi
CLUSTER ANALYSIS V. K. Bhatia I.A.S.R.I., Library Avenue, New Delhi-110 012 In multivariate situation, the primary interest of the experimenter is to examine and understand the relationship amongst the
More informationTHE ENSEMBLE CONCEPTUAL CLUSTERING OF SYMBOLIC DATA FOR CUSTOMER LOYALTY ANALYSIS
THE ENSEMBLE CONCEPTUAL CLUSTERING OF SYMBOLIC DATA FOR CUSTOMER LOYALTY ANALYSIS Marcin Pełka 1 1 Wroclaw University of Economics, Faculty of Economics, Management and Tourism, Department of Econometrics
More informationSTATISTICS (STAT) Statistics (STAT) 1
Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).
More informationDivisive Monothetic Clustering for Interval and Histogram-valued Data
Divisive Monothetic Clustering for Interval and Histogram-valued Data Paula Brito, Marie Chavent To cite this version: Paula Brito, Marie Chavent. Divisive Monothetic Clustering for Interval and Histogram-valued
More informationPrincipal Component Analysis of Interval Data: a Symbolic Data Analysis Approach 1
Principal Component Analysis of Interval Data: a Symbolic Data Analysis Approach 1 Carlo N. Lauro 1 and Francesco Palumbo 2 1 Dipartimento di Matematica e Statistica Università Federico II Napoli, Italy
More informationTable Of Contents: xix Foreword to Second Edition
Data Mining : Concepts and Techniques Table Of Contents: Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments xxxi About the Authors xxxv Chapter 1 Introduction 1 (38) 1.1 Why Data
More information2. Background. 2.1 Clustering
2. Background 2.1 Clustering Clustering involves the unsupervised classification of data items into different groups or clusters. Unsupervised classificaiton is basically a learning task in which learning
More informationPart I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures
Part I, Chapters 4 & 5 Data Tables and Data Analysis Statistics and Figures Descriptive Statistics 1 Are data points clumped? (order variable / exp. variable) Concentrated around one value? Concentrated
More informationComparison of Forescasting Methods for Interval-Valued Time Series
International Journal of Statistics and Applications 2015, 5(6): 317-337 DOI: 10.5923/j.statistics.20150506.07 Comparison of Forescasting Methods for Interval-Valued Time Series Ebrucan Islamoglu 1,*,
More informationModern Multidimensional Scaling
Ingwer Borg Patrick Groenen Modern Multidimensional Scaling Theory and Applications With 116 Figures Springer Contents Preface vii I Fundamentals of MDS 1 1 The Four Purposes of Multidimensional Scaling
More informationIMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING
SECOND EDITION IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING ith Algorithms for ENVI/IDL Morton J. Canty с*' Q\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC
More informationFEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS Djamal Ziani 1 1 College of Computer Sciences and Information Systems, Information Systems Department, PO Box 51178 Riyadh 11543 Saudi Arabia ABSTRACT With
More informationJAVA Projects. 1. Enforcing Multitenancy for Cloud Computing Environments (IEEE 2012).
JAVA Projects I. IEEE based on CLOUD COMPUTING 1. Enforcing Multitenancy for Cloud Computing Environments 2. Practical Detection of Spammers and Content Promoters in Online Video Sharing Systems 3. An
More informationDecision Trees Dr. G. Bharadwaja Kumar VIT Chennai
Decision Trees Decision Tree Decision Trees (DTs) are a nonparametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target
More informationDigital Image Processing
Digital Image Processing Third Edition Rafael C. Gonzalez University of Tennessee Richard E. Woods MedData Interactive PEARSON Prentice Hall Pearson Education International Contents Preface xv Acknowledgments
More informationChapter DM:II. II. Cluster Analysis
Chapter DM:II II. Cluster Analysis Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained Cluster Analysis DM:II-1
More informationContents. List of Figures. List of Tables. List of Algorithms. I Clustering, Data, and Similarity Measures 1
Contents List of Figures List of Tables List of Algorithms Preface xiii xv xvii xix I Clustering, Data, and Similarity Measures 1 1 Data Clustering 3 1.1 Definition of Data Clustering... 3 1.2 The Vocabulary
More informationExpectation Maximization (EM) and Gaussian Mixture Models
Expectation Maximization (EM) and Gaussian Mixture Models Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 2 3 4 5 6 7 8 Unsupervised Learning Motivation
More informationA new topological clustering algorithm for interval data
A new topological clustering algorithm for interval data Guénaël Cabanes, Younès Bennani, Renaud Destenay, André Hardy To cite this version: Guénaël Cabanes, Younès Bennani, Renaud Destenay, André Hardy.
More informationPreprocessing Short Lecture Notes cse352. Professor Anita Wasilewska
Preprocessing Short Lecture Notes cse352 Professor Anita Wasilewska Data Preprocessing Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept
More informationHomework # 4. Example: Age in years. Answer: Discrete, quantitative, ratio. a) Year that an event happened, e.g., 1917, 1950, 2000.
Homework # 4 1. Attribute Types Classify the following attributes as binary, discrete, or continuous. Further classify the attributes as qualitative (nominal or ordinal) or quantitative (interval or ratio).
More informationCLASSIFICATION AND CHANGE DETECTION
IMAGE ANALYSIS, CLASSIFICATION AND CHANGE DETECTION IN REMOTE SENSING With Algorithms for ENVI/IDL and Python THIRD EDITION Morton J. Canty CRC Press Taylor & Francis Group Boca Raton London NewYork CRC
More informationContents. Preface to the Second Edition
Preface to the Second Edition v 1 Introduction 1 1.1 What Is Data Mining?....................... 4 1.2 Motivating Challenges....................... 5 1.3 The Origins of Data Mining....................
More informationPredict Outcomes and Reveal Relationships in Categorical Data
PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,
More informationECLT 5810 Clustering
ECLT 5810 Clustering What is Cluster Analysis? Cluster: a collection of data objects Similar to one another within the same cluster Dissimilar to the objects in other clusters Cluster analysis Grouping
More informationData Statistics Population. Census Sample Correlation... Statistical & Practical Significance. Qualitative Data Discrete Data Continuous Data
Data Statistics Population Census Sample Correlation... Voluntary Response Sample Statistical & Practical Significance Quantitative Data Qualitative Data Discrete Data Continuous Data Fewer vs Less Ratio
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and
More informationCluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1
Cluster Analysis Mu-Chun Su Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Introduction Cluster analysis is the formal study of algorithms and methods
More informationPattern Recognition Lecture Sequential Clustering
Pattern Recognition Lecture Prof. Dr. Marcin Grzegorzek Research Group for Pattern Recognition Institute for Vision and Graphics University of Siegen, Germany Pattern Recognition Chain patterns sensor
More informationAcknowledgments. Acronyms
Acknowledgments Preface Acronyms xi xiii xv 1 Basic Tools 1 1.1 Goals of inference 1 1.1.1 Population or process? 1 1.1.2 Probability samples 2 1.1.3 Sampling weights 3 1.1.4 Design effects. 5 1.2 An introduction
More informationCredit card Fraud Detection using Predictive Modeling: a Review
February 207 IJIRT Volume 3 Issue 9 ISSN: 2396002 Credit card Fraud Detection using Predictive Modeling: a Review Varre.Perantalu, K. BhargavKiran 2 PG Scholar, CSE, Vishnu Institute of Technology, Bhimavaram,
More informationLecture 7: Decision Trees
Lecture 7: Decision Trees Instructor: Outline 1 Geometric Perspective of Classification 2 Decision Trees Geometric Perspective of Classification Perspective of Classification Algorithmic Geometric Probabilistic...
More informationUNIT 4. Research Methods in Business
UNIT 4 Preparing Data for Analysis:- After data are obtained through questionnaires, interviews, observation or through secondary sources, they need to be edited. The blank responses, if any have to be
More informationIBM SPSS Categories. Predict outcomes and reveal relationships in categorical data. Highlights. With IBM SPSS Categories you can:
IBM Software IBM SPSS Statistics 19 IBM SPSS Categories Predict outcomes and reveal relationships in categorical data Highlights With IBM SPSS Categories you can: Visualize and explore complex categorical
More information2 Renata M.C.R. de Souza et al cluster with its own representation. The advantage of these adaptive distances is that the clustering algorithm is able
Dynamic Cluster Methods for Interval Data based on Mahalanobis Distances Renata M.C.R. de Souza 1,Francisco de A.T. de Carvalho 1, Camilo P. Tenório 1 and Yves Lechevallier 2 1 Centro de Informatica -
More informationSpecial Review Section. Copyright 2014 Pearson Education, Inc.
Special Review Section SRS-1--1 Special Review Section Chapter 1: The Where, Why, and How of Data Collection Chapter 2: Graphs, Charts, and Tables Describing Your Data Chapter 3: Describing Data Using
More informationECLT 5810 Clustering
ECLT 5810 Clustering What is Cluster Analysis? Cluster: a collection of data objects Similar to one another within the same cluster Dissimilar to the objects in other clusters Cluster analysis Grouping
More information2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.
Code No: M0502/R05 Set No. 1 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss
More informationFundamentals of Digital Image Processing
\L\.6 Gw.i Fundamentals of Digital Image Processing A Practical Approach with Examples in Matlab Chris Solomon School of Physical Sciences, University of Kent, Canterbury, UK Toby Breckon School of Engineering,
More informationClustering from Data Streams
Clustering from Data Streams João Gama LIAAD-INESC Porto, University of Porto, Portugal jgama@fep.up.pt 1 Introduction 2 Clustering Micro Clustering 3 Clustering Time Series Growing the Structure Adapting
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More informationDetermination of the number of clusters
Determination of the number of clusters P. Lallemand pascale.lallemand@fundp.ac.be Department of Mathematics University of Namur BELGIUM Symbolic Data Analysis in LISBON 1 Outline Clustering problem Clustering
More informationAnalysis of Complex Survey Data with SAS
ABSTRACT Analysis of Complex Survey Data with SAS Christine R. Wells, Ph.D., UCLA, Los Angeles, CA The differences between data collected via a complex sampling design and data collected via other methods
More informationImage Analysis, Classification and Change Detection in Remote Sensing
Image Analysis, Classification and Change Detection in Remote Sensing WITH ALGORITHMS FOR ENVI/IDL Morton J. Canty Taylor &. Francis Taylor & Francis Group Boca Raton London New York CRC is an imprint
More information2. Data Preprocessing
2. Data Preprocessing Contents of this Chapter 2.1 Introduction 2.2 Data cleaning 2.3 Data integration 2.4 Data transformation 2.5 Data reduction Reference: [Han and Kamber 2006, Chapter 2] SFU, CMPT 459
More informationBoolean Reasoning. The Logic of Boolean Equations. Frank Markham Brown Air Force Institute of Technology
Boolean Reasoning The Logic of Boolean Equations by Frank Markham Brown Air Force Institute of Technology ff Kluwer Academic Publishers Boston/Dordrecht/London Contents Preface Two Logical Languages Boolean
More informationAnalytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.
Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied
More informationBy Mahesh R. Sanghavi Associate professor, SNJB s KBJ CoE, Chandwad
By Mahesh R. Sanghavi Associate professor, SNJB s KBJ CoE, Chandwad Data Analytics life cycle Discovery Data preparation Preprocessing requirements data cleaning, data integration, data reduction, data
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationLatent Class Modeling as a Probabilistic Extension of K-Means Clustering
Latent Class Modeling as a Probabilistic Extension of K-Means Clustering Latent Class Cluster Models According to Kaufman and Rousseeuw (1990), cluster analysis is "the classification of similar objects
More informationModern Multidimensional Scaling
Ingwer Borg Patrick J.F. Groenen Modern Multidimensional Scaling Theory and Applications Second Edition With 176 Illustrations ~ Springer Preface vii I Fundamentals of MDS 1 1 The Four Purposes of Multidimensional
More informationTABLE OF CONTENTS CHAPTER NO. TITLE PAGENO. LIST OF TABLES LIST OF FIGURES LIST OF ABRIVATION
vi TABLE OF CONTENTS ABSTRACT LIST OF TABLES LIST OF FIGURES LIST OF ABRIVATION iii xii xiii xiv 1 INTRODUCTION 1 1.1 WEB MINING 2 1.1.1 Association Rules 2 1.1.2 Association Rule Mining 3 1.1.3 Clustering
More informationGetting to Know Your Data
Chapter 2 Getting to Know Your Data 2.1 Exercises 1. Give three additional commonly used statistical measures (i.e., not illustrated in this chapter) for the characterization of data dispersion, and discuss
More informationCHAPTER 1 INTRODUCTION
Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,
More informationMethods for Intelligent Systems
Methods for Intelligent Systems Lecture Notes on Clustering (II) Davide Eynard eynard@elet.polimi.it Department of Electronics and Information Politecnico di Milano Davide Eynard - Lecture Notes on Clustering
More informationEnterprise Miner Software: Changes and Enhancements, Release 4.1
Enterprise Miner Software: Changes and Enhancements, Release 4.1 The correct bibliographic citation for this manual is as follows: SAS Institute Inc., Enterprise Miner TM Software: Changes and Enhancements,
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More informationKnowledge libraries and information space
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2009 Knowledge libraries and information space Eric Rayner University
More information"Charting the Course... MOC C: Developing SQL Databases. Course Summary
Course Summary Description This five-day instructor-led course provides students with the knowledge and skills to develop a Microsoft SQL database. The course focuses on teaching individuals how to use
More informationProximity and Data Pre-processing
Proximity and Data Pre-processing Slide 1/47 Proximity and Data Pre-processing Huiping Cao Proximity and Data Pre-processing Slide 2/47 Outline Types of data Data quality Measurement of proximity Data
More informationSTAT10010 Introductory Statistics Lab 2
STAT10010 Introductory Statistics Lab 2 1. Aims of Lab 2 By the end of this lab you will be able to: i. Recognize the type of recorded data. ii. iii. iv. Construct summaries of recorded variables. Calculate
More informationPreface to the Second Edition. Preface to the First Edition. 1 Introduction 1
Preface to the Second Edition Preface to the First Edition vii xi 1 Introduction 1 2 Overview of Supervised Learning 9 2.1 Introduction... 9 2.2 Variable Types and Terminology... 9 2.3 Two Simple Approaches
More informationAn Hausdorff distance between hyper-rectangles for clustering interval data
An Hausdorff distance between hyper-rectangles for clustering interval data M. Chavent Laboratoire de Mathématiques Appliquées de Bordeaux, UMR CNRS 5466 Universités Bordeaux et, France chavent@math.u-bordeaux.fr
More informationContents. Chapter 1 SPECIFYING SYNTAX 1
Contents Chapter 1 SPECIFYING SYNTAX 1 1.1 GRAMMARS AND BNF 2 Context-Free Grammars 4 Context-Sensitive Grammars 8 Exercises 8 1.2 THE PROGRAMMING LANGUAGE WREN 10 Ambiguity 12 Context Constraints in Wren
More informationHigh-Performance Parallel Database Processing and Grid Databases
High-Performance Parallel Database Processing and Grid Databases David Taniar Monash University, Australia Clement H.C. Leung Hong Kong Baptist University and Victoria University, Australia Wenny Rahayu
More informationCS 2750 Machine Learning. Lecture 19. Clustering. CS 2750 Machine Learning. Clustering. Groups together similar instances in the data sample
Lecture 9 Clustering Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square Clustering Groups together similar instances in the data sample Basic clustering problem: distribute data into k different groups
More informationUniversity of Florida CISE department Gator Engineering. Data Preprocessing. Dr. Sanjay Ranka
Data Preprocessing Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville ranka@cise.ufl.edu Data Preprocessing What preprocessing step can or should
More informationCS573 Data Privacy and Security. Li Xiong
CS573 Data Privacy and Security Anonymizationmethods Li Xiong Today Clustering based anonymization(cont) Permutation based anonymization Other privacy principles Microaggregation/Clustering Two steps:
More informationElysium Technologies Private Limited::IEEE Final year Project
Elysium Technologies Private Limited::IEEE Final year Project - o n t e n t s Data mining Transactions Rule Representation, Interchange, and Reasoning in Distributed, Heterogeneous Environments Defeasible
More informationBMEGUI Tutorial 1 Spatial kriging
BMEGUI Tutorial 1 Spatial kriging 1. Objective The primary objective of this exercise is to get used to the basic operations of BMEGUI using a purely spatial dataset. The analysis will consist in an exploratory
More informationDATA WAREHOUING UNIT I
BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009
More informationData mining: concepts and algorithms
Data mining: concepts and algorithms Practice Data mining Objective Exploit data mining algorithms to analyze a real dataset using the RapidMiner machine learning tool. The practice session is organized
More informationDiscriminate Analysis
Discriminate Analysis Outline Introduction Linear Discriminant Analysis Examples 1 Introduction What is Discriminant Analysis? Statistical technique to classify objects into mutually exclusive and exhaustive
More informationData Preprocessing. Slides by: Shree Jaswal
Data Preprocessing Slides by: Shree Jaswal Topics to be covered Why Preprocessing? Data Cleaning; Data Integration; Data Reduction: Attribute subset selection, Histograms, Clustering and Sampling; Data
More informationFuzzy Set Theory and Its Applications. Second, Revised Edition. H.-J. Zimmermann. Kluwer Academic Publishers Boston / Dordrecht/ London
Fuzzy Set Theory and Its Applications Second, Revised Edition H.-J. Zimmermann KM ff Kluwer Academic Publishers Boston / Dordrecht/ London Contents List of Figures List of Tables Foreword Preface Preface
More informationData Preprocessing. Data Preprocessing
Data Preprocessing Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville ranka@cise.ufl.edu Data Preprocessing What preprocessing step can or should
More informationUSING SOFT COMPUTING TECHNIQUES TO INTEGRATE MULTIPLE KINDS OF ATTRIBUTES IN DATA MINING
USING SOFT COMPUTING TECHNIQUES TO INTEGRATE MULTIPLE KINDS OF ATTRIBUTES IN DATA MINING SARAH COPPOCK AND LAWRENCE MAZLACK Computer Science, University of Cincinnati, Cincinnati, Ohio 45220 USA E-mail:
More informationCluster Analysis. Angela Montanari and Laura Anderlucci
Cluster Analysis Angela Montanari and Laura Anderlucci 1 Introduction Clustering a set of n objects into k groups is usually moved by the aim of identifying internally homogenous groups according to a
More informationChapter 1 Introduction to Statistics
Corresponds to ELEMENTARY STATISTICS USING THE TI 83/84 PLUS CALCULATOR 3rd ed. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Chapter 1 Introduction
More informationMulti-Modal Data Fusion: A Description
Multi-Modal Data Fusion: A Description Sarah Coppock and Lawrence J. Mazlack ECECS Department University of Cincinnati Cincinnati, Ohio 45221-0030 USA {coppocs,mazlack}@uc.edu Abstract. Clustering groups
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationData Mining and Analytics. Introduction
Data Mining and Analytics Introduction Data Mining Data mining refers to extracting or mining knowledge from large amounts of data It is also termed as Knowledge Discovery from Data (KDD) Mostly, data
More informationSummary of Contents LIST OF FIGURES LIST OF TABLES
Summary of Contents LIST OF FIGURES LIST OF TABLES PREFACE xvii xix xxi PART 1 BACKGROUND Chapter 1. Introduction 3 Chapter 2. Standards-Makers 21 Chapter 3. Principles of the S2ESC Collection 45 Chapter
More informationPackage FPDclustering
Type Package Title PD-Clustering and Factor PD-Clustering Version 1.2 Date 2017-08-23 Package FPDclustering Author Cristina Tortora and Paul D. McNicholas August 23, 2017 Maintainer Cristina Tortora
More informationELEC Dr Reji Mathew Electrical Engineering UNSW
ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion
More informationKeywords: clustering algorithms, unsupervised learning, cluster validity
Volume 6, Issue 1, January 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Clustering Based
More informationChapter 2: Descriptive Statistics
Chapter 2: Descriptive Statistics Student Learning Outcomes By the end of this chapter, you should be able to: Display data graphically and interpret graphs: stemplots, histograms and boxplots. Recognize,
More informationObject Classification Problem
HIERARCHICAL OBJECT CATEGORIZATION" Gregory Griffin and Pietro Perona. Learning and Using Taxonomies For Fast Visual Categorization. CVPR 2008 Marcin Marszalek and Cordelia Schmid. Constructing Category
More informationMODERN FACTOR ANALYSIS
MODERN FACTOR ANALYSIS Harry H. Harman «ö THE pigj UNIVERSITY OF CHICAGO PRESS Contents LIST OF ILLUSTRATIONS GUIDE TO NOTATION xv xvi Parti Foundations of Factor Analysis 1. INTRODUCTION 3 1.1. Brief
More informationLOGIC SYNTHESIS AND VERIFICATION ALGORITHMS. Gary D. Hachtel University of Colorado. Fabio Somenzi University of Colorado.
LOGIC SYNTHESIS AND VERIFICATION ALGORITHMS by Gary D. Hachtel University of Colorado Fabio Somenzi University of Colorado Springer Contents I Introduction 1 1 Introduction 5 1.1 VLSI: Opportunity and
More informationDATA CLASSIFICATORY TECHNIQUES
DATA CLASSIFICATORY TECHNIQUES AMRENDER KUMAR AND V.K.BHATIA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 akjha@iasri.res.in 1. Introduction Rudimentary, exploratory
More informationBased on Raymond J. Mooney s slides
Instance Based Learning Based on Raymond J. Mooney s slides University of Texas at Austin 1 Example 2 Instance-Based Learning Unlike other learning algorithms, does not involve construction of an explicit
More informationA Self Organizing Map for dissimilarity data 0
A Self Organizing Map for dissimilarity data Aïcha El Golli,2, Brieuc Conan-Guez,2, and Fabrice Rossi,2,3 Projet AXIS, INRIA-Rocquencourt Domaine De Voluceau, BP 5 Bâtiment 8 7853 Le Chesnay Cedex, France
More informationThis article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and
This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and education use, including for instruction at the authors institution
More informationData mining techniques for actuaries: an overview
Data mining techniques for actuaries: an overview Emiliano A. Valdez joint work with Banghee So and Guojun Gan University of Connecticut Advances in Predictive Analytics (APA) Conference University of
More information