Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining. Data Analysis and Knowledge Discovery
|
|
- Austin Atkins
- 6 years ago
- Views:
Transcription
1 Lluis Belanche + Alfredo Vellido Intelligent Data Analysis and Data Mining or Data Analysis and Knowledge Discovery a.k.a. Data Mining II
2 MINING as a methodology (from previous session )
3 CRISP: a DM methodology CRoss Industry Standard Process for Data Mining: neutral methodology from the point of view of industry, tool and application (free &nonproprietary) Pete Chapman, Randy Kerber (NCR); Julian Clinton, Thomas Khabaza, Colin Shearer (SPSS), Thomas Reinartz, Rüdiger Wirth (DaimlerChrysler) CRISP DM was conceived in 1996 DaimlerChrysler: leaders in industrial application, SPSS: leaders in product development (Clementine, 1994), NCR: owners of large (huge!) databases (Teradata) Financed by the EU. Version 1.0 released officially in 1999
4 CRISP: Hierarchic structure of the methodology
5 CRISP: The virtuous loop of methodology phases
6 CRISP: Phases: Problem understanding PROBLEM UNDERSTANDING UNDERST ING PREPARATION MODELLING EVALUATION IMPLEMEN TATION DETERMINE PROBLEM GOAL BACKGROUND PROBLEM GOALS SUCCESS CRITERIA ASSESS SITUATION INVENTORY RESOURCES REQUERIMS. ASSUMPTIONS LIMITATIONS RISKS CONTINGEN. TERMINOLOG. COSTS & BENEFITS DETERMINE DM GOALS GOALS DM SUCCESS CRITERIA DM PRODUCE PROJECT PLAN PROJECT PLAN INITIAL SELECTION OF TOOLS
7 DM application areas ( 10 > 11)
8 end of last session wrap up
9 CRISP: Phases: Data understanding PROBLEM UNDERSTANDING UNDERST ING PREPARATION MODELLING EVALUATION IMPLEMEN TATION OBTAIN INITIAL INITIAL REPORT DESCRIPTION EXPLORATION VERIFICATION QUALITY DESCRIPTIVE REPORT EXPLORATION REPORT QUALITY REPORT
10 METROFANG: a real story about data understanding (1) (Barcelona Espa%25C3%25B1a)/
11 METROFANG: a real story about data understanding (2) caudal entrada 350,00 300,00 250,00 200,00 150,00 100,00 50,00 0, Par motor Secador A 140,00 120,00 100,00 80,00 Missing data Stationality Outliers Time Series Weekend? FORUM??? 60,00 40,00 20,00 0,
12 Storing data ( 07)
13 CRISP: Phases: Data preparation PROBLEM UNDERSTANDING UNDERST ING PREPARATION MODELLING EVALUATION IMPLEMEN TATION SELECTION ARGUMENTS FOR SELECTION CLEANING CLEANING REPORT RECONSTRUCT DERIVATED VARIABLES OSERVATIONS GENERATED INTEGRATE INTEGRATED FORMATTING WITH NEW FORMAT
14 Is data preparation that important?
15 Common data types analyzed ( 07) Compared to 2005 KDnuggets Poll on Types of data you analyzed/mined in last 12 months, the biggest increase was in anonymized data (perhaps and indicator of increasing importance of privacy issues).
16 Common data types analyzed ( 09) Compared to 2005 KDnuggets Poll on Types of data you analyzed/mined in last 12 months, the biggest increase was in anonymized data (perhaps and indicator of increasing importance of privacy issues). Comparing with 2008, the top 5 categories are unchanged.
17 Common data types analyzed ( > 12)
18 How large is it? ( 06 > 09)
19 How large is it? ( 09 > 13) The Big Data Challenge
20 How large is it? ( 09 > 13) Some fun facts: Google processes over 20 PB worth of data every day. Back in December 2007, YouTube generated 27 PB of traffic. The CERN Large Hadron Collider (HLC) generetes about 20 PB of usable data per year. The volume of global annual data traffic is expected exceed 60,000 PB in 2016, from 8,000 petabytes in 2011 In the next decade, astronomers expect to be processing 10 PB of data every hour from the Square Kilometre Array (SKA) telescope one exabyte every four days.
21 10 PB of data every hour from the Square Kilometre Array (SKA) telescope one exabyte every four days.
22 Data manipulation tools ( 08)
23 Data manipulation tools ( > 12)
24 Data manipulation tools ( > 13)
25 CRISP: Phases: Modelling PROBLEM UNDERSTANDING UNDERST ING PREPARATION MODELLING EVALUATION IMPLEMEN TATION SELECT MODELING TECHNIQUE SELECTED TECHNIQUE CREATE TEST DESIGN TEST DESIGN BUILD MODEL PARAMETER SELECTION MODEL MODEL DESCRIPTION VALIDATE MODEL MODEL VALIDATION
26 CRISP: A typology of DM problems PROBLEM DESCRIPTION EXAMPLES TECHNIQUES SUMMARY and DESCRIPTION SEGMENTATION CONCEPTUAL DESCRIPTION CLASIFICATION PREDICTION (REGRESSION, FORECASTING) DEPENDENCY ANALYSIS Compact and aggregated data description. Exploratory Analysis Finding data groups (unsupervised) segm / clust / classif Accessible and useful description of concepts / classes / groups. Knowledge comes first, then precissión. Linked to clasif / segmentation Assumed that different ítems can be assigned to a given closed cathegory (supervised) Continuous dependent variable. Given values of the predictive variables, predict (supervised) Looking for dependencies between variables (superv. or unsuperv.) Often with segmentation Almost any problem includes some elements of data description Market Segmentation, Shopping Basket analysis Ex.: Description of customer groups according to loyalty. Rule segment profiling if SEX=male and age>45 then CUST=loyal Bankruptcy prediction, Credit Scoring Markets, company benefit pred., Market share forec. Basket Analysis Ex.: 30% of those who bought peanuts also bought beer ERPs, stats., OLAP, EIS, control dashboards Clustering, NNs (SOM, GTM), visualización Rule Induction, Conceptual Clustering Discriminant Analysis, Rule Induction, Decision Trees, NNs, C-B Reasoning, GAs Regression Analysis, Regression Trees, NNs, Box-Jenkins, GAs Correlation Analysis, Association Rules, Bayesian Networks, Inductive Logic Prog.
27 CRISP: Selection of techniques U N I V E R S E OF T E C H N I Q U E S (Definided by tools) TECHNIQUES SUITED TO A PROBLEM POLITICAL REQUIREMENTS (Business, executive) Money, time, hh.rr. LIMITATIONS Data types, knowledge SELECTED TOOL(S)
Lluis Belanche + Alfredo Vellido Data Mining II An Introduction to Mining (2)
Lluis Belanche + Alfredo Vellido Data Mining II An Introduction to Mining (2) On dates & evaluation: Lectures expected to end on the week 14-18th Dec Likely essay deadline & presentation: 15th, 22nd Jan
More informationData Mining An Overview ITEV, F /18
Data Mining An Overview ITEV, F-2008 1/18 ITEV, F-2008 2/18 What is Data Mining?? ITEV, F-2008 2/18 What is Data Mining?? ITEV, F-2008 2/18 What is Data Mining?! ITEV, F-2008 3/18 What is Data Mining?
More informationInternational Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16
The Survey Of Data Mining And Warehousing Architha.S, A.Kishore Kumar Department of Computer Engineering Department of computer engineering city engineering college VTU Bangalore, India ABSTRACT: Data
More informationA Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)
International Journal of Innovation and Scientific Research ISSN 2351-8014 Vol. 12 No. 1 Nov. 2014, pp. 217-222 2014 Innovative Space of Scientific Research Journals http://www.ijisr.issr-journals.org/
More informationThe CRISP-DM Process Model
CRISP-DM Discussion Paper March, 1999 The CRISP-DM Process Model Pete Chapman (NCR) Julian Clinton (SPSS) Thomas Khabaza (SPSS) Thomas Reinartz (DaimlerChrysler) Rüdiger Wirth (DaimlerChrysler) This discussion
More informationCRISP-DM 1.0. Step-by-step data mining guide
Step-by-step data mining guide Pete Chapman (NCR), Julian Clinton (SPSS), Randy Kerber (NCR), Thomas Khabaza (SPSS), Thomas Reinartz (DaimlerChrysler), Colin Shearer (SPSS) and Rüdiger Wirth (DaimlerChrysler)
More informationKnowledge Discovery. Javier Béjar URL - Spring 2019 CS - MIA
Knowledge Discovery Javier Béjar URL - Spring 2019 CS - MIA Knowledge Discovery (KDD) Knowledge Discovery in Databases (KDD) Practical application of the methodologies from machine learning/statistics
More informationChapter 1, Introduction
CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from
More information> Data Mining Overview with Clementine
> Data Mining Overview with Clementine This two-day course introduces you to the major steps of the data mining process. The course goal is for you to be able to begin planning or evaluate your firm s
More informationKnowledge Discovery. URL - Spring 2018 CS - MIA 1/22
Knowledge Discovery Javier Béjar cbea URL - Spring 2018 CS - MIA 1/22 Knowledge Discovery (KDD) Knowledge Discovery in Databases (KDD) Practical application of the methodologies from machine learning/statistics
More informationData Warehousing and Machine Learning
Data Warehousing and Machine Learning Introduction Thomas D. Nielsen Aalborg University Department of Computer Science Spring 2008 DWML Spring 2008 1 / 47 What is Data Mining?? Introduction DWML Spring
More informationEnhancing Preprocessing in Data-Intensive Domains using Online-Analytical Processing
Enhancing Preprocessing in Data-Intensive Domains using Online-Analytical Processing Alexander Maedche 1, Andreas Hotho 1, and Markus Wiese 2 1 Institute AIFB, Karlsruhe University, D-76128 Karlsruhe,
More informationR07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.
www..com www..com Set No.1 1. a) What is data mining? Briefly explain the Knowledge discovery process. b) Explain the three-tier data warehouse architecture. 2. a) With an example, describe any two schema
More informationData Mining Concepts. Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech
http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Data Mining Concepts Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech Partly based on
More informationTerabyte-class data analysis for CRM in service provider
Terabyte-class data analysis for CRM in service provider NTT COMWARE CORPORATION Ryo Mukae ( mukae.ryo ryo@nttcom.co..co.jp ) NTT COMWARE CORPORATION Achievement in building the fundamental operating system
More informationIntroduction to Data Mining and Data Analytics
1/28/2016 MIST.7060 Data Analytics 1 Introduction to Data Mining and Data Analytics What Are Data Mining and Data Analytics? Data mining is the process of discovering hidden patterns in data, where Patterns
More informationApplications and Trends in Data Mining
Applications and Trends in Data Mining Data mining applications Data mining system products and research prototypes Additional themes on data mining Social impacts of data mining Trends in data mining
More informationopensap Getting Started with Data Science
opensap Getting Started with Data Science Week 1 Unit 1 00:00:11 Hello and welcome to the opensap course "Getting Started with Data Science". My name is Stuart Clarke and I am a consultant with SAP, specializing
More informationThe Data Science Process. Polong Lin Big Data University Leader & Data Scientist IBM
The Data Science Process Polong Lin Big Data University Leader & Data Scientist IBM polong@ca.ibm.com Every day, we create 2.5 quintillion bytes of data so much that 90% of the data in the world today
More informationThis tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.
About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts
More informationENTERPRISE MINER: 1 DATA EXPLORATION AND VISUALISATION
ENTERPRISE MINER: 1 DATA EXPLORATION AND VISUALISATION JOZEF MOFFAT, ANALYTICS & INNOVATION PRACTICE, SAS UK 10, MAY 2016 DATA EXPLORATION AND VISUALISATION AGENDA SAS Webinar 10th May 2016 at 10:00 AM
More informationData warehouse and Data Mining
Data warehouse and Data Mining Lecture No. 14 Data Mining and its techniques Naeem A. Mahoto Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology
More informationTypes of Data Mining
Data Mining and The Use of SAS to Deploy Scoring Rules South Central SAS Users Group Conference Neil Fleming, Ph.D., ASQ CQE November 7-9, 2004 2W Systems Co., Inc. Neil.Fleming@2WSystems.com 972 733-0588
More informationD B M G Data Base and Data Mining Group of Politecnico di Torino
DataBase and Data Mining Group of Data mining fundamentals Data Base and Data Mining Group of Data analysis Most companies own huge databases containing operational data textual documents experiment results
More informationData Mining Concepts & Tasks
Data Mining Concepts & Tasks Duen Horng (Polo) Chau Georgia Tech CSE6242 / CX4242 Sept 9, 2014 Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos Last Time
More informationCOMP 465 Special Topics: Data Mining
COMP 465 Special Topics: Data Mining Introduction & Course Overview 1 Course Page & Class Schedule http://cs.rhodes.edu/welshc/comp465_s15/ What s there? Course info Course schedule Lecture media (slides,
More informationData Mining. Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA.
Data Mining Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA January 13, 2011 Important Note! This presentation was obtained from Dr. Vijay Raghavan
More informationData mining fundamentals
Data mining fundamentals Elena Baralis Politecnico di Torino Data analysis Most companies own huge bases containing operational textual documents experiment results These bases are a potential source of
More informationADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA
INSIGHTS@SAS: ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA AGENDA 09.00 09.15 Intro 09.15 10.30 Analytics using SAS Enterprise Guide Ellen Lokollo 10.45 12.00 Advanced Analytics using SAS
More informationCOMP 6838 Data MIning
COMP 6838 Data MIning LECTURE 1: Introduction Dr. Edgar Acuna Departmento de Matematicas Universidad de Puerto Rico- Mayaguez math.uprm.edu/~edgar 1 Course s Objectives Understand the basic concepts to
More informationData Mining Course Overview
Data Mining Course Overview 1 Data Mining Overview Understanding Data Classification: Decision Trees and Bayesian classifiers, ANN, SVM Association Rules Mining: APriori, FP-growth Clustering: Hierarchical
More informationNow, Data Mining Is Within Your Reach
Clementine Desktop Specifications Now, Data Mining Is Within Your Reach Data mining delivers significant, measurable value. By uncovering previously unknown patterns and connections in data, data mining
More informationA Complete Data Mining process to Manage the QoS of ADSL Services
A Complete Data Mining process to Manage the QoS of ADSL Services Name1 Surname1 and Name2 Surname2, paper Id 4 1 Abstract. 1 In this paper we explore the interest of computational intelligence tools in
More informationData Mining Technology Based on Bayesian Network Structure Applied in Learning
, pp.67-71 http://dx.doi.org/10.14257/astl.2016.137.12 Data Mining Technology Based on Bayesian Network Structure Applied in Learning Chunhua Wang, Dong Han College of Information Engineering, Huanghuai
More information1 of 5 1/28/2015 12:27 PM BDA Program Program Mission/Purpose The mission of the Bachelor of Science in Business Data Analytics (BDA) program is to prepare students to understand the foundation of business
More informationIntroduction to Data Mining S L I D E S B Y : S H R E E J A S W A L
Introduction to Data Mining S L I D E S B Y : S H R E E J A S W A L Books 2 Which Chapter from which Text Book? Chapter 1: Introduction from Han, Kamber, "Data Mining Concepts and Techniques", Morgan Kaufmann
More informationBasic Data Mining Technique
Basic Data Mining Technique What is classification? What is prediction? Supervised and Unsupervised Learning Decision trees Association rule K-nearest neighbor classifier Case-based reasoning Genetic algorithm
More informationData Mining. Chapter 1: Introduction. Adapted from materials by Jiawei Han, Micheline Kamber, and Jian Pei
Data Mining Chapter 1: Introduction Adapted from materials by Jiawei Han, Micheline Kamber, and Jian Pei 1 Any Question? Just Ask 3 Chapter 1. Introduction Why Data Mining? What Is Data Mining? A Multi-Dimensional
More informationAnalysis of Big Data Tim Miller, Sr. Analytics Consultant Teradata Alexander Kolovos, Ph.D., Advanced Analytics Software Engineer Teradata
Analysis of Big Data Tim Miller, Sr. Analytics Consultant Teradata Alexander Kolovos, Ph.D., Advanced Analytics Software Engineer Teradata March 28, 2017 Your Presenters Tim Miller Senior Analytics Consultant
More informationData Mining. Jeff M. Phillips. January 7, 2019 CS 5140 / CS 6140
Data Mining CS 5140 / CS 6140 Jeff M. Phillips January 7, 2019 What is Data Mining? What is Data Mining? Finding structure in data? Machine learning on large data? Unsupervised learning? Large scale computational
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 1 1 Acknowledgement Several Slides in this presentation are taken from course slides provided by Han and Kimber (Data Mining Concepts and Techniques) and Tan,
More informationKDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW. Ana Azevedo and M.F. Santos
KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW Ana Azevedo and M.F. Santos ABSTRACT In the last years there has been a huge growth and consolidation of the Data Mining field. Some efforts are being done
More informationGUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV
GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV Subject Name: Elective I Data Warehousing & Data Mining (DWDM) Subject Code: 2640005 Learning Objectives: To understand
More informationWinter Semester 2009/10 Free University of Bozen, Bolzano
Data Warehousing and Data Mining Winter Semester 2009/10 Free University of Bozen, Bolzano DW Lecturer: Johann Gamper gamper@inf.unibz.it DM Lecturer: Mouna Kacimi mouna.kacimi@unibz.it http://www.inf.unibz.it/dis/teaching/dwdm/index.html
More informationContents. Preface to the Second Edition
Preface to the Second Edition v 1 Introduction 1 1.1 What Is Data Mining?....................... 4 1.2 Motivating Challenges....................... 5 1.3 The Origins of Data Mining....................
More informationTable Of Contents: xix Foreword to Second Edition
Data Mining : Concepts and Techniques Table Of Contents: Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments xxxi About the Authors xxxv Chapter 1 Introduction 1 (38) 1.1 Why Data
More informationCS423: Data Mining. Introduction. Jakramate Bootkrajang. Department of Computer Science Chiang Mai University
CS423: Data Mining Introduction Jakramate Bootkrajang Department of Computer Science Chiang Mai University Jakramate Bootkrajang CS423: Data Mining 1 / 29 Quote of the day Never memorize something that
More informationDATA SCIENCE INTRODUCTION QSHORE TECHNOLOGIES. About the Course:
DATA SCIENCE About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst/Analytics Manager/Actuarial Scientist/Business
More informationSlides for Data Mining by I. H. Witten and E. Frank
Slides for Data Mining by I. H. Witten and E. Frank 7 Engineering the input and output Attribute selection Scheme-independent, scheme-specific Attribute discretization Unsupervised, supervised, error-
More informationCS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University
CS377: Database Systems Data Warehouse and Data Mining Li Xiong Department of Mathematics and Computer Science Emory University 1 1960s: Evolution of Database Technology Data collection, database creation,
More informationContents. Foreword to Second Edition. Acknowledgments About the Authors
Contents Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments About the Authors xxxi xxxv Chapter 1 Introduction 1 1.1 Why Data Mining? 1 1.1.1 Moving toward the Information Age 1
More informationSeminars of Software and Services for the Information Society
DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society
More informationAn Effectual Approach to Swelling the Selling Methodology in Market Basket Analysis using FP Growth
An Effectual Approach to Swelling the Selling Methodology in Market Basket Analysis using FP Growth P.Sathish kumar, T.Suvathi K.S.Rangasamy College of Technology suvathi007@gmail.com Received: 03/01/2017,
More informationData warehouses Decision support The multidimensional model OLAP queries
Data warehouses Decision support The multidimensional model OLAP queries Traditional DBMSs are used by organizations for maintaining data to record day to day operations On-line Transaction Processing
More informationINSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad
INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad - 500 043 INFORMATION TECHNOLOGY DEFINITIONS AND TERMINOLOGY Course Name : DATA WAREHOUSING AND DATA MINING Course Code : AIT006 Program
More informationData mining techniques for actuaries: an overview
Data mining techniques for actuaries: an overview Emiliano A. Valdez joint work with Banghee So and Guojun Gan University of Connecticut Advances in Predictive Analytics (APA) Conference University of
More informationCS 521 Data Mining Techniques Instructor: Abdullah Mueen
CS 521 Data Mining Techniques Instructor: Abdullah Mueen LECTURE 2: DATA TRANSFORMATION AND DIMENSIONALITY REDUCTION Chapter 3: Data Preprocessing Data Preprocessing: An Overview Data Quality Major Tasks
More informationPython With Data Science
Course Overview This course covers theoretical and technical aspects of using Python in Applied Data Science projects and Data Logistics use cases. Who Should Attend Data Scientists, Software Developers,
More informationIntroduction to Data Mining
Introduction to Data Mining José Hernández-Orallo Dpto. de Sistemas Informáticos y Computación Universidad Politécnica de Valencia, Spain jorallo@dsic.upv.es Roma, 14-15th May 2009 1 Outline Motivation.
More informationPre-Requisites: CS2510. NU Core Designations: AD
DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification
More informationA Variability-Aware Design Approach to the Data Analysis Modeling Process
A Variability-Aware Design Approach to the Data Analysis Modeling Process Maria Cristina Vale Tavares David R. Cheriton School of Computer Science University of Waterloo Waterloo, Canada mvtavare@uwaterloo.ca
More informationComparative analysis of data mining methods for predicting credit default probabilities in a retail bank portfolio
Comparative analysis of data mining methods for predicting credit default probabilities in a retail bank portfolio Adela Ioana Tudor, Adela Bâra, Simona Vasilica Oprea Department of Economic Informatics
More informationMachine Learning - Regression. CS102 Fall 2017
Machine Learning - Fall 2017 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for
More informationFault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016
Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016 Mark Hornick, Director, Advanced Analytics January 27, 2016 Safe Harbor Statement The following
More informationOracle Big Data Science
Oracle Big Data Science Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com @VlamisSoftware Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri
More informationJarek Szlichta
Jarek Szlichta http://data.science.uoit.ca/ Approximate terminology, though there is some overlap: Data(base) operations Executing specific operations or queries over data Data mining Looking for patterns
More informationThink & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI)
Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) About the Speaker Dr. SubraMANI Paramasivam PhD., MCT, MCSE, MCITP, MCP, MCTS, MCSA CEO, Principal Consultant & Trainer
More informationData Mining. Vera Goebel. Department of Informatics, University of Oslo
Data Mining Vera Goebel Department of Informatics, University of Oslo 2012 1 Lecture Contents Knowledge Discovery in Databases (KDD) Definition and Applications OLAP Architectures for OLAP and KDD KDD
More informationMachine Learning - Clustering. CS102 Fall 2017
Machine Learning - Fall 2017 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for
More information9. Conclusions. 9.1 Definition KDD
9. Conclusions Contents of this Chapter 9.1 Course review 9.2 State-of-the-art in KDD 9.3 KDD challenges SFU, CMPT 740, 03-3, Martin Ester 419 9.1 Definition KDD [Fayyad, Piatetsky-Shapiro & Smyth 96]
More informationPredictive Analytics: Demystifying Current and Emerging Methodologies. Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA
Predictive Analytics: Demystifying Current and Emerging Methodologies Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA May 18, 2017 About the Presenters Tom Kolde, FCAS, MAAA Consulting Actuary Chicago,
More informationINTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING
CS 7265 BIG DATA ANALYTICS INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, PhD Computer Science,
More informationData Mining. Jeff M. Phillips. January 12, 2015 CS 5140 / CS 6140
Data Mining CS 5140 / CS 6140 Jeff M. Phillips January 12, 2015 Data Mining What is Data Mining? Finding structure in data? Machine learning on large data? Unsupervised learning? Large scale computational
More informationIntroduction to Data Mining
Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Sistemas Informáticos and Computación Universidad Politécnica de Valencia, Spain jorallo@dsic.upv.es Horsens, Denmark, 26th September 2005
More informationData Mining Techniques
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
More informationSCHEME OF COURSE WORK. Data Warehousing and Data mining
SCHEME OF COURSE WORK Course Details: Course Title Course Code Program: Specialization: Semester Prerequisites Department of Information Technology Data Warehousing and Data mining : 15CT1132 : B.TECH
More informationEnd-to-End data mining feature integration, transformation and selection with Datameer Datameer, Inc. All rights reserved.
End-to-End data mining feature integration, transformation and selection with Datameer Fastest time to Insights Rapid Data Integration Zero coding data integration Wizard-led data integration & No ETL
More informationBig Data Analytics The Data Mining process. Roger Bohn March. 2016
1 Big Data Analytics The Data Mining process Roger Bohn March. 2016 Office hours HK thursday5 to 6 in the library 3115 If trouble, email or Slack private message. RB Wed. 2 to 3:30 in my office Some material
More informationBUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)
SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon
More informationBusiness Intelligence Roadmap HDT923 Three Days
Three Days Prerequisites Students should have experience with any relational database management system as well as experience with data warehouses and star schemas. It would be helpful if students are
More informationDatabase and Knowledge-Base Systems: Data Mining. Martin Ester
Database and Knowledge-Base Systems: Data Mining Martin Ester Simon Fraser University School of Computing Science Graduate Course Spring 2006 CMPT 843, SFU, Martin Ester, 1-06 1 Introduction [Fayyad, Piatetsky-Shapiro
More informationUsing Text Mining to Locate and Classify Research Papers
Using Text Mining to Locate and Classify Research Papers EDGAR ALAN CALVILLO MORENO Instituto Tecnologico de Aguascalientes Av. Adolfo Lopez Mateos 1801 Ote. CP 20256, Aguascalientes, Mexico alancalvillo@yahoo.com
More informationTime: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.
COPYRIGHT RESERVED End Sem (V) MCA (XXVIII) 2017 Time: 3 hours Full Marks: 70 Candidates are required to give their answers in their own words as far as practicable. The figures in the margin indicate
More informationAn Introduction to Data Mining in Institutional Research. Dr. Thulasi Kumar Director of Institutional Research University of Northern Iowa
An Introduction to Data Mining in Institutional Research Dr. Thulasi Kumar Director of Institutional Research University of Northern Iowa AIR/SPSS Professional Development Series Background Covering variety
More informationData Mining: The Scientific and Industry Revolution
Introduction Data Mining: The Scientific and Industry Revolution Alvaro J. Riascos Villegas University of los Andes and Quantil July 6 2018 Data Mining: The Scientific and Industry Revolution A. Riascos
More informationData Mining. Allan Tucker School of Information Systems Computing and Mathematics Brunel University, London. UB8 3PH. UK
Data Mining Allan Tucker School of Information Systems Computing and Mathematics Brunel University, London. UB8 3PH. UK The talk The Data Explosion Data Mining techniques & Application Data Mining in the
More informationDATA MINING II - 1DL460
DATA MINING II - 1DL460 Spring 2012 A second course in data mining!! http://www.it.uu.se/edu/course/homepage/infoutv2/vt12 Kjell Orsborn! Uppsala Database Laboratory! Department of Information Technology,
More informationData Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44
Data Mining Piotr Paszek piotr.paszek@us.edu.pl Introduction (Piotr Paszek) Data Mining DM KDD 1 / 44 Plan of the lecture 1 Data Mining (DM) 2 Knowledge Discovery in Databases (KDD) 3 CRISP-DM 4 DM software
More informationDATA MINING II - 1DL460
DATA MINING II - 1DL460 Spring 2016 A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationWeek 1 Unit 1: Introduction to Data Science
Week 1 Unit 1: Introduction to Data Science The next 6 weeks What to expect in the next 6 weeks? 2 Curriculum flow (weeks 1-3) Business & Data Understanding 1 2 3 Data Preparation Modeling (1) Introduction
More informationVALLIAMMAI ENGNIEERING COLLEGE SRM Nagar, Kattankulathur 603203. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year & Semester : III & VI Section : CSE - 2 Subject Code : IT6702 Subject Name : Data warehousing
More informationData Mining. Outline. Motivation. Data Mining. Fraud division, some large telephone company: Sharma Chakravarthy IT Laboratory and CSE Department
Outline Data Mining Sharma Chakravarthy IT Laboratory and CSE Department The University of Texas at Arlington sharma@cse.uta.edu http://itlab.uta.edu/sharma Overview Association rules Will try to discuss
More informationMachine Learning Techniques for Data Mining
Machine Learning Techniques for Data Mining Eibe Frank University of Waikato New Zealand 10/25/2000 1 PART VII Moving on: Engineering the input and output 10/25/2000 2 Applying a learner is not all Already
More informationDATA MINING TEAM #1. Kristen Durst Mark Gillespie Banan Mandura. MBA 664: Database Management
DATA MINING TEAM #1 Kristen Durst Mark Gillespie Banan Mandura : Database Management OUTLINE INTRODUCTION 1 DATA MINING DEFINITION AND EXAMPLES 1 DATA MINING PRODUCTS 2 DATA MINING PROCESS 4 DATA MINING
More informationIntroduction to Data Mining
Introduction to JULY 2011 Afsaneh Yazdani What motivated? Wide availability of huge amounts of data and the imminent need for turning such data into useful information and knowledge What motivated? Data
More informationApplying big data analytics in practice
ARISTOTLE UNIVERSITY of THESSALONIKI Applying big data analytics in practice Anastasios Gounaris School of Informatics datalab.csd.auth.gr/~gounaris email: gounaria@csd.auth.gr New data every 1 min 2 What
More informationData Mining. Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University
Data Mining Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University Why Mine Data? Commercial Viewpoint Lots of data is being collected and warehoused Web data, e-commerce
More informationMEDICAL INFORMATICS & DATABASE MANAGEMENT MODULE 5: BIG DATA MANAGEMENT AND ANALYSIS DR.ORALUCK PATTANAPRATEEP
MEDICAL INFORMATICS & DATABASE MANAGEMENT MODULE 5: BIG DATA MANAGEMENT AND ANALYSIS DR.ORALUCK PATTANAPRATEEP Doctor of Philosophy Program in Clinical Epidemiology Section for Clinical Epidemiology &
More informationData Mining & Machine Learning F2.4DN1/F2.9DM1
Data Mining & Machine Learning F2.4DN1/F2.9DM1 Nick Taylor N.K.Taylor@hw.ac.uk Room EM1.62 Data Data Mining - Content Introduction to Data Mining What it is, Who does it and Why Data Warehousing Virtuous
More informationDecision Support and Business Intelligence Systems
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 5: Data Mining for Business Intelligence Learning Objectives Define data mining as an enabling technology for business
More information