Next Steps in Data Mining. Sistemas de Apoio à Decisão Cláudia Antunes

Similar documents
D B M G Data Base and Data Mining Group of Politecnico di Torino

Data mining fundamentals

Knowledge Discovery. Javier Béjar URL - Spring 2019 CS - MIA

Deep Learning on Graphs

Knowledge Discovery. URL - Spring 2018 CS - MIA 1/22

COMP 465 Special Topics: Data Mining

Deep Learning on Graphs

INTRODUCTION TO DATA MINING

Data Mining. Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University

CS423: Data Mining. Introduction. Jakramate Bootkrajang. Department of Computer Science Chiang Mai University

Object Detection Lecture Introduction to deep learning (CNN) Idar Dyrdal

CS6220: DATA MINING TECHNIQUES

Machine Learning. Deep Learning. Eric Xing (and Pengtao Xie) , Fall Lecture 8, October 6, Eric CMU,

Machine Learning and Sensor Fusion for Precision Farming. Solmaz Hajmohammadi, Christopher Schardt, Noah Fahlgren, Arash Abbasi, Stefan Paulus

A Deep Learning primer

Intrusion Detection Using Data Mining Technique (Classification)

DEEP LEARNING REVIEW. Yann LeCun, Yoshua Bengio & Geoffrey Hinton Nature Presented by Divya Chitimalla

Deep Learning Approach to Network Intrusion Detection

Machine Learning 13. week

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

Data warehouse and Data Mining

International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani

Intelligent Edge Computing and ML-based Traffic Classifier. Kwihoon Kim, Minsuk Kim (ETRI) April 25.

Early detection of Crossfire attacks using deep learning

INTRUSION DETECTION SYSTEM USING BIG DATA FRAMEWORK

International Journal of Scientific & Engineering Research, Volume 4, Issue 7, July-2013 ISSN

Introduction to Data Mining and Data Analytics

Deconvolution Networks

Epilog: Further Topics

9. Conclusions. 9.1 Definition KDD

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts

Deep Learning. Deep Learning provided breakthrough results in speech recognition and image classification. Why?

Broad Learning via Fusion of Heterogeneous Information

Flow-based Anomaly Intrusion Detection System Using Neural Network

DATA MINING II - 1DL460

Chapter 3. Foundations of Business Intelligence: Databases and Information Management

CS231N Section. Video Understanding 6/1/2018

Comparison Deep Learning Method to Traditional Methods Using for Network Intrusion Detection

Outlier detection using autoencoders

Data Mining An Overview ITEV, F /18

Contents PART I: CLOUD, BIG DATA, AND COGNITIVE COMPUTING 1

DATA MINING II - 1DL460

An Introduction to Data Mining in Institutional Research. Dr. Thulasi Kumar Director of Institutional Research University of Northern Iowa

Introduction to Data Mining

A Brief Introduction to Data Mining

Overview of Web Mining Techniques and its Application towards Web

Patterns that Matter

Neural Networks In Data Mining

Scalable Video Coding

Statistical Learning and Data Mining CS 363D/ SSC 358

Knowledge Discovery and Data Mining 1 (VO) ( )

A THREE LAYERED MODEL TO PERFORM CHARACTER RECOGNITION FOR NOISY IMAGES

Data Mining Concepts

Feature Selection in the Corrected KDD -dataset

CS 4510/9010 Applied Machine Learning. Neural Nets. Paula Matuszek Fall copyright Paula Matuszek 2016

Knowledge Discovery and Data Mining

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Polytechnic University of Tirana

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Social Network Mining An Introduction

Disquisition of a Novel Approach to Enhance Security in Data Mining

Cluster Based detection of Attack IDS using Data Mining

Database and Knowledge-Base Systems: Data Mining. Martin Ester

Deep Learning. Deep Learning. Practical Application Automatically Adding Sounds To Silent Movies

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey

Machine Learning in WAN Research

Contents. Part I Setting the Scene

Machine Learning in WAN Research

Efficient Algorithms may not be those we think

3 Object Detection. BVM 2018 Tutorial: Advanced Deep Learning Methods. Paul F. Jaeger, Division of Medical Image Computing

10 Years of Data Mining Research: Retrospect and Prospect

Connecting relevant video content to audiences CREDENTIALS DECK

Sensor networks. Ericsson

Data Mining. Neural Networks

Review on Data Mining Techniques for Intrusion Detection System

A Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING

Different attack manifestations Network packets OS calls Audit records Application logs Different types of intrusion detection Host vs network IT

Summary. Machine Learning: Introduction. Marcin Sydow

Based on Big Data: Hype or Hallelujah? by Elena Baralis

Deep Tensor: Eliciting New Insights from Graph Data that Express Relationships between People and Things

Thanks to the advances of data processing technologies, a lot of data can be collected and stored in databases efficiently New challenges: with a

Introduction to Data Mining S L I D E S B Y : S H R E E J A S W A L

Lecture #11: The Perceptron

Character Recognition Using Convolutional Neural Networks

Chapter 1, Introduction

Wireless Sensor Networks --- Concepts and Challenges

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

SCENARIO BASED ADAPTIVE PREPROCESSING FOR STREAM DATA USING SVM CLASSIFIER

Neural Networks. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

An Introduction to Data Mining BY:GAGAN DEEP KAUSHAL

Deep (1) Matthieu Cord LIP6 / UPMC Paris 6

Data Mining. Chapter 1: Introduction. Adapted from materials by Jiawei Han, Micheline Kamber, and Jian Pei

Fuzzy Set Theory in Computer Vision: Example 3

IEEE Project Titles

Data Mining Course Overview

Wireless Sensor Networks --- Concepts and Challenges

Fully Convolutional Networks for Semantic Segmentation

Machine Learning (CSMML16) (Autumn term, ) Xia Hong

Transcription:

Next Steps in Data Mining Sistemas de Apoio à Decisão Cláudia Antunes

Temporal Data Mining Cláudia Antunes

Data Mining Knowledge Discovery is the nontrivial extraction of implicit, previously unknown, and potentially useful information from data. [Frawley, KDD 1995] Data sources Preprocessing Data Mining Evaluation Info 3

Data Mining: Open Issues New Visualization Forms Data sources Preprocessing Data Mining Evaluation Info 4 Privacy Issues

Data Sources Time Series Bio Sequences Social Nets Data Streams 5

Mining Data Streams Streams continuous, online process e.g. how to monitor network packets for intruders? concept drift and environment drift? RFID network and sensor network data Requirements: small constant time per record fixed amount of memory at most one scan of data model always available model up-to-date 6

Mining Networks Community and Social Networks Linked data between emails, Web pages, blogs, citations, sequences and people Static and dynamic structural behavior Mining in and for Computer Networks detect anomalies (e.g., sudden traffic spikes due to a DoS (Denial of Service) attacks Need to handle 10Gig Ethernet links (a) detect (b) trace back (c ) drop packet 7

Sequential and Time Series Data How to efficiently and accurately cluster, classify and predict the trends? Time series data used for predictions are contaminated by noise How to do accurate shortterm and long-term predictions? Signal processing techniques introduce lags in the filtered data, which reduces accuracy Key in source selection, domain knowledge in rules, and optimization methods 8

Mining Bio and Environmental Data New problems raise new questions Large scale problems especially so Biological data mining, such as HIV vaccine design DNA, chemical properties, 3D structures, and functional properties à need to be fused Environmental data mining Mining for solving the energy crisis 9

If you want a second opinion, I will ask my computer Inteligência Artificial - Aprendizagem by Cláudia Antunes 10

Guiding the Discovery Using domain knowledge to inform the methods How to represent knowledge? How to guide the process How to prevent discovery unknown patterns? 11

Security, Privacy and Data Integrity How to ensure the users privacy while their data are being mined? How to do data mining for protection of security and privacy? Knowledge integrity assessment Data are intentionally modified from in order to misinform the recipients 12

Scaling Up for Big Data Using Hadoop / MapReduce Distributed file system Must scale (linearly ) with Amount of data Number of machines Problem complexity 13

14

The traditional classification process (since the late 50's) Fixed/engineered features (or fixed kernel) + trainable classifier classifier hand-crafted Feature Extractor Simple Trainable Classifier 15

Deep Learning Deep learning = Feature learning Trainable features (or kernel) + trainable classifier classifier hand-crafted Trainable Feature Extractor Simple Trainable Trainable Classifier 16

Representation Learning a set of methods that allows a machine to be fed with raw data and to automatically discover the representations needed for detection or classification. pixels edges object parts (combination of edges) object models 17

Deep Learning Deep-learning methods are representationlearning methods with multiple levels of representation, obtained by composing simple but non-linear modules that each transform the representation at one level (starting with the raw input) into a representation at a higher, slightly more abstract level 18

Deep Learning Alternatives Feed-Forward: multilayer neural nets, convolutional nets Feed-Back: Stacked Sparse Coding, Deconvolutional Nets Bi-Drectional: Deep Boltzmann Machines, Stacked Auto-Encoders 19

20