Mining Significant Graph Patterns by Leap Search
|
|
- Felicia Shaw
- 5 years ago
- Views:
Transcription
1 Mining Significant Graph Patterns by Leap Search Xifeng Yan (IBM T. J. Watson) Hong Cheng, Jiawei Han (UIUC) Philip S. Yu (UIC)
2 Graphs Are Everywhere Magwene et al. Genome Biology :R100 Co-expression Network Program Flow Social Network Chemical Compound Protein Structure 2
3 Graph Pattern Mining 3
4 Graph Patterns Interestingness measures / Objective functions Frequency: frequent graph pattern Discriminative: information gain, Fisher score Significance: G-test 4
5 Frequent Graph Pattern 5
6 Optimal Graph Pattern (this work) 6
7 Objective Functions Challenge: Not Anti-Monotonic X 7
8 Challenge: Non Anti-Monotonic Non Monotonic Anti-Monotonic Enumerate subgraphs : small-size to large-size Non-Monotonic: Enumerate all subgraphs then check their score? 8
9 Frequent Pattern Based Mining Framework Exploratory task Graph clustering Graph classification Graph index Graph Database Frequent Patterns Optimal Patterns (SIGMOD 04, 05) (ISMB 05, 07) 1. Bottleneck : millions, even billions of patterns 2. No guarantee of quality 9
10 Direct Pattern Mining Framework Exploratory task Graph clustering Direct Graph classification Graph index Graph Database Optimal Patterns How? 10
11 Upper-Bound IBM T. J. Watson Research Center 11
12 Upper-Bound: Anti-Monotonic (cont.) Rule of Thumb : If the frequency difference of a graph pattern in the positive dataset and the negative dataset increases, the pattern becomes more interesting We can recycle the existing graph mining algorithms to accommodate non-monotonic functions. 12
13 Vertical Pruning Large <- small 13
14 Horizontal Pruning: Structural Proximity 14
15 Structural Proximity: Another Perspective # of frequent patterns >> # of possible frequency pairs Many patterns share the same score 15
16 Structural Leap Search 16
17 Frequency Association Significant patterns often fall into the high-quantile of frequency Starting with the most frequent patterns 17
18 Descending Leap Mine 1. Structural Leap Search with frequency threshold F(g*) converges 2. Frequency-Descending Mining 3. Structural Leap Search 18
19 Results: NCI Anti-Cancer Screen Datasets Chemical Compounds: anti-cancer or not # of vertices: 10 ~ 200 Name MCF-7 MOLT-4 NCI-H23 OVCAR-8 P388 PC-3 SF-295 SN12C SW-620 UACC257 YEAST # of Compounds 27,770 39,765 40,353 40,516 41,472 27,509 40,271 40,004 40,532 39,988 79,601 Tumor Description Breast Leukemia Non-Small Cell Lung Ovarian Leukemia Prostate Central Nerve System Renal Colon Melanoma Yeast anti-cancer 19 Link:
20 Efficiency IBM T. J. Watson Research Center Vertical Pruning Vertical Pruning + Horizontal Pruning 20
21 Effectiveness IBM T. J. Watson Research Center frequency descending frequency descending + structural leap search 21
22 Graph Classification Name OA Kernel LEAP OA Kernel (6x) LEAP (6x) Average (AUC) (6x) (6x) * OA Kernel: Optimal Assignment Kernel LEAP: LEAP search 22
23 Scalability Means Something! ~8000sec OA(6X) Quadratic ~200sec ~100sec ~20sec OA LEAP(6X) LEAP Linear 23
24 Beyond Graph Patterns Pattern-based categorical data classification (ICDE 07) 24
25 Beyond Graph Patterns (cont.) 1. Direct mining can be applied to itemsets, sequences, and trees Direct Exploratory task Clustering Classification Index itemset/sequence/tree Database Optimal Patterns 2. Existing algorithms can be recycled to mine patterns with sophisticated measures. 3. Pattern-based methods including indexing and classification are competitive. 25
26 Thank You 26
GRAPH MINING AND GRAPH KERNELS
GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan* ^University of Cambridge *IBM T. J. Watson Research Center August 24, 2008 ACM SIG KDD, Las Vegas Graphs Are Everywhere
More informationData Mining in Bioinformatics Day 5: Graph Mining
Data Mining in Bioinformatics Day 5: Graph Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen from Borgwardt and Yan, KDD 2008 tutorial Graph Mining and Graph Kernels,
More informationData Mining in Bioinformatics Day 3: Graph Mining
Graph Mining and Graph Kernels Data Mining in Bioinformatics Day 3: Graph Mining Karsten Borgwardt & Chloé-Agathe Azencott February 6 to February 17, 2012 Machine Learning and Computational Biology Research
More informationTOWARDS ACCURATE AND EFFICIENT CLASSIFICATION: A DISCRIMINATIVE AND FREQUENT PATTERN-BASED APPROACH
c 28 Hong Cheng TOWARDS ACCURATE AND EFFICIENT CLASSIFICATION: A DISCRIMINATIVE AND FREQUENT PATTERN-BASED APPROACH BY HONG CHENG B.S., Zhejiang University, 21 M.Phil., Hong Kong University of Science
More informationHierarchical clustering
Hierarchical clustering Rebecca C. Steorts, Duke University STA 325, Chapter 10 ISL 1 / 63 Agenda K-means versus Hierarchical clustering Agglomerative vs divisive clustering Dendogram (tree) Hierarchical
More informationMulti-Label Feature Selection for Graph Classification
Multi-Label Feature Selection for Graph Classification Xiangnan Kong Department of Computer Science University of Illinois at Chicago, IL, USA xkong4@uic.edu Philip S. Yu Department of Computer Science
More informationgmlc: a multi-label feature selection framework for graph classification
Under consideration for publication in Knowledge and Information Systems gmlc: a multi-label feature selection framework for graph classification Xiangnan Kong, Philip S. Yu Department of Computer Science,
More informationExploratory data analysis for microarrays
Exploratory data analysis for microarrays Jörg Rahnenführer Computational Biology and Applied Algorithmics Max Planck Institute for Informatics D-66123 Saarbrücken Germany NGFN - Courses in Practical DNA
More informationMining Quantitative Maximal Hyperclique Patterns: A Summary of Results
Mining Quantitative Maximal Hyperclique Patterns: A Summary of Results Yaochun Huang, Hui Xiong, Weili Wu, and Sam Y. Sung 3 Computer Science Department, University of Texas - Dallas, USA, {yxh03800,wxw0000}@utdallas.edu
More informationFinding the Best Not the Most: Regularized Loss Minimization Subgraph Selection for Graph Classification
Finding the Best Not the Most: Regularized Loss Minimization Subgraph Selection for Graph Classification Shirui Pan a,, Jia Wu a, Xingquan Zhu b, Guodong Long a, Chengqi Zhang a a Centre for Quantum Computation
More informationManaging and Mining Graph Data
Managing and Mining Graph Data by Charu C. Aggarwal IBM T.J. Watson Research Center Hawthorne, NY, USA Haixun Wang Microsoft Research Asia Beijing, China
More informationIntegration of Classification and Pattern Mining: A Discriminative and Frequent Pattern-Based Approach
Integration of Classification and Pattern Mining: A Discriminative and Frequent Pattern-Based Approach Hong Cheng Jiawei Han Chinese Univ. of Hong Kong Univ. of Illinois at U-C hcheng@se.cuhk.edu.hk hanj@cs.uiuc.edu
More informationPositive and Unlabeled Learning for Graph Classification
Positive and Unlabeled Learning for Graph Classification Yuchen Zhao Department of Computer Science University of Illinois at Chicago Chicago, IL Email: yzhao@cs.uic.edu Xiangnan Kong Department of Computer
More informationStatistics 202: Data Mining. c Jonathan Taylor. Week 8 Based in part on slides from textbook, slides of Susan Holmes. December 2, / 1
Week 8 Based in part on slides from textbook, slides of Susan Holmes December 2, 2012 1 / 1 Part I Clustering 2 / 1 Clustering Clustering Goal: Finding groups of objects such that the objects in a group
More informationData Mining for Knowledge Management. Association Rules
1 Data Mining for Knowledge Management Association Rules Themis Palpanas University of Trento http://disi.unitn.eu/~themis 1 Thanks for slides to: Jiawei Han George Kollios Zhenyu Lu Osmar R. Zaïane Mohammad
More informationWhich Null-Invariant Measure Is Better? Which Null-Invariant Measure Is Better?
Which Null-Invariant Measure Is Better? D 1 is m,c positively correlated, many null transactions D 2 is m,c positively correlated, little null transactions D 3 is m,c negatively correlated, many null transactions
More informationExploratory data analysis for microarrays
Exploratory data analysis for microarrays Adrian Alexa Computational Biology and Applied Algorithmics Max Planck Institute for Informatics D-66123 Saarbrücken slides by Jörg Rahnenführer NGFN - Courses
More informationContents. Preface to the Second Edition
Preface to the Second Edition v 1 Introduction 1 1.1 What Is Data Mining?....................... 4 1.2 Motivating Challenges....................... 5 1.3 The Origins of Data Mining....................
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data: Part I Instructor: Yizhou Sun yzsun@ccs.neu.edu November 12, 2013 Announcement Homework 4 will be out tonight Due on 12/2 Next class will be canceled
More informationgprune: A Constraint Pushing Framework for Graph Pattern Mining
gprune: A Constraint Pushing Framework for Graph Pattern Mining Feida Zhu Xifeng Yan Jiawei Han Philip S. Yu Computer Science, UIUC, {feidazhu,xyan,hanj}@cs.uiuc.edu IBM T. J. Watson Research Center, psyu@us.ibm.com
More informationLecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R,
Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R, statistics foundations 5 Introduction to D3, visual analytics
More informationCategorization of Sequential Data using Associative Classifiers
Categorization of Sequential Data using Associative Classifiers Mrs. R. Meenakshi, MCA., MPhil., Research Scholar, Mrs. J.S. Subhashini, MCA., M.Phil., Assistant Professor, Department of Computer Science,
More informationTransfer String Kernel for Cross-Context Sequence Specific DNA-Protein Binding Prediction. by Ritambhara Singh IIIT-Delhi June 10, 2016
Transfer String Kernel for Cross-Context Sequence Specific DNA-Protein Binding Prediction by Ritambhara Singh IIIT-Delhi June 10, 2016 1 Biology in a Slide DNA RNA PROTEIN CELL ORGANISM 2 DNA and Diseases
More informationDual Active Feature and Sample Selection for Graph Classification
Dual Active Feature and Sample Selection for Graph Classification Xiangnan Kong University of Illinois at Chicago Chicago, IL, USA xkong4@uic.edu Wei Fan IBM T. J. Watson Research Hawthorn, NY, USA weifan@us.ibm.com
More informationgspan: Graph-Based Substructure Pattern Mining
University of Illinois at Urbana-Champaign February 3, 2017 Agenda What motivated the development of gspan? Technical Preliminaries Exploring the gspan algorithm Experimental Performance Evaluation Introduction
More informationCARPENTER Find Closed Patterns in Long Biological Datasets. Biological Datasets. Overview. Biological Datasets. Zhiyu Wang
CARPENTER Find Closed Patterns in Long Biological Datasets Zhiyu Wang Biological Datasets Gene expression Consists of large number of genes Knowledge Discovery and Data Mining Dr. Osmar Zaiane Department
More informationModeling Big Data Variety with Graph Mining Techniques
Modeling Big Data Variety with Graph Mining Techniques BY XIANGNAN KONG B.S., Nanjing University, 2006 M.S., Nanjing University, 2009 THESIS Submitted as partial fulfillment of the requirements for the
More informationExploratory data analysis for microarrays
Exploratory data analysis for microarrays Jörg Rahnenführer Computational Biology and Applied Algorithmics Max Planck Institute for Informatics D-66123 Saarbrücken Germany NGFN - Courses in Practical DNA
More informationFrequent Pattern Mining with Uncertain Data
Charu C. Aggarwal 1, Yan Li 2, Jianyong Wang 2, Jing Wang 3 1. IBM T J Watson Research Center 2. Tsinghua University 3. New York University Frequent Pattern Mining with Uncertain Data ACM KDD Conference,
More informationMining Frequent Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach *
Mining Frequent Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach * Hongyan Liu 1 Jiawei Han 2 Dong Xin 2 Zheng Shao 2 1 Department of Management Science and Engineering, Tsinghua
More informationMin-Hash Fingerprints for Graph Kernels: A Trade-off among Accuracy, Efficiency, and Compression
Min-Hash Fingerprints for Graph Kernels: A Trade-off among Accuracy, Efficiency, and Compression Carlos H. C. Teixeira, Arlei Silva, Wagner Meira Jr. Computer Science Department Universidade Federal de
More informationSurvey on Graph Query Processing on Graph Database. Presented by FAN Zhe
Survey on Graph Query Processing on Graph Database Presented by FA Zhe utline Introduction of Graph and Graph Database. Background of Subgraph Isomorphism. Background of Subgraph Query Processing. Background
More informationGraph Mining and Social Network Analysis
Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References q Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
More informationOn Demand Phenotype Ranking through Subspace Clustering
On Demand Phenotype Ranking through Subspace Clustering Xiang Zhang, Wei Wang Department of Computer Science University of North Carolina at Chapel Hill Chapel Hill, NC 27599, USA {xiang, weiwang}@cs.unc.edu
More informationPackage omicade4. June 29, 2018
Type Package Title Multiple co-inertia analysis of omics datasets Version 1.20.0 Date 2017-04-24 Package omicade4 June 29, 2018 Author, Aedin Culhane, Amin M. Gholami. Maintainer
More informationStability of Feature Selection Algorithms
Stability of Feature Selection Algorithms Alexandros Kalousis, Jullien Prados, Phong Nguyen Melanie Hilario Artificial Intelligence Group Department of Computer Science University of Geneva Stability of
More informationCLOSET+:Searching for the Best Strategies for Mining Frequent Closed Itemsets
CLOSET+:Searching for the Best Strategies for Mining Frequent Closed Itemsets Jianyong Wang, Jiawei Han, Jian Pei Presentation by: Nasimeh Asgarian Department of Computing Science University of Alberta
More informationAN EFFICIENT CLASSIFICATION OF FAULT DETECTION THROUGH COMPRESSED TREE (CT) APRIORI BASED APPROACH USING ARC-BC CLASSIFIER
AN EFFICIENT CLASSIFICATION OF FAULT DETECTION THROUGH COMPRESSED TREE (CT) APRIORI BASED APPROACH USING ARC-BC CLASSIFIER R. Jeevarathinam 1 and T. Santhanam 2 1 Department of Computer Science, SNR Sons
More informationData Mining: Concepts and Techniques. (3 rd ed.) Chapter 7
Data Mining: Concepts and Techniques (3 rd ed.) Chapter 7 Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign & Simon Fraser University 2013-2017 Han and Kamber & Pei.
More informationA Survey on Image Classification using Data Mining Techniques Vyoma Patel 1 G. J. Sahani 2
IJSRD - International Journal for Scientific Research & Development Vol. 2, Issue 10, 2014 ISSN (online): 2321-0613 A Survey on Image Classification using Data Mining Techniques Vyoma Patel 1 G. J. Sahani
More informationGraph Mining Sub Domains and a Framework for Indexing A Graphical Approach
Graph Mining Sub Domains and a Framework for Indexing A Graphical Approach K. Vivekanandan Professor BSMED A. Pankaj Moses Monickaraj (Correspoding author) Doctoral Scholar Department of Computer Science
More informationRECENT years have witnessed a wide range of applications
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 28, NO. 3, MARCH 2016 715 Joint Structure Feature Exploration and Regularization for Multi-Task Graph Classification Shirui Pan, Jia Wu, Xingquan
More informationEfficient Subgraph Matching by Postponing Cartesian Products
Efficient Subgraph Matching by Postponing Cartesian Products Computer Science and Engineering Lijun Chang Lijun.Chang@unsw.edu.au The University of New South Wales, Australia Joint work with Fei Bi, Xuemin
More informationSVM Classification in -Arrays
SVM Classification in -Arrays SVM classification and validation of cancer tissue samples using microarray expression data Furey et al, 2000 Special Topics in Bioinformatics, SS10 A. Regl, 7055213 What
More informationBehavior Query Discovery in System-Generated Temporal Graphs
Behavior Query Discovery in System-Generated Temporal Graphs Bo Zong,, Xusheng Xiao, Zhichun Li, Zhenyu Wu, Zhiyun Qian, Xifeng Yan, Ambuj K. Singh, Guofei Jiang UC Santa Barbara NEC Labs, America UC Riverside
More informationFP-Growth algorithm in Data Compression frequent patterns
FP-Growth algorithm in Data Compression frequent patterns Mr. Nagesh V Lecturer, Dept. of CSE Atria Institute of Technology,AIKBS Hebbal, Bangalore,Karnataka Email : nagesh.v@gmail.com Abstract-The transmission
More informationOrder Preserving Clustering by Finding Frequent Orders in Gene Expression Data
Order Preserving Clustering by Finding Frequent Orders in Gene Expression Data Li Teng and Laiwan Chan Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong Abstract.
More informationData Mining in Bioinformatics Day 5: Frequent Subgraph Mining
Data Mining in Bioinformatics Day 5: Frequent Subgraph Mining Chloé-Agathe Azencott & Karsten Borgwardt February 18 to March 1, 2013 Machine Learning & Computational Biology Research Group Max Planck Institutes
More informationAC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery
: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Hong Cheng Philip S. Yu Jiawei Han University of Illinois at Urbana-Champaign IBM T. J. Watson Research Center {hcheng3, hanj}@cs.uiuc.edu,
More informationSemi-supervised Clustering of Graph Objects: A Subgraph Mining Approach
Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach Xin Huang 1, Hong Cheng 1, Jiong Yang 2, Jeffery Xu Yu 1, Hongliang Fei 3, and Jun Huan 3 1 The Chinese University of Hong Kong 2
More informationCODENSE v
CODENSE v1.0 ----------------- INTRODUCTION Given a relation graph dataset, D={G 1,G 2, G n }, where G i =(V,E i ), Definition 1 (Support) The support of a graph g is the number of graphs (in D) where
More informationExtraction of Frequent Subgraph from Graph Database
Extraction of Frequent Subgraph from Graph Database Sakshi S. Mandke, Sheetal S. Sonawane Deparment of Computer Engineering Pune Institute of Computer Engineering, Pune, India. sakshi.mandke@cumminscollege.in;
More informationMeta-path based Multi-Network Collective Link Prediction
Meta-path based Multi-Network Collective Link Prediction Jiawei Zhang 1,2, Philip S. Yu 1, Zhi-Hua Zhou 2 University of Illinois at Chicago 2, Nanjing University 2 Traditional social link prediction in
More informationWIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity
WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity Unil Yun and John J. Leggett Department of Computer Science Texas A&M University College Station, Texas 7783, USA
More informationUtility Mining: An Enhanced UP Growth Algorithm for Finding Maximal High Utility Itemsets
Utility Mining: An Enhanced UP Growth Algorithm for Finding Maximal High Utility Itemsets C. Sivamathi 1, Dr. S. Vijayarani 2 1 Ph.D Research Scholar, 2 Assistant Professor, Department of CSE, Bharathiar
More informationMaintaining Frequent Itemsets over High-Speed Data Streams
Maintaining Frequent Itemsets over High-Speed Data Streams James Cheng, Yiping Ke, and Wilfred Ng Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Kowloon,
More informationAppropriate Item Partition for Improving the Mining Performance
Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National
More informationComparative Survey of Query Processing on Graph Databases
Comparative Survey of Query Processing on Graph Databases Project Report for COP5725: Spring 2013 Group name: Sunsteeds (Sharanya Jayaraman, Srinath Viswanathan) April 25, 2013 Abstract Graph Databases
More informationRHUIET : Discovery of Rare High Utility Itemsets using Enumeration Tree
International Journal for Research in Engineering Application & Management (IJREAM) ISSN : 2454-915 Vol-4, Issue-3, June 218 RHUIET : Discovery of Rare High Utility Itemsets using Enumeration Tree Mrs.
More informationStats Overview Ji Zhu, Michigan Statistics 1. Overview. Ji Zhu 445C West Hall
Stats 415 - Overview Ji Zhu, Michigan Statistics 1 Overview Ji Zhu 445C West Hall 734-936-2577 jizhu@umich.edu Stats 415 - Overview Ji Zhu, Michigan Statistics 2 What is Data Mining? Data mining is a multi-disciplinary
More informationChapter 4 Data Mining A Short Introduction
Chapter 4 Data Mining A Short Introduction Data Mining - 1 1 Today's Question 1. Data Mining Overview 2. Association Rule Mining 3. Clustering 4. Classification Data Mining - 2 2 1. Data Mining Overview
More informationPattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42
Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth
More informationPC Tree: Prime-Based and Compressed Tree for Maximal Frequent Patterns Mining
Chapter 42 PC Tree: Prime-Based and Compressed Tree for Maximal Frequent Patterns Mining Mohammad Nadimi-Shahraki, Norwati Mustapha, Md Nasir B Sulaiman, and Ali B Mamat Abstract Knowledge discovery or
More informationChapter 1, TUFTE STYLE GRIDDING FOR READABILITY. Chapter 5, SLICE (CROSS-SECTIONAL VIEWS)
Chapter, TUFTE STYLE GRIDDING FOR READABILITY Chapter 5, SLICE (CROSS-SECTIONAL VIEWS) Number of responses 8 7 6 5 4 3 2 9 8 7 6 5 4 3 2 Distribution of ethnicities in each income group of SF bay area
More informationMachine Learning: Symbolische Ansätze
Machine Learning: Symbolische Ansätze Unsupervised Learning Clustering Association Rules V2.0 WS 10/11 J. Fürnkranz Different Learning Scenarios Supervised Learning A teacher provides the value for the
More informationEFFICIENT TRANSACTION REDUCTION IN ACTIONABLE PATTERN MINING FOR HIGH VOLUMINOUS DATASETS BASED ON BITMAP AND CLASS LABELS
EFFICIENT TRANSACTION REDUCTION IN ACTIONABLE PATTERN MINING FOR HIGH VOLUMINOUS DATASETS BASED ON BITMAP AND CLASS LABELS K. Kavitha 1, Dr.E. Ramaraj 2 1 Assistant Professor, Department of Computer Science,
More informationClassifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped
More informationImproved Frequent Pattern Mining Algorithm with Indexing
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VII (Nov Dec. 2014), PP 73-78 Improved Frequent Pattern Mining Algorithm with Indexing Prof.
More informationDirect Local Pattern Sampling by Efficient Two-Step Random Procedures
Direct Local Pattern Sampling by Efficient Two-Step Random Procedures Mario Boley Fraunhofer IAIS and University of Bonn mario.boley@iais.fhg.de Claudio Lucchese I.S.T.I.-C.N.R. Pisa claudio.lucchese@isti.cnr.it
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu [Kumar et al. 99] 2/13/2013 Jure Leskovec, Stanford CS246: Mining Massive Datasets, http://cs246.stanford.edu
More informationA Combination Approach to Cluster Validation Based on Statistical Quantiles
A Combination Approach to Cluster Validation Based on Statistical Quantiles Amparo Albalate Institute of Information Technology University of Ulm Ulm, Germany amparo.albalate@uni-ulm.de David Suendermann
More informationLu.Getz.Miska_Nature.June.2005.mouse.lung
Lu.Getz.Miska_Nature.June.2005.mouse.lung Module name: Lu.Getz.Miska_Nature.June.2005.mouse.lung Description: Normal/tumor classifier and knn prediction of mouse lung samples Author: Gad Getz (Broad Institute),
More informationGene selection through Switched Neural Networks
Gene selection through Switched Neural Networks Marco Muselli Istituto di Elettronica e di Ingegneria dell Informazione e delle Telecomunicazioni Consiglio Nazionale delle Ricerche Email: Marco.Muselli@ieiit.cnr.it
More informationUAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA
UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA METANAT HOOSHSADAT, SAMANEH BAYAT, PARISA NAEIMI, MAHDIEH S. MIRIAN, OSMAR R. ZAÏANE Computing Science Department, University
More informationData mining, 4 cu Lecture 8:
582364 Data mining, 4 cu Lecture 8: Graph mining Spring 2010 Lecturer: Juho Rousu Teaching assistant: Taru Itäpelto Frequent Subgraph Mining Extend association rule mining to finding frequent subgraphs
More informationBeyond Sliding Windows: Object Localization by Efficient Subwindow Search
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search Christoph H. Lampert, Matthew B. Blaschko, & Thomas Hofmann Max Planck Institute for Biological Cybernetics Tübingen, Germany Google,
More informationData Mining: Concepts and Techniques. Graph Mining. Graphs are Everywhere. Why Graph Mining? Chapter Graph mining
Data Mining: Concepts and Techniques Chapter 9 9.1. Graph mining Jiawei Han and Micheline Kamber Department of Computer Science University of Illinois at Urbana-Champaign www.cs.uiuc.edu/~hanj 2006 Jiawei
More information/ Computational Genomics. Normalization
10-810 /02-710 Computational Genomics Normalization Genes and Gene Expression Technology Display of Expression Information Yeast cell cycle expression Experiments (over time) baseline expression program
More informationFrequent Pattern Mining in Data Streams. Raymond Martin
Frequent Pattern Mining in Data Streams Raymond Martin Agenda -Breakdown & Review -Importance & Examples -Current Challenges -Modern Algorithms -Stream-Mining Algorithm -How KPS Works -Combing KPS and
More informationCS570 Introduction to Data Mining
CS570 Introduction to Data Mining Frequent Pattern Mining and Association Analysis Cengiz Gunay Partial slide credits: Li Xiong, Jiawei Han and Micheline Kamber George Kollios 1 Mining Frequent Patterns,
More informationWeb Usage Mining. Overview Session 1. This material is inspired from the WWW 16 tutorial entitled Analyzing Sequential User Behavior on the Web
Web Usage Mining Overview Session 1 This material is inspired from the WWW 16 tutorial entitled Analyzing Sequential User Behavior on the Web 1 Outline 1. Introduction 2. Preprocessing 3. Analysis 2 Example
More informationMODES v1.1 User Manual
----------------------- MODES v1.1 User Manual ----------------------- References Mining Coherent Dense Subgraphs Across Massive Biological Networks for Functional Discovery Haiyan Hu 1, Xifeng Yan 2,
More informationLecture notes for April 6, 2005
Lecture notes for April 6, 2005 Mining Association Rules The goal of association rule finding is to extract correlation relationships in the large datasets of items. Many businesses are interested in extracting
More informationGraph Classification Based on Pattern Co-occurrence
raph Classification ased on Pattern Co-occurrence Ning Jin University of North Carolina at Chapel Hill Chapel Hill, NC, USA njin@cs.unc.edu Calvin Young University of North Carolina at Chapel Hill Chapel
More informationA SURVEY OF DATA MINING & ITS APPLICATIONS
A SURVEY OF DATA MINING & ITS APPLICATIONS Pankaj jain M.Tech Student, Computer Science Siddhi Vinayak College of Science & Hr.Education, Alwar (Rajasthan) Abstract- Data mining consists of evolving set
More informationBiclustering with δ-pcluster John Tantalo. 1. Introduction
Biclustering with δ-pcluster John Tantalo 1. Introduction The subject of biclustering is chiefly concerned with locating submatrices of gene expression data that exhibit shared trends between genes. That
More informationEpilog: Further Topics
Ludwig-Maximilians-Universität München Institut für Informatik Lehr- und Forschungseinheit für Datenbanksysteme Knowledge Discovery in Databases SS 2016 Epilog: Further Topics Lecture: Prof. Dr. Thomas
More informationPrivacy-Preserving. Introduction to. Data Publishing. Concepts and Techniques. Benjamin C. M. Fung, Ke Wang, Chapman & Hall/CRC. S.
Chapman & Hall/CRC Data Mining and Knowledge Discovery Series Introduction to Privacy-Preserving Data Publishing Concepts and Techniques Benjamin C M Fung, Ke Wang, Ada Wai-Chee Fu, and Philip S Yu CRC
More informationIncremental SVM and Visualization Tools for Biomedical
Incremental SVM and Visualization Tools for Biomedical Data Mining Thanh-Nghi Do, François Poulet ESIEA Recherche 38, rue des Docteurs Calmette et Guérin Parc Universitaire de Laval-Changé 53000 Laval
More informationDATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS
DATA MINING INTRODUCTION TO CLASSIFICATION USING LINEAR CLASSIFIERS 1 Classification: Definition Given a collection of records (training set ) Each record contains a set of attributes and a class attribute
More informationExploring high dimensional data with Butterfly: a novel classification algorithm based on discrete dynamical systems
Exploring high dimensional data with Butterfly: a novel classification algorithm based on discrete dynamical systems J o s e p h G e r a c i, M o y e z D h a r s e e, P a u l o N u i n, A l e x a n d r
More informationFPGP: Graph Processing Framework on FPGA
FPGP: Graph Processing Framework on FPGA Guohao DAI, Yuze CHI, Yu WANG, Huazhong YANG E.E. Dept., TNLIST, Tsinghua University dgh14@mails.tsinghua.edu.cn 1 Big graph is widely used Big graph is widely
More informationSeqIndex: Indexing Sequences by Sequential Pattern Analysis
SeqIndex: Indexing Sequences by Sequential Pattern Analysis Hong Cheng Xifeng Yan Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign {hcheng3, xyan, hanj}@cs.uiuc.edu
More informationCSE 5243 INTRO. TO DATA MINING
CSE 5243 INTRO. TO DATA MINING Mining Frequent Patterns and Associations: Basic Concepts (Chapter 6) Huan Sun, CSE@The Ohio State University 10/19/2017 Slides adapted from Prof. Jiawei Han @UIUC, Prof.
More informationData Mining Part 3. Associations Rules
Data Mining Part 3. Associations Rules 3.2 Efficient Frequent Itemset Mining Methods Fall 2009 Instructor: Dr. Masoud Yaghini Outline Apriori Algorithm Generating Association Rules from Frequent Itemsets
More informationAn overview of Graph Categories and Graph Primitives
An overview of Graph Categories and Graph Primitives Dino Ienco (dino.ienco@irstea.fr) https://sites.google.com/site/dinoienco/ Topics I m interested in: Graph Database and Graph Data Mining Social Network
More informationAPRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS A REVIEW
International Journal of Computer Application and Engineering Technology Volume 3-Issue 3, July 2014. Pp. 232-236 www.ijcaet.net APRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS A REVIEW Priyanka 1 *, Er.
More informationRole of Association Rule Mining in DNA Microarray Data - A Research
Role of Association Rule Mining in DNA Microarray Data - A Research T. Arundhathi Asst. Professor Department of CSIT MANUU, Hyderabad Research Scholar Osmania University, Hyderabad Prof. T. Adilakshmi
More informationCOMP 465: Data Mining Classification Basics
Supervised vs. Unsupervised Learning COMP 465: Data Mining Classification Basics Slides Adapted From : Jiawei Han, Micheline Kamber & Jian Pei Data Mining: Concepts and Techniques, 3 rd ed. Supervised
More informationEfficient homomorphism-free enumeration of conjunctive queries
Efficient homomorphism-free enumeration of conjunctive queries Jan Ramon 1, Samrat Roy 1, and Jonny Daenen 2 1 K.U.Leuven, Belgium, Jan.Ramon@cs.kuleuven.be, Samrat.Roy@cs.kuleuven.be 2 University of Hasselt,
More information