Data Mining in Bioinformatics Day 3: Graph Mining

Size: px
Start display at page:

Download "Data Mining in Bioinformatics Day 3: Graph Mining"

Transcription

1 Graph Mining and Graph Kernels Data Mining in Bioinformatics Day 3: Graph Mining Karsten Borgwardt & Chloé-Agathe Azencott February 6 to February 17, 2012 Machine Learning and Computational Biology Research Group MPIs Tübingen August 24, 2008 ACM SIG KDD, Las Vegas From Borgwardt & Yan, Graph Mining & Graph Kernels, KDD tutorial, 2008 with permission from Xifeng Yan. Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 1

2 Graphs Graph Are Mining Everywhere and Graph Kernels Magwene et al. Genome Biology :R100 Co-expression Network Program Flow Social Network Chemical Compound Protein Structure Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 2

3 Mining Graph Patterns Graph Mining and Graph Kernels Graph Pattern Mining Frequent graph patterns Pattern summarization Optimal graph patterns Graph patterns with constraints Approximate graph patterns Graph Classification Pattern-based approach Decision tree Decision stumps Graph Compression Other important topics (graph model, laws, graph dynamics, social network analysis, visualization, summarization, graph clustering, link analysis, ) Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 3

4 Applications Graph Mining of and Graph Kernels Pattern Mining Mining biochemical structures Finding biological conserved subnetworks Finding functional modules Program control flow analysis Intrusion network analysis Mining communication networks Anomaly detection Mining XML structures Building blocks for graph classification, clustering, compression, comparison, correlation analysis, and indexing Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 4

5 Graph Pattern Mining Graph Mining and Graph Kernels Graph pattern mining (single graph setting) Graph classification (multiple graphs setting) Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 5

6 Graph Mining and Kernels Interesting Graph Patterns Interestingness measures / Objective functions Frequency: frequent graph patterns Discriminative: information gain, Fisher score Significance: G-test Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 6

7 Graphs Graph Mining and Graph Kernels A graph is an ordered pair (V, E) V is a set of vertices (nodes) E is a set of edges (links) e = (v1, v2) v1, v2 vertices in V undirected / directed / mixed loops / multigraphs / simple graphs unlabeled / labeled unweighted / weighted Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 7

8 Graph isomorphism Graph Mining and Graph Kernels G and H are isomorphic if there exist a bijection f between the vertices of G and the vertices of H such that any two vertices u and v are adjacent in H iff f(u) and f(v) are adjacent in G. f(1) = A f(2) = C f(3) = D f(4) = B f(5) = F f(6) = E Solving graph isomorphism is NP, and it s not known whether it is P or NPC. In practice, it can be solved efficiently for a number of classes of graphs [McKay1981]. The subgraph isomorphism problem is NP-hard. Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 8

9 Searching Graphs Graph Mining and Graph Kernels Breadth-First Search Explore all vertices at distance k before exploring vertices at distance (k+1). Depth-First Search Explore each branch til its end before exploring another branch. Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 9

10 Graph Mining and Graph Kernels Frequent Pattern Mining Frequent Pattern freq(g) is called the support of g θ is called the minimum support Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 10

11 Example: Graph Chemical Mining and Graph Kernels compounds (a) caffeine (b) theobromine (c) viagra Frequent subgraph: Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 11

12 Example: Graph Program Mining and Graph call Kernels graphs Min support = 2 Frequent subgraphs: Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 12

13 Algorithms Graph Mining and Graph Kernels Inductive Logic Programming (WARMR, King et al. 2001) Graphs are represented by Datalog facts Graph Based Approaches Apriori-based approach AGM/AcGM: Inokuchi et al. (PKDD 00) FSG: Kuramochi and Karypis (ICDM 01) PATH # : Vanetik and Gudes (ICDM 02, ICDM 04) FFSM: Huan et al. (ICDM 03) SPIN: Huan et al. (KDD 04) FTOSM: Hórvath et al. (KDD 06) Pattern growth approach Subdue: Holder et al. (KDD 94) MoFa: Borgelt and Berthold (ICDM 02) gspan: Yan and Han (ICDM 02) Gaston: Nijssen and Kok (KDD 04) CMTreeMiner: Chi et al. (TKDE 05) LEAP: Yan et al. (SIGMOD 08) Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 13

14 Apriori Property Graph Mining and Graph Kernels If a graph is frequent, all of its subgraphs are frequent. Therefore you can stop extending non-frequent patterns. k edges (k+1) edges heuristics Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 14

15 Cost Analysis Graph Mining and Graph Kernels number of candidates frequent infrequent (X) duplicate (X) data isomorphism checking Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 15

16 Design Principles Graph Mining and Graph Kernels Search Order breadth vs. depth complete vs. incomplete Support Calculation compute for each new pattern vs. remember all subgraph isomorphism tests 1. Generation of Candidate Patterns apriori vs. pattern growth 2. Discovery Order of Patterns DFS order path tree graph 3. Elimination of Duplicate Subgraphs passive vs. active Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 16

17 Generation Graph Mining of Candidate Graph Kernels Patterns size k size (k+1) size k size (k+1) size (k+2) G Q P G 1 G 2 G n G G 1 G 2 G 1 G 2 Apriori-Based Approach join 2 patterns of size k that share a pattern of size (k-1) Pattern-Growth Approach extend patterns of size k by 1 edge / vertex Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 17

18 Discovery Graph Mining Order and Graph Kernels Free Extension 6 edges 7 edges 22 new patterns Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 18

19 Discovery Graph Mining Order and Graph Right-Most Kernels Extension [Yan and Han ICDM 02] 6 edges start end right-most path depth-first search 7 edges 4 new patterns Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 19

20 Duplicates Elimination Graph Mining and Graph Kernels Existing patterns Newly discovered pattern g1, g2,, gm g Option 1 Check graph isomorphism of g with each pattern g1, g2,, gm (slow) Option 2 Transform each graph to a canonical label, create a hash value for this canonical label, and check if there is a match with g (faster) Option 3 Build a canonical order and generate graph patterns in that order (fastest) Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 20

21 Duplicates Graph Mining Elimination and Graph Kernels Run Time AIDS antiviral screen compound dataset from NCI/NIH [Wörlein et al. PKDD 05] Run time per pattern (msec) MoFA: option 1 + free extension gspan: option 1 + right-most extension FFSM: option 2 Gaston: option 3 Minimum support (in %) Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 21

22 Duplicates Graph Mining Elimination and Graph Kernels Memory Usage [Wörlein et al. PKDD 05] Memory usage (GB) MoFA: option 1 + free extension gspan: option 1 + right-most extension FFSM: option 2 Gaston: option 3 Minimum support (in %) Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 22

23 Graph Pattern Graph Mining and Explosion Graph Kernels Problem Conclusion: Many enumeration algorithms (AGM, FSG, gspan, Path- Join, MoFa, FFSM, SPIN, Gaston.) But: If a graph is frequent, all of its subgraphs are frequent (apriori property) An n-edge frequent graph may have 2 n subgraphs E.g.: AIDS antiviral screen dataset with 400+ compounds, support level 5% over1m frequent graph patterns Problem 1: How to interpret frequent patterns? Pattern summarization Problem 2: How to reduce the size of the pattern set? Closed and maximal graphs Problem 3: How to set the minimum support? Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 23

24 Pattern Graph Summarization Mining and Graph Kernels [Xin et al. KDD 06, Chen et al. CIKM 08] Too many patterns may not lead to more explicit knowledge It can confuse users as well as further discovery (e.g., clustering, classification, indexing, etc.) a small set of representative patterns that preserve most of the information Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 24

25 Pattern Graph Summarization Mining and Graph Kernels Pattern Distance distance patterns patterns data measure 1: pattern-based pattern containment pattern similarity measure 2: data-based data similarity Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 25

26 Closed Graph and Mining Maximal and Graph Kernels Patterns Closed Frequent Graph A frequent graph G is closed if there exists no supergraph of G that carries the same support as G If some of G s subgraphs have the same support, it is unnecessary to output these subgraphs (nonclosed graphs) Lossless compression: still ensures that the mining result is complete Maximal Frequent Graph A frequent graph G is maximal if there exists no supergraph of G that is frequent Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 26

27 Closed Graph and Mining Maximal and Graph Kernels Patterns Examples Data: Therefeore D is not closed is a subgraph of A, B, C but so is which has the same support (3) No supergraph of E is also a subgraph of all 3 graphs and therefore E is closed. is a subgraph of A, B and is also closed: none of its supergraphs has support 2 If θ = 70%, E is maximal: E is frequent None of its supergraphs is frequent Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 27

28 Closed Graph and Mining Maximal and Graph Kernels Patterns Sizes Number of patterns Minimum support Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 28

29 CloseGraph Mining (Yan and Graph and Kernels Han, KDD 03) [Yan and Han KDD 03] Pattern-Growth Approach k-edge G (k+1)-edge G 1 G 2 G n If: Under which condition can we stop searching supergraphs? (early termination) G and H are frequent G is a subgraph of H in any part of graphs in the dataset where G occurs, H also occurs, then we need not grow G, since none of G s supergraphs will be closed except those of H. Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 29

30 References Graph Mining & and Further Graph Kernels Reading B. McKay. Practical graph isomorphism. Congressus Numerantium, 30:45 87, M. Wörlein, T. Meinl, I. Fischer, and M. Philippsen, A quantitative comparison of the subgraph miners MoFa, gspan, FFSM, and Gaston, PKDD 2005 X. Yan and J. Han, gspan: graph-based substructure pattern mining, ICDM 2002 X. Yan and J. Han, CloseGraph: mining closed frequent graph patterns, KDD 2003 Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 30

31 More References Graph Mining and Graph Kernels C. Borgelt and M. R. Berthold, Mining molecular fragments: finding relevant substructures of molecules, ICDM 2002 C. Chen, C. X. Lin, X. Yan, and J. Han. On effective presentation of graph patterns: a structural representative approach, CIKM 2008 Y. Chi, Y. Xia, Y. Yang, and R. Muntz, Mining closed and maximal frequent subtrees from databases of labeled rooted trees, TKDE 2005 T. Horváth, J. Ramon, and S. Wrobel, Frequent subgraph mining in outerplanar graphs, KDD 2006 J. Huan, W. Wang, and J. Prins, Efficient mining of frequent subgraph in the presence of isomorphism, ICDM 2003 J. Huan, W. Wang, and J. Prins, and J. Yang, SPIN: Mining maximal frequent subgraphs from graph databases, KDD 2004 A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data, PKDD 2000 R. King, A Srinivasan, and L Dehaspe, WARMR: a data mining tool for chemical data, J. Comput. Aided Mol. Des M. Kuramochi and G. Karypis. Frequent subgraph discovery, ICDM 2001 S. Nijssen and J. Kok, A quickstart in frequent structure mining can make a difference, KDD 2004 N. Vanetik, E. Gudes, and S. E. Shimony. Computing frequent graph patterns from semistructured data, ICDM 2002 D. Xin, H. Cheng, X. Yan, and J. Han, Extracting redundancy-aware top-k patterns, KDD 2006 X. Yan, H. Cheng, J. Han, and P. S. Yu, Mining significant graph patterns by leap search, SIGMOD 2008 Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 31

32 The end Graph Mining and Graph Kernels See you soon! Next: Feature Selection Karsten Borgwardt & Chloé-Agathe Azencott Data mining in Bioinformatics 32

Data Mining in Bioinformatics Day 5: Graph Mining

Data Mining in Bioinformatics Day 5: Graph Mining Data Mining in Bioinformatics Day 5: Graph Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen from Borgwardt and Yan, KDD 2008 tutorial Graph Mining and Graph Kernels,

More information

Data Mining in Bioinformatics Day 5: Frequent Subgraph Mining

Data Mining in Bioinformatics Day 5: Frequent Subgraph Mining Data Mining in Bioinformatics Day 5: Frequent Subgraph Mining Chloé-Agathe Azencott & Karsten Borgwardt February 18 to March 1, 2013 Machine Learning & Computational Biology Research Group Max Planck Institutes

More information

GRAPH MINING AND GRAPH KERNELS

GRAPH MINING AND GRAPH KERNELS GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan* ^University of Cambridge *IBM T. J. Watson Research Center August 24, 2008 ACM SIG KDD, Las Vegas Graphs Are Everywhere

More information

Graph Pattern Mining

Graph Pattern Mining : Lecture VIII Graph Pattern Mining Computer Science Department Data Mining Research Nov 26, 2014 Announcement No Homework Slides available at www.cs.ucsb.edu/~xyan/classes/ns201 Two Quizzes (Dec 3, 10),

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data: Part I Instructor: Yizhou Sun yzsun@ccs.neu.edu November 12, 2013 Announcement Homework 4 will be out tonight Due on 12/2 Next class will be canceled

More information

Subdue: Compression-Based Frequent Pattern Discovery in Graph Data

Subdue: Compression-Based Frequent Pattern Discovery in Graph Data Subdue: Compression-Based Frequent Pattern Discovery in Graph Data Nikhil S. Ketkar University of Texas at Arlington ketkar@cse.uta.edu Lawrence B. Holder University of Texas at Arlington holder@cse.uta.edu

More information

Data Mining: Concepts and Techniques. Graph Mining. Graphs are Everywhere. Why Graph Mining? Chapter Graph mining

Data Mining: Concepts and Techniques. Graph Mining. Graphs are Everywhere. Why Graph Mining? Chapter Graph mining Data Mining: Concepts and Techniques Chapter 9 9.1. Graph mining Jiawei Han and Micheline Kamber Department of Computer Science University of Illinois at Urbana-Champaign www.cs.uiuc.edu/~hanj 2006 Jiawei

More information

MINING AND SEARCHING GRAPHS AND STRUCTURES

MINING AND SEARCHING GRAPHS AND STRUCTURES MINING AND SEARCHING GRAPHS AND STRUCTURES Jiawei Han Xifeng Yan Department of Computer Science University of Illinois at Urbana-Champaign Philip S. Yu IBM T. J. Watson Research Center http://ews.uiuc.edu/~xyan/tutorial/kdd06_graph.htm

More information

Extraction of Frequent Subgraph from Graph Database

Extraction of Frequent Subgraph from Graph Database Extraction of Frequent Subgraph from Graph Database Sakshi S. Mandke, Sheetal S. Sonawane Deparment of Computer Engineering Pune Institute of Computer Engineering, Pune, India. sakshi.mandke@cumminscollege.in;

More information

Mining Minimal Contrast Subgraph Patterns

Mining Minimal Contrast Subgraph Patterns Mining Minimal Contrast Subgraph Patterns Roger Ming Hieng Ting James Bailey Abstract In this paper, we introduce a new type of contrast pattern, the minimal contrast subgraph. It is able to capture structural

More information

gprune: A Constraint Pushing Framework for Graph Pattern Mining

gprune: A Constraint Pushing Framework for Graph Pattern Mining gprune: A Constraint Pushing Framework for Graph Pattern Mining Feida Zhu Xifeng Yan Jiawei Han Philip S. Yu Computer Science, UIUC, {feidazhu,xyan,hanj}@cs.uiuc.edu IBM T. J. Watson Research Center, psyu@us.ibm.com

More information

Survey on Graph Query Processing on Graph Database. Presented by FAN Zhe

Survey on Graph Query Processing on Graph Database. Presented by FAN Zhe Survey on Graph Query Processing on Graph Database Presented by FA Zhe utline Introduction of Graph and Graph Database. Background of Subgraph Isomorphism. Background of Subgraph Query Processing. Background

More information

gspan: Graph-Based Substructure Pattern Mining

gspan: Graph-Based Substructure Pattern Mining University of Illinois at Urbana-Champaign February 3, 2017 Agenda What motivated the development of gspan? Technical Preliminaries Exploring the gspan algorithm Experimental Performance Evaluation Introduction

More information

Graph Mining: Repository vs. Canonical Form

Graph Mining: Repository vs. Canonical Form Graph Mining: Repository vs. Canonical Form Christian Borgelt and Mathias Fiedler European Center for Soft Computing c/ Gonzalo Gutiérrez Quirós s/n, 336 Mieres, Spain christian.borgelt@softcomputing.es,

More information

Canonical Forms for Frequent Graph Mining

Canonical Forms for Frequent Graph Mining Canonical Forms for Frequent Graph Mining Christian Borgelt Dept. of Knowledge Processing and Language Engineering Otto-von-Guericke-University of Magdeburg borgelt@iws.cs.uni-magdeburg.de Summary. A core

More information

Chapters 11 and 13, Graph Data Mining

Chapters 11 and 13, Graph Data Mining CSI 4352, Introduction to Data Mining Chapters 11 and 13, Graph Data Mining Young-Rae Cho Associate Professor Department of Computer Science Balor Universit Graph Representation Graph An ordered pair GV,E

More information

Mining Top K Large Structural Patterns in a Massive Network

Mining Top K Large Structural Patterns in a Massive Network Mining Top K Large Structural Patterns in a Massive Network Feida Zhu Singapore Management University fdzhu@smu.edu.sg Xifeng Yan University of California at Santa Barbara xyan@cs.ucsb.edu Qiang Qu Peking

More information

Review Article Performance Evaluation of Frequent Subgraph Discovery Techniques

Review Article Performance Evaluation of Frequent Subgraph Discovery Techniques Mathematical Problems in Engineering, rticle ID 869198, 6 pages http://dx.doi.org/10.1155/2014/869198 Review rticle Performance Evaluation of Frequent Subgraph Discovery Techniques Saif Ur Rehman, 1 Sohail

More information

Mining frequent Closed Graph Pattern

Mining frequent Closed Graph Pattern Mining frequent Closed Graph Pattern Seminar aus maschninellem Lernen Referent: Yingting Fan 5.November Fachbereich 21 Institut Knowledge Engineering Prof. Fürnkranz 1 Outline Motivation and introduction

More information

Efficient homomorphism-free enumeration of conjunctive queries

Efficient homomorphism-free enumeration of conjunctive queries Efficient homomorphism-free enumeration of conjunctive queries Jan Ramon 1, Samrat Roy 1, and Jonny Daenen 2 1 K.U.Leuven, Belgium, Jan.Ramon@cs.kuleuven.be, Samrat.Roy@cs.kuleuven.be 2 University of Hasselt,

More information

9.1. Graph Mining, Social Network Analysis, and Multirelational Data Mining. Graph Mining

9.1. Graph Mining, Social Network Analysis, and Multirelational Data Mining. Graph Mining 9 Graph Mining, Social Network Analysis, and Multirelational Data Mining 9.1 We have studied frequent-itemset mining in Chapter 5 and sequential-pattern mining in Section 3 of Chapter 8. Many scientific

More information

EGDIM - Evolving Graph Database Indexing Method

EGDIM - Evolving Graph Database Indexing Method EGDIM - Evolving Graph Database Indexing Method Shariful Islam Department of Computer Science and Engineering University of Dhaka, Bangladesh tulip.du@gmail.com Chowdhury Farhan Ahmed Department of Computer

More information

Pattern Mining in Frequent Dynamic Subgraphs

Pattern Mining in Frequent Dynamic Subgraphs Pattern Mining in Frequent Dynamic Subgraphs Karsten M. Borgwardt, Hans-Peter Kriegel, Peter Wackersreuther Institute of Computer Science Ludwig-Maximilians-Universität Munich, Germany kb kriegel wackersr@dbs.ifi.lmu.de

More information

Dual Active Feature and Sample Selection for Graph Classification

Dual Active Feature and Sample Selection for Graph Classification Dual Active Feature and Sample Selection for Graph Classification Xiangnan Kong University of Illinois at Chicago Chicago, IL, USA xkong4@uic.edu Wei Fan IBM T. J. Watson Research Hawthorn, NY, USA weifan@us.ibm.com

More information

Combining Ring Extensions and Canonical Form Pruning

Combining Ring Extensions and Canonical Form Pruning Combining Ring Extensions and Canonical Form Pruning Christian Borgelt European Center for Soft Computing c/ Gonzalo Gutiérrez Quirós s/n, 00 Mieres, Spain christian.borgelt@softcomputing.es Abstract.

More information

FP-GROWTH BASED NEW NORMALIZATION TECHNIQUE FOR SUBGRAPH RANKING

FP-GROWTH BASED NEW NORMALIZATION TECHNIQUE FOR SUBGRAPH RANKING FP-GROWTH BASED NEW NORMALIZATION TECHNIQUE FOR SUBGRAPH RANKING E.R.Naganathan 1 S.Narayanan 2 K.Ramesh kumar 3 1 Department of Computer Applications, Velammal Engineering College Ambattur-Redhills Road,

More information

Les Cahiers du GERAD ISSN:

Les Cahiers du GERAD ISSN: Les Cahiers du GERAD ISSN: 0711 2440 SyGMA: Reducing Symmetry in Graph Mining C. Desrosiers, Ph. Galinier, P. Hansen, A. Hertz G 2007 12 February 2007 Revised: February 2008 Les textes publiés dans la

More information

Edgar: the Embedding-baseD GrAph MineR

Edgar: the Embedding-baseD GrAph MineR Edgar: the Embedding-baseD GrAph MineR Marc Wörlein, 1 Alexander Dreweke, 1 Thorsten Meinl, 2 Ingrid Fischer 2, and Michael Philippsen 1 1 University of Erlangen-Nuremberg, Computer Science Department

More information

A Quantitative Comparison of the Subgraph Miners MoFa, gspan, FFSM, and Gaston

A Quantitative Comparison of the Subgraph Miners MoFa, gspan, FFSM, and Gaston A Quantitative omparison of the Subgraph Miners MoFa,,, and Marc Wörlein, Thorsten Meinl, Ingrid Fischer, and Michael Philippsen University of Erlangen-Nuremberg, omputer Science Department 2, Martensstr.

More information

Using a Hash-Based Method for Apriori-Based Graph Mining

Using a Hash-Based Method for Apriori-Based Graph Mining Using a Hash-Based Method for Apriori-Based Graph Mining Phu Chien Nguyen, Takashi Washio, Kouzou Ohara, and Hiroshi Motoda The Institute of Scientific and Industrial Research, Osaka University 8-1 Mihogaoka,

More information

Discovering Frequent Topological Structures from Graph Datasets

Discovering Frequent Topological Structures from Graph Datasets Discovering Frequent Topological Structures from Graph Datasets R. Jin C. Wang D. Polshakov S. Parthasarathy G. Agrawal Department of Computer Science and Engineering Ohio State University, Columbus OH

More information

Edgar: the Embedding-baseD GrAph MineR

Edgar: the Embedding-baseD GrAph MineR Edgar: the Embedding-baseD GrAph MineR Marc Wörlein, 1 Alexander Dreweke, 1 Thorsten Meinl, 2 Ingrid Fischer 2, and Michael Philippsen 1 1 University of Erlangen-Nuremberg, Computer Science Department

More information

Mining Significant Graph Patterns by Leap Search

Mining Significant Graph Patterns by Leap Search Mining Significant Graph Patterns by Leap Search Xifeng Yan (IBM T. J. Watson) Hong Cheng, Jiawei Han (UIUC) Philip S. Yu (UIC) Graphs Are Everywhere Magwene et al. Genome Biology 2004 5:R100 Co-expression

More information

The Weisfeiler-Lehman Kernel

The Weisfeiler-Lehman Kernel The Weisfeiler-Lehman Kernel Karsten Borgwardt and Nino Shervashidze Machine Learning and Computational Biology Research Group, Max Planck Institute for Biological Cybernetics and Max Planck Institute

More information

A COMPARATIVE STUDY OF FREQUENT SUBGRAPH MINING ALGORITHMS

A COMPARATIVE STUDY OF FREQUENT SUBGRAPH MINING ALGORITHMS A COMPARATIVE STUDY OF FREQUENT SUBGRAPH MINING ALGORITHMS K.Lakshmi 1 and Dr. T. Meyyappan 2 1. Department of MCA, Sir M.Visvesvaraya Institute of Technology, Bangalore. lakshmi_kes@rediffmail.com 2.

More information

Mining Closed Relational Graphs with Connectivity Constraints

Mining Closed Relational Graphs with Connectivity Constraints Mining Closed Relational Graphs with Connectivity Constraints Xifeng Yan Computer Science Univ. of Illinois at Urbana-Champaign xyan@uiuc.edu X. Jasmine Zhou Molecular and Computational Biology Univ. of

More information

Graph Mining Sub Domains and a Framework for Indexing A Graphical Approach

Graph Mining Sub Domains and a Framework for Indexing A Graphical Approach Graph Mining Sub Domains and a Framework for Indexing A Graphical Approach K. Vivekanandan Professor BSMED A. Pankaj Moses Monickaraj (Correspoding author) Doctoral Scholar Department of Computer Science

More information

Mining Interesting Itemsets in Graph Datasets

Mining Interesting Itemsets in Graph Datasets Mining Interesting Itemsets in Graph Datasets Boris Cule Bart Goethals Tayena Hendrickx Department of Mathematics and Computer Science University of Antwerp firstname.lastname@ua.ac.be Abstract. Traditionally,

More information

Graph-based Learning. Larry Holder Computer Science and Engineering University of Texas at Arlington

Graph-based Learning. Larry Holder Computer Science and Engineering University of Texas at Arlington Graph-based Learning Larry Holder Computer Science and Engineering University of Texas at Arlingt 1 Graph-based Learning Multi-relatial data mining and learning SUBDUE graph-based relatial learner Discovery

More information

Positive and Unlabeled Learning for Graph Classification

Positive and Unlabeled Learning for Graph Classification Positive and Unlabeled Learning for Graph Classification Yuchen Zhao Department of Computer Science University of Illinois at Chicago Chicago, IL Email: yzhao@cs.uic.edu Xiangnan Kong Department of Computer

More information

Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach

Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach Xin Huang 1, Hong Cheng 1, Jiong Yang 2, Jeffery Xu Yu 1, Hongliang Fei 3, and Jun Huan 3 1 The Chinese University of Hong Kong 2

More information

In Mathematics and computer science, the study of graphs is graph theory where graphs are data structures used to model

In Mathematics and computer science, the study of graphs is graph theory where graphs are data structures used to model ISSN: 0975-766X CODEN: IJPTFI Available Online through Research Article www.ijptonline.com A BRIEF REVIEW ON APPLICATION OF GRAPH THEORY IN DATA MINING Abhinav Chanana*, Tanya Rastogi, M.Yamuna VIT University,

More information

A Roadmap to an Enhanced Graph Based Data mining Approach for Multi-Relational Data mining

A Roadmap to an Enhanced Graph Based Data mining Approach for Multi-Relational Data mining A Roadmap to an Enhanced Graph Based Data mining Approach for Multi-Relational Data mining D.Kavinya 1 Student, Department of CSE, K.S.Rangasamy College of Technology, Tiruchengode, Tamil Nadu, India 1

More information

MINING GRAPH DATA EDITED BY. Diane J. Cook School of Electrical Engineering and Computei' Science Washington State University Puliman, Washington

MINING GRAPH DATA EDITED BY. Diane J. Cook School of Electrical Engineering and Computei' Science Washington State University Puliman, Washington MINING GRAPH DATA EDITED BY Diane J. Cook School of Electrical Engineering and Computei' Science Washington State University Puliman, Washington Lawrence B. Holder School of Electrical Engineering and

More information

Mining Molecular Datasets on Symmetric Multiprocessor Systems

Mining Molecular Datasets on Symmetric Multiprocessor Systems Mining Molecular Datasets on Symmetric Multiprocessor Systems Thorsten Meinl ALTANA Chair for Bioinformatics and Information Mining, University of Konstanz, Germany meinl@inf.uni-konstanz.de Marc Wörlein,

More information

Multi-Label Feature Selection for Graph Classification

Multi-Label Feature Selection for Graph Classification Multi-Label Feature Selection for Graph Classification Xiangnan Kong Department of Computer Science University of Illinois at Chicago, IL, USA xkong4@uic.edu Philip S. Yu Department of Computer Science

More information

Lower and upper queries for graph-mining

Lower and upper queries for graph-mining Lower and upper queries for graph-mining Amina Kemmar, Yahia Lebbah, Samir Loudni, Mohammed Ouali To cite this version: Amina Kemmar, Yahia Lebbah, Samir Loudni, Mohammed Ouali. Lower and upper queries

More information

Chapter 7. Advanced Frequent Pattern Mining. Meng Jiang CS412 Summer 2017: Introduction to Data Mining

Chapter 7. Advanced Frequent Pattern Mining. Meng Jiang CS412 Summer 2017: Introduction to Data Mining Chapter 7. Advanced Frequent Pattern Mining Meng Jiang CS412 Summer 2017: Introduction to Data Mining Research on Pattern Mining: A Road Map 2 Advanced Frequent Pattern Mining Mining Diverse Patterns Constraint-Based

More information

Using Graphs to Improve Activity Prediction in Smart Environments based on Motion Sensor Data

Using Graphs to Improve Activity Prediction in Smart Environments based on Motion Sensor Data Using Graphs to Improve Activity Prediction in Smart Environments based on Motion Sensor Data S. Seth Long and Lawrence B. Holder Washington State University Abstract. Activity Recognition in Smart Environments

More information

Managing and Mining Graph Data

Managing and Mining Graph Data Managing and Mining Graph Data by Charu C. Aggarwal IBM T.J. Watson Research Center Hawthorne, NY, USA Haixun Wang Microsoft Research Asia Beijing, China

More information

Frequent Pattern Mining On Un-rooted Unordered Tree Using FRESTM

Frequent Pattern Mining On Un-rooted Unordered Tree Using FRESTM Frequent Pattern Mining On Un-rooted Unordered Tree Using FRESTM Dhananjay G. Telavekar 1, Hemant A. Tirmare 2 1M.Tech. Scholar, Dhananjay G. Telavekar, Dept. Of Technology, Shivaji University, Kolhapur,

More information

Subtree Testing and Closed Tree Mining Through Natural Representations

Subtree Testing and Closed Tree Mining Through Natural Representations Subtree Testing and Closed Tree Mining Through Natural Representations José L. Balcázar, Albert Bifet and Antoni Lozano Universitat Politècnica de Catalunya {balqui,abifet,antoni}@lsi.upc.edu Abstract

More information

Graph mining-based Image Indexing

Graph mining-based Image Indexing Graph mining-based Image Indeing Gábor Iváncs, Renáta Iváncs and István Vajk Department of Automation and Applied Informatics, Budapest Universit of Technolog and Economics,, Goldmann G. ter 3. Budapest,

More information

cmfsm: a scalable CPU-MIC coordinated drug-finding tool by frequent subgraph mining

cmfsm: a scalable CPU-MIC coordinated drug-finding tool by frequent subgraph mining Yang et al. BMC Bioinformatics 2018, 19(Suppl 4):98 https://doi.org/10.1186/s12859-018-2071-z RESEARCH cmfsm: a scalable CPU-MIC coordinated drug-finding tool by frequent subgraph mining Open Access Shunyun

More information

Data mining, 4 cu Lecture 8:

Data mining, 4 cu Lecture 8: 582364 Data mining, 4 cu Lecture 8: Graph mining Spring 2010 Lecturer: Juho Rousu Teaching assistant: Taru Itäpelto Frequent Subgraph Mining Extend association rule mining to finding frequent subgraphs

More information

EMPIRICAL COMPARISON OF GRAPH CLASSIFICATION AND REGRESSION ALGORITHMS. By NIKHIL S. KETKAR

EMPIRICAL COMPARISON OF GRAPH CLASSIFICATION AND REGRESSION ALGORITHMS. By NIKHIL S. KETKAR EMPIRICAL COMPARISON OF GRAPH CLASSIFICATION AND REGRESSION ALGORITHMS By NIKHIL S. KETKAR A dissertation submitted in partial fulfillment of the requirements for the degree of DOCTORATE OF PHILOSOPHY

More information

Graph Mining and Social Network Analysis

Graph Mining and Social Network Analysis Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References q Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann

More information

Monotone Constraints in Frequent Tree Mining

Monotone Constraints in Frequent Tree Mining Monotone Constraints in Frequent Tree Mining Jeroen De Knijf Ad Feelders Abstract Recent studies show that using constraints that can be pushed into the mining process, substantially improves the performance

More information

gapprox: Mining Frequent Approximate Patterns from a Massive Network

gapprox: Mining Frequent Approximate Patterns from a Massive Network gapprox: Mining Frequent Approximate Patterns from a Massive Network Chen Chen Xifeng Yan Feida Zhu Jiawei Han University of Illinois at Urbana-Champaign IBM T. J. Watson Research Center {cchen3, feidazhu,

More information

Practical Graph Mining With R By Nagiza F Samatova

Practical Graph Mining With R By Nagiza F Samatova We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with practical graph mining

More information

Iliya Mitov 1, Krassimira Ivanova 1, Benoit Depaire 2, Koen Vanhoof 2

Iliya Mitov 1, Krassimira Ivanova 1, Benoit Depaire 2, Koen Vanhoof 2 Iliya Mitov 1, Krassimira Ivanova 1, Benoit Depaire 2, Koen Vanhoof 2 1: Institute of Mathematics and Informatics BAS, Sofia, Bulgaria 2: Hasselt University, Belgium 1 st Int. Conf. IMMM, 23-29.10.2011,

More information

FAST-ON*: AN EXTENDED ALGORITHM FOR GRAPH ISOMORPHISM PROBLEM AND GRAPH QUERY PROCESSING

FAST-ON*: AN EXTENDED ALGORITHM FOR GRAPH ISOMORPHISM PROBLEM AND GRAPH QUERY PROCESSING FAST-ON*: AN EXTENDED ALORITHM FOR RAPH ISOMORPHISM PROBLEM AND RAPH QUERY PROCESSIN Mosab Hassaan and Karam ouda Faculty of Computers and Informatics, Benha University, Egypt {mosab.hassaan, karam.gouda}@fci.bu.edu.eg

More information

Graph-based Relational Learning with Application to Security

Graph-based Relational Learning with Application to Security Fundamenta Informaticae 66 (2005) 1 19 1 IS Press Graph-based Relational Learning with Application to Security Lawrence Holder Λ, Diane Cook, Jeff Coble and Maitrayee Mukherjee Department of Computer Science

More information

MARGIN: Maximal Frequent Subgraph Mining Λ

MARGIN: Maximal Frequent Subgraph Mining Λ MARGIN: Maximal Frequent Subgraph Mining Λ Lini T Thomas Satyanarayana R Valluri Kamalakar Karlapalem enter For Data Engineering, IIIT, Hyderabad flini,satyag@research.iiit.ac.in, kamal@iiit.ac.in Abstract

More information

Graph Indexing: A Frequent Structure-based Approach

Graph Indexing: A Frequent Structure-based Approach Graph Indexing: A Frequent Structure-based Approach Xifeng Yan Philip S. Yu Jiawei Han Department of omputer Science University of Illinois at Urbana-hampaign {xyan, hanj}@cs.uiuc.edu IBM T. J. Watson

More information

Appropriate Item Partition for Improving the Mining Performance

Appropriate Item Partition for Improving the Mining Performance Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National

More information

gmlc: a multi-label feature selection framework for graph classification

gmlc: a multi-label feature selection framework for graph classification Under consideration for publication in Knowledge and Information Systems gmlc: a multi-label feature selection framework for graph classification Xiangnan Kong, Philip S. Yu Department of Computer Science,

More information

Innovative Study to the Graph-based Data Mining: Application of the Data Mining

Innovative Study to the Graph-based Data Mining: Application of the Data Mining Innovative Study to the Graph-based Data Mining: Application of the Data Mining Amit Kr. Mishra, Pradeep Gupta, Ashutosh Bhatt, Jainendra Singh Rana Abstract Graph-based data mining represents a collection

More information

Discovering Geometric Patterns in Genomic Data

Discovering Geometric Patterns in Genomic Data Discovering Geometric Patterns in Genomic Data Wenxuan Gao Department of Computer Science University of Illinois at Chicago wgao5@uic.edu Lijia Ma ljma @uchicago.edu Christopher Brown caseybrown@uchicago.edu

More information

Behavior Query Discovery in System-Generated Temporal Graphs

Behavior Query Discovery in System-Generated Temporal Graphs Behavior Query Discovery in System-Generated Temporal Graphs Bo Zong,, Xusheng Xiao, Zhichun Li, Zhenyu Wu, Zhiyun Qian, Xifeng Yan, Ambuj K. Singh, Guofei Jiang UC Santa Barbara NEC Labs, America UC Riverside

More information

Frequent pattern mining: current status and future directions

Frequent pattern mining: current status and future directions Data Min Knowl Disc (2007) 15:55 86 DOI 10.1007/s10618-006-0059-1 Frequent pattern mining: current status and future directions Jiawei Han Hong Cheng Dong Xin Xifeng Yan Received: 22 June 2006 / Accepted:

More information

Numeric Ranges Handling for Graph Based Knowledge Discovery Oscar E. Romero A., Jesús A. González B., Lawrence B. Holder

Numeric Ranges Handling for Graph Based Knowledge Discovery Oscar E. Romero A., Jesús A. González B., Lawrence B. Holder Numeric Ranges Handling for Graph Based Knowledge Discovery Oscar E. Romero A., Jesús A. González B., Lawrence B. Holder Reporte Técnico No. CCC-08-003 27 de Febrero de 2008 2008 Coordinación de Ciencias

More information

Graph Mining: An Overview

Graph Mining: An Overview Graph Mining: An verview Christian Borgelt European Centre for Soft Computing Campus Mieres, Edificio Científico-Tecnológico c/ Gonzalo Gutiérrez Quirós s/n, E-33600 Mieres (Asturias), Spain Tel.: +34

More information

Efficient graphlet kernels for large graph comparison

Efficient graphlet kernels for large graph comparison Nino Shervashidze MPI for Biological Cybernetics MPI for Developmental Biology Tübingen, Germany S.V.N. Vishwanathan Department of Statistics Purdue University West Lafayette, IN, USA Tobias H. Petri Institute

More information

Algorithms for Extracting Frequent Episodes in the Process of Temporal Data Mining

Algorithms for Extracting Frequent Episodes in the Process of Temporal Data Mining Informatica Economică vol. 14, no. 3/2010 165 Algorithms for Extracting Frequent Episodes in the Process of Temporal Data Mining Alexandru PIRJAN Romanian-American University, Bucharest, Romania alexparjan@yahoo.com

More information

Sub-Graph Finding Information over Nebula Networks

Sub-Graph Finding Information over Nebula Networks ISSN (e): 2250 3005 Volume, 05 Issue, 10 October 2015 International Journal of Computational Engineering Research (IJCER) Sub-Graph Finding Information over Nebula Networks K.Eswara Rao $1, A.NagaBhushana

More information

Graph Classification Based on Pattern Co-occurrence

Graph Classification Based on Pattern Co-occurrence raph Classification ased on Pattern Co-occurrence Ning Jin University of North Carolina at Chapel Hill Chapel Hill, NC, USA njin@cs.unc.edu Calvin Young University of North Carolina at Chapel Hill Chapel

More information

Privacy-Preserving Subgraph Discovery

Privacy-Preserving Subgraph Discovery Privacy-Preserving Subgraph Discovery Danish Mehmood 1, Basit Shafiq 1,2, Jaideep Vaidya 2, Yuan Hong 2, Nabil Adam 2, and Vijayalakshmi Atluri 2 1 Lahore University of Management Sciences, Pakistan 2

More information

An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data

An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data Akihiro Inokuchi, Takashi Washio, and Hiroshi Motoda I.S.I.R., Osaka University 8-1, Mihogaoka, Ibarakishi, Osaka, 567-0047,

More information

Chapter 12 MINING GRAPH PATTERNS. Hong Cheng. Xifeng Yan. Jiawei Han

Chapter 12 MINING GRAPH PATTERNS. Hong Cheng. Xifeng Yan. Jiawei Han Chapter 12 MINING GRAPH PATTERNS Hong Cheng Department of Systems Engineering and Engineering Management Chinese University of Hong Kong hcheng@se.cuhk.edu.hk Xifeng Yan Department of Computer Science

More information

A New Approach To Graph Based Object Classification On Images

A New Approach To Graph Based Object Classification On Images A New Approach To Graph Based Object Classification On Images Sandhya S Krishnan,Kavitha V K P.G Scholar, Dept of CSE, BMCE, Kollam, Kerala, India Sandhya4parvathy@gmail.com Abstract: The main idea of

More information

This is the author s version of a work that was submitted/accepted for publication in the following source:

This is the author s version of a work that was submitted/accepted for publication in the following source: This is the author s version of a work that was submitted/accepted for publication in the following source: Chowdhury, Israt Jahan & Nayak, Richi (2014) BEST : an efficient algorithm for mining frequent

More information

A Graph-Based Web Usage Mining Considering Page Browsing Time

A Graph-Based Web Usage Mining Considering Page Browsing Time A Graph-Based We Usage Mining Considering Page Browsing Time Koichiro Mihara Masahiro Terae Kazuo Hashimoto Graduate School of Information Sciences, TOHOKU University {mihara, terae, kh}@aiet.ecei.tohoku.ac.jp

More information

Mining Complex Patterns

Mining Complex Patterns Mining Complex Data COMP 790-90 Seminar Spring 0 The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Mining Complex Patterns Common Pattern Mining Tasks: Itemsets (transactional, unordered data) Sequences

More information

Frequent Subgraph Retrieval in Geometric Graph Databases

Frequent Subgraph Retrieval in Geometric Graph Databases Frequent Subgraph Retrieval in Geometric Graph Databases Sebastian Nowozin Max Planck Institute for Biological Cybernetics Spemannstr. 38, 72076 Tübingen, Germany sebastian.nowozin@tuebingen.mpg.de Koji

More information

Efficient Subgraph Matching by Postponing Cartesian Products

Efficient Subgraph Matching by Postponing Cartesian Products Efficient Subgraph Matching by Postponing Cartesian Products Computer Science and Engineering Lijun Chang Lijun.Chang@unsw.edu.au The University of New South Wales, Australia Joint work with Fei Bi, Xuemin

More information

Epilog: Further Topics

Epilog: Further Topics Ludwig-Maximilians-Universität München Institut für Informatik Lehr- und Forschungseinheit für Datenbanksysteme Knowledge Discovery in Databases SS 2016 Epilog: Further Topics Lecture: Prof. Dr. Thomas

More information

HDB-SUBDUE, A RELATIONAL DATABASE APPROACH TO GRAPH MINING AND HIERARCHICAL REDUCTION SRIHARI PADMANABHAN

HDB-SUBDUE, A RELATIONAL DATABASE APPROACH TO GRAPH MINING AND HIERARCHICAL REDUCTION SRIHARI PADMANABHAN HDB-SUBDUE, A RELATIONAL DATABASE APPROACH TO GRAPH MINING AND HIERARCHICAL REDUCTION by SRIHARI PADMANABHAN Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial

More information

FP-Growth algorithm in Data Compression frequent patterns

FP-Growth algorithm in Data Compression frequent patterns FP-Growth algorithm in Data Compression frequent patterns Mr. Nagesh V Lecturer, Dept. of CSE Atria Institute of Technology,AIKBS Hebbal, Bangalore,Karnataka Email : nagesh.v@gmail.com Abstract-The transmission

More information

DFS-based frequent graph pattern extraction to characterize the content of RDF Triple Stores

DFS-based frequent graph pattern extraction to characterize the content of RDF Triple Stores DFS-based frequent graph pattern extraction to characterize the content of RDF Triple Stores Adrien Basse, Fabien Gandon, Isabelle Mirbel, Moussa o To cite this version: Adrien Basse, Fabien Gandon, Isabelle

More information

2. Department of Electronic Engineering and Computer Science, Case Western Reserve University

2. Department of Electronic Engineering and Computer Science, Case Western Reserve University Chapter MINING HIGH-DIMENSIONAL DATA Wei Wang 1 and Jiong Yang 2 1. Department of Computer Science, University of North Carolina at Chapel Hill 2. Department of Electronic Engineering and Computer Science,

More information

CMTreeMiner: Mining Both Closed and Maximal Frequent Subtrees

CMTreeMiner: Mining Both Closed and Maximal Frequent Subtrees MTreeMiner: Mining oth losed and Maximal Frequent Subtrees Yun hi, Yirong Yang, Yi Xia, and Richard R. Muntz University of alifornia, Los ngeles, 90095, US {ychi,yyr,xiayi,muntz}@cs.ucla.edu bstract. Tree

More information

Chapter 1, Introduction

Chapter 1, Introduction CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from

More information

SeqIndex: Indexing Sequences by Sequential Pattern Analysis

SeqIndex: Indexing Sequences by Sequential Pattern Analysis SeqIndex: Indexing Sequences by Sequential Pattern Analysis Hong Cheng Xifeng Yan Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign {hcheng3, xyan, hanj}@cs.uiuc.edu

More information

Social Network Analysis as Knowledge Discovery process: a case study on Digital Bibliography

Social Network Analysis as Knowledge Discovery process: a case study on Digital Bibliography Social etwork Analysis as Knowledge Discovery process: a case study on Digital Bibliography Michele Coscia, Fosca Giannotti, Ruggero Pensa ISTI-CR Pisa, Italy Email: name.surname@isti.cnr.it Abstract Today

More information

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42 Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth

More information

Mining Edge-disjoint Patterns in Graph-relational Data

Mining Edge-disjoint Patterns in Graph-relational Data Mining Edge-disjoint Patterns in Graph-relational Data Christopher Besemann and Anne Denton North Dakota State University Department of Computer Science Fargo ND, USA christopher.besemann@ndsu.edu, anne.denton@ndsu.edu

More information

Efficient Graphlet Kernels for Large Graph Comparison

Efficient Graphlet Kernels for Large Graph Comparison Efficient Graphlet Kernels for Large Graph Comparison Abstract State-of-the-art graph kernels do not scale to large graphs with hundreds of nodes and thousands of edges. In this article we attempt to rectify

More information

Graph-Based Anomaly Detection Applied to Homeland Security Cargo Screening

Graph-Based Anomaly Detection Applied to Homeland Security Cargo Screening Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference Graph-Based Anomaly Detection Applied to Homeland Security Cargo Screening William Eberle Department

More information

igraph: A Framework for Comparisons of Disk Based Graph Indexing Techniques

igraph: A Framework for Comparisons of Disk Based Graph Indexing Techniques igraph: A Framework for Comparisons of Disk Based Graph Indexing Techniques Wook Shin Han Department of Computer Engineering Kyungpook National University, Korea wshan@knu.ac.kr Jinsoo Lee Department of

More information