Managing and Mining Graph Data
|
|
- Blaze Shanon Hopkins
- 6 years ago
- Views:
Transcription
1 Managing and Mining Graph Data by Charu C. Aggarwal IBM T.J. Watson Research Center Hawthorne, NY, USA Haixun Wang Microsoft Research Asia Beijing, China <ÖSp nneer
2 Contents List of Figures List of Tables Preface 1 An Introduction to Graph Data Charit С. Aggarwal and Haixun Wang 1. Introduction 2. Graph Management and Mining Applications 3. Summary References 2 Graph Data Management and Mining: A Survey of Algorithmsand Cham C. Aggarwal and Haixun Wang 1. Introduction 2. Graph Data Management Algorithms 2.1 Indexing and Query Processing Techniques 2.2 Reachability Queries 2.3 Graph Matching 2.4 Keyword Search 2.5 Synopsis Construction of Massive Graphs 3. Graph Mining Algorithms 3.1 Pattern Mining in Graphs 3.2 Clustering Algorithms for Graph Data 3.3 Classification Algorithms for Graph Data 3.4 The Dynamics of Time-Evolving Graphs 4. Graph Applications 4.1 Chemical and Biological Applications 4.2 Web Applications 4.3 Software Bug Localization 5. Conclusions and Future Research References 3 Graph Mining: Laws and Generators Deepaxan Chakrabarti, Christos Faloiitsos and Mary McGlohon 1. Introduction 2. Graph Patterns
3 VI MANAGING AND MINING GRAPH DATA 2.1 Power Laws and Heavy-Tailed Distributions Small Diameters Other Static Graph Patterns Patterns in Evolving Graphs The Structure of Specific Graphs Graph Generators Random Graph Models Preferential Attachment and Variants Optimization-based generators Tensor-based Generators for specific graphs Graph Generators: A summary Conclusions 115 References Query Language and Access Methods for Graph Databases 125 Hiiahai He and Ambuj K. Singh 1. Introduction Graphs-at-a-time Queries Graph Specific Optimizations GraphQL Operations on Graph Structures Concatenation Disjunction Repetition Graph Query Language Data Model Graph Patterns Graph Algebra FLWR Expressions Expressive Power Implementation of the Selection Operator Graph Pattern Matching Local Pruning and Retrieval of Feasible Mates Joint Reduction of Search Space Optimization of Search Order Experimental Study Biological Network Synthetic Graphs Related Work Graph Query Languages Graph Indexing Future Research Directions Conclusion 156 Appendix: Query Syntax of GraphQL 156 References Graph Indexing 161 Xifeng Yan and Jiawei Han 1. Introduction 161
4 Contents VII 2. Feature-Based Graph Index Paths Frequent Structures Discriminative Structures Closed Frequent Structures Trees Hierarchical Indexing Structure Similarity Search Feature-Based Structural Filtering Feature Miss Estimation Frequency Difference Feature Set Selection Structures with Gaps Reverse Substructure Search Conclusions 177 References Graph Reachability Queries: A Survey 181 Jeffrey Xu Yu and Jiefeng Cheng 1. Introduction Traversal Approaches Tree+SSPI GRIPP Dual-Labeling Tree Cover Chain Cover Computing the Optimal Chain Cover Path-Tree Cover HOP Cover A Heuristic Ranking A Geometrical-Based Approach Graph Partitioning Approaches Hop Cover Maintenance Hop Cover Distance-Aware 2-Hop Cover Graph Pattern Matching A Special Case: A^D The General Cases Conclusions and Summary 212 References Exact and Inexact Graph Matching: Methodology and Applications 217 Kaspar Riesen, Xiaoyi Jiang and Horst Blinke 1. Introduction Basic Notations Exact Graph Matching Inexact Graph Matching Graph Edit Distance Other Inexact Graph Matching Techniques Graph Matching for Data Mining and Information Retrieval 231
5 Vlll MANAGING AND MINING GRAPH DATA 6. Vector Space Embeddings of Graphs via Graph Matching Conclusions 239 References A Survey of Algorithms for Keyword Search on Graph Data 249 Hai.xun Wang and Cham С Aggarwal 1. Introduction Keyword Search on XML Data Query Semantics Answer Ranking Algorithms for LCA-based Keyword Search Keyword Search on Relational Data Query Semantics DBXplorer and DISCOVER Keyword Search on Schema-Free Graphs Query Semantics and Answer Ranking Graph Exploration by Backward Search Graph Exploration by Bidirectional Search Index-based Graph Exploration - the BLINKS Algorithm The ObjectRank Algorithm Conclusions and Future Research 271 References A Survey of Clustering Algorithms for Graph Data 275 Cham С Aggarwal and Haixun Wang 1. Introduction Node Clustering Algorithms The Minimum Cut Problem Multi-way Graph Partitioning Conventional Generalizations and Network Structure Indices The Girvan-Newman Algorithm The Spectral Clustering Method Determining Quasi-Cliques The Case of Massive Graphs Clustering Graphs as Objects Extending Classical Algorithms to Structural Data The XProj Approach Applications of Graph Clustering Algorithms Community Detection in Web Applications and Social Networks Telecommunication Networks Analysis Conclusions and Future Research 297 References A Survey of Algorithms for Dense Subgraph Discovery 303 Victor E. Lee. Ning Ruan, Ruoming Jin and Cham Aggarwal 1. Introduction 304
6 Contents ix 2. Types of Dense Components Absolute vs. Relative Density Graph Terminology Definitions of Dense Components Dense Component Selection Relationship between Clusters and Dense Components Algorithms for Detecting Dense Components in a Single Graph 31 I 3.1 Exact Enumeration Approach Heuristic Approach Exact and Approximation Algorithms for Discovering Densest Components Frequent Dense Components Frequent Patterns with Density Constraints Dense Components with Frequency Constraint Enumerating Cross-Graph Quasi-Cliques Applications of Dense Component Analysis Conclusions and Future Research 331 References Graph Classification 337 Koji Tsuda and Hiroto Saigo 1. Introduction Graph Kernels Random Walks on Graphs Label Sequence Kernel Efficient Computation of Label Sequence Kernels Extensions Graph Boosting Formulation of Graph Boosting Optimal Pattern Search Computational Experiments Related Work Applications of Graph Classification Label Propagation Concluding Remarks 359 References Mining Graph Patterns 365 Hong Cheng, Xifeng Yan and Jiawei Han 1. Introduction Frequent Subgraph Mining Problem Definition Apriori-based Approach Pattern-Growth Approach Closed and Maximal Subgraphs Mining Subgraphs in a Single Graph The Computational Bottleneck Mining Significant Graph Patterns Problem Definition gboost: A Branch-and-Bound Approach 373
7 X MANAGING AND MINING GRAPH DATA 3.3 gpls: A Partial Least Squares Regression Approach LEAP: A Structural Leap Search Approach GraphSig: A Feature Representation Approach Mining Representative Orthogonal Graphs Problem Definition Randomized Maximal Subgraph Mining Orthogonal Representative Set Generation Conclusions 389 References A Survey on Streaming Algorithms for Massive Graphs 393 Jian Zhang 1. Introduction Streaming Model for Massive Graphs Statistics and Counting Triangles Graph Matching Unweighted Matching Weighted Matching Graph Distance Distance Approximation using Multiple Passes Distance Approximation in One Pass Random Walks on Graphs Conclusions 416 References A Survey of Privacy-Preservation of Graphs and Social Networks 421 Xintao Wit. Xiaowei Ying, Kim Liu and Lei Chen 1. Introduction Privacy in Publishing Social Networks Background Knowledge Utility Preservation Anonymization Approaches Notations Privacy Attacks on Naive Anonymized Networks Active Attacks and Passive Attacks Structural Queries Other Attacks Л -Anonymity Privacy Preservation via Edge Modification Л-Degree Generalization Tv'-Neighborhood Anonymity Л'-Automorphism Anonymity Privacy Preservation via Randomization Resilience to Structural Attacks Link Disclosure Analysis Reconstruction Feature Preserving Randomization Privacy Preservation via Generalization Anonymixing Rich Graphs 441
8 Contents XI 6.1 Link Protection in Rich Graphs Anonymizing Bipartite Graphs Anonymizing Rich Interaction Graphs Anonymizing Edge-Weighted Graphs Other Privacy Issues in Online Social Networks Deriving Link Structure of the Entire Network Deriving Personal Identifying Information from Social Networking Sites Conclusion and Future Work 448 Acknowledgments 449 References A Survey of Graph Mining for Web Applications 455 Debora Donato and Aristides Gionis 1. Introduction Preliminaries 2.1 Link Analysis Ranking Algorithms Mining High-Quality Items Prediction of Successful Items in a Co-citation Network Finding High-Quality Content in Question-Answering Portals Mining Query Logs Description of Query Logs Query Log Graphs Query Recommendations Conclusions 480 References Graph Mining Applications to Social Network Analysis 487 Lei Tang and Huan Liu 1. Introduction Graph Patterns in Large-Scale Networks Scale-Free Networks Small-World Effect Community Structures Graph Generators Community Detection 3.1 Node-Centric Community Detection Group-Centric Community Detection Network-Centric Community Detection Hierarchy-Centric Community Detection Community Structure Evaluation Research Issues 507 References Software-Bug Localization with Graph Mining 515 Frank Eichinger and Klemens Behm 1. Introduction Basics of Call Graph Based Bug Localization 517
9 Xll MANAGING AND MINING GRAPH DATA Acknowledgments References 2.1 Dynamic Call Graphs 2.2 Bugs in Software 2.3 Bug Localization with Call Graphs 2.4 Graph and Tree Mining Related Work Call-Graph Reduction 4.1 Total Reduction Iterations Temporal Order Recursion Comparison Call Graph Based Bug Localization 5.1 Structural Approaches 5.2 Frequency-based Approach 5.3 Combined Approaches 5.4 Comparison Conclusions and Future Directions A Survey of Graph Mining Techniques for Biological Datasets 547 S. Parthasarathy, S. Tatikonda and D. Ucar 1. Introduction Mining Trees Frequent Subtree Mining Tree Alignment and Comparison Statistical Models Mining Graphs for the Discovery of Frequent Substructures Frequent Subgraph Mining Motif Discovery in Biological Networks Mining Graphs for the Discovery of Modules Extracting Communities Clustering Discussion 569 References Trends in Chemical Graph Data Mining 581 Nikil Wale, Xia Ning and George Karypis 1. Introduction Topological Descriptors for Chemical Compounds Hashed Fingerprints (FP) Maces Keys (MK) Extended Connectivity Fingerprints (ECFP) Frequent Subgraphs (FS) Bounded-Size Graph Fragments (GF) Comparison of Descriptors Classification Algorithms for Chemical Compounds Approaches based on Descriptors Approaches based on Graph Kernels Searching Compound Libraries 590
10 Contents xin Methods Based on Direct Similarity Methods Based on Indirect Similarity Performance oflndirect Similarity Methods Identifying Potential Targets for Compounds Model-based Methods For Target Fishing Performance of Target Fishing Strategies Future Research Directions 600 References 602 Index 607
DATA STREAMS: MODELS AND ALGORITHMS
DATA STREAMS: MODELS AND ALGORITHMS DATA STREAMS: MODELS AND ALGORITHMS Edited by CHARU C. AGGARWAL IBM T. J. Watson Research Center, Yorktown Heights, NY 10598 Kluwer Academic Publishers Boston/Dordrecht/London
More informationData Mining in Bioinformatics Day 3: Graph Mining
Graph Mining and Graph Kernels Data Mining in Bioinformatics Day 3: Graph Mining Karsten Borgwardt & Chloé-Agathe Azencott February 6 to February 17, 2012 Machine Learning and Computational Biology Research
More informationData Mining in Bioinformatics Day 5: Graph Mining
Data Mining in Bioinformatics Day 5: Graph Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen from Borgwardt and Yan, KDD 2008 tutorial Graph Mining and Graph Kernels,
More informationMINING GRAPH DATA EDITED BY. Diane J. Cook School of Electrical Engineering and Computei' Science Washington State University Puliman, Washington
MINING GRAPH DATA EDITED BY Diane J. Cook School of Electrical Engineering and Computei' Science Washington State University Puliman, Washington Lawrence B. Holder School of Electrical Engineering and
More informationWeb Structure Mining Community Detection and Evaluation
Web Structure Mining Community Detection and Evaluation 1 Community Community. It is formed by individuals such that those within a group interact with each other more frequently than with those outside
More informationMining Significant Graph Patterns by Leap Search
Mining Significant Graph Patterns by Leap Search Xifeng Yan (IBM T. J. Watson) Hong Cheng, Jiawei Han (UIUC) Philip S. Yu (UIC) Graphs Are Everywhere Magwene et al. Genome Biology 2004 5:R100 Co-expression
More informationCommunity Detection. Community
Community Detection Community In social sciences: Community is formed by individuals such that those within a group interact with each other more frequently than with those outside the group a.k.a. group,
More informationContents. Foreword to Second Edition. Acknowledgments About the Authors
Contents Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments About the Authors xxxi xxxv Chapter 1 Introduction 1 1.1 Why Data Mining? 1 1.1.1 Moving toward the Information Age 1
More informationA New Approach To Graph Based Object Classification On Images
A New Approach To Graph Based Object Classification On Images Sandhya S Krishnan,Kavitha V K P.G Scholar, Dept of CSE, BMCE, Kollam, Kerala, India Sandhya4parvathy@gmail.com Abstract: The main idea of
More informationGraph Mining and Social Network Analysis
Graph Mining and Social Network Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References q Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
More informationPrivacy-Preserving. Introduction to. Data Publishing. Concepts and Techniques. Benjamin C. M. Fung, Ke Wang, Chapman & Hall/CRC. S.
Chapman & Hall/CRC Data Mining and Knowledge Discovery Series Introduction to Privacy-Preserving Data Publishing Concepts and Techniques Benjamin C M Fung, Ke Wang, Ada Wai-Chee Fu, and Philip S Yu CRC
More informationInvited Talk: GraphSM/DBKDA About Reachability in Graphs
Invited Talk: GraphSM/DBKDA-2014 The Sixth International Conference on Advances in Databases, Knowledge, and Data Applications April 20-26, 2014 - Chamonix, France About Reachability in Graphs Andreas
More informationBehavior Query Discovery in System-Generated Temporal Graphs
Behavior Query Discovery in System-Generated Temporal Graphs Bo Zong,, Xusheng Xiao, Zhichun Li, Zhenyu Wu, Zhiyun Qian, Xifeng Yan, Ambuj K. Singh, Guofei Jiang UC Santa Barbara NEC Labs, America UC Riverside
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationChapters 11 and 13, Graph Data Mining
CSI 4352, Introduction to Data Mining Chapters 11 and 13, Graph Data Mining Young-Rae Cho Associate Professor Department of Computer Science Balor Universit Graph Representation Graph An ordered pair GV,E
More informationAn overview of Graph Categories and Graph Primitives
An overview of Graph Categories and Graph Primitives Dino Ienco (dino.ienco@irstea.fr) https://sites.google.com/site/dinoienco/ Topics I m interested in: Graph Database and Graph Data Mining Social Network
More informationExtraction of Frequent Subgraph from Graph Database
Extraction of Frequent Subgraph from Graph Database Sakshi S. Mandke, Sheetal S. Sonawane Deparment of Computer Engineering Pune Institute of Computer Engineering, Pune, India. sakshi.mandke@cumminscollege.in;
More informationTable Of Contents: xix Foreword to Second Edition
Data Mining : Concepts and Techniques Table Of Contents: Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments xxxi About the Authors xxxv Chapter 1 Introduction 1 (38) 1.1 Why Data
More informationOn Dense Pattern Mining in Graph Streams
On Dense Pattern Mining in Graph Streams [Extended Abstract] Charu C. Aggarwal IBM T. J. Watson Research Ctr Hawthorne, NY charu@us.ibm.com Yao Li, Philip S. Yu University of Illinois at Chicago Chicago,
More informationData Mining in Bioinformatics Day 5: Frequent Subgraph Mining
Data Mining in Bioinformatics Day 5: Frequent Subgraph Mining Chloé-Agathe Azencott & Karsten Borgwardt February 18 to March 1, 2013 Machine Learning & Computational Biology Research Group Max Planck Institutes
More informationSurvey on Graph Query Processing on Graph Database. Presented by FAN Zhe
Survey on Graph Query Processing on Graph Database Presented by FA Zhe utline Introduction of Graph and Graph Database. Background of Subgraph Isomorphism. Background of Subgraph Query Processing. Background
More informationgsketch: On Query Estimation in Graph Streams
gsketch: On Query Estimation in Graph Streams Peixiang Zhao (Florida State University) Charu C. Aggarwal (IBM Research, Yorktown Heights) Min Wang (HP Labs, China) Istanbul, Turkey, August, 2012 Synopsis
More informationClusters and Communities
Clusters and Communities Lecture 7 CSCI 4974/6971 22 Sep 2016 1 / 14 Today s Biz 1. Reminders 2. Review 3. Communities 4. Betweenness and Graph Partitioning 5. Label Propagation 2 / 14 Today s Biz 1. Reminders
More informationMining Frequent Patterns without Candidate Generation
Mining Frequent Patterns without Candidate Generation Outline of the Presentation Outline Frequent Pattern Mining: Problem statement and an example Review of Apriori like Approaches FP Growth: Overview
More informationSIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE)
SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road 517583 QUESTION BANK (DESCRIPTIVE) Subject with Code : Data Warehousing and Mining (16MC815) Year & Sem: II-MCA & I-Sem Course
More informationDatabase Supports for Efficient Frequent Pattern Mining
Database Supports for Efficient Frequent Pattern Mining Ruoming Jin Kent State University Joint work with Dave Furhy (KSU), Scott McCallen (KSU), Dong Wang (KSU), Yuri Breitbart (KSU), and Gagan Agrawal
More informationDatabase and Knowledge-Base Systems: Data Mining. Martin Ester
Database and Knowledge-Base Systems: Data Mining Martin Ester Simon Fraser University School of Computing Science Graduate Course Spring 2006 CMPT 843, SFU, Martin Ester, 1-06 1 Introduction [Fayyad, Piatetsky-Shapiro
More informationTop-k Keyword Search Over Graphs Based On Backward Search
Top-k Keyword Search Over Graphs Based On Backward Search Jia-Hui Zeng, Jiu-Ming Huang, Shu-Qiang Yang 1College of Computer National University of Defense Technology, Changsha, China 2College of Computer
More informationFrequent Pattern Mining with Uncertain Data
Charu C. Aggarwal 1, Yan Li 2, Jianyong Wang 2, Jing Wang 3 1. IBM T J Watson Research Center 2. Tsinghua University 3. New York University Frequent Pattern Mining with Uncertain Data ACM KDD Conference,
More informationSocial-Network Graphs
Social-Network Graphs Mining Social Networks Facebook, Google+, Twitter Email Networks, Collaboration Networks Identify communities Similar to clustering Communities usually overlap Identify similarities
More informationLearning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li
Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,
More informationAC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery
: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Hong Cheng Philip S. Yu Jiawei Han University of Illinois at Urbana-Champaign IBM T. J. Watson Research Center {hcheng3, hanj}@cs.uiuc.edu,
More informationA Graph-Based Approach for Mining Closed Large Itemsets
A Graph-Based Approach for Mining Closed Large Itemsets Lee-Wen Huang Dept. of Computer Science and Engineering National Sun Yat-Sen University huanglw@gmail.com Ye-In Chang Dept. of Computer Science and
More informationIntroduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.
Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How
More informationPattern Mining in Frequent Dynamic Subgraphs
Pattern Mining in Frequent Dynamic Subgraphs Karsten M. Borgwardt, Hans-Peter Kriegel, Peter Wackersreuther Institute of Computer Science Ludwig-Maximilians-Universität Munich, Germany kb kriegel wackersr@dbs.ifi.lmu.de
More informationContents. Preface to the Second Edition
Preface to the Second Edition v 1 Introduction 1 1.1 What Is Data Mining?....................... 4 1.2 Motivating Challenges....................... 5 1.3 The Origins of Data Mining....................
More informationLecture Note: Computation problems in social. network analysis
Lecture Note: Computation problems in social network analysis Bang Ye Wu CSIE, Chung Cheng University, Taiwan September 29, 2008 In this lecture note, several computational problems are listed, including
More informationInternational Journal of Modern Engineering and Research Technology
Volume 2, Issue 3, July 2015 ISSN: 2348-8565 (Online) International Journal of Modern Engineering and Research Technology Website: http://www.ijmert.org Line Up: A Technique for Semantic-Synaptic Synaptic
More informationDENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE
DENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE Sinu T S 1, Mr.Joseph George 1,2 Computer Science and Engineering, Adi Shankara Institute of Engineering
More informationBig Data Management and NoSQL Databases
NDBI040 Big Data Management and NoSQL Databases Lecture 10. Graph databases Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz http://www.ksi.mff.cuni.cz/~holubova/ndbi040/ Graph Databases Basic
More informationAdvanced Data Management
Advanced Data Management Medha Atre Office: KD-219 atrem@cse.iitk.ac.in Sept 26, 2016 defined Given a graph G(V, E) with V as the set of nodes and E as the set of edges, a reachability query asks does
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationCS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks
CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks Archana Sulebele, Usha Prabhu, William Yang (Group 29) Keywords: Link Prediction, Review Networks, Adamic/Adar,
More informationA SURVEY OF CLUSTERING ALGORITHMS FOR GRAPH DATA
Chapter 9 A SURVEY OF CLUSTERING ALGORITHMS FOR GRAPH DATA Charu C. Aggarwal IBM T. J. Watson Research Center Hawthorne, NY 10532 charu@us.ibm.com Haixun Wang Microsoft Research Asia Beijing, China 100190
More informationImproved Frequent Pattern Mining Algorithm with Indexing
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VII (Nov Dec. 2014), PP 73-78 Improved Frequent Pattern Mining Algorithm with Indexing Prof.
More informationResearch on Data Mining Technology Based on Business Intelligence. Yang WANG
2018 International Conference on Mechanical, Electronic and Information Technology (ICMEIT 2018) ISBN: 978-1-60595-548-3 Research on Data Mining Technology Based on Business Intelligence Yang WANG Communication
More informationA Local Algorithm for Structure-Preserving Graph Cut
A Local Algorithm for Structure-Preserving Graph Cut Presenter: Dawei Zhou Dawei Zhou* (ASU) Si Zhang (ASU) M. Yigit Yildirim (ASU) Scott Alcorn (Early Warning) Hanghang Tong (ASU) Hasan Davulcu (ASU)
More informationExtracting Information from Complex Networks
Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform
More informationAppropriate Item Partition for Improving the Mining Performance
Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National
More informationBiclustering with δ-pcluster John Tantalo. 1. Introduction
Biclustering with δ-pcluster John Tantalo 1. Introduction The subject of biclustering is chiefly concerned with locating submatrices of gene expression data that exhibit shared trends between genes. That
More informationData Clustering Hierarchical Clustering, Density based clustering Grid based clustering
Data Clustering Hierarchical Clustering, Density based clustering Grid based clustering Team 2 Prof. Anita Wasilewska CSE 634 Data Mining All Sources Used for the Presentation Olson CF. Parallel algorithms
More informationRandom Sampling over Data Streams for Sequential Pattern Mining
Random Sampling over Data Streams for Sequential Pattern Mining Chedy Raïssi LIRMM, EMA-LGI2P/Site EERIE 161 rue Ada 34392 Montpellier Cedex 5, France France raissi@lirmm.fr Pascal Poncelet EMA-LGI2P/Site
More informationData Mining for Knowledge Management. Association Rules
1 Data Mining for Knowledge Management Association Rules Themis Palpanas University of Trento http://disi.unitn.eu/~themis 1 Thanks for slides to: Jiawei Han George Kollios Zhenyu Lu Osmar R. Zaïane Mohammad
More informationSpectral Methods for Network Community Detection and Graph Partitioning
Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection
More informationA Framework for Clustering Massive Text and Categorical Data Streams
A Framework for Clustering Massive Text and Categorical Data Streams Charu C. Aggarwal IBM T. J. Watson Research Center charu@us.ibm.com Philip S. Yu IBM T. J.Watson Research Center psyu@us.ibm.com Abstract
More informationCut-And-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs
School of Computer Science Cut-And-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs Lei Li, WenjieFu, Fan Guo, Todd C. Mowry, Christos Faloutsos Computer Science Department Carnegie
More informationThe Transpose Technique to Reduce Number of Transactions of Apriori Algorithm
The Transpose Technique to Reduce Number of Transactions of Apriori Algorithm Narinder Kumar 1, Anshu Sharma 2, Sarabjit Kaur 3 1 Research Scholar, Dept. Of Computer Science & Engineering, CT Institute
More informationPattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42
Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth
More informationDepartment of Computer Science & Engineering University of Kalyani. Syllabus for Ph.D. Coursework
Department of Computer Science & Engineering University of Kalyani Syllabus for Ph.D. Coursework Paper 1: A) Literature Review: (Marks - 25) B) Research Methodology: (Marks - 25) Paper 2: Computer Applications:
More informationGraph-based Learning. Larry Holder Computer Science and Engineering University of Texas at Arlington
Graph-based Learning Larry Holder Computer Science and Engineering University of Texas at Arlingt 1 Graph-based Learning Multi-relatial data mining and learning SUBDUE graph-based relatial learner Discovery
More informationAn Edge-Swap Heuristic for Finding Dense Spanning Trees
Theory and Applications of Graphs Volume 3 Issue 1 Article 1 2016 An Edge-Swap Heuristic for Finding Dense Spanning Trees Mustafa Ozen Bogazici University, mustafa.ozen@boun.edu.tr Hua Wang Georgia Southern
More informationMining Data Streams. Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction. Summarization Methods. Clustering Data Streams
Mining Data Streams Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction Summarization Methods Clustering Data Streams Data Stream Classification Temporal Models CMPT 843, SFU, Martin Ester, 1-06
More informationIntroduction to Data Mining
Introduction to Data Mining Privacy preserving data mining Li Xiong Slides credits: Chris Clifton Agrawal and Srikant 4/3/2011 1 Privacy Preserving Data Mining Privacy concerns about personal data AOL
More informationData Mining and Data Warehousing Introduction to Data Mining
Data Mining and Data Warehousing Introduction to Data Mining Quiz Easy Q1. Which of the following is a data warehouse? a. Can be updated by end users. b. Contains numerous naming conventions and formats.
More informationSub-Graph Finding Information over Nebula Networks
ISSN (e): 2250 3005 Volume, 05 Issue, 10 October 2015 International Journal of Computational Engineering Research (IJCER) Sub-Graph Finding Information over Nebula Networks K.Eswara Rao $1, A.NagaBhushana
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data: Part I Instructor: Yizhou Sun yzsun@ccs.neu.edu November 12, 2013 Announcement Homework 4 will be out tonight Due on 12/2 Next class will be canceled
More informationIntroduction Types of Social Network Analysis Social Networks in the Online Age Data Mining for Social Network Analysis Applications Conclusion
Introduction Types of Social Network Analysis Social Networks in the Online Age Data Mining for Social Network Analysis Applications Conclusion References Social Network Social Network Analysis Sociocentric
More informationTowards New Heterogeneous Data Stream Clustering based on Density
, pp.30-35 http://dx.doi.org/10.14257/astl.2015.83.07 Towards New Heterogeneous Data Stream Clustering based on Density Chen Jin-yin, He Hui-hao Zhejiang University of Technology, Hangzhou,310000 chenjinyin@zjut.edu.cn
More informationCase Study: Social Network Analysis. Part II
Case Study: Social Network Analysis Part II https://sites.google.com/site/kdd2017iot/ Outline IoT Fundamentals and IoT Stream Mining Algorithms Predictive Learning Descriptive Learning Frequent Pattern
More informationCODENSE v
CODENSE v1.0 ----------------- INTRODUCTION Given a relation graph dataset, D={G 1,G 2, G n }, where G i =(V,E i ), Definition 1 (Support) The support of a graph g is the number of graphs (in D) where
More informationExploring graph mining approaches for dynamic heterogeneous networks
Georgetown University Institutional Repository http://www.library.georgetown.edu/digitalgeorgetown The author made this article openly available online. Please tell us how this access affects you. Your
More informationAn Algorithm for Frequent Pattern Mining Based On Apriori
An Algorithm for Frequent Pattern Mining Based On Goswami D.N.*, Chaturvedi Anshu. ** Raghuvanshi C.S.*** *SOS In Computer Science Jiwaji University Gwalior ** Computer Application Department MITS Gwalior
More informationDiscovery of Frequent Itemset and Promising Frequent Itemset Using Incremental Association Rule Mining Over Stream Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 5, May 2014, pg.923
More informationEfficient Subgraph Matching by Postponing Cartesian Products
Efficient Subgraph Matching by Postponing Cartesian Products Computer Science and Engineering Lijun Chang Lijun.Chang@unsw.edu.au The University of New South Wales, Australia Joint work with Fei Bi, Xuemin
More informationBiology, Physics, Mathematics, Sociology, Engineering, Computer Science, Etc
Motivation Motifs Algorithms G-Tries Parallelism Complex Networks Networks are ubiquitous! Biology, Physics, Mathematics, Sociology, Engineering, Computer Science, Etc Images: UK Highways Agency, Uriel
More informationWIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity
WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity Unil Yun and John J. Leggett Department of Computer Science Texas A&M University College Station, Texas 7783, USA
More informationParallel Approach for Implementing Data Mining Algorithms
TITLE OF THE THESIS Parallel Approach for Implementing Data Mining Algorithms A RESEARCH PROPOSAL SUBMITTED TO THE SHRI RAMDEOBABA COLLEGE OF ENGINEERING AND MANAGEMENT, FOR THE DEGREE OF DOCTOR OF PHILOSOPHY
More informationProbabilistic Graph Summarization
Probabilistic Graph Summarization Nasrin Hassanlou, Maryam Shoaran, and Alex Thomo University of Victoria, Victoria, Canada {hassanlou,maryam,thomo}@cs.uvic.ca 1 Abstract We study group-summarization of
More informationElysium Technologies Private Limited::IEEE Final year Project
Elysium Technologies Private Limited::IEEE Final year Project - o n t e n t s Data mining Transactions Rule Representation, Interchange, and Reasoning in Distributed, Heterogeneous Environments Defeasible
More informationMultilevel Data Aggregated Using Privacy Preserving Data mining
Multilevel Data Aggregated Using Privacy Preserving Data mining V.Nirupa Department of Computer Science and Engineering Madanapalle, Andhra Pradesh, India M.V.Jaganadha Reddy Department of Computer Science
More informationThe Discovery and Retrieval of Temporal Rules in Interval Sequence Data
The Discovery and Retrieval of Temporal Rules in Interval Sequence Data by Edi Winarko, B.Sc., M.Sc. School of Informatics and Engineering, Faculty of Science and Engineering March 19, 2007 A thesis presented
More informationQuery Independent Scholarly Article Ranking
Query Independent Scholarly Article Ranking Shuai Ma, Chen Gong, Renjun Hu, Dongsheng Luo, Chunming Hu, Jinpeng Huai SKLSDE Lab, Beihang University, China Beijing Advanced Innovation Center for Big Data
More informationAn Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization
An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science
More informationOverlay and P2P Networks. Unstructured networks. Prof. Sasu Tarkoma
Overlay and P2P Networks Unstructured networks Prof. Sasu Tarkoma 20.1.2014 Contents P2P index revisited Unstructured networks Gnutella Bloom filters BitTorrent Freenet Summary of unstructured networks
More informationAn Evolutionary Algorithm for Mining Association Rules Using Boolean Approach
An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach ABSTRACT G.Ravi Kumar 1 Dr.G.A. Ramachandra 2 G.Sunitha 3 1. Research Scholar, Department of Computer Science &Technology,
More informationMining of Web Server Logs using Extended Apriori Algorithm
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationIntroduction to Algorithms Third Edition
Thomas H. Cormen Charles E. Leiserson Ronald L. Rivest Clifford Stein Introduction to Algorithms Third Edition The MIT Press Cambridge, Massachusetts London, England Preface xiü I Foundations Introduction
More informationEfficient and Effective Clustering Methods for Spatial Data Mining. Raymond T. Ng, Jiawei Han
Efficient and Effective Clustering Methods for Spatial Data Mining Raymond T. Ng, Jiawei Han 1 Overview Spatial Data Mining Clustering techniques CLARANS Spatial and Non-Spatial dominant CLARANS Observations
More informationFrequent Subgraph Retrieval in Geometric Graph Databases
Frequent Subgraph Retrieval in Geometric Graph Databases Sebastian Nowozin Max Planck Institute for Biological Cybernetics Spemannstr. 38, 72076 Tübingen, Germany sebastian.nowozin@tuebingen.mpg.de Koji
More informationPrivacy-Preserving of Check-in Services in MSNS Based on a Bit Matrix
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 15, No 2 Sofia 2015 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.1515/cait-2015-0032 Privacy-Preserving of Check-in
More informationOutlier Ensembles. Charu C. Aggarwal IBM T J Watson Research Center Yorktown, NY Keynote, Outlier Detection and Description Workshop, 2013
Charu C. Aggarwal IBM T J Watson Research Center Yorktown, NY 10598 Outlier Ensembles Keynote, Outlier Detection and Description Workshop, 2013 Based on the ACM SIGKDD Explorations Position Paper: Outlier
More informationKeyLabel Algorithms for Keyword Search in Large Graphs
KeyLabel Algorithms for Keyword Search in Large Graphs Yue Wang, Ke Wang, Ada Wai-Chee Fu, and Raymond Chi-Wing Wong School of Computing Science, Simon Fraser University Email: {ywa138, wangk }@cs.sfu.ca
More informationTo Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set
To Enhance Scalability of Item Transactions by Parallel and Partition using Dynamic Data Set Priyanka Soni, Research Scholar (CSE), MTRI, Bhopal, priyanka.soni379@gmail.com Dhirendra Kumar Jha, MTRI, Bhopal,
More information[Gidhane* et al., 5(7): July, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION Kiran V. Gaidhane*, Prof. L. H. Patil, Prof. C. U. Chouhan DOI: 10.5281/zenodo.58632
More informationImproving the Efficiency of Fast Using Semantic Similarity Algorithm
International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year
More informationA Hierarchical Document Clustering Approach with Frequent Itemsets
A Hierarchical Document Clustering Approach with Frequent Itemsets Cheng-Jhe Lee, Chiun-Chieh Hsu, and Da-Ren Chen Abstract In order to effectively retrieve required information from the large amount of
More informationUpper bound tighter Item caps for fast frequent itemsets mining for uncertain data Implemented using splay trees. Shashikiran V 1, Murali S 2
Volume 117 No. 7 2017, 39-46 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Upper bound tighter Item caps for fast frequent itemsets mining for uncertain
More informationA Survey On Different Text Clustering Techniques For Patent Analysis
A Survey On Different Text Clustering Techniques For Patent Analysis Abhilash Sharma Assistant Professor, CSE Department RIMT IET, Mandi Gobindgarh, Punjab, INDIA ABSTRACT Patent analysis is a management
More informationModel-Driven Matching and Segmentation of Trajectories
Model-Driven Matching and Segmentation of Trajectories Jiangwei Pan (Duke University) joint work with Swaminathan Sankararaman (Akamai Technologies) Pankaj K. Agarwal (Duke University) Thomas Mølhave (Scalable
More informationDOTNET PROJECTS. DOTNET Projects. I. IEEE based IOT IEEE BASED CLOUD COMPUTING
DOTNET PROJECTS I. IEEE based IOT 1. A Fuzzy Model-based Integration Framework for Vision-based Intelligent Surveillance Systems 2. Learning communities in social networks and their relationship with the
More information