Fast Reliability Search in Uncertain Graphs Arijit Khan, Francesco Bonchi, Aristides Gionis, Francesco Gullo
|
|
- Jonas Bennett
- 5 years ago
- Views:
Transcription
1 Fast Reliability earch in ncertain Graphs Arijit Khan, Francesco Bonchi, Aristides Gionis, Francesco Gullo ystems Group, ETH Zurich Yahoo Labs, pain Aalto niversity, Finland
2 ncertain Graphs T ocial Network Traffic Network Ad-hoc Mobile Network Protein-interaction Network ncertain Graph 1
3 Motivation Mobile Ad-hoc Network: find the set of sink nodes where a source node can deliver a packet with high probability T Traffic Network: find a set of target locations reachable from a source location with high probability ocial Network: find a set of users who could be influenced with high probability by a target user Packet Delivery Probability in Mobile Ad-hoc Network 2
4 Motivation Mobile Ad-hoc Network: find the set of sink nodes where a source node can deliver a packet with high probability T Traffic Network: find a set of target locations reachable from a source location with high probability ocial Network: find a set of users who could be influenced with high probability by a target user Packet Delivery Probability in Mobile Ad-hoc Network 2
5 Reliability in ncertain Graphs T T ample Edges ncertain Graph Certain Graph (Possible orld) 3
6 Reliability in ncertain Graphs T T ample Edges ncertain Graph Certain Graph (Possible orld) Identity Function 3
7 Reliability earch in ncertain Graphs Given an uncertain graph G, a probability threshold ɳ ϵ (0, 1), and a source node in G, find all nodes in G that are reachable from with probability greater than or equal to threshold ɳ #P - complete 4
8 Related ork Two-terminal reliability All-terminal reliability K-terminal reliability Monte-Carlo (MC) sampling Distance-constraint reliability RHT sampling (Jin et. al., LDB 2011) 5
9 Baseline MC imulation + BF T MC ampling + BF ncertain Graph T Number of amples Certain Graph (Possible orld) 6
10 Can e Be More Efficient? Given a source node and a probability threshold ɳ ϵ (0, 1), can we quickly determine the nodes that are certainly not reachable from with probability greater than or equal to ɳ T Indexing (offline) Filtering + erification (Online) ɳ = ncertain Graph 7
11 RQ-Tree Index T 0.3 ɳ = ncertain Graph out (, *)=0 out (, *)=0.496 out (, *)=0.8 out (, *)=0.8,,,, T ɳ =,,,T, T RQ-Tree Index 7
12 RQ-Tree: Filtering out (, *) =0 out (, *) =0.496,,,,,, T ɳ =,T Max-Flow Min-Cut Based pper Bound: Edge Capacity: out (, *) =0.8 out (, *) =0.8, T RQ-Tree Index c(a) = log (1 p(a)) Compute Max-Flow f from to Outside Cluster C out (, C) = 1 exp(-f) 8
13 RQ-Tree: Filtering out (, *) =0 out (, *) =0.496,,,,,, T ɳ =,T Max-Flow Min-Cut Based pper Bound: Edge Capacity: out (, *) =0.8 out (, *) =0.8, T RQ-Tree Index c(a) = log (1 p(a)) Compute Max-Flow f from to Outside Cluster C out (, C) = 1 exp(-f) Benefits: No false negative (recall = 1) Computation limited only inside cluster C Incremental Max-Flow computation 8
14 RQ-Tree: erification ampling-based erification: MC-ample + BF over the sub-graph formed by the candidate set Pros: high precision, high recall Cons: verification could still be relatively expensive 0.2 Lower-Bound-based erification: Most-Likely-Path Pros: precision = 1, high efficiency Cons: lower recall Pr(--) = * 0.2 = 0.10 Pr(--) = 0.7 * 0.3 = 0.21 Most-Likely-Path: (--) 9
15 RQ-Tree: Online Complexity MC ampling Recursive ampling [LDB 11] RQ-Tree + MC-ampling-based erification [Our Method] O ( m n K ( m n )) RQ-tree + Lower-Bound-based erification [Our Method] O ( m n O ( K ( m n)) O ( n 2 d ) ) K = No of amples m = No of edges n = No of nodes n = No of nodes in the candidate set m = No of edges induced by the candidate nodes d = Diameter of the graph 10
16 RQ-Tree Index Construction 0.1,,,, T T,,,T, T ncertain Graph RQ-Tree Index Hierarchical Clustering: Minimum-cut balanced bi-partition using METI Edge weight: w(a) = log (1 p(a)) 11
17 Experimental Results # Nodes # Edges #Arc Prob: Mean, D, Quartiles DBLP ± 0.11, {0.09, 0.09, 0.18} Flickr ± 0.06, {0.06, 0.07, 0.09} BioMine ± 0.21, {0.12, 0.22, 0.36} Dataset Characteristics 12
18 Accuracy Results RQ-Tree-MC RQ-Tree-LB ɳ=0.4 ɳ= ɳ=0.8 ɳ=0.4 ɳ= ɳ=0.8 DBLP Flickr BioMine Precision RQ-Tree-MC RQ-Tree-LB ɳ=0.4 ɳ= ɳ=0.8 ɳ=0.4 ɳ= ɳ=0.8 DBLP Flickr BioMine Recall 13
19 Efficiency Results RQ-Tree-MC RQ-Tree-LB MC ɳ=0.4 ɳ= ɳ=0.8 ɳ=0.4 ɳ= ɳ=0.8 All ɳ DBLP Flickr BioMine Online query-processing time (sec) 14
20 Pruning Capacity of Filtering Phase Precision of Filtering Phase 15
21 RQ-Tree in Influence Maximization RQ-Tree index in multi-source reliability query and in influence maximization Expected pread (Last.FM) Top-k eed Finding Time (Last.FM) 16
22 Conclusion Indexing method for answering online reliability queries efficiently and effectively. RQ-tree works very well with lower arc probabilities and with higher probability threshold. In future, we shall study reliability search queries when the arc probabilities are not independent. 17
23 Questions?
Fast Reliability Search in Uncertain Graphs
Fast Reliability Search in Uncertain Graphs Arijit Khan, Francesco Bonchi, Aristides Gionis, Francesco Gullo Systems Group, ETH Zurich Yahoo Labs, Spain Aalto University, Finland ABSTRACT Uncertain, or
More informationUncertain Data Management Non-relational models: Graphs
Uncertain Data Management Non-relational models: Graphs Antoine Amarilli 1, Silviu Maniu 2 1 Télécom ParisTech 2 Université Paris-Sud January 16th, 2017 1/21 Credits M. Potamias, F. Bonchi, A. Gionis,
More informationdoc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague
Praha & EU: Investujeme do vaší budoucnosti Evropský sociální fond course: Searching the Web and Multimedia Databases (BI-VWM) Tomáš Skopal, 2011 SS2010/11 doc. RNDr. Tomáš Skopal, Ph.D. Department of
More informationDiscovering Highly Reliable Subgraphs in Uncertain Graphs
Discovering Highly Reliable Subgraphs in Uncertain Graphs Ruoming Jin Kent State University Kent, OH, USA jin@cs.kent.edu Lin Liu Kent State University Kent, OH, USA lliu@cs.kent.edu Charu C. Aggarwal
More informationImproving the Efficiency of Multi-site Web Search Engines
Improving the Efficiency of Multi-site Web Search Engines Xiao Bai ( xbai@yahoo-inc.com) Yahoo Labs Joint work with Guillem Francès Medina, B. Barla Cambazoglu and Ricardo Baeza-Yates July 15, 2014 Web
More informationDistributed Exact Subgraph Matching in Small Diameter Dynamic Graphs
Distributed Exact Subgraph Matching in Small Diameter Dynamic Graphs Charith Wickramaarachchi, Rajgopal Kannan Charalampos Chelmis*, and Viktor K. Prasanna University of Southern California *University
More informationExperimental Study on Speed-Up Techniques for Timetable Information Systems
Experimental Study on Speed-Up Techniques for Timetable Information Systems Reinhard Bauer (rbauer@ira.uka.de) Daniel Delling (delling@ira.uka.de) Dorothea Wagner (wagner@ira.uka.de) ITI Wagner, Universität
More informationFennel: Streaming Graph Partitioning for Massive Scale Graphs
Fennel: Streaming Graph Partitioning for Massive Scale Graphs Charalampos E. Tsourakakis 1 Christos Gkantsidis 2 Bozidar Radunovic 2 Milan Vojnovic 2 1 Aalto University, Finland 2 Microsoft Research, Cambridge
More informationTotal Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al.
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision Fall 2007 Objective Given a query image of an object,
More informationCore Decomposition of Uncertain Graphs
Core Decomposition of Uncertain Graphs Francesco Bonchi 1 Francesco Gullo 1 Andreas Kaltenbrunner 2 Yana Volkovich 2 1 Yahoo Labs, Spain 2 Barcelona Media - Innovation Centre, Spain {bonchi,gullo}@yahoo-inc.com
More information4. Ad-hoc I: Hierarchical clustering
4. Ad-hoc I: Hierarchical clustering Hierarchical versus Flat Flat methods generate a single partition into k clusters. The number k of clusters has to be determined by the user ahead of time. Hierarchical
More informationUncertainties: Representation and Propagation & Line Extraction from Range data
41 Uncertainties: Representation and Propagation & Line Extraction from Range data 42 Uncertainty Representation Section 4.1.3 of the book Sensing in the real world is always uncertain How can uncertainty
More informationEfficient Processing of Multiple DTW Queries in Time Series Databases
Efficient Processing of Multiple DTW Queries in Time Series Databases Hardy Kremer 1 Stephan Günnemann 1 Anca-Maria Ivanescu 1 Ira Assent 2 Thomas Seidl 1 1 RWTH Aachen University, Germany 2 Aarhus University,
More informationVisual Representations for Machine Learning
Visual Representations for Machine Learning Spectral Clustering and Channel Representations Lecture 1 Spectral Clustering: introduction and confusion Michael Felsberg Klas Nordberg The Spectral Clustering
More informationPartitioning of Public Transit Networks
Partitioning of Public Transit Networks [Bachelor s thesis] Matthias Hertel Albert-Ludwigs-Universität Freiburg.09.205 Contents Introduction Motivation Goal 2 Data model 3 Algorithms K-means Merging algorithm
More informationCSE 5243 INTRO. TO DATA MINING
CSE 53 INTRO. TO DATA MINING Locality Sensitive Hashing (LSH) Huan Sun, CSE@The Ohio State University Slides adapted from Prof. Jiawei Han @UIUC, Prof. Srinivasan Parthasarathy @OSU MMDS Secs. 3.-3.. Slides
More informationLearning Similarity Metrics for Event Identification in Social Media. Hila Becker, Luis Gravano
Learning Similarity Metrics for Event Identification in Social Media Hila Becker, Luis Gravano Columbia University Mor Naaman Rutgers University Event Content in Social Media Sites Event Content in Social
More informationHierarchical Intelligent Cuttings: A Dynamic Multi-dimensional Packet Classification Algorithm
161 CHAPTER 5 Hierarchical Intelligent Cuttings: A Dynamic Multi-dimensional Packet Classification Algorithm 1 Introduction We saw in the previous chapter that real-life classifiers exhibit structure and
More informationInterdomain Routing Design for MobilityFirst
Interdomain Routing Design for MobilityFirst October 6, 2011 Z. Morley Mao, University of Michigan In collaboration with Mike Reiter s group 1 Interdomain routing design requirements Mobility support Network
More informationMining Social Network Graphs
Mining Social Network Graphs Analysis of Large Graphs: Community Detection Rafael Ferreira da Silva rafsilva@isi.edu http://rafaelsilva.com Note to other teachers and users of these slides: We would be
More informationCourse : Data mining
Course : Data mining Lecture : Mining data streams Aristides Gionis Department of Computer Science Aalto University visiting in Sapienza University of Rome fall 2016 reading assignment LRU book: chapter
More informationQuerying Shortest Distance on Large Graphs
.. Miao Qiao, Hong Cheng, Lijun Chang and Jeffrey Xu Yu Department of Systems Engineering & Engineering Management The Chinese University of Hong Kong October 19, 2011 Roadmap Preliminary Related Work
More informationOn Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs
On Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs Sungpack Hong 2, Nicole C. Rodia 1, and Kunle Olukotun 1 1 Pervasive Parallelism Laboratory, Stanford University
More informationTopologies. Maurizio Palesi. Maurizio Palesi 1
Topologies Maurizio Palesi Maurizio Palesi 1 Network Topology Static arrangement of channels and nodes in an interconnection network The roads over which packets travel Topology chosen based on cost and
More informationClustering Algorithms for general similarity measures
Types of general clustering methods Clustering Algorithms for general similarity measures general similarity measure: specified by object X object similarity matrix 1 constructive algorithms agglomerative
More informationTrack Join. Distributed Joins with Minimal Network Traffic. Orestis Polychroniou! Rajkumar Sen! Kenneth A. Ross
Track Join Distributed Joins with Minimal Network Traffic Orestis Polychroniou Rajkumar Sen Kenneth A. Ross Local Joins Algorithms Hash Join Sort Merge Join Index Join Nested Loop Join Spilling to disk
More informationNew Directions in Traffic Measurement and Accounting. Need for traffic measurement. Relation to stream databases. Internet backbone monitoring
New Directions in Traffic Measurement and Accounting C. Estan and G. Varghese Presented by Aaditeshwar Seth 1 Need for traffic measurement Internet backbone monitoring Short term Detect DoS attacks Long
More informationEfficient AND/OR Search Algorithms for Exact MAP Inference Task over Graphical Models
Efficient AND/OR Search Algorithms for Exact MAP Inference Task over Graphical Models Akihiro Kishimoto IBM Research, Ireland Joint work with Radu Marinescu and Adi Botea Outline 1 Background 2 RBFAOO
More informationMidterm Examination CS540-2: Introduction to Artificial Intelligence
Midterm Examination CS540-2: Introduction to Artificial Intelligence March 15, 2018 LAST NAME: FIRST NAME: Problem Score Max Score 1 12 2 13 3 9 4 11 5 8 6 13 7 9 8 16 9 9 Total 100 Question 1. [12] Search
More informationOn Optimizing Load Balancing of Intrusion Detection and Prevention Systems. Anh Le, Ehab Al-Shaer, and Raouf Boutaba
On Optimizing Load Balancing of Intrusion Detection and Prevention Systems Anh Le, Ehab Al-Shaer, and Raouf Boutaba Outline 1. Motivation 2. Approach Overview 3. Problem Formalization 4. Online Clustering
More informationOpportunistic Application Flows in Sensor-based Pervasive Environments
Opportunistic Application Flows in Sensor-based Pervasive Environments Nanyan Jiang, Cristina Schmidt, Vincent Matossian, and Manish Parashar ICPS 2004 1 Outline Introduction to pervasive sensor-based
More informationFast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs
Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs Sungpack Hong 2, Nicole C. Rodia 1, and Kunle Olukotun 1 1 Pervasive Parallelism Laboratory, Stanford University 2 Oracle
More informationTracking Groups in Mobile Network Traces
Tracking Groups in Mobile Network Traces Kun Tu*, Bruno Ribeiro**, Ananthram Swami***, Don Towsley* *University of Massachusetts, Amherst **Purdue University ***Army Research Lab Presented by Gayane Vardoyan
More informationApproximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme
Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme Miao Qiao, Hong Cheng, Lijun Chang and Jeffrey Xu Yu The Chinese University of Hong Kong {mqiao, hcheng, ljchang, yu}@secuhkeduhk
More informationExtracting Rankings for Spatial Keyword Queries from GPS Data
Extracting Rankings for Spatial Keyword Queries from GPS Data Ilkcan Keles Christian S. Jensen Simonas Saltenis Aalborg University Outline Introduction Motivation Problem Definition Proposed Method Overview
More informationPROJECT PROPOSALS: COMMUNITY DETECTION AND ENTITY RESOLUTION. Donatella Firmani
PROJECT PROPOSALS: COMMUNITY DETECTION AND ENTITY RESOLUTION Donatella Firmani donatella.firmani@uniroma3.it PROJECT 1: COMMUNITY DETECTION What is Community Detection? What Social Network Analysis is?
More informationAd hoc and Sensor Networks Topology control
Ad hoc and Sensor Networks Topology control Goals of this chapter Networks can be too dense too many nodes in close (radio) vicinity This chapter looks at methods to deal with such networks by Reducing/controlling
More informationInformation Retrieval
Information Retrieval ETH Zürich, Fall 2012 Thomas Hofmann LECTURE 6 EVALUATION 24.10.2012 Information Retrieval, ETHZ 2012 1 Today s Overview 1. User-Centric Evaluation 2. Evaluation via Relevance Assessment
More informationFinding Optimal Probabilistic Generators for XML Collections Serge Abiteboul, Yael Amsterdamer, Daniel Deutch, Tova Milo, Pierre Senellart
Finding Optimal Probabilistic Generators for XML Collections Serge Abiteboul, Yael Amsterdamer, Daniel Deutch, Tova Milo, Pierre Senellart To appear in BDA 2011 Motivation Adding probabilities to an XML
More informationImage Segmentation continued Graph Based Methods
Image Segmentation continued Graph Based Methods Previously Images as graphs Fully-connected graph node (vertex) for every pixel link between every pair of pixels, p,q affinity weight w pq for each link
More informationOn Smart Query Routing: For Distributed Graph Querying with Decoupled Storage
On Smart Query Routing: For Distributed Graph Querying with Decoupled Storage Arijit Khan Nanyang Technological University (NTU), Singapore Gustavo Segovia ETH Zurich, Switzerland Donald Kossmann Microsoft
More informationData Mining Practical Machine Learning Tools and Techniques. Slides for Chapter 6 of Data Mining by I. H. Witten and E. Frank
Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 6 of Data Mining by I. H. Witten and E. Frank Implementation: Real machine learning schemes Decision trees Classification
More informationJinho Hwang (IBM Research) Wei Zhang, Timothy Wood, H. Howie Huang (George Washington Univ.) K.K. Ramakrishnan (Rutgers University)
Jinho Hwang (IBM Research) Wei Zhang, Timothy Wood, H. Howie Huang (George Washington Univ.) K.K. Ramakrishnan (Rutgers University) Background: Memory Caching Two orders of magnitude more reads than writes
More informationFast Nearest Neighbor Search on Large Time-Evolving Graphs
Fast Nearest Neighbor Search on Large Time-Evolving Graphs Leman Akoglu Srinivasan Parthasarathy Rohit Khandekar Vibhore Kumar Deepak Rajan Kun-Lung Wu Graphs are everywhere Leman Akoglu Fast Nearest Neighbor
More informationSolution for Homework set 3
TTIC 300 and CMSC 37000 Algorithms Winter 07 Solution for Homework set 3 Question (0 points) We are given a directed graph G = (V, E), with two special vertices s and t, and non-negative integral capacities
More informationPerception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich.
Autonomous Mobile Robots Localization "Position" Global Map Cognition Environment Model Local Map Path Perception Real World Environment Motion Control Perception Sensors Vision Uncertainties, Line extraction
More informationBig Data Analytics. Special Topics for Computer Science CSE CSE Feb 11
Big Data Analytics Special Topics for Computer Science CSE 4095-001 CSE 5095-005 Feb 11 Fei Wang Associate Professor Department of Computer Science and Engineering fei_wang@uconn.edu Clustering II Spectral
More information6. Learning Partitions of a Set
6. Learning Partitions of a Set Also known as clustering! Usually, we partition sets into subsets with elements that are somewhat similar (and since similarity is often task dependent, different partitions
More informationGraphs and Network Flows IE411. Lecture 13. Dr. Ted Ralphs
Graphs and Network Flows IE411 Lecture 13 Dr. Ted Ralphs IE411 Lecture 13 1 References for Today s Lecture IE411 Lecture 13 2 References for Today s Lecture Required reading Sections 21.1 21.2 References
More informationRandomized algorithms have several advantages over deterministic ones. We discuss them here:
CS787: Advanced Algorithms Lecture 6: Randomized Algorithms In this lecture we introduce randomized algorithms. We will begin by motivating the use of randomized algorithms through a few examples. Then
More informationI/O Efficient Algorithms for Exact Distance Queries on Disk- Resident Dynamic Graphs
I/O Efficient Algorithms for Exact Distance Queries on Disk- Resident Dynamic Graphs Yishi Lin, Xiaowei Chen, John C.S. Lui The Chinese University of Hong Kong 9/4/15 EXACT DISTANCE QUERIES ON DYNAMIC
More informationDeployment of Collaborative Web Caching with Active Networks
Deployment of Collaborative Web Caching with Active Networks Laurent Lefèvre, Jean-Marc Pierson, Sid Ali Guebli Laurent.lefevre@inria.fr Jean-Marc.Pierson@insa-lyon.fr Agenda Introduction & Motivations
More informationGraph Partitioning for Scalable Distributed Graph Computations
Graph Partitioning for Scalable Distributed Graph Computations Aydın Buluç ABuluc@lbl.gov Kamesh Madduri madduri@cse.psu.edu 10 th DIMACS Implementation Challenge, Graph Partitioning and Graph Clustering
More informationClustering Lecture 4: Density-based Methods
Clustering Lecture 4: Density-based Methods Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced
More informationCyber Security Analysis of State Estimators in Electric Power Systems
Cyber Security Analysis of State Estimators in Electric Power Systems H. Sandberg, G. Dán, A. Teixeira, K. C. Sou, O. Vukovic, K. H. Johansson ACCESS Linnaeus Center KTH Royal Institute of Technology,
More informationDistributed Fine-Grained Node Localization in Ad-Hoc Networks. A Scalable Location Service for Geographic Ad Hoc Routing. Presented by An Nguyen
Distributed Fine-Grained Node Localization in Ad-Hoc Networks A Scalable Location Service for Geographic Ad Hoc Routing Presented by An Nguyen Distributed Fine-Grained Node Localization in Ad-Hoc Networks
More informationPlanar: Parallel Lightweight Architecture-Aware Adaptive Graph Repartitioning
Planar: Parallel Lightweight Architecture-Aware Adaptive Graph Repartitioning Angen Zheng, Alexandros Labrinidis, and Panos K. Chrysanthis University of Pittsburgh 1 Graph Partitioning Applications of
More informationSociaLite: A Datalog-based Language for
SociaLite: A Datalog-based Language for Large-Scale Graph Analysis Jiwon Seo M OBIS OCIAL RESEARCH GROUP Overview Overview! SociaLite: language for large-scale graph analysis! Extensions to Datalog! Compiler
More informationClustering. So far in the course. Clustering. Clustering. Subhransu Maji. CMPSCI 689: Machine Learning. dist(x, y) = x y 2 2
So far in the course Clustering Subhransu Maji : Machine Learning 2 April 2015 7 April 2015 Supervised learning: learning with a teacher You had training data which was (feature, label) pairs and the goal
More informationMin-Hashing and Geometric min-hashing
Min-Hashing and Geometric min-hashing Ondřej Chum, Michal Perdoch, and Jiří Matas Center for Machine Perception Czech Technical University Prague Outline 1. Looking for representation of images that: is
More informationOn the Utility of Inference Mechanisms
On the Utility of Inference Mechanisms Ethan Blanton Sonia Fahmy Greg N. Frederickson June 23, 2009 On the Utility of Inference Mechanisms We present an algorithm for determining when and if an inference
More informationDynamic Skyline Queries in Large Graphs
Dynamic Skyline Queries in Large Graphs Lei Zou, Lei Chen 2, M. Tamer Özsu 3, and Dongyan Zhao,4 Institute of Computer Science and Technology, Peking University, Beijing, China, {zoulei,zdy}@icst.pku.edu.cn
More informationSCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING
SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING ZEYNEP KORKMAZ CS742 - PARALLEL AND DISTRIBUTED DATABASE SYSTEMS UNIVERSITY OF WATERLOO OUTLINE. Background 2. What is Schism?
More informationCommunity Detection. Community
Community Detection Community In social sciences: Community is formed by individuals such that those within a group interact with each other more frequently than with those outside the group a.k.a. group,
More informationConvexization in Markov Chain Monte Carlo
in Markov Chain Monte Carlo 1 IBM T. J. Watson Yorktown Heights, NY 2 Department of Aerospace Engineering Technion, Israel August 23, 2011 Problem Statement MCMC processes in general are governed by non
More informationPower-Aware Routing in Mobile Ad Hoc Networks
Power-Aware Routing in Mobile Ad Hoc Networks Introduction 5 power aware metrics for shortest-cost routing will be presented Compared to the traditional shortest-hop routing, shortest-cost routing Reduces
More informationHierarchical Clustering
Hierarchical Clustering Hierarchical Clustering Produces a set of nested clusters organized as a hierarchical tree Can be visualized as a dendrogram A tree-like diagram that records the sequences of merges
More informationCMPSCI 311: Introduction to Algorithms Practice Final Exam
CMPSCI 311: Introduction to Algorithms Practice Final Exam Name: ID: Instructions: Answer the questions directly on the exam pages. Show all your work for each question. Providing more detail including
More informationWeb Structure Mining Community Detection and Evaluation
Web Structure Mining Community Detection and Evaluation 1 Community Community. It is formed by individuals such that those within a group interact with each other more frequently than with those outside
More informationJure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research
Jure Leskovec, Cornell/Stanford University Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research Network: an interaction graph: Nodes represent entities Edges represent interaction
More informationDepartment of Computer Science San Marcos, TX Report Number TXSTATE-CS-TR Clustering in the Cloud. Xuan Wang
Department of Computer Science San Marcos, TX 78666 Report Number TXSTATE-CS-TR-2010-24 Clustering in the Cloud Xuan Wang 2010-05-05 !"#$%&'()*+()+%,&+!"-#. + /+!"#$%&'()*+0"*-'(%,1$+0.23%(-)+%-+42.--3+52367&.#8&+9'21&:-';
More informationLatent Variable Models for Structured Prediction and Content-Based Retrieval
Latent Variable Models for Structured Prediction and Content-Based Retrieval Ariadna Quattoni Universitat Politècnica de Catalunya Joint work with Borja Balle, Xavier Carreras, Adrià Recasens, Antonio
More informationALEMBIC: AUTOMATED MODEL INFERENCE FOR STATEFUL NETWORK FUNCTIONS
ALEMBIC: AUTOMATED MODEL INFERENCE FOR STATEFUL NETWORK FUNCTIONS Soo-Jin Moon Jeffrey Helt, Yifei Yuan, Yves Bieri, Sujata Banerjee, Vyas Sekar, Wenfei Wu, Mihalis Yannakakis, Ying Zhang Carnegie Mellon
More informationModel Checking Dynamic Datapaths
Model Checking Dynamic Datapaths Aurojit Panda, Katerina Argyraki, Scott Shenker UC Berkeley, ICSI, EPFL Networks: Not Just for Delivery Enforce a variety of invariants: Packet Isolation: Packets from
More informationQuery Evaluation Strategies
Introduction to Search Engine Technology Term-at-a-Time and Document-at-a-Time Evaluation Ronny Lempel Yahoo! Labs (Many of the following slides are courtesy of Aya Soffer and David Carmel, IBM Haifa Research
More informationGeneric Topology Mapping Strategies for Large-scale Parallel Architectures
Generic Topology Mapping Strategies for Large-scale Parallel Architectures Torsten Hoefler and Marc Snir Scientific talk at ICS 11, Tucson, AZ, USA, June 1 st 2011, Hierarchical Sparse Networks are Ubiquitous
More informationCS 231A CA Session: Problem Set 4 Review. Kevin Chen May 13, 2016
CS 231A CA Session: Problem Set 4 Review Kevin Chen May 13, 2016 PS4 Outline Problem 1: Viewpoint estimation Problem 2: Segmentation Meanshift segmentation Normalized cut Problem 1: Viewpoint Estimation
More informationEfficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction
Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction Takuya Akiba U Tokyo Takanori Hayashi U Tokyo Nozomi Nori Kyoto
More informationDynamic and Historical Shortest-Path Distance Queries on Large Evolving Networks by Pruned Landmark Labeling
2014/04/09 @ WWW 14 Dynamic and Historical Shortest-Path Distance Queries on Large Evolving Networks by Pruned Landmark Labeling Takuya Akiba (U Tokyo) Yoichi Iwata (U Tokyo) Yuichi Yoshida (NII & PFI)
More informationClustering. Subhransu Maji. CMPSCI 689: Machine Learning. 2 April April 2015
Clustering Subhransu Maji CMPSCI 689: Machine Learning 2 April 2015 7 April 2015 So far in the course Supervised learning: learning with a teacher You had training data which was (feature, label) pairs
More informationOptimising Topical Query Decomposition
Optimising Topical Query Decomposition Marcin Sydow 1 Francesco Bonchi 2 Carlos Castillo 2 Debora Donato 2 1 Web Mining Lab, Polish-Japanese Institute 2 Yahoo! Research of Information Technology, Warszawa,
More informationChapter 6: Distributed Systems: The Web. Fall 2012 Sini Ruohomaa Slides joint work with Jussi Kangasharju et al.
Chapter 6: Distributed Systems: The Web Fall 2012 Sini Ruohomaa Slides joint work with Jussi Kangasharju et al. Chapter Outline Web as a distributed system Basic web architecture Content delivery networks
More informationClient-server architectures
Client-server architectures Client/Server approach Basic concept 1. Clients are requesting information (e.g., a specific web page) 2. Servers are responding to individual requests Response Client Server
More informationTowards Querying and Mining of Large-Scale Networks
UNIVERSITY OF CALIFORNIA Santa Barbara Towards Querying and Mining of Large-Scale Networks A Dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in
More informationAn introduction to Web Mining part II
An introduction to Web Mining part II Ricardo Baeza-Yates, Aristides Gionis Yahoo! Research Barcelona, Spain & Santiago, Chile ECML/PKDD 2008 Antwerp Yahoo! Research Agenda Statistical methods: the size
More informationPredictive Analysis: Evaluation and Experimentation. Heejun Kim
Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training
More informationExtracting Information from Complex Networks
Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform
More informationNon-convex Multi-objective Optimization
Non-convex Multi-objective Optimization Multi-objective Optimization Real-world optimization problems usually involve more than one criteria multi-objective optimization. Such a kind of optimization problems
More informationTutorial on Association Rule Mining
Tutorial on Association Rule Mining Yang Yang yang.yang@itee.uq.edu.au DKE Group, 78-625 August 13, 2010 Outline 1 Quick Review 2 Apriori Algorithm 3 FP-Growth Algorithm 4 Mining Flickr and Tag Recommendation
More informationA Scheme of Automated Object and Facet Extraction for Faceted Search over XML Data
IDEAS 2014 A Scheme of Automated Object and Facet Extraction for Faceted Search over XML Data Takahiro Komamizu, Toshiyuki Amagasa, Hiroyuki Kitagawa University of Tsukuba Background Introduction Background
More informationWireless Sensor Networks: Clustering, Routing, Localization, Time Synchronization
Wireless Sensor Networks: Clustering, Routing, Localization, Time Synchronization Maurizio Bocca, M.Sc. Control Engineering Research Group Automation and Systems Technology Department maurizio.bocca@tkk.fi
More informationData Clustering via Graph Min-Cuts. Erin R. Rhode University of Michigan EECS 767 March 15, 2006
Data Clustering via Graph Min-Cuts Erin R. Rhode University of Michigan EECS 767 March 15, 2006 Outline Why Data Clustering? Introduction to Graph Min-Cuts Bipartite Graph Partitioning (Zha et. al., 2001)
More informationJoint Entity Resolution
Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute
More informationCS644 Advanced Networks
Outline CS644 Advanced Networks Lecture 9 Intra Domain Routing Andreas Terzis Spring 2004 1 So far we have talked about E2E mechanisms Routing is the other big component of the network Largest distributed
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:
Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,
More informationGenetic-Algorithm-Based Construction of Load-Balanced CDSs in Wireless Sensor Networks
Genetic-Algorithm-Based Construction of Load-Balanced CDSs in Wireless Sensor Networks Jing He, Shouling Ji, Mingyuan Yan, Yi Pan, and Yingshu Li Department of Computer Science Georgia State University,
More informationNearest neighbors. Focus on tree-based methods. Clément Jamin, GUDHI project, Inria March 2017
Nearest neighbors Focus on tree-based methods Clément Jamin, GUDHI project, Inria March 2017 Introduction Exact and approximate nearest neighbor search Essential tool for many applications Huge bibliography
More informationINF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering
INF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering Erik Velldal University of Oslo Sept. 18, 2012 Topics for today 2 Classification Recap Evaluating classifiers Accuracy, precision,
More informationTRIE BASED METHODS FOR STRING SIMILARTIY JOINS
TRIE BASED METHODS FOR STRING SIMILARTIY JOINS Venkat Charan Varma Buddharaju #10498995 Department of Computer and Information Science University of MIssissippi ENGR-654 INFORMATION SYSTEM PRINCIPLES RESEARCH
More informationScalable Evaluation of k-nn Queries on Large Uncertain Graphs
Scalable Evaluation of k-nn Queries on Large Uncertain Graphs Xiaodong Li 1, Reynold Cheng 1, Yixiang Fang 1, Jiafeng Hu 1, Silviu Maniu 2 1 The University of Hong Kong, China, 2 Université Paris-Sud,
More information