DANCer: Dynamic Attributed Network with Community Structure Generator

Similar documents
LaHC at CLEF 2015 SBS Lab

BoxPlot++ Zeina Azmeh, Fady Hamoui, Marianne Huchard. To cite this version: HAL Id: lirmm

Setup of epiphytic assistance systems with SEPIA

Blind Browsing on Hand-Held Devices: Touching the Web... to Understand it Better

Relabeling nodes according to the structure of the graph

How to simulate a volume-controlled flooding with mathematical morphology operators?

Multimedia CTI Services for Telecommunication Systems

Taking Benefit from the User Density in Large Cities for Delivering SMS

Fault-Tolerant Storage Servers for the Databases of Redundant Web Servers in a Computing Grid

Comparison of spatial indexes

Study on Feebly Open Set with Respect to an Ideal Topological Spaces

SIM-Mee - Mobilizing your social network

An FCA Framework for Knowledge Discovery in SPARQL Query Answers

Branch-and-price algorithms for the Bi-Objective Vehicle Routing Problem with Time Windows

Simulations of VANET Scenarios with OPNET and SUMO

Every 3-connected, essentially 11-connected line graph is hamiltonian

Service Reconfiguration in the DANAH Assistive System

Tacked Link List - An Improved Linked List for Advance Resource Reservation

Catalogue of architectural patterns characterized by constraint components, Version 1.0

Representation of Finite Games as Network Congestion Games

Mokka, main guidelines and future

Comparison of radiosity and ray-tracing methods for coupled rooms

X-Kaapi C programming interface

Multi-atlas labeling with population-specific template and non-local patch-based label fusion

Open Source Software Developer and Project Networks

Linux: Understanding Process-Level Power Consumption

The optimal routing of augmented cubes.

Traffic Grooming in Bidirectional WDM Ring Networks

Lossless and Lossy Minimal Redundancy Pyramidal Decomposition for Scalable Image Compression Technique

Very Tight Coupling between LTE and WiFi: a Practical Analysis

NP versus PSPACE. Frank Vega. To cite this version: HAL Id: hal

Teaching Encapsulation and Modularity in Object-Oriented Languages with Access Graphs

HySCaS: Hybrid Stereoscopic Calibration Software

A N-dimensional Stochastic Control Algorithm for Electricity Asset Management on PC cluster and Blue Gene Supercomputer

Comparator: A Tool for Quantifying Behavioural Compatibility

The Proportional Colouring Problem: Optimizing Buffers in Radio Mesh Networks

QuickRanking: Fast Algorithm For Sorting And Ranking Data

Regularization parameter estimation for non-negative hyperspectral image deconvolution:supplementary material

Synthesis of fixed-point programs: the case of matrix multiplication

KeyGlasses : Semi-transparent keys to optimize text input on virtual keyboard

The Connectivity Order of Links

From medical imaging to numerical simulations

An Experimental Assessment of the 2D Visibility Complex

Self-optimisation using runtime code generation for Wireless Sensor Networks Internet-of-Things

A Voronoi-Based Hybrid Meshing Method

lambda-min Decoding Algorithm of Regular and Irregular LDPC Codes

Syrtis: New Perspectives for Semantic Web Adoption

A Resource Discovery Algorithm in Mobile Grid Computing based on IP-paging Scheme

Change Detection System for the Maintenance of Automated Testing

Natural Language Based User Interface for On-Demand Service Composition

Robust IP and UDP-lite header recovery for packetized multimedia transmission

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

MUTE: A Peer-to-Peer Web-based Real-time Collaborative Editor

Kernel perfect and critical kernel imperfect digraphs structure

A Practical Evaluation Method of Network Traffic Load for Capacity Planning

FIT IoT-LAB: The Largest IoT Open Experimental Testbed

SDLS: a Matlab package for solving conic least-squares problems

Decentralised and Privacy-Aware Learning of Traversal Time Models

Fueling Time Machine: Information Extraction from Retro-Digitised Address Directories

Application of RMAN Backup Technology in the Agricultural Products Wholesale Market System

Fuzzy sensor for the perception of colour

YANG-Based Configuration Modeling - The SecSIP IPS Case Study

Structuring the First Steps of Requirements Elicitation

Graphe-Based Rules For XML Data Conversion to OWL Ontology

Linked data from your pocket: The Android RDFContentProvider

Light field video dataset captured by a R8 Raytrix camera (with disparity maps)

Real-Time and Resilient Intrusion Detection: A Flow-Based Approach

Modularity for Java and How OSGi Can Help

Quality of Service Enhancement by Using an Integer Bloom Filter Based Data Deduplication Mechanism in the Cloud Storage Environment

Malware models for network and service management

YAM++ : A multi-strategy based approach for Ontology matching task

Scan chain encryption in Test Standards

Efficient implementation of interval matrix multiplication

THE COVERING OF ANCHORED RECTANGLES UP TO FIVE POINTS

Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface

Assisted Policy Management for SPARQL Endpoints Access Control

The New Territory of Lightweight Security in a Cloud Computing Environment

Reverse-engineering of UML 2.0 Sequence Diagrams from Execution Traces

Moveability and Collision Analysis for Fully-Parallel Manipulators

A case-based reasoning approach for unknown class invoice processing

Hierarchical Multi-Views Software Architecture

Real-Time Collision Detection for Dynamic Virtual Environments

ASAP.V2 and ASAP.V3: Sequential optimization of an Algorithm Selector and a Scheduler

Sliding HyperLogLog: Estimating cardinality in a data stream

XML Document Classification using SVM

Prototype Selection Methods for On-line HWR

[Demo] A webtool for analyzing land-use planning documents

Quasi-tilings. Dominique Rossin, Daniel Krob, Sebastien Desreux

BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs

Primitive roots of bi-periodic infinite pictures

Fast and precise kinematic skeleton extraction of 3D dynamic meshes

FStream: a decentralized and social music streamer

Experimental Evaluation of an IEC Station Bus Communication Reliability

Computing and maximizing the exact reliability of wireless backhaul networks

Deformetrica: a software for statistical analysis of anatomical shapes

Combined video and laser camera for inspection of old mine shafts

Efficient Gradient Method for Locally Optimizing the Periodic/Aperiodic Ambiguity Function

Application-Aware Protection in DWDM Optical Networks

Scalewelis: a Scalable Query-based Faceted Search System on Top of SPARQL Endpoints

RETIN AL: An Active Learning Strategy for Image Category Retrieval

Transcription:

DANCer: Dynamic Attributed Network with Community Structure Generator Oualid Benyahia, Christine Largeron, Baptiste Jeudy, Osmar Zaïane To cite this version: Oualid Benyahia, Christine Largeron, Baptiste Jeudy, Osmar Zaïane. DANCer: Dynamic Attributed Network with Community Structure Generator. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2016, Riva del Garda, Italy. 9853, pp.41-44, 2016, Machine Learning and Knowledge Discovery in Databases. <10.1007/978-3-319-46131-1 9>. <hal-01377321> HAL Id: hal-01377321 https://hal-auf.archives-ouvertes.fr/hal-01377321 Submitted on 6 Oct 2016 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

DANCer: Dynamic Attributed Network with Community Structure Generator Oualid Benyahia 1, Christine Largeron 1, Baptiste Jeudy 1, and Osmar R. Zaïane 2 1 Univ Lyon, UJM-Saint-Etienne, CNRS, Institut d Optique Graduate School, Laboratoire Hubert Curien UMR 5516, F-42023, SAINT-ETIENNE, France. oualid.benyahia,christine.largeron,baptiste.jeudy@univ-st-etienne.fr 2 Department of Computer Science, University of Alberta, Edmonton, Canada. zaiane@cs.ualberta.ca Abstract. We propose a new generator for dynamic attributed networks with community structure which follow the known properties of real-world networks such as preferential attachment, small world and homophily. After the generation, the different graphs forming the dynamic network as well as its evolution can be displayed in the interface. Several measures are also computed to evaluate the properties verified by each graph. Finally, the generated dynamic network, the parameters and the measures can be saved as a collection of files. Keywords: Social network mining, Attributed graph, Synthetic data generator 1 Introduction The proliferation of complex information networks in diverse fields of application has led to the proposal of a panoply of methods to analyze and discover relevant patterns in these networks. However, evaluating these methods and the comparison of the different approaches are not very easy due to the lack of large real networks with ground truth freely accessible to researchers. The alternative consists in using synthetic data provided by generators. There is a large bibliography regarding generation for static graphs, including the classic Erdős-Rényi (ER) model which generates random graphs or the Barabási-Albert (BA) model that generates random scale-free networks, but very few generators allow the construction of evolving graphs, exhibiting or not a community structure and, none of them takes into account the attribute values of the vertices. The interest of community detection, link prediction and more generally pattern discovery in dynamic networks where vertices are associated with attributes led us to develop the generator DANCer for attributed dynamic graphs with embedded community structure. This generator is an extended version of a previous generator, ANC dedicated to static graphs [1].

2 2 Model An attributed dynamic network generated by DANCer is represented by (1) a sequence of T attributed graphs G i = (V i, E i ), i {1,..., T }, where V i is a set of vertices, E i a set of undirected edges and where for each vertex v V i and each real attribute A A, v A denotes the attribute value of A assigned to vertex v and (2) a sequence of T partitions P i of V i, i {1,..., T } which gives a community for each vertex in the corresponding graph G i, i {1,..., T }. Each partition allows to define a community structure on a graph (i.e., the network at a single timestamp) in such a way that the nodes are grouped into sets densely connected and relatively homogeneous with regard to the attributes, while they are less connected to vertices belonging to other groups, and less similar with regard to their attributes. The generation of the network is carried out in two phases. In phase one, an initial graph G 1 = (V 1, E 1 ) is built while respecting the well-known network properties such as preferential attachment, small world or homophily and, in the second phase, this initial graph is modified through two kinds of operations. The first set of operations, called micro operations, consist in removing or adding vertices and edges or updating their attributes whereas the second kind of operations is applied on the communities, i.e., at a macro level. They consist in (1) migrating members of a community to either a new community or an existing one, (2) splitting a community into two new sub-communities and (3) merging two existing communities into a single one. 3 Software Overview The user interface has three panels as shown in Fig. 1. In the parameter panel, the user selects the dynamic generator parameters presented in Table 1 3. Note that a seed is used for the random number generator. It can be saved to reproduce exactly the same network. The visualization panel allows to display the generated network and its dynamic evolution. Each graph in the sequence can be selected with a timestamp scrollbar and viewed separately (Fig. 1). This panel can also display the size and the evolution of the different communities in the sequence of graphs according to the macro dynamic operations (split, merge and migrate) (see Fig. 2). The sequence of attributed graphs is built while preserving properties of real networks and several measures, like modularity, clustering coefficient, diameter, expected and observed homophily or within inertia rate are computed on each graph of the dynamic network to describe its properties 4. The changes in these different measures on the sequence of graphs are presented at the bottom of the interface in the measure panel (Fig. 3). 3 The reader is referred to [1] for the static network generator parameters http:// journals.plos.org/plosone/article?id=10.1371/journal.pone.0122777. 4 see [1] for a more detailed presentation of the properties and corresponding measures.

3 Visualization Panel Parameters Panel Measures / Degree Distribution Panel Fig. 1. User interface of the generator DANCer. Fig. 2. Community dynamics display in visualization panel. Fig. 3. Measures panel of the generator. The bottom of the user interface includes also a panel displaying the distribution of vertex degrees on each graph of the sequence as shown in Fig. 4. The generated dynamic network can be saved as a collection of files. For each graph of the sequence, a file indicates the composition of the graph (vertices and edges) and a parameters file enumerates all the parameters used by the

4 Parameter Micro operations P roba M icro Domain Description Add V ertex Remove V ertex U pdate Attr. Add Btw. Edges Remove Btw. Edges Add W th. Edges Remove W th. Edges Macro operations PremoveEdgeSplit A threshold to select if the micro dynamic updates are performed or not Ratio defining the number of vertices inserted Ratio defining the number of vertices removed Ratio defining the number of attributes updated Ratio defining the number of between edges inserted Ratio defining the number of between edges removed Ratio defining the number of within edges inserted Ratio defining the number of within edges removed T imestamps N+ P roba M erge P roba Split P roba M igrate Table 1. Description Proba. to remove an edge between two vertices in the previously same community when splitting a community Number of graphs generated Probability to perform the merge operation Probability to perform the split operation Probability to perform the migrate vertices operation of the dynamic network generator parameters Fig. 4. Degree distribution panel. generator. The graph measures and community dynamics can also be saved in separated files. 4 Conclusion The software DANCer and a detailed user manual5 are available under the terms of the GNU Public Licence. Note that our generator can trivially be extended to produce multiplex networks, where all nodes are omnipresent in all levels and intra-level edges connect the representations of a node from one level to the other. This conversion is possible by simply converting each timestamp graph into a layer of the multiplex network and adding the necessary intra-level edges. References 1. Largeron C., Mougel P.N., Rabbany R., Zaı ane O.R. Generating Attributed Networks with Communities. PLoS ONE 10(4): e0122777 (2015) 5 http://perso.univ-st-etienne.fr/largeron/dancer_generator/