Towardsunderstanding: Astudy ofthe SourceForge.net community using modeling and simulation

Size: px
Start display at page:

Download "Towardsunderstanding: Astudy ofthe SourceForge.net community using modeling and simulation"

Transcription

1 Towardsunderstanding: Astudy ofthe SourceForge.net community using modeling and simulation YongqinGao,GregMadey DepartmentofComputerScience and Engineering UniversityofNotreDame Keywords: Complex Network, Open Source Software Research, Agent Based Simulation, Verification and Validation Abstract The Open Source Software (OSS) development movement is a classic example of a social network; also it is a prototype of a complex evolving network. In this research, we used a iterated method to generate models depicting the evolution of this network. Then, using Java/Swarm, we simulated in our model the evolution of the OSS developer and project network and calibrated the model using the two year empirical data we collected from SourceForge. In simulation, we implemented several different models to replicate the empirical data. From these simulation iterations we were able to verify and validate our model for this network and to increase our understanding of the OSS developer collaboration network. As a result, we generated a fit model which can replicate all measures we inspected in the set. INTRODUCTION Open source software, usually created by volunteer programmers dispersed worldwide, now competes with that developed by commercial software firms. This is due in part because closed-source proprietary software has associated risks to users. These risks may be summarized as systems taking too long to develop, costing too much and not working very well when delivered. Open source software offers an opportunity to surmount those disadvantage of proprietary software. SourceForge is one of the most popular OSS hosting web sites, which offers features like bug tracking, project management, forum service, mailing list distribution, CVS and more. In January 7 it had almost.5 millions registered users and almost thousands registered projects. It is a good sample of the OSS community to study the underlying mechanisms of OSS communities. The collaboration relationships between the developers in the OSS community are the focus of our study. These relationships can be studied if we model them as a complex network. Scientists and mathematicians have been studying such Open source is a certification mark owned by the Open Source Initiative( networks for some time. Different from the traditional networks computer scientists study, randomness is dominant in this kind of network, since there is no centralized control of the construction and evolution of the network, every node in the network will determine what nodes to connect to itself based on the local knowledge it has. In this paper, we will use different network models for the complex networks, including random networks [6], Small world networks [], and Scale-free networks []. Finally, we will propose a modified scale-free network model to fit the OSS community. Previous research involving the simulation of complex networks is based on snapshots [,, ], which focused on the topology of the network at a given point in time. Since we collected data over a two year period, we were able to look into the evolution of the network. This helped us understand the evolution of the network topology, the development patterns of any single object in the network and the impact of the interaction among objects on the evolution of the overall network system. Preliminary reports on this research have been published in proceedings of NAACSOS [8, 9] and ADS [7]. The remainder of this paper is structured as follows. In section, we explain our approach to do the modeling and simulation including methodology and experiment setup. In Section, we discuss the model we generated to depict the SourceForge community. The result and analysis of the simulation we build for the SourceForge community and the verification and validation of the simulation are provided in Section. Finally, we conclude in Section 5. OUR APPROACH Simulation Methodology We will use the same simulation procedure we proposed in the previous study [, ], which is an iterated method to do modeling and simulation. The main goal of the iterations is to get a fit model to describe the evolution of a complex network by simulation. Verification and validation is the most important step in the simulation iterations []. By comparing against the empirical data set repeatly, verification can make the model converge to the empirical system. There are two important components in the verification step - measures and methods. The measures are the criterion for the verification and the methods are how SpringSim Vol. 5 ISBN

2 the verification is carried out. In our study, we used both statistical and network measures:. Network measures includes degree distribution, average diameter, average betweenness and so on. These measures describe the characteristics of the network structure and its evolution pattern.. Statistical measures includes monthly pageviews, downloads, number of developers in the group and so on. These measures describe the characteristics of individual entity in the network and its evolution pattern. For the verification methods, we used two verification methods to verify - visual inspection and statistical hypothesis test. Experiment Environment Our simulations are executed on a small computer cluster, which includes multiple simulation servers and duplicate database servers. These machines were connected by a M(bps) switched LAN (Local Area Network). The simulation servers, which are used to run the simulations, are all dual Xeon.6GHz with G of memory and G of swap. The database servers are used to host the repository database and working database. One (the repository database server) is dual Xeon.6GHz with G of memory, 6G of swap and T storage and the other (the working database server) is dual Xeon.6GHz with G of memory and G of swap. The operating systems of the cluster are as follows: The simulation servers are Linux with..-.elsmp kernel. The database servers are Linux with..-.elsmp kernel with PostgreSQL 7.. The simulation toolkit we used was Swarm and the programming language we used was Java. MODEL DESCRIPTION The model for simulation is like a procedure definition with related parameters. We are using the same procedure definition as our previous study [7]. In the model, there were two kinds of entities developer and project. Developers, which are the active roles in real SourceForge network, will also be the active entities (agents) in our model. Projects will stored in the simulation as an accessory data structure. Our model is a time-step based simulation, which means that every agent in the simulation is triggered to act at every time step. We define the time step in our model as reflecting a single day in the real world. At every time step, a given number of new developers, N(ND), will be generated and added into the simulation. In our model, the actual number of new developers at every time step was a random number meaned at N(ND). Then every existing developer (existing and new developers) in the simulation was triggered to act one by one. The procedure definitions for new developer and existing developer were different, which we will discuss separately. New developer. Every new developer is triggered to act right after it is generated. There are two possible actions for a new developer creating or joining. Creating means that the developer would create a new project and thus add a link from this developer to a new project. Joining means that the developer would select an existing project according to some rules, and thus a link from this developer to the project is added. P(CN) and P(JN) are the two parameters that controlled the probability of creating and joining. The one exemption is that the very first developer in the simulation has to create a new project because there are no existing projects in the system yet. Existing developer. Every existing developer is triggered to act at every time step. There are four possible actions for an existing developer creating, joining, abandoning and idling. Creating and joining are similar to the definitions for the new developer. Dropping means that the developer would drop a randomly selected project in which he had participated. Idling means that the developer did nothing, which is normally the most frequent action a developer will take. P(CO), P(JO), P(DO) and P(IO) are the four parameters to control the probability of creating, joining, dropping and idling. The probabilities that a developer would choose a certain action are different in different models for simulations based on different considered attributes. This is only the skeleton of the procedure definitions in our models. Different simulation models may have different procedure definitions. Most of these differences are related to the rules a developer used to choose an action and the rules a developer used to decide which project to join. Also, the related parameters may be different for different simulation iterations. All the procedure definitions are based on the skeleton procedure described in this section. We will explain the detailed procedure definitions and related parameters for every model when we discuss that specific model. RESULTS AND DISCUSSIONS In our previous study [, ], we iterated through adaptive ER model, BA model, BA model with constant fitness and BA model with dynamic fitness. As the starting point of current study - model one, we improved the last model in the previous study - BA model with dynamic fitness in the following aspects to better match the real SourceForge.net community. ISBN SpringSim Vol.

3 The first improvement is on the stochastic procedures. Since the networks we are simulating are complex networks, we will utilize multiple stochastic procedures in the model to replicate these complex characteristics. In our models, we have the following stochastic procedures: Generating new developers every time step; Generating fitness for new projects; Picking action at every time step for every user; Picking a project to join. The latter two stochastic procedures are well defined by developer preferential attachment and project preferential attachment discussed in the previous study. On the other hand, the first two stochastic procedures are not well defined in the previous models. Redefinition of these two stochastic procedures will be the first improvement in our first iterated model. In previous study, we assume that these two stochastic procedures are based on discrete uniform distribution. However, this is not a reasonable assumption in real networks like SourceForge.net community networks. We will replace the underlying distributions for these two stochastic procedures. For the first stochastic procedure generating new developers every time step, Poisson distribution is a reasonable approximation for the underlying distribution. Normally, it expresses the probability of a number of events occurring in a fixed period of time if these events occur with a known average rate, and are independent of the time since the last event. The probability density function for the Poisson distribution is F(k;λ) = e λ λ k where e is the base of the natural logarithm, k! is the factorial of k and λ is the expected number of occurrences that occur during the given interval. So we replaced the underlying distribution for the random number generator in the first stochastic procedure with the Poisson distribution with average λ at the average we inspected in the. For the second stochastic procedure initial fitness for new project, we used log-normal distribution as the underlying distribution for the stochastic procedure. The probability density function is k! f (x;µ,σ) = xσ /σ e (lnx µ) π for x >, where µ and σ are the mean and standard deviation of the variable s logarithm. The actual mean and standard deviation used in our model are picked according to the set to achieve a good fit. The second improvement is applying a periodical update procedure for project list. By applying these two improvements, we got our first model in this study. As we discussed before, we verified this model by comparing the network measures calculated from the set with the network measures calculated () () from the set. Our simulation starts from the very beginning of the network (the first node) and up to three simulated virtual years. The set starts from January and intermittently after that. The comparison will be taken in the overlapped time period, reflecting January to December for and simulated month to simulated month 6 in the simulation. This and other data from SourceForge.net is available for scholarly research from the authors [, ]. In addition to visual inspection that we used in the previous study, we will also use statistical hypothesis test to verify the match of the network measures. There are two kinds of statistics hypothesis test - parametric hypothesis testing and non-parametric hypothesis test. In our study, we can not make the assumption that the data set is normally distributed, so we will use non-parametric hypothesis test to verify the similarity of the measures from the simulation and the set. One of the tests we used is Chi-squared χ test for categorized data. The statistics χ is calculated by finding the difference between each observed and expected value, squaring them, dividing each by the theoretical value, and taking the sum of the results: χ = n i= (O i E i ) E i () where O i is an observed value and E i is an expected(empirical) value. We will use the χ test on verifying the network measures like average degrees, approximate diameter, approximate clustering coefficient, average betweenness and average closeness. The other test we used is the Kolmogorov-Smirnov K- S test for determining whether two underlying distributions of two data sets differ, or whether an underlying distribution differs from a hypothesized distribution, in either case based on finite samples. The two one-sided Kolmogorov-Smirnov test statistics are given by D + n = max(f n (x) F(x)) () D n = max(f(x) F n(x)) (5) The details about K-S test can be found at [5]. In our study, we used the K-S test to verify the degree distributions in the simulated data set match the corresponding distributions in the. The first network measure we inspected is the average degree and their development through the interested time period. The results are shown in Figure. The X coordinate is the time. The time range is [,], which is reflecting time range [January, December ] in the set and time range [month, month 6] for the simulated data set. The Y coordinate is the average degree value. All the SpringSim Vol. 7 ISBN

4 . Developer in C NET Project in C NET Diameter.7.75 CC Developer in D NET Project in P NET. Figure. s Comparison. x represents the and + represents the simulated data Approximate Diameter Approximate Clustering Coefficient four average degrees yield similar data points in empirical data and simulated data. We also verified this by using statistical hypothesis test by applying the χ test for all the average degrees. The χ for these four average degrees are.,.7,. and.7 respectively. By choosing significance level α at. and the d f value at, the critical value is.66. These values are all well below the critical value. Thus the null hypotheses are not rejected, which means the average degrees in the simulated virtual data set have no statistically significant difference from the average degrees in the set. Next, we verify the simulation results using the approximate diameter and approximate clustering coefficient. Figure shows the diameter development and the clustering coefficient development. Using visual inspection, we found the developments of diameter and clustering coefficient from the simulated data is similar to those of the. We also used the χ test to verify the similarity of the developments in the simulated data and the developments in the with α as. and d f as. The χ value for the diameter development and clustering coefficient development are.6 and.e-, which are well below the critical value at.66. So the null hypotheses are not rejected and thus the simulated data yields similar diameter and clustering coefficient development trends as the. The next network measures are the betweenness and closeness. we show the measures in Figure. By using the χ test, we found the χ value for the betweenness development and the closeness development are 5.8e-8 and.6e-9, which are well below the critical value.66 using α as. and d f as. So the null hypotheses are not rejected and, thus, the simulated data yields similar average betweenness and average closeness development trends as the Figure. Diameter and Clustering Coefficient Comparison. x represents the and + represents the simulated data Average Btweenness.69 x Betweenness.6 Average Closeness.6 x Closeness. Figure. Betweenness and Closeness Comparison. x represents the and + represents the simulated data ISBN SpringSim Vol.

5 Developers Developers(LOG) 6 x 5 5 Participated projects 6 5 Dev Dist in C NET.5.5 Participated projects(log) Projects Projects(LOG).5 x Proj Dist in C NET 5 5 Participated Developers 6 5 Participated developers(log) Figure. Degree Distributions Comparison. x represents the and o represents the simulated data Finally, we also verified the simulation model using the degree distributions. The degree distributions are shown in Figure. We used the K-S test to test whether the degree distributions in the simulated data set are the same as the degree distributions in the. The statistics for the developer degree distribution and project degree distribution are.75 and.979. Using the common significance level at.5, the hypotheses are not rejected. Thus both the developer degree distribution and project degree distribution in the simulated data are the same as the corresponding distributions in the. However, by visual inspection, we can still find differences between the degree distributions of the set and the simulated data set. For the developer degree distribution, the simulated data set has a significantly shorter tail than the. For the project degree distribution, the simulated data set has a significantly longer tail than the empirical data. These deficiencies (the busiest developer in the simulated data set participated in just projects instead of 5 in the and the biggest project in the simulated data set has more than 5 developers instead of 95 in the ) are not reasonable for the real world community such as SourceForge.net. So we iterated more models to make these deficiencies. The second iterated model is trying to make up for the deficiencies discovered in the previous model. To achieve this, we added a new parameter into the model - user energy. In the previous models, we introduced the fitness factor into the project preference. On the other hand, user preference is based only on the number of projects he/she participated. The deficiency in the previous model could be the result of lacking user fitness, so in this model we introduced a fitness factor for user preference, which is called user energy. We define the energy as a constant parameter γ i for every user, where γ i is chosen for a uniform continuous distribution between (,]. This energy level can be described as the probability of a user will take a action instead of idling during every time step. This model yielded similar results for most of the network measures as previous models except the degree distributions. The biggest project in the project distribution decreased from more than 5 to below 5. However, the developer distribution still has significant deficiency at the tail, compared to the same measures in the. Further iteration on the modeling introduced dynamic user energy into the model. In this model, we made the energy decay linearly by time and meanwhile it can be self-adjusted by taking roles and responsibilities or giving up roles and responsibilities in the community. By using this dynamic energy, we expected to catch the details of the possible influence of a user to the related project, even the whole community, and then generate better matches in both the developer distribution and the project distribution. The results of the simulation of this model improve the matches of developer distribution and project distribution. The degree distributions of this model yield similar results as the previous models except that the busiest user is increased to projects, which is more close the the while the size of the largest project is kept below. This is more reasonable tail for the SourceForge.net and close to the set. Thus this model is able to replicate all the network measures we used in the verification by both visual inspection and statistics hypothesis test methods. After analyzing and iterating the models, our third model reproduced all the inspected network measures we chose of the and captured all the phenomenon we wanted to highlight in the community and can be used to simulate the community and possibly predict the future of the SourceForge.net community by extending the simulations. CONCLUSION In this study, we used simulation methods to assist in understanding the topology and evolution of the Source- Forge.net developer collaboration network. In the study, we used an interactive method to develop more models for the SourceForge.net community. We iterated three more models from our previous study to generate the virtual SourceForge.net. In the simulation iteration, we verified the result by comparing the measures from the simulated data and measures from the using both visual inspection and statistical hypothesis test. The final result of the study is a model to simulate the SourceForge.net with matched network measures. Using this model, we are able to simulate the SourceForge.net community and possibly SpringSim Vol. 9 ISBN

6 predict the future development of the community by simply extendingthesimulationtime. Acknowledgment.ThisworkwassupportedinpartbytheNationalScienceFoundation,CISE/IIS-DigitalSociety&Technology,underGrantNo.89[,]. REFERENCES [] OSS research( 6. [] OSS research portal ( 6. [] R. Albert and A.L. Barabási. Emergence of scaling in random networks. Science, 86:59 5, 999. [] R. Bellman. Adaptive control processes: A guided tour. Princeton University Press, Princeton, NJ, 96. [5] I. M. Chakravarti, R. G. Laha and J. Roy. Handbook of methods of applied statistics. John Wiley and Sons, I, 967. [6] P. Erdös and A. Rényi. On random graphs. Publicationes Mathematicae, 6:9 97, 959. [7] Y. Gao, G. Madey and V. Freeh. Modeling and simulation of the open source software community. ADS 5, San Diego, 5. [8] Y. Gao, V. Freeh and G. Madey. Analysis and modeling of the open source software community. NAACSOS, Pittsburgh,. [9] Y. Gao, V. Freeh and G. Madey. Conceptual framework for agent-based modeling and simulation. NAACSOS, Pittsburgh,. [] G. Madey, V. Freeh and R. Tynan. Agent-based modeling of open source using swarm. Eight Americas Conference of Information Systems,. [] G. Madey, V. Freeh and R. Tynan. The open source software development phenomenon: an analysis based on social network theory. Eight Americas Conference of Information Systems,. [] M.E.J. Newman and D.J. Watts. Random graph models of social networks. Physics Review, 6(68),. [] D.J. Watts and S. H. Strogatz. Collective dynamics of small-world networks. Nature, 9:, 998. ISBN SpringSim Vol.

An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based

An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based Greg Madey Computer Science & Engineering University of Notre Dame UIUC - NSF Workshop

More information

Network Analysis of the SourceForge.net Community

Network Analysis of the SourceForge.net Community Network Analysis of the SourceForge.net Community Yongqin Gao and Greg Madey Department of Computer Science and Engineering University of Notre Dame {ygao,gmadey}@nd.edu Abstract. Software is central to

More information

γ : constant Goett 2 P(k) = k γ k : degree

γ : constant Goett 2 P(k) = k γ k : degree Goett 1 Jeffrey Goett Final Research Paper, Fall 2003 Professor Madey 19 December 2003 Abstract: Recent observations by physicists have lead to new theories about the mechanisms controlling the growth

More information

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich (Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Outline Network Topological Analysis Network Models Random Networks Small-World Networks Scale-Free Networks

More information

An Evolving Network Model With Local-World Structure

An Evolving Network Model With Local-World Structure The Eighth International Symposium on Operations Research and Its Applications (ISORA 09) Zhangjiajie, China, September 20 22, 2009 Copyright 2009 ORSC & APORC, pp. 47 423 An Evolving Network odel With

More information

Erdős-Rényi Model for network formation

Erdős-Rényi Model for network formation Network Science: Erdős-Rényi Model for network formation Ozalp Babaoglu Dipartimento di Informatica Scienza e Ingegneria Università di Bologna www.cs.unibo.it/babaoglu/ Why model? Simpler representation

More information

Constructing a G(N, p) Network

Constructing a G(N, p) Network Random Graph Theory Dr. Natarajan Meghanathan Professor Department of Computer Science Jackson State University, Jackson, MS E-mail: natarajan.meghanathan@jsums.edu Introduction At first inspection, most

More information

Constructing a G(N, p) Network

Constructing a G(N, p) Network Random Graph Theory Dr. Natarajan Meghanathan Associate Professor Department of Computer Science Jackson State University, Jackson, MS E-mail: natarajan.meghanathan@jsums.edu Introduction At first inspection,

More information

Properties of Biological Networks

Properties of Biological Networks Properties of Biological Networks presented by: Ola Hamud June 12, 2013 Supervisor: Prof. Ron Pinter Based on: NETWORK BIOLOGY: UNDERSTANDING THE CELL S FUNCTIONAL ORGANIZATION By Albert-László Barabási

More information

Complex Networks. Structure and Dynamics

Complex Networks. Structure and Dynamics Complex Networks Structure and Dynamics Ying-Cheng Lai Department of Mathematics and Statistics Department of Electrical Engineering Arizona State University Collaborators! Adilson E. Motter, now at Max-Planck

More information

Overlay (and P2P) Networks

Overlay (and P2P) Networks Overlay (and P2P) Networks Part II Recap (Small World, Erdös Rényi model, Duncan Watts Model) Graph Properties Scale Free Networks Preferential Attachment Evolving Copying Navigation in Small World Samu

More information

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno Wednesday, March 8, 2006 Complex Networks Presenter: Jirakhom Ruttanavakul CS 790R, University of Nevada, Reno Presented Papers Emergence of scaling in random networks, Barabási & Bonabeau (2003) Scale-free

More information

Critical Phenomena in Complex Networks

Critical Phenomena in Complex Networks Critical Phenomena in Complex Networks Term essay for Physics 563: Phase Transitions and the Renormalization Group University of Illinois at Urbana-Champaign Vikyath Deviprasad Rao 11 May 2012 Abstract

More information

CSCI5070 Advanced Topics in Social Computing

CSCI5070 Advanced Topics in Social Computing CSCI5070 Advanced Topics in Social Computing Irwin King The Chinese University of Hong Kong king@cse.cuhk.edu.hk!! 2012 All Rights Reserved. Outline Graphs Origins Definition Spectral Properties Type of

More information

A generative model for feedback networks

A generative model for feedback networks A generative model for feedback networks D.R. White 1 N. Kejžar 2 C. Tsallis 3 D. Farmer 4 S. White 1 1 University of California Irvine, USA 2 University of Ljubljana, Slovenia 3 Centro Brasileiro de Pesquisas

More information

Exercise set #2 (29 pts)

Exercise set #2 (29 pts) (29 pts) The deadline for handing in your solutions is Nov 16th 2015 07:00. Return your solutions (one.pdf le and one.zip le containing Python code) via e- mail to Becs-114.4150@aalto.fi. Additionally,

More information

CAIM: Cerca i Anàlisi d Informació Massiva

CAIM: Cerca i Anàlisi d Informació Massiva 1 / 72 CAIM: Cerca i Anàlisi d Informació Massiva FIB, Grau en Enginyeria Informàtica Slides by Marta Arias, José Balcázar, Ricard Gavaldá Department of Computer Science, UPC Fall 2016 http://www.cs.upc.edu/~caim

More information

Detection of Resource-Drained Attacks on SIP-Based Wireless VoIP Networks

Detection of Resource-Drained Attacks on SIP-Based Wireless VoIP Networks Detection of Resource-Drained Attacks on SIP-Based Wireless VoIP Networks Jin Tang, Yong Hao, Yu Cheng and Chi Zhou Department of Electrical and Computer Engineering Illinois Institute of Technology, Chicago,

More information

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns Roadmap Next several lectures: universal structural properties of networks Each large-scale network is unique microscopically,

More information

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Models of Network Formation Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Roadmap Recently: typical large-scale social and other networks exhibit: giant component with small diameter sparsity

More information

Constructing Numerical Reference Modes from Sparse Time Series

Constructing Numerical Reference Modes from Sparse Time Series Constructing Numerical Reference Modes from Sparse Series Peter S. Hovmand, Ph.D. Washington University George Warren Brown School of Social Work Campus Box 1196 One Brookings Drive St. Louis, MO 63130

More information

M.E.J. Newman: Models of the Small World

M.E.J. Newman: Models of the Small World A Review Adaptive Informatics Research Centre Helsinki University of Technology November 7, 2007 Vocabulary N number of nodes of the graph l average distance between nodes D diameter of the graph d is

More information

A Generating Function Approach to Analyze Random Graphs

A Generating Function Approach to Analyze Random Graphs A Generating Function Approach to Analyze Random Graphs Presented by - Vilas Veeraraghavan Advisor - Dr. Steven Weber Department of Electrical and Computer Engineering Drexel University April 8, 2005 Presentation

More information

Use of Extreme Value Statistics in Modeling Biometric Systems

Use of Extreme Value Statistics in Modeling Biometric Systems Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision

More information

Modeling and Performance Analysis with Discrete-Event Simulation

Modeling and Performance Analysis with Discrete-Event Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation Chapter 10 Verification and Validation of Simulation Models Contents Model-Building, Verification, and Validation Verification

More information

Empirical Study on Impact of Developer Collaboration on Source Code

Empirical Study on Impact of Developer Collaboration on Source Code Empirical Study on Impact of Developer Collaboration on Source Code Akshay Chopra University of Waterloo Waterloo, Ontario a22chopr@uwaterloo.ca Parul Verma University of Waterloo Waterloo, Ontario p7verma@uwaterloo.ca

More information

Graph Sampling Approach for Reducing. Computational Complexity of. Large-Scale Social Network

Graph Sampling Approach for Reducing. Computational Complexity of. Large-Scale Social Network Journal of Innovative Technology and Education, Vol. 3, 216, no. 1, 131-137 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/1.12988/jite.216.6828 Graph Sampling Approach for Reducing Computational Complexity

More information

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008 Lesson 4 Random graphs Sergio Barbarossa Graph models 1. Uncorrelated random graph (Erdős, Rényi) N nodes are connected through n edges which are chosen randomly from the possible configurations 2. Binomial

More information

Bootstrapping Methods

Bootstrapping Methods Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods

More information

arxiv: v1 [cs.si] 11 Jan 2019

arxiv: v1 [cs.si] 11 Jan 2019 A Community-aware Network Growth Model for Synthetic Social Network Generation Furkan Gursoy furkan.gursoy@boun.edu.tr Bertan Badur bertan.badur@boun.edu.tr arxiv:1901.03629v1 [cs.si] 11 Jan 2019 Dept.

More information

Applying Social Network Analysis to the Information in CVS Repositories

Applying Social Network Analysis to the Information in CVS Repositories Applying Social Network Analysis to the Information in CVS Repositories Luis López-Fernández, Gregorio Robles-Martínez, Jesús M. González-Barahona GSyC, Universidad Rey Juan Carlos {llopez,grex,jgb}@gsyc.escet.urjc.es

More information

HEURISTIC OPTIMIZATION USING COMPUTER SIMULATION: A STUDY OF STAFFING LEVELS IN A PHARMACEUTICAL MANUFACTURING LABORATORY

HEURISTIC OPTIMIZATION USING COMPUTER SIMULATION: A STUDY OF STAFFING LEVELS IN A PHARMACEUTICAL MANUFACTURING LABORATORY Proceedings of the 1998 Winter Simulation Conference D.J. Medeiros, E.F. Watson, J.S. Carson and M.S. Manivannan, eds. HEURISTIC OPTIMIZATION USING COMPUTER SIMULATION: A STUDY OF STAFFING LEVELS IN A

More information

Analytic Performance Models for Bounded Queueing Systems

Analytic Performance Models for Bounded Queueing Systems Analytic Performance Models for Bounded Queueing Systems Praveen Krishnamurthy Roger D. Chamberlain Praveen Krishnamurthy and Roger D. Chamberlain, Analytic Performance Models for Bounded Queueing Systems,

More information

Microscopic Traffic Simulation

Microscopic Traffic Simulation Microscopic Traffic Simulation Lecture Notes in Transportation Systems Engineering Prof. Tom V. Mathew Contents Overview 2 Traffic Simulation Models 2 2. Need for simulation.................................

More information

Analysis of the Social Community Based on the Network Growing Model in Open Source Software Community

Analysis of the Social Community Based on the Network Growing Model in Open Source Software Community Analysis of the Social Community Based on the Networ Growing Model in Open Source Software Community arxiv:8.8v [cs.ma] 9 Apr 8 Taumi Ichimura Department of Management and Systems, Prefectural University

More information

Typeset with NdThesiS version 2.14 (2000/09/08) on November 17, 2003 THIS PAGE IS NOT PART OF THE THESIS, BUT SHOULD BE TURNED IN TO THE PROOFREADER!

Typeset with NdThesiS version 2.14 (2000/09/08) on November 17, 2003 THIS PAGE IS NOT PART OF THE THESIS, BUT SHOULD BE TURNED IN TO THE PROOFREADER! Typeset with NdThesiS version 2.14 (2000/09/08) on November 17, 2003 for Yongqin Gao entitled TOPOLOGY AND EVOLUTION OF THE OPEN SOURCE SOFTWARE COMMUNITY This class conforms to the University of Notre

More information

Slides 11: Verification and Validation Models

Slides 11: Verification and Validation Models Slides 11: Verification and Validation Models Purpose and Overview The goal of the validation process is: To produce a model that represents true behaviour closely enough for decision making purposes.

More information

Chapter 16. Microscopic Traffic Simulation Overview Traffic Simulation Models

Chapter 16. Microscopic Traffic Simulation Overview Traffic Simulation Models Chapter 6 Microscopic Traffic Simulation 6. Overview The complexity of traffic stream behaviour and the difficulties in performing experiments with real world traffic make computer simulation an important

More information

Higher order clustering coecients in Barabasi Albert networks

Higher order clustering coecients in Barabasi Albert networks Physica A 316 (2002) 688 694 www.elsevier.com/locate/physa Higher order clustering coecients in Barabasi Albert networks Agata Fronczak, Janusz A. Ho lyst, Maciej Jedynak, Julian Sienkiewicz Faculty of

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

An Empirical Analysis of Communities in Real-World Networks

An Empirical Analysis of Communities in Real-World Networks An Empirical Analysis of Communities in Real-World Networks Chuan Sheng Foo Computer Science Department Stanford University csfoo@cs.stanford.edu ABSTRACT Little work has been done on the characterization

More information

Analytical reasoning task reveals limits of social learning in networks

Analytical reasoning task reveals limits of social learning in networks Electronic Supplementary Material for: Analytical reasoning task reveals limits of social learning in networks Iyad Rahwan, Dmytro Krasnoshtan, Azim Shariff, Jean-François Bonnefon A Experimental Interface

More information

Package fastnet. September 11, 2018

Package fastnet. September 11, 2018 Type Package Title Large-Scale Social Network Analysis Version 0.1.6 Package fastnet September 11, 2018 We present an implementation of the algorithms required to simulate largescale social networks and

More information

Introduction to Networks and Business Intelligence

Introduction to Networks and Business Intelligence Introduction to Networks and Business Intelligence Prof. Dr. Daning Hu Department of Informatics University of Zurich Sep 16th, 2014 Outline n Network Science A Random History n Network Analysis Network

More information

6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model

6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model 6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model Daron Acemoglu and Asu Ozdaglar MIT September 23, 2009 1 Outline Generalized random graph models Graphs with prescribed

More information

Clustering: Classic Methods and Modern Views

Clustering: Classic Methods and Modern Views Clustering: Classic Methods and Modern Views Marina Meilă University of Washington mmp@stat.washington.edu June 22, 2015 Lorentz Center Workshop on Clusters, Games and Axioms Outline Paradigms for clustering

More information

Specific Communication Network Measure Distribution Estimation

Specific Communication Network Measure Distribution Estimation Baller, D., Lospinoso, J. Specific Communication Network Measure Distribution Estimation 1 Specific Communication Network Measure Distribution Estimation Daniel P. Baller, Joshua Lospinoso United States

More information

Package fastnet. February 12, 2018

Package fastnet. February 12, 2018 Type Package Title Large-Scale Social Network Analysis Version 0.1.4 Package fastnet February 12, 2018 We present an implementation of the algorithms required to simulate largescale social networks and

More information

ARMA MODEL SELECTION USING PARTICLE SWARM OPTIMIZATION AND AIC CRITERIA. Mark S. Voss a b. and Xin Feng.

ARMA MODEL SELECTION USING PARTICLE SWARM OPTIMIZATION AND AIC CRITERIA. Mark S. Voss a b. and Xin Feng. Copyright 2002 IFAC 5th Triennial World Congress, Barcelona, Spain ARMA MODEL SELECTION USING PARTICLE SWARM OPTIMIZATION AND AIC CRITERIA Mark S. Voss a b and Xin Feng a Department of Civil and Environmental

More information

Introduction to the Special Issue on AI & Networks

Introduction to the Special Issue on AI & Networks Introduction to the Special Issue on AI & Networks Marie desjardins, Matthew E. Gaston, and Dragomir Radev March 16, 2008 As networks have permeated our world, the economy has come to resemble an ecology

More information

TELCOM2125: Network Science and Analysis

TELCOM2125: Network Science and Analysis School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2015 Figures are taken from: M.E.J. Newman, Networks: An Introduction 2

More information

On Reshaping of Clustering Coefficients in Degreebased Topology Generators

On Reshaping of Clustering Coefficients in Degreebased Topology Generators On Reshaping of Clustering Coefficients in Degreebased Topology Generators Xiafeng Li, Derek Leonard, and Dmitri Loguinov Texas A&M University Presented by Derek Leonard Agenda Motivation Statement of

More information

A COMPUTER-AIDED SIMULATION ANALYSIS TOOL FOR SIMAN MODELS AUTOMATICALLY GENERATED FROM PETRI NETS

A COMPUTER-AIDED SIMULATION ANALYSIS TOOL FOR SIMAN MODELS AUTOMATICALLY GENERATED FROM PETRI NETS A COMPUTER-AIDED SIMULATION ANALYSIS TOOL FOR SIMAN MODELS AUTOMATICALLY GENERATED FROM PETRI NETS Albert Peñarroya, Francesc Casado and Jan Rosell Institute of Industrial and Control Engineering Technical

More information

Note Set 4: Finite Mixture Models and the EM Algorithm

Note Set 4: Finite Mixture Models and the EM Algorithm Note Set 4: Finite Mixture Models and the EM Algorithm Padhraic Smyth, Department of Computer Science University of California, Irvine Finite Mixture Models A finite mixture model with K components, for

More information

Graph Model Selection using Maximum Likelihood

Graph Model Selection using Maximum Likelihood Graph Model Selection using Maximum Likelihood Ivona Bezáková Adam Tauman Kalai Rahul Santhanam Theory Canal, Rochester, April 7 th 2008 [ICML 2006 (International Conference on Machine Learning)] Overview

More information

Using the Kolmogorov-Smirnov Test for Image Segmentation

Using the Kolmogorov-Smirnov Test for Image Segmentation Using the Kolmogorov-Smirnov Test for Image Segmentation Yong Jae Lee CS395T Computational Statistics Final Project Report May 6th, 2009 I. INTRODUCTION Image segmentation is a fundamental task in computer

More information

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS Overview of Networks Instructor: Yizhou Sun yzsun@cs.ucla.edu January 10, 2017 Overview of Information Network Analysis Network Representation Network

More information

Supply chains involve complex webs of interactions among suppliers, manufacturers,

Supply chains involve complex webs of interactions among suppliers, manufacturers, D e p e n d a b l e A g e n t S y s t e m s Survivability of Multiagent-Based Supply Networks: A Topological Perspective Hari Prasad Thadakamalla, Usha Nandini Raghavan, Soundar Kumara, and Réka Albert,

More information

Distributed Detection in Sensor Networks: Connectivity Graph and Small World Networks

Distributed Detection in Sensor Networks: Connectivity Graph and Small World Networks Distributed Detection in Sensor Networks: Connectivity Graph and Small World Networks SaeedA.AldosariandJoséM.F.Moura Electrical and Computer Engineering Department Carnegie Mellon University 5000 Forbes

More information

Random Generation of the Social Network with Several Communities

Random Generation of the Social Network with Several Communities Communications of the Korean Statistical Society 2011, Vol. 18, No. 5, 595 601 DOI: http://dx.doi.org/10.5351/ckss.2011.18.5.595 Random Generation of the Social Network with Several Communities Myung-Hoe

More information

CLUSTERING. JELENA JOVANOVIĆ Web:

CLUSTERING. JELENA JOVANOVIĆ   Web: CLUSTERING JELENA JOVANOVIĆ Email: jeljov@gmail.com Web: http://jelenajovanovic.net OUTLINE What is clustering? Application domains K-Means clustering Understanding it through an example The K-Means algorithm

More information

Summary: What We Have Learned So Far

Summary: What We Have Learned So Far Summary: What We Have Learned So Far small-world phenomenon Real-world networks: { Short path lengths High clustering Broad degree distributions, often power laws P (k) k γ Erdös-Renyi model: Short path

More information

The importance of networks permeates

The importance of networks permeates Introduction to the Special Issue on AI and Networks Marie desjardins, Matthew E. Gaston, and Dragomir Radev This introduction to AI Magazine s special issue on networks and AI summarizes the seven articles

More information

Nonparametric Estimation of Distribution Function using Bezier Curve

Nonparametric Estimation of Distribution Function using Bezier Curve Communications for Statistical Applications and Methods 2014, Vol. 21, No. 1, 105 114 DOI: http://dx.doi.org/10.5351/csam.2014.21.1.105 ISSN 2287-7843 Nonparametric Estimation of Distribution Function

More information

Lecture: Simulation. of Manufacturing Systems. Sivakumar AI. Simulation. SMA6304 M2 ---Factory Planning and scheduling. Simulation - A Predictive Tool

Lecture: Simulation. of Manufacturing Systems. Sivakumar AI. Simulation. SMA6304 M2 ---Factory Planning and scheduling. Simulation - A Predictive Tool SMA6304 M2 ---Factory Planning and scheduling Lecture Discrete Event of Manufacturing Systems Simulation Sivakumar AI Lecture: 12 copyright 2002 Sivakumar 1 Simulation Simulation - A Predictive Tool Next

More information

THE EFFECT OF SEGREGATION IN NON- REPEATED PRISONER'S DILEMMA

THE EFFECT OF SEGREGATION IN NON- REPEATED PRISONER'S DILEMMA THE EFFECT OF SEGREGATION IN NON- REPEATED PRISONER'S DILEMMA Thomas Nordli University of South-Eastern Norway, Norway ABSTRACT This article consolidates the idea that non-random pairing can promote the

More information

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena CSE 190 Lecture 16 Data Mining and Predictive Analytics Small-world phenomena Another famous study Stanley Milgram wanted to test the (already popular) hypothesis that people in social networks are separated

More information

UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA

UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA PhD course in Electronics, Automation and Complex Systems Control-XXIV Cycle DIPARTIMENTO DI INGEGNERIA ELETTRICA ELETTRONICA E DEI SISTEMI ing.

More information

Open Source Software Developer and Project Networks

Open Source Software Developer and Project Networks Open Source Software Developer and Project Networks Matthew Antwerp, Greg Madey To cite this version: Matthew Antwerp, Greg Madey. Open Source Software Developer and Project Networks. Pär Ågerfalk; Cornelia

More information

CS-E5740. Complex Networks. Scale-free networks

CS-E5740. Complex Networks. Scale-free networks CS-E5740 Complex Networks Scale-free networks Course outline 1. Introduction (motivation, definitions, etc. ) 2. Static network models: random and small-world networks 3. Growing network models: scale-free

More information

You ve already read basics of simulation now I will be taking up method of simulation, that is Random Number Generation

You ve already read basics of simulation now I will be taking up method of simulation, that is Random Number Generation Unit 5 SIMULATION THEORY Lesson 39 Learning objective: To learn random number generation. Methods of simulation. Monte Carlo method of simulation You ve already read basics of simulation now I will be

More information

Probability Models.S4 Simulating Random Variables

Probability Models.S4 Simulating Random Variables Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Probability Models.S4 Simulating Random Variables In the fashion of the last several sections, we will often create probability

More information

Small World Properties Generated by a New Algorithm Under Same Degree of All Nodes

Small World Properties Generated by a New Algorithm Under Same Degree of All Nodes Commun. Theor. Phys. (Beijing, China) 45 (2006) pp. 950 954 c International Academic Publishers Vol. 45, No. 5, May 15, 2006 Small World Properties Generated by a New Algorithm Under Same Degree of All

More information

On Complex Dynamical Networks. G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong

On Complex Dynamical Networks. G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong On Complex Dynamical Networks G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong 1 Complex Networks: Some Typical Examples 2 Complex Network Example: Internet (William

More information

The Simulation Study of Spread and Evolution about Network Opinion on Education

The Simulation Study of Spread and Evolution about Network Opinion on Education Asian Social Science; Vol. 10, No. 10; 2014 ISSN 1911-2017 E-ISSN 1911-2025 Published by Canadian Center of Science and Education The Simulation Study of Spread and Evolution about Network Opinion on Education

More information

Descriptive Statistics, Standard Deviation and Standard Error

Descriptive Statistics, Standard Deviation and Standard Error AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.

More information

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques Kouhei Sugiyama, Hiroyuki Ohsaki and Makoto Imase Graduate School of Information Science and Technology,

More information

The missing links in the BGP-based AS connectivity maps

The missing links in the BGP-based AS connectivity maps The missing links in the BGP-based AS connectivity maps Zhou, S; Mondragon, RJ http://arxiv.org/abs/cs/0303028 For additional information about this publication click this link. http://qmro.qmul.ac.uk/xmlui/handle/123456789/13070

More information

A Comparison of Evaluation Networks and Collaboration Networks in Open Source Software Communities

A Comparison of Evaluation Networks and Collaboration Networks in Open Source Software Communities Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2008 Proceedings Americas Conference on Information Systems (AMCIS) 2008 A Comparison of Evaluation Networks and Collaboration Networks

More information

Genetic Algorithms with Oracle for the Traveling Salesman Problem

Genetic Algorithms with Oracle for the Traveling Salesman Problem PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY VOLUME 7 AUGUST 25 ISSN 17-884 Genetic Algorithms with Oracle for the Traveling Salesman Problem Robin Gremlich, Andreas Hamfelt, Héctor

More information

arxiv: v1 [physics.soc-ph] 13 Jan 2011

arxiv: v1 [physics.soc-ph] 13 Jan 2011 An Estimation of the Shortest and Largest Average Path Length in Graphs of Given Density László Gulyás, Gábor Horváth, Tamás Cséri and George Kampis arxiv:1101.549v1 [physics.soc-ph] 1 Jan 011 Abstract

More information

Stable Statistics of the Blogograph

Stable Statistics of the Blogograph Stable Statistics of the Blogograph Mark Goldberg, Malik Magdon-Ismail, Stephen Kelley, Konstantin Mertsalov Rensselaer Polytechnic Institute Department of Computer Science Abstract. The primary focus

More information

Complex networks: A mixture of power-law and Weibull distributions

Complex networks: A mixture of power-law and Weibull distributions Complex networks: A mixture of power-law and Weibull distributions Ke Xu, Liandong Liu, Xiao Liang State Key Laboratory of Software Development Environment Beihang University, Beijing 100191, China Abstract:

More information

Metaheuristic Development Methodology. Fall 2009 Instructor: Dr. Masoud Yaghini

Metaheuristic Development Methodology. Fall 2009 Instructor: Dr. Masoud Yaghini Metaheuristic Development Methodology Fall 2009 Instructor: Dr. Masoud Yaghini Phases and Steps Phases and Steps Phase 1: Understanding Problem Step 1: State the Problem Step 2: Review of Existing Solution

More information

Models for the growth of the Web

Models for the growth of the Web Models for the growth of the Web Chi Bong Ho Introduction Yihao Ben Pu December 6, 2007 Alexander Tsiatas There has been much work done in recent years about the structure of the Web and other large information

More information

Graph Types. Peter M. Kogge. Graphs Types. Types of Graphs. Graphs: Sets (V,E) where E= {(u,v)}

Graph Types. Peter M. Kogge. Graphs Types. Types of Graphs. Graphs: Sets (V,E) where E= {(u,v)} Graph Types Peter M. Kogge Please Sir, I want more 1 Types of Graphs Graphs: Sets (V,E) where E= {(u,v)} Undirected: (u,v) = (v,u) Directed: (u,v)!= (v,u) Networks: Graphs with weights Multi-graphs: multiple

More information

A Statistical Method for Synthetic Power Grid Generation based on the U.S. Western Interconnection

A Statistical Method for Synthetic Power Grid Generation based on the U.S. Western Interconnection A Statistical Method for Synthetic Power Grid Generation based on the U.S. Western Interconnection Saleh Soltan, Gil Zussman Columbia University New York, NY Failures in Power Grids One of the most essential

More information

Topologies and Centralities of Replied Networks on Bulletin Board Systems

Topologies and Centralities of Replied Networks on Bulletin Board Systems Topologies and Centralities of Replied Networks on Bulletin Board Systems Qin Sen 1,2 Dai Guanzhong 2 Wang Lin 2 Fan Ming 2 1 Hangzhou Dianzi University, School of Sciences, Hangzhou, 310018, China 2 Northwestern

More information

Dynamic network generative model

Dynamic network generative model Dynamic network generative model Habiba, Chayant Tantipathanananandh, Tanya Berger-Wolf University of Illinois at Chicago. In this work we present a statistical model for generating realistic dynamic networks

More information

Estimating Local Decision-Making Behavior in Complex Evolutionary Systems

Estimating Local Decision-Making Behavior in Complex Evolutionary Systems Estimating Local Decision-Making Behavior in Complex Evolutionary Systems Zhenghui Sha Graduate Research Assistant School of Mechanical Engineering, Purdue University West Lafayette, Indiana 47907 E-mail:

More information

Attack Vulnerability of Network with Duplication-Divergence Mechanism

Attack Vulnerability of Network with Duplication-Divergence Mechanism Commun. Theor. Phys. (Beijing, China) 48 (2007) pp. 754 758 c International Academic Publishers Vol. 48, No. 4, October 5, 2007 Attack Vulnerability of Network with Duplication-Divergence Mechanism WANG

More information

Spectral Methods for Network Community Detection and Graph Partitioning

Spectral Methods for Network Community Detection and Graph Partitioning Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection

More information

Statistical Analysis of List Experiments

Statistical Analysis of List Experiments Statistical Analysis of List Experiments Kosuke Imai Princeton University Joint work with Graeme Blair October 29, 2010 Blair and Imai (Princeton) List Experiments NJIT (Mathematics) 1 / 26 Motivation

More information

Internet as a Complex Network. Guanrong Chen City University of Hong Kong

Internet as a Complex Network. Guanrong Chen City University of Hong Kong Internet as a Complex Network Guanrong Chen City University of Hong Kong 1 Complex Network: Internet (K. C. Claffy) 2 Another View of the Internet http://www.caida.org/analysis/topology/as_core_network/

More information

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 1. Introduction Reddit is one of the most popular online social news websites with millions

More information

Rethinking Preferential Attachment Scheme: Degree centrality versus closeness centrality 1

Rethinking Preferential Attachment Scheme: Degree centrality versus closeness centrality 1 CONNECTIONS 27(3): 53-59 2007 INSNA http://www.insna.org/connections-web/volume27-3/ko.pdf Rethinking Preferential Attachment Scheme: Degree centrality versus closeness centrality 1 Kilkon Ko 2 University

More information

SIMULTANEOUS COMPUTATION OF MODEL ORDER AND PARAMETER ESTIMATION FOR ARX MODEL BASED ON MULTI- SWARM PARTICLE SWARM OPTIMIZATION

SIMULTANEOUS COMPUTATION OF MODEL ORDER AND PARAMETER ESTIMATION FOR ARX MODEL BASED ON MULTI- SWARM PARTICLE SWARM OPTIMIZATION SIMULTANEOUS COMPUTATION OF MODEL ORDER AND PARAMETER ESTIMATION FOR ARX MODEL BASED ON MULTI- SWARM PARTICLE SWARM OPTIMIZATION Kamil Zakwan Mohd Azmi, Zuwairie Ibrahim and Dwi Pebrianti Faculty of Electrical

More information

Developing Effect Sizes for Non-Normal Data in Two-Sample Comparison Studies

Developing Effect Sizes for Non-Normal Data in Two-Sample Comparison Studies Developing Effect Sizes for Non-Normal Data in Two-Sample Comparison Studies with an Application in E-commerce Durham University Apr 13, 2010 Outline 1 Introduction Effect Size, Complementory for Hypothesis

More information

ENTITIES WITH COMBINED DISCRETE-CONTINUOUS ATTRIBUTES IN DISCRETE- EVENT-DRIVEN SYSTEMS

ENTITIES WITH COMBINED DISCRETE-CONTINUOUS ATTRIBUTES IN DISCRETE- EVENT-DRIVEN SYSTEMS ENTITIES WITH COMBINED DISCRETE-CONTINUOUS ATTRIBUTES IN DISCRETE- EVENT-DRIVEN SYSTEMS Kristina Dammasch, Graham Horton (a) (a) University of Magdeburg, Department of Simulation and Graphics (a) kristina@sim-md.de,

More information

Sequences Modeling and Analysis Based on Complex Network

Sequences Modeling and Analysis Based on Complex Network Sequences Modeling and Analysis Based on Complex Network Li Wan 1, Kai Shu 1, and Yu Guo 2 1 Chongqing University, China 2 Institute of Chemical Defence People Libration Army {wanli,shukai}@cqu.edu.cn

More information