Sociotechnical Information From Software Repositories

Size: px
Start display at page:

Download "Sociotechnical Information From Software Repositories"

Transcription

1 Sociotechnical Information From Software Repositories Marco Aurélio Gerosa UNIVERSITY OF SÃO PAULO, BRAZIL UFU November/

2 Repositories of repositories 250K projects 93K projects 1 million users 11.3 million repositories 5.4 million users In 2013: 3 million new users 661K projects 30K projects 29 billion of lines of codes million pushes million users 25 million comments million issues 7 million pull requests 33K projects 36K projects s 324K projects 3.4 million developers 200 projects %20SourceForge.net ng_facilities 2

3 Mining Data mining = computational process of discovering patterns in largemining data sets The Software Repositories (MSR) field analyzes the rich data available in software repositories to uncover interesting and actionable information about software systems and projects. Mining 3

4 Programmers of a given language are happier than the others?

5 Sentiment analysis on commits GitHub Data Challenge 2nd place messages/ 5

6 If a programmer knows a language, which others does she know?

7 Programming language relations Visualise 7

8 When are the bugs inserted?

9 Don t program on Fridays that-matters 9

10 Which files are more buggy?

11 Which files are more buggy? 11

12 And what about social data? IS IT IMPORTANT FOR SOFTWARE ENGINEERING?

13 David Parnas David Parnas Software Engineering = Multiperson development of multi-version programs 13

14 Frederick Brooks To avoid disaster, all the teams working on a project should remain in contact with each other in as many ways as possible Frederick Brooks 14

15 Conway s law Organizations which design systems are constrained to produce designs which are copies of the communication structures of these organizations 15

16 Agile manifesto Individuals and interactions ov er processes and tools 16

17 Social aspects matter! 17

18 Mining software repositories Source codediscussion lists Issue trackers and artifacts Comments on issues Project management systems Code comments Reputation systems User reports Q&A sites Social media laboration and software production Mining Practitioner Researcher Applications Information about a project Decision making Information about an ecosystem Software understanding Information about Software Engineering Support maintenance Empirical validation of ideas & techniques Tag cloud from MSR CFP 18

19 Coordination requirements Cataldo, M., Dependencies in geographically distribut software development: Overcoming the limits of modularity, PhD Thesis Santana, F. et a. XFlow: An Extensible Tool for Empirical Analysis of Software Systems Evolutio ESELAW

20 Programmers who changed this function also changed Thomas Zimmermann, Peter Weissgerber, Stephan Diehl, and Andreas Zeller Mining Version Histories to Guide Software Changes. IEEE Trans. Software Eng. 31, 6 (June 2005), DOI= /TSE

21 Some of our work

22 Change Coupling

23 Change coupling Files frequently changed together share some sort of dependency [Gall et al. 1998] Strong change dependency from B to A (the opposite is a much weaker dependency) A logical dependency denotes an implicit and evolutionary relationship between software artifacts The concept has proven useful in several different studies: Software quality analysis [Cataldo & Nambiar, 2010] Bugs prediction [D Ambros et al., 2009a] Change prediction and change impact analysis [Zimmermann et al., 2005] Uncover cross-cutting concerns [Adams et al., 2010] Uncover design flaws and opportunities for refactoring [D Ambros et al., 2009b] Understand and evaluate software architecture [Zimmermann et al., 2003] Requirements traceability [Ali et al., 2013] Maintain documentation [Kagdi et al., 2006] 23

24 1.1) Structural x Change coupling Overlap between change dependencies and structural dependencies S tr u c tu r a l D ependency C o -c h a n g e s? Gustavo Oliva, PhD candidate lysis of 150K commits of the ASF showed that: % of the change dependencies did not involve structural dependencies % of the structural dependencies did not imply in a change dependency Oliva, G.A., Gerosa, M. A. (2011) On the Interplay between Structural and Logical Dependencies in Free Software. Brazilian Symposium on Software Engineering (SBES 2011) 24

25 1.2) Change coupling origins Manual classification of commits to understand the origins of change coupling Category Refactoring elements that belong to a same semantic class Structural dependencies on a changing semantic class Cross-cutting concerns Overloaded revision Repository operations Structural dependencies on specific elements Other reasons Total Jointchanges Total % % 9 2.2% % 14.7% 5.1% % % Gustavo Oliva, PhD candidate Oliva, G.A., Santana, F., Gerosa, M. A., Souza, C. (2011) Towards a Classification of Logical Dependencies Origins: A Case Study. Proceedings of the 12th International Workshop on Principles of Software Evolution and the 7th annual ERCIM Workshop on Software Evolution (IWPSE-EVOL '11) 25

26 1.3) Preprocessing commits Commit = change? Using the sliding time window approach [Zimmermann & Weißgerber, 2004] to group SVN commits Gustavo Oliva, PhD candidate Evaluation in the Apache code repository showed that the produced grouping corresponded to 4.6% of the number of commits What about commit habits/practices/policies? Social aspects matter! Oliva, G. A., Santana, F., Gerosa, M. A., Souza, C. (2012), Preprocessing Change-Sets to Improve Logical Dependencies Identification, 6th Int. Workshop on Software Quality and Maintainability (SQM 2012) 26

27 How to identify design degradation?

28 2) Design degradation identification Rigidity and fragility [Martin & Martin, 2006] identification based on commit metadata Gustavo Oliva, PhD candidate 5. 3 Rigidity => designs difficult to change due to ripple effects => commit density (number of changed files per commit) Fragility => designs break in different areas when a change is performed => commit dispersion (distance in the directory tree among file paths included in a commit) Oliva, G., Steinmacher, I., Wiese, I.S., Gerosa, M.A. What Can Commit Metadata Tell Us About Design Degradation?, In: 13th International Workshop on Principles on Software Evolution (IWPSE 2013), Saint Petersburg. 28

29 Who are the key developers?

30 3) Key developers characterization Key developers participation Oliva, G., Santana, F.W., da Silva, J. T., Oliveira, K.C.M., Werner, C.M.L., Souza, C.R.B. & Gerosa, M.A., Evolving the System s Core: A Case Study on the Identification and Characterization of Key Developers in Apache Ant, Computing and Informatics [to appear]. 30

31 Do tests characteristics indicate the quality of the code under test?

32 4) Unit tests feedback for code quality Mauricio Aniche, PhD candidate Number of asserts indicate - Cyclomatic complexity? - LOC? - Method calls? - Asserted Objects metric presents better results than number of asserts - Statistically difference in 20% of the projects 22 ASF projects 3 industry projects Aniche, M., Oliva, G.A., Gerosa, M.A., What Do the Asserts in a Unit Test Tell Us about Code Quality? A Study on Open Source and Industrial Projects, 17th European Conference on Software Maintenance and Reengineering (CSMR 2013). 32

33 Does refactoring reduce cyclomatic complexity?

34 5) Refactoring Most part of the documented refactoring does not reduce cyclomatic complexity. However, 23% of the documented refactoring reduce cyclomatic complexity while 12% of the other commits have the same effect. Francisco Sokol, BSc Sokol, F., Aniche, M.F., Gerosa, M.A., Does the Act of Refactoring Really Make Code Simpler? A Preliminary Study,. In: IV Brazilian Workshop of Agile Methods (WBMA 2013). 34

35 Is it possible to predict changes, bugs, and change dependencies?

36 6) Change prediction Using social, process, and architectural metrics to improve change proneness prediction Igor Wiese, PhD candidate 13 projects, 6 classifiers, 11 metrics Social and architectural metrics improved the prediction model Similar results when considering projects grouped by change ratio Wiese, I. S, Nassif Jr, Steinmacher I, Re Reginaldo, Gerosa, M.A Comparing communication and development networks for predicting file change proneness: An exploratory study considering process and social metrics,. In: International Workshop on Software Quality and Maintainability (SQM ). 36

37 Why do newcomers dropout from OSS projects?

38 7) Supporting Newcomers to OSS projects Analysis of newcomers dropout reasons Use of MSR to identify newcomers Igor Steinmacher PhD candidate Systematic review on awareness in DSD [JCSCW 2013] com um + ação A ç ã o d e to rn a r c o m u m C O M U N IC A Ç Ã O dem anda C O O PE RAÇ Ã O c o + o p e ra r + a ç ã o A ç ã o d e o p e ra r e m c o n ju n to g e r a c o m p r o m is s o s g e r e n c ia d o s p e l a A wareness C O O RD ENAÇ ÃO o r g a n iz a a s ta r e f a s p a r a c o + o rd e m + a ç ã o A ç ã o d e o r g a n iz a r e m c o n ju n to Steinmacher, I., Wiese, I.S., Chaves, A.P., Gerosa, M.A., Why do newcomers abandon open source software projects?, 6th Int. Workshop on Cooperative and Human Aspects of Software Engineering (CHASE 2013) 38

39 How to use MSR to support newcomers?

40 7) Supporting Newcomers to OSS projects Systematic Literature Review on barriers for newcomers to OSS Qualitative analysis of interviews Igor Steinmacher PhD candidate Steinmacher, I., Wiese, I., Conte, T., Gerosa, M.A., Redmiles, D.F. The Hard Life of Newcomers to OSS Projects, 7th International Workshop on Cooperative and Human Aspects of Software Engineering Steinmacher, I.; Graciotto Silva, M. A. ; GEROSA, M.A., Barriers faced by newcomers to open source projects: a systematic review. 10th International Conference on Open Source 40 Systems (OSS )

41 41

Towards a Classification of Logical Dependencies Origins: A Case Study

Towards a Classification of Logical Dependencies Origins: A Case Study Towards a Classification of Logical Dependencies Origins: A Case Study Gustavo Oliva, Francisco Santana, Marco Aurelio Gerosa, Cleidson De Souza To cite this version: Gustavo Oliva, Francisco Santana,

More information

Quantifying and Assessing the Merge of Cloned Web-Based System: An Exploratory Study

Quantifying and Assessing the Merge of Cloned Web-Based System: An Exploratory Study Quantifying and Assessing the Merge of Cloned Web-Based System: An Exploratory Study Jadson Santos Department of Informatics and Applied Mathematics Federal University of Rio Grande do Norte, UFRN Natal,

More information

BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs

BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs Cesar Couto, Pedro Pires, Marco Tulio Valente, Roberto Bigonha, Andre Hora, Nicolas Anquetil To cite this version: Cesar

More information

Empirical Study on Impact of Developer Collaboration on Source Code

Empirical Study on Impact of Developer Collaboration on Source Code Empirical Study on Impact of Developer Collaboration on Source Code Akshay Chopra University of Waterloo Waterloo, Ontario a22chopr@uwaterloo.ca Parul Verma University of Waterloo Waterloo, Ontario p7verma@uwaterloo.ca

More information

How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects?

How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? Saraj Singh Manes School of Computer Science Carleton University Ottawa, Canada sarajmanes@cmail.carleton.ca Olga

More information

Cross-project defect prediction. Thomas Zimmermann Microsoft Research

Cross-project defect prediction. Thomas Zimmermann Microsoft Research Cross-project defect prediction Thomas Zimmermann Microsoft Research Upcoming Events ICSE 2010: http://www.sbs.co.za/icse2010/ New Ideas and Emerging Results ACM Student Research Competition (SRC) sponsored

More information

Reverse Engineering with Logical Coupling

Reverse Engineering with Logical Coupling Reverse Engineering with Logical Coupling Marco D Ambros, Michele Lanza - Faculty of Informatics - University of Lugano Switzerland 13th Working Conference on Reverse Engineering October 23-27, 2006, Benevento,

More information

Click to edit Master title. style. Software Development. Tesseract: Interactive. Visual Exploration of. Relationships in.

Click to edit Master title. style. Software Development. Tesseract: Interactive. Visual Exploration of. Relationships in. Click to edit Master title Tesseract: Interactive Visual Exploration of style Socio-Technical Relationships in Software Development Anita Sarma, Larry Maccherone, Patrick Wagstrom, and James Herbsleb Institute

More information

Impact of Dependency Graph in Software Testing

Impact of Dependency Graph in Software Testing Impact of Dependency Graph in Software Testing Pardeep Kaur 1, Er. Rupinder Singh 2 1 Computer Science Department, Chandigarh University, Gharuan, Punjab 2 Assistant Professor, Computer Science Department,

More information

Reverse Engineering with Logical Coupling

Reverse Engineering with Logical Coupling Reverse Engineering with Logical Coupling Marco D Ambros, Michele Lanza - Faculty of Informatics -! University of Lugano! Switzerland! 13th Working Conference on Reverse Engineering October 23-27, 2006,

More information

A Study on Inappropriately Partitioned Commits How Much and What Kinds of IP Commits in Java Projects?

A Study on Inappropriately Partitioned Commits How Much and What Kinds of IP Commits in Java Projects? How Much and What Kinds of IP Commits in Java Projects? Ryo Arima r-arima@ist.osaka-u.ac.jp Yoshiki Higo higo@ist.osaka-u.ac.jp Shinji Kusumoto kusumoto@ist.osaka-u.ac.jp ABSTRACT When we use code repositories,

More information

Empirical Software Engineering. Empirical Software Engineering with Examples. Classification. Software Quality. precision = TP/(TP + FP)

Empirical Software Engineering. Empirical Software Engineering with Examples. Classification. Software Quality. precision = TP/(TP + FP) Empirical Software Engineering Empirical Software Engineering with Examples a sub-domain of software engineering focusing on experiments on software systems devise experiments on software, in collecting

More information

Churrasco: Supporting Collaborative Software Evolution Analysis

Churrasco: Supporting Collaborative Software Evolution Analysis Churrasco: Supporting Collaborative Software Evolution Analysis Marco D Ambros a, Michele Lanza a a REVEAL @ Faculty of Informatics - University of Lugano, Switzerland Abstract Analyzing the evolution

More information

Empirical Software Engineering. Empirical Software Engineering with Examples! is not a topic for examination. Classification.

Empirical Software Engineering. Empirical Software Engineering with Examples! is not a topic for examination. Classification. Empirical Software Engineering Empirical Software Engineering with Examples is not a topic for examination a sub-domain of software engineering focusing on experiments on software systems devise experiments

More information

3 Analysing Software Repositories to Understand Software Evolution

3 Analysing Software Repositories to Understand Software Evolution 3 Analysing Software Repositories to Understand Software Evolution Marco D Ambros 1, Harald C. Gall 2, Michele Lanza 1, and Martin Pinzger 2 1 Faculty of Informatics, University of Lugano, Switzerland

More information

3 Prioritization of Code Anomalies

3 Prioritization of Code Anomalies 32 3 Prioritization of Code Anomalies By implementing a mechanism for detecting architecturally relevant code anomalies, we are already able to outline to developers which anomalies should be dealt with

More information

Lecture 3 Guiding Software Development

Lecture 3 Guiding Software Development Lecture 3 Guiding Software Development Where do you go next? Mentoring! Evolutionay Coupling Research Hypothesis Evolutionay Coupling erose Reengineering of Software Evolution Tom Zimmermann Saarland University

More information

Improving Evolvability through Refactoring

Improving Evolvability through Refactoring Improving Evolvability through Refactoring Jacek Ratzinger, Michael Fischer Vienna University of Technology Institute of Information Systems A-1040 Vienna, Austria {ratzinger,fischer}@infosys.tuwien.ac.at

More information

Predicting Bugs. by Analyzing History. Sunghun Kim Research On Program Analysis System Seoul National University

Predicting Bugs. by Analyzing History. Sunghun Kim Research On Program Analysis System Seoul National University Predicting Bugs by Analyzing History Sunghun Kim Research On Program Analysis System Seoul National University Around the World in 80 days Around the World in 8 years Predicting Bugs Severe consequences

More information

Towards a Taxonomy of Approaches for Mining of Source Code Repositories

Towards a Taxonomy of Approaches for Mining of Source Code Repositories Towards a Taxonomy of Approaches for Mining of Source Code Repositories Huzefa Kagdi, Michael L. Collard, Jonathan I. Maletic Department of Computer Science Kent State University Kent Ohio 44242 {hkagdi,

More information

IMPACT OF DEPENDENCY GRAPH IN SOFTWARE TESTING

IMPACT OF DEPENDENCY GRAPH IN SOFTWARE TESTING IMPACT OF DEPENDENCY GRAPH IN SOFTWARE TESTING Pardeep kaur 1 and Er. Rupinder Singh 2 1 Research Scholar, Dept. of Computer Science and Engineering, Chandigarh University, Gharuan, India (Email: Pardeepdharni664@gmail.com)

More information

On the automatic classification of app reviews

On the automatic classification of app reviews Requirements Eng (2016) 21:311 331 DOI 10.1007/s00766-016-0251-9 RE 2015 On the automatic classification of app reviews Walid Maalej 1 Zijad Kurtanović 1 Hadeer Nabil 2 Christoph Stanik 1 Walid: please

More information

Empirical Study on Impact of Developer Collaboration on Source Code

Empirical Study on Impact of Developer Collaboration on Source Code Empirical Study on Impact of Developer Collaboration on Source Code Akshay Chopra, Sahil Puri and Parul Verma 03 April 2018 Outline Introduction Research Questions Methodology Data Characteristics Analysis

More information

Bug Inducing Analysis to Prevent Fault Prone Bug Fixes

Bug Inducing Analysis to Prevent Fault Prone Bug Fixes Bug Inducing Analysis to Prevent Fault Prone Bug Fixes Haoyu Yang, Chen Wang, Qingkai Shi, Yang Feng, Zhenyu Chen State Key Laboratory for ovel Software Technology, anjing University, anjing, China Corresponding

More information

24 th Annual Research Review

24 th Annual Research Review 24 th Annual Research Review April 4-6 2017 Towards Better Understanding of Software Quality Evolution Through Commit-Impact Analysis Pooyan Behnamghader USC CSSE pbehnamg@usc.edu Commit-Impact Analysis

More information

A Design Structure Matrix Approach for Measuring Co-Change-Modularity of Software Products

A Design Structure Matrix Approach for Measuring Co-Change-Modularity of Software Products A Design Structure Matrix Approach for Measuring Co-Change-Modularity of Software Products Robert Benkoczi, Daya Gaur, Shahadat Hossain, Muhammad A. Khan Department of Mathematics and Computer Science,

More information

Software Bugs and Evolution: A Visual Approach to Uncover Their Relationship

Software Bugs and Evolution: A Visual Approach to Uncover Their Relationship Software Bugs and Evolution: A Visual Approach to Uncover Their Relationship Marco D Ambros and Michele Lanza Faculty of Informatics University of Lugano, Switzerland Abstract Versioning systems such as

More information

Measuring fine-grained change in software: towards modification-aware change metrics

Measuring fine-grained change in software: towards modification-aware change metrics Measuring fine-grained change in software: towards modification-aware change metrics Daniel M. German Abram Hindle Software Engineering Group Department fo Computer Science University of Victoria Abstract

More information

SOFTWARE ENGINEERING SOFTWARE EVOLUTION. Saulius Ragaišis.

SOFTWARE ENGINEERING SOFTWARE EVOLUTION. Saulius Ragaišis. SOFTWARE ENGINEERING SOFTWARE EVOLUTION Saulius Ragaišis saulius.ragaisis@mif.vu.lt CSC2008 SE Software Evolution Learning Objectives: Identify the principal issues associated with software evolution and

More information

Theme Identification in RDF Graphs

Theme Identification in RDF Graphs Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published

More information

Continuous the research Software Engineering. Cycle through Test-Driven Search the teaching

Continuous the research Software Engineering. Cycle through Test-Driven Search the teaching Overview 1 1. Find out why software engineering is important see some software engineering failures 2. Integrating Get acquainted with Reuse into the Rapid, the Chair of Software Engineering Continuous

More information

Are Refactorings Less Error-prone Than Other Changes?

Are Refactorings Less Error-prone Than Other Changes? Are Refactorings Less Error-prone Than Other Changes? Peter Weißgerber University of Trier Computer Science Department 54286 Trier, Germany weissger@uni-trier.de Stephan Diehl University of Trier Computer

More information

Mining Software Repositories for Software Change Impact Analysis: A Case Study

Mining Software Repositories for Software Change Impact Analysis: A Case Study Mining Software Repositories for Software Change Impact Analysis: A Case Study Lile Hattori 1, Gilson dos Santos Jr. 2, Fernando Cardoso 2, Marcus Sampaio 2 1 Faculty of Informatics University of Lugano

More information

Imagine you ve written a piece of code but then accidentally deleted and lost it.

Imagine you ve written a piece of code but then accidentally deleted and lost it. Why Refactor? Imagine you ve written a piece of code but then accidentally deleted and lost it. Questions: How much time would it take you to reconstruct from scratch what you had the same amount, or more,

More information

SNS College of Technology, Coimbatore, India

SNS College of Technology, Coimbatore, India Support Vector Machine: An efficient classifier for Method Level Bug Prediction using Information Gain 1 M.Vaijayanthi and 2 M. Nithya, 1,2 Assistant Professor, Department of Computer Science and Engineering,

More information

IDENTIFICATION OF PROMOTED ECLIPSE UNSTABLE INTERFACES USING CLONE DETECTION TECHNIQUE

IDENTIFICATION OF PROMOTED ECLIPSE UNSTABLE INTERFACES USING CLONE DETECTION TECHNIQUE International Journal of Software Engineering & Applications (IJSEA), Vol.9, No.5, September 2018 IDENTIFICATION OF PROMOTED ECLIPSE UNSTABLE INTERFACES USING CLONE DETECTION TECHNIQUE Simon Kawuma 1 and

More information

Automatic Identification of Important Clones for Refactoring and Tracking

Automatic Identification of Important Clones for Refactoring and Tracking Automatic Identification of Important Clones for Refactoring and Tracking Manishankar Mondal Chanchal K. Roy Kevin A. Schneider Department of Computer Science, University of Saskatchewan, Canada {mshankar.mondal,

More information

Mining Sequences of Changed-files from Version Histories

Mining Sequences of Changed-files from Version Histories Mining Sequences of Changed-files from Version Histories Huzefa Kagdi, Shehnaaz Yusuf, Jonathan I. Maletic Department of Computer Science Kent State University Kent Ohio 44242 {hkagdi, sdawoodi, jmaletic}@cs.kent.edu

More information

Mining Frequent Bug-Fix Code Changes

Mining Frequent Bug-Fix Code Changes Mining Frequent Bug-Fix Code Changes Haidar Osman, Mircea Lungu, Oscar Nierstrasz Software Composition Group University of Bern Bern, Switzerland {osman, lungu, oscar@iam.unibe.ch Abstract Detecting bugs

More information

HOW AND WHEN TO FLATTEN JAVA CLASSES?

HOW AND WHEN TO FLATTEN JAVA CLASSES? HOW AND WHEN TO FLATTEN JAVA CLASSES? Jehad Al Dallal Department of Information Science, P.O. Box 5969, Safat 13060, Kuwait ABSTRACT Improving modularity and reusability are two key objectives in object-oriented

More information

Model-based Mining of Software Repositories. Markus Scheidgen

Model-based Mining of Software Repositories. Markus Scheidgen Model-based Mining of Software Repositories Markus Scheidgen Agenda Mining Software Repositories (MSR) and current approaches srcrepo a model-based MSR system srcrepo components and analysis process a

More information

The goal of this project is to enhance the identification of code duplication which can result in high cost reductions for a minimal price.

The goal of this project is to enhance the identification of code duplication which can result in high cost reductions for a minimal price. Code Duplication New Proposal Dolores Zage, Wayne Zage Ball State University June 1, 2017 July 31, 2018 Long Term Goals The goal of this project is to enhance the identification of code duplication which

More information

Configuration Management

Configuration Management Configuration Management VIMIMA11 Design and integration of embedded systems Budapest University of Technology and Economics Department of Measurement and Information Systems BME-MIT 2017 Configuration

More information

Filtering Bug Reports for Fix-Time Analysis

Filtering Bug Reports for Fix-Time Analysis Filtering Bug Reports for Fix-Time Analysis Ahmed Lamkanfi, Serge Demeyer LORE - Lab On Reengineering University of Antwerp, Belgium Abstract Several studies have experimented with data mining algorithms

More information

Towards Better Understanding of Software Quality Evolution Through Commit Impact Analysis

Towards Better Understanding of Software Quality Evolution Through Commit Impact Analysis Towards Better Understanding of Software Quality Evolution Through Commit Impact Analysis Sponsor: DASD(SE) By Mr. Pooyan Behnamghader 5 th Annual SERC Doctoral Students Forum November 7, 2017 FHI 360

More information

A Case Study on the Similarity Between Source Code and Bug Reports Vocabularies

A Case Study on the Similarity Between Source Code and Bug Reports Vocabularies A Case Study on the Similarity Between Source Code and Bug Reports Vocabularies Diego Cavalcanti 1, Dalton Guerrero 1, Jorge Figueiredo 1 1 Software Practices Laboratory (SPLab) Federal University of Campina

More information

Agenda. Tool-Supported Detection of Code Smells in Software Aspectization. Motivation. Mistakes in Software Aspectization. Advice-related mistakes

Agenda. Tool-Supported Detection of Code Smells in Software Aspectization. Motivation. Mistakes in Software Aspectization. Advice-related mistakes Agenda Tool-Supported Detection of Code Smells in Software Aspectization Péricles Alves and Diogo Santana Recurring Mistakes Code Smells ConcernReCS Tool ConcernReCS Extension 1 Motivation AOP is about

More information

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region Azure DevOps Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region What is DevOps? People. Process. Products. Build & Test Deploy DevOps is the union of people, process, and products to

More information

Understanding When to Adopt a Library: A Case Study on ASF Projects

Understanding When to Adopt a Library: A Case Study on ASF Projects Understanding When to Adopt a Library: A Case Study on ASF Projects Akinori Ihara 1(B), Daiki Fujibayashi 1, Hirohiko Suwa 1, Raula Gaikovina Kula 2, and Kenichi Matsumoto 1 1 Nara Institute of Science

More information

Microservices Smaller is Better? Eberhard Wolff Freelance consultant & trainer

Microservices Smaller is Better? Eberhard Wolff Freelance consultant & trainer Microservices Smaller is Better? Eberhard Wolff Freelance consultant & trainer http://ewolff.com Why Microservices? Why Microservices? Strong modularization Replaceability Small units Sustainable Development

More information

Appendix to The Health of Software Engineering Research

Appendix to The Health of Software Engineering Research Appendix to The Health of Software Engineering Research David Lo School of Information Systems Singapore Management University Singapore davidlo@smu.edu.sg Nachiappan Nagappan and Thomas Zimmermann Research

More information

EmpAnADa Project. Christian Lange. June 4 th, Eindhoven University of Technology, The Netherlands.

EmpAnADa Project. Christian Lange. June 4 th, Eindhoven University of Technology, The Netherlands. EmpAnADa Project C.F.J.Lange@tue.nl June 4 th, 2004 Eindhoven University of Technology, The Netherlands Outline EmpAnADa introduction Part I Completeness and consistency in detail Part II Background UML

More information

Systematic Source Code Transformations. Gustavo SANTOS

Systematic Source Code Transformations. Gustavo SANTOS Systematic Source Code Transformations Gustavo SANTOS Presentation Bachelor in Computer Science Master in Computer Science Short visit to RMoD team Funded by CAPES (Brazil) Science Without Borders program

More information

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Dipak J Kakade, Nilesh P Sable Department of Computer Engineering, JSPM S Imperial College of Engg. And Research,

More information

Visualizing and Characterizing the Evolution of Class Hierarchies

Visualizing and Characterizing the Evolution of Class Hierarchies Visualizing and Characterizing the Evolution of Class Hierarchies Tudor Gîrba and Michele Lanza Software Composition Group University of Berne Switzerland {girba, lanza}@iam.unibe.ch Abstract Analyzing

More information

CoxR: Open Source Development History Search System

CoxR: Open Source Development History Search System CoxR: Open Source Development History Search System Makoto Matsushita, Kei Sasaki and Katsuro Inoue Graduate School of Information Science and Technology, Osaka University 1-3, Machikaneyama-cho, Toyonaka-shi,

More information

Code Analysis Via Version Control History

Code Analysis Via Version Control History Code Analysis Via Version Control History Justin Mclean Class Software Email: justin@classsoftware.com Twitter: @justinmclean Blog: http://blog.classsoftware.com Who am I? Freelance Developer Programming

More information

Visualizing the evolution of software using softchange

Visualizing the evolution of software using softchange Visualizing the evolution of software using softchange Daniel M. German, Abram Hindle and Norman Jordan Software Engineering Group Department of Computer Science University of Victoria dmgerman,abez,njordan

More information

Clustering Analysis based on Data Mining Applications Xuedong Fan

Clustering Analysis based on Data Mining Applications Xuedong Fan Applied Mechanics and Materials Online: 203-02-3 ISSN: 662-7482, Vols. 303-306, pp 026-029 doi:0.4028/www.scientific.net/amm.303-306.026 203 Trans Tech Publications, Switzerland Clustering Analysis based

More information

Bug Triaging: Profile Oriented Developer Recommendation

Bug Triaging: Profile Oriented Developer Recommendation Bug Triaging: Profile Oriented Developer Recommendation Anjali Sandeep Kumar Singh Department of Computer Science and Engineering, Jaypee Institute of Information Technology Abstract Software bugs are

More information

Pliny and Fixr Meeting. September 15, 2014

Pliny and Fixr Meeting. September 15, 2014 Pliny and Fixr Meeting September 15, 2014 Fixr: Mining and Understanding Bug Fixes for App-Framework Protocol Defects (TA2) University of Colorado Boulder September 15, 2014 Fixr: Mining and Understanding

More information

White Paper. Incorporating Usability Experts with Your Software Development Lifecycle: Benefits and ROI Situated Research All Rights Reserved

White Paper. Incorporating Usability Experts with Your Software Development Lifecycle: Benefits and ROI Situated Research All Rights Reserved White Paper Incorporating Usability Experts with Your Software Development Lifecycle: Benefits and ROI 2018 Situated Research All Rights Reserved Learnability, efficiency, safety, effectiveness, memorability

More information

Evolizer A Platform for Software Evolution Analysis and Research

Evolizer A Platform for Software Evolution Analysis and Research Evolizer A Platform for Software Evolution Analysis and Research Michael Würsch, Harald C. Gall University of Zurich Department of Informatics software evolution & architecture lab Friday, April 23, 200

More information

Change-based Software Evolution

Change-based Software Evolution Change-based Software Evolution Romain Robbes and Michele Lanza Faculty of Informatics University of Lugano, Switzerland Abstract Software evolution research is limited by the amount of information available

More information

What Do Large Commits Tell Us? A taxonomical study of large commits

What Do Large Commits Tell Us? A taxonomical study of large commits What Do Large Commits Tell Us? A taxonomical study of large commits Abram Hindle David Cheriton School of Computer Science University of Waterloo Waterloo, Ontario Canada ahindle@cs.uwaterloo.ca Daniel

More information

Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha

Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha 2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha Physics Institute,

More information

Software Maintainability Ontology in Open Source Software. Celia Chen ARR 2018, USC

Software Maintainability Ontology in Open Source Software. Celia Chen ARR 2018, USC Software Maintainability Ontology in Open Source Software Celia Chen qianqiac@usc.edu ARR 2018, USC How do others measure software maintainability? Most popular methods: Automated analysis of the code

More information

Blogging and Project Management Survey: Preliminary Findings

Blogging and Project Management Survey: Preliminary Findings Blogging and Project Management Survey: Preliminary Findings Dennis D. McDonald, Ph.D. Email: ddmcd@yahoo.com Web: http://www.ddmcd.com December 7, 2007 Contents copyright 2007 by Dennis D. McDonald Background

More information

The Power of Unit Testing and it s impact on your business. Ashish Kumar Vice President, Engineering

The Power of Unit Testing and it s impact on your business. Ashish Kumar Vice President, Engineering The Power of Unit Testing and it s impact on your business Ashish Kumar Vice President, Engineering Agitar Software, 2006 1 The Power of Unit Testing Why Unit Test? The Practical Reality Where do we go

More information

Investigation of Metrics for Object-Oriented Design Logical Stability

Investigation of Metrics for Object-Oriented Design Logical Stability Investigation of Metrics for Object-Oriented Design Logical Stability Mahmoud O. Elish Department of Computer Science George Mason University Fairfax, VA 22030-4400, USA melish@gmu.edu Abstract As changes

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Abstract: Refactorings are used to improve the internal structure of software without changing its external behavior.

Abstract: Refactorings are used to improve the internal structure of software without changing its external behavior. Refactoring: Risks and its Effects on Software Quality Attribute Ramesh Kumar, Dr.Rajesh Verma Research Scholar, Department of Computer Science, Singhania University, Rajasthan. Asst. Professor, Department

More information

Creating a Recommender System. An Elasticsearch & Apache Spark approach

Creating a Recommender System. An Elasticsearch & Apache Spark approach Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused

More information

Who Can Help Me with this Source Code Change?

Who Can Help Me with this Source Code Change? Who Can Help Me with this Source Code Change? Huzefa Kagdi 1, Maen Hammad 2, and Jonathan I. Maletic 2 1 Department of Computer Science Missouri University of Science and Technology Rolla Missouri 65409

More information

Experience-based Refactoring for Goal- oriented Software Quality Improvement

Experience-based Refactoring for Goal- oriented Software Quality Improvement Experience-based Refactoring for Goal- oriented Software Quality Improvement International Workshop on Software Quality (SOQUA 2004) Erfurt,, Germany, September 30, 2004 Fraunhofer IESE Institut Experimentelles

More information

Social Network Mining An Introduction

Social Network Mining An Introduction Social Network Mining An Introduction Jiawei Zhang Assistant Professor Florida State University Big Data A Questionnaire Please raise your hands, if you (1) use Facebook (2) use Instagram (3) use Snapchat

More information

Versioning. Jurriaan Hage homepage: Slides stolen from Eelco Dolstra.

Versioning. Jurriaan Hage   homepage:  Slides stolen from Eelco Dolstra. Versioning Jurriaan Hage e-mail: jur@cs.uu.nl homepage: http://www.cs.uu.nl/people/jur/ Slides stolen from Eelco Dolstra Department of Information and Computing Sciences, Universiteit Utrecht August 24,

More information

MINING SOURCE CODE REPOSITORIES AT MASSIVE SCALE USING LANGUAGE MODELING COMP 5900 X ERIC TORUNSKI DEC 1, 2016

MINING SOURCE CODE REPOSITORIES AT MASSIVE SCALE USING LANGUAGE MODELING COMP 5900 X ERIC TORUNSKI DEC 1, 2016 MINING SOURCE CODE REPOSITORIES AT MASSIVE SCALE USING LANGUAGE MODELING COMP 5900 X ERIC TORUNSKI DEC 1, 2016 OVERVIEW Miltiadis Allamanis, Mining Source Code Repositories at Massive Scale using Language

More information

0-1 Programming Model-Based Method for Planning Code Review using Bug Fix History

0-1 Programming Model-Based Method for Planning Code Review using Bug Fix History 0-1 Programming Model-Based Method for Planning Code Review using Bug Fix History Hirohisa Aman Center for Information Technology Ehime University Matsuyama, Japan 790 8577 Email: aman@ehime-u.ac.jp Abstract

More information

Software Quality Assurance. David Janzen

Software Quality Assurance. David Janzen Software Quality Assurance David Janzen What is quality? Crosby: Conformance to requirements Issues: who establishes requirements? implicit requirements Juran: Fitness for intended use Issues: Who defines

More information

Deliver robust products at reduced cost by linking model-driven software testing to quality management.

Deliver robust products at reduced cost by linking model-driven software testing to quality management. Quality management White paper September 2009 Deliver robust products at reduced cost by linking model-driven software testing to quality management. Page 2 Contents 2 Closing the productivity gap between

More information

Applying Auto-Data Classification Techniques for Large Data Sets

Applying Auto-Data Classification Techniques for Large Data Sets SESSION ID: PDAC-W02 Applying Auto-Data Classification Techniques for Large Data Sets Anchit Arora Program Manager InfoSec, Cisco The proliferation of data and increase in complexity 1995 2006 2014 2020

More information

SHINOBI: A Real-Time Code Clone Detection Tool for Software Maintenance

SHINOBI: A Real-Time Code Clone Detection Tool for Software Maintenance : A Real-Time Code Clone Detection Tool for Software Maintenance Takanobu Yamashina Hidetake Uwano Kyohei Fushida Yasutaka Kamei Masataka Nagura Shinji Kawaguchi Hajimu Iida Nara Institute of Science and

More information

The Strength of Social Coding Collaboration on GitHub

The Strength of Social Coding Collaboration on GitHub The Strength of Social Coding Collaboration on GitHub 2 Who?! Gabriela Brant Alves Graduate Student in Sistemas de Informação at UFMG gabrielabrant@dcc.ufmg.br 3 Team Michele A. Brandão Diogo M. Santana

More information

Project Automation. If it hurts, automate it! Jan Pool NioCAD University of Stellenbosch 19 March 2008

Project Automation. If it hurts, automate it! Jan Pool NioCAD University of Stellenbosch 19 March 2008 Project Automation If it hurts, automate it! Jan Pool NioCAD University of Stellenbosch 19 March 2008 Introduction Purpose: Introduce various aspects of project automation. Why, when, what, and how to

More information

A Comparative Study on Different Version Control System

A Comparative Study on Different Version Control System e-issn 2455 1392 Volume 2 Issue 6, June 2016 pp. 449 455 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com A Comparative Study on Different Version Control System Monika Nehete 1, Sagar Bhomkar

More information

How Does the Shift to GitHub Impact Project Collaboration?

How Does the Shift to GitHub Impact Project Collaboration? 216 IEEE International Conference on Software Maintenance and Evolution How Does the Shift to GitHub Impact Project Collaboration? Luiz Felipe Dias 1 Igor Steinmacher 1 Gustavo Pinto 2 Daniel Alencar da

More information

Optimize tomorrow today.

Optimize tomorrow today. Applying Agile Practices to Improve Software Quality Name: Arlene Minkiewicz Chief Scientist 17000 Commerce Parkway Mt. Laurel, NJ 08054 arlene.minkiewicz@pricesystems.com Phone: 856 608-7222 Agenda Introduction

More information

Commit Guru: Analytics and Risk Prediction of Software Commits

Commit Guru: Analytics and Risk Prediction of Software Commits Commit Guru: Analytics and Risk Prediction of Software Commits Christoffer Rosen, Ben Grawi Department of Software Engineering Rochester Institute of Technology Rochester, NY, USA {cbr4830, bjg1568}@rit.edu

More information

COSC 3351 Software Design. An Introduction to UML (I)

COSC 3351 Software Design. An Introduction to UML (I) COSC 3351 Software Design An Introduction to UML (I) This lecture contains material from: http://wps.prenhall.com/esm_pfleeger_softengtp_2 http://sunset.usc.edu/classes/cs577a_2000/lectures/05/ec-05.ppt

More information

Table Identification and Information extraction in Spreadsheets

Table Identification and Information extraction in Spreadsheets Table Identification and Information extraction in Spreadsheets Elvis Koci 1,2, Maik Thiele 1, Oscar Romero 2, and Wolfgang Lehner 1 1 Technische Universität Dresden, Germany 2 Universitat Politècnica

More information

Using Concept Analysis to Detect Co-Change Patterns

Using Concept Analysis to Detect Co-Change Patterns Using Concept Analysis to Detect Co-Change Patterns In Proceedings of International Workshop on Principles of Software Evolution (IWPSE 2007) Tudor Gîrba Software Composition Group University of Bern Radu

More information

Managing Open Bug Repositories through Bug Report Prioritization Using SVMs

Managing Open Bug Repositories through Bug Report Prioritization Using SVMs Managing Open Bug Repositories through Bug Report Prioritization Using SVMs Jaweria Kanwal Quaid-i-Azam University, Islamabad kjaweria09@yahoo.com Onaiza Maqbool Quaid-i-Azam University, Islamabad onaiza@qau.edu.pk

More information

The Goal of this Document. Where to Start?

The Goal of this Document. Where to Start? A QUICK INTRODUCTION TO THE SEMILAR APPLICATION Mihai Lintean, Rajendra Banjade, and Vasile Rus vrus@memphis.edu linteam@gmail.com rbanjade@memphis.edu The Goal of this Document This document introduce

More information

Patent documents usecases with MyIntelliPatent. Alberto Ciaramella IntelliSemantic 25/11/2012

Patent documents usecases with MyIntelliPatent. Alberto Ciaramella IntelliSemantic 25/11/2012 Patent documents usecases with MyIntelliPatent Alberto Ciaramella IntelliSemantic 25/11/2012 Objectives and contents of this presentation This presentation: identifies and motivates the most significant

More information

LTAT : Software Analytics. Ezequiel Scott

LTAT : Software Analytics. Ezequiel Scott LTAT.05.008: Software Analytics Ezequiel Scott ezequiel.scott@ut.ee Motivation Motivation How to take advantage of the data? How to support decisions? Data analytics Some definitions Analytic Maturity

More information

Evolution of Open Source Software Networks

Evolution of Open Source Software Networks Evolution of Open Source Software Networks Matthew Van Antwerp 1 University of Notre Dame mvanantw@cse.nd.edu Abstract. The work presented in this paper is focused on the Open Source Software (OSS) community

More information

Cse634 DATA MINING TEST REVIEW. Professor Anita Wasilewska Computer Science Department Stony Brook University

Cse634 DATA MINING TEST REVIEW. Professor Anita Wasilewska Computer Science Department Stony Brook University Cse634 DATA MINING TEST REVIEW Professor Anita Wasilewska Computer Science Department Stony Brook University Preprocessing stage Preprocessing: includes all the operations that have to be performed before

More information

Effective Bug Triage and Recommendation System

Effective Bug Triage and Recommendation System Effective Bug Triage and Recommendation System Manisha Bedmutha ¹, Megha Sawant ², Sushmitha Ghan³ Department of Computer Engineering, P.E.S. Modern College of Engineering, Pune University (MH), India.

More information

ADM Commitment to No Deforestation, No Peat and No Exploitation

ADM Commitment to No Deforestation, No Peat and No Exploitation ADM Commitment to No Deforestation, No Peat and No Exploitation Policy Implementation Q3 2016 Soy Progress Report In Partnership with The Forest Trust Introduction ADM committed to Our No Deforestation,

More information