Pliny and Fixr Meeting. September 15, 2014

Size: px
Start display at page:

Download "Pliny and Fixr Meeting. September 15, 2014"

Transcription

1 Pliny and Fixr Meeting September 15, 2014

2 Fixr: Mining and Understanding Bug Fixes for App-Framework Protocol Defects (TA2) University of Colorado Boulder September 15, 2014

3 Fixr: Mining and Understanding Bug Fixes for App-Framework Protocol Defects (TA2) Bor-Yuh Evan Chang Ken Anderson Pavol Cerny Sriram Sankaranarayanan Tom Yeh University of Colorado Boulder University of Colorado Boulder September 15, 2014

4 A bug that manifests spectacularly

5 A bug that manifests spectacularly

6 A bug that manifests spectacularly

7 A bug that manifests spectacularly Crash

8 A bug that manifests spectacularly Crash caused by an app-created memory leak

9 Framework Dev s say...

10 Framework Dev s say...

11 Framework Dev s say... Do not keep long-lived references to a context-activity

12 Framework Dev s say... Do not keep long-lived references to a context-activity I don t know how I created a long-lived reference to an Activity!

13 Framework Dev s say... Do not keep long-lived references to a context-activity I don t know how I created a long-lived reference to an Activity! Often: A misunderstanding of a library causes the library to keep the Activity reference.

14 Framework Dev s say... Do not keep long-lived references to a context-activity Bug from violating I don t know how I created a long-lived reference to an Activity! Often: A misunderstanding of a library causes the library to keep (implicit) framework protocol rules the Activity reference.

15 Imagining a post-muse scenario... I don t know how I created a long-lived reference to an Activity! for xxxxxxxxxxxx

16 Elsewhere, following the state of practice for debugging leaks...

17 Elsewhere, following the state of practice for debugging leaks Run the app

18 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage

19 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit!

20 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit!

21 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit! 4. Commit a bugfix

22 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit! 4. Commit a bugfix 5. Bugfix is picked up by Fixr Fixr

23 A Fixr-enabled IDE responds... I don t know how I created a long-lived reference to an Activity!

24 A Fixr-enabled IDE responds... I don t know how I created a long-lived reference to an Activity!

25 A Fixr-enabled IDE responds... It looks like you ve created a memory leak like and 100,000 others. Would you like to apply? I don t know how I created a long-lived reference to an Activity!

26 A Fixr-enabled IDE responds... It looks like you ve created a memory leak like and 100,000 others. Would you like to apply? the bugfix is transferred I don t know how I created a long-lived reference to an Activity!

27 Summary: Mine specifications of framework rules (indirectly) from bugfixes Leverage volume and variety of bugfixes made by the crowd of client app developers

28 Summary: Mine specifications of framework rules (indirectly) from bugfixes Leverage volume and variety of bugfixes made by the crowd of client app developers

29 Summary: Mine specifications of framework rules (indirectly) from bugfixes Leverage volume and variety of bugfixes made by the crowd of client app developers toolify stackoverflow

30 Fixr: Proposed System MUSE

31 Fixr: Proposed System fix MUSE

32 Fixr: Proposed System E.g., Two successive versions of source code fix MUSE

33 Fixr: Proposed System fix MUSE

34 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications fix MUSE

35 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications semantic delta fix MUSE

36 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications E.g., Diff in relevant flow-insensitive summary semantic delta fix MUSE

37 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications semantic delta fix MUSE

38 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification semantic delta fix MUSE

39 Fixr: Proposed System E.g., framework Deltar: Inferring Semantic Deltas and Repair Specifications semantic delta invariant + app pre for bug + fix repair specification fix MUSE

40 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification semantic delta fix MUSE

41 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE

42 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification

43 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications fix semantic delta MUSE probabilistic repair specification E.g., generalized repair spec with confidence measure

44 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification

45 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification Patchr: Detecting Potential Bugs and Synthesizing Patches

46 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification patch Patchr: Detecting Potential Bugs and Synthesizing Patches

47 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification patch E.g., bug evidence and patch Patchr: Detecting Potential Bugs and Synthesizing Patches

48 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification patch Patchr: Detecting Potential Bugs and Synthesizing Patches

49 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification Harvestr: Social Validation and Mining of Fixes patch Patchr: Detecting Potential Bugs and Synthesizing Patches

50 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification social delta Harvestr: Social Validation and Mining of Fixes patch Patchr: Detecting Potential Bugs and Synthesizing Patches

51 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification E.g., bug fix confirmation social delta Harvestr: Social Validation and Mining of Fixes patch Patchr: Detecting Potential Bugs and Synthesizing Patches

52 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction E.g., bug fix confirmation Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches

53 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches

54 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches

55 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code

56 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code

57 Fixr: Proposed System semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code

58 Fixr: Proposed System semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Goal: Create a positive feedback loop commit Patches to Code derive high-quality repair specifications

59 semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code

60 symbolic program analysis Bor-Yuh Evan Chang Team Sriram Sankaranarayanan numerical-probabilistic program analysis semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications software engineering for big data fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes Ken Anderson social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code user-centered big data analytics program synthesis Tom Yeh Pavol Cerny

61 Evaluation Questions

62 Evaluation Questions Iterative and incremental design and evaluation of the Fixr loop

63 Evaluation Questions Iterative and incremental design and evaluation of the Fixr loop Effectiveness of Bugfix Transfer: Given an isolated bugfix, can we derive high-quality repair specifications to lead to useful patches?

64 Evaluation Questions Iterative and incremental design and evaluation of the Fixr loop Effectiveness of Bugfix Transfer: Given an isolated bugfix, can we derive high-quality repair specifications to lead to useful patches? Effectiveness of Bugfix Seeding: Can we isolate likely bugfixes from source repositories?

65 Bor-Yuh Evan Chang Sriram Sankaranarayanan semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes Ken Anderson social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code Tom Yeh Pavol Cerny

66 Deltar Deltar: Inferring Semantic Deltas and Repair Specifications repair specification semantic delta fix

67 Bug: On Android <4 aview.settag(..., anobject)

68 Bug: On Android <4 aview.settag(..., anobject) if anobject can reach aview

69 Bug: On Android <4 aview.settag(..., anobject) if anobject can reach aview Goal: Produce the above ( bug pre ) with framework invariant and fix

70 Subtask 1.1: Summarizing App Commits

71 Subtask 1.1: Summarizing App Commits Diff coarse semantic summaries E.g., points-to graphs abstracted to appframework crossings

72 Subtask 1.1: Summarizing App Commits Diff coarse semantic summaries E.g., points-to graphs abstracted to appframework crossings Refine diffs

73 Subtask 1.2: Approximating Framework Properties

74 Subtask 1.2: Approximating Framework Properties Need 1: Scalable verification of framework invariants (on the bugfixed version) Fissile Types: intertwined invariant-based and operational-based reasoning ( almost everywhere dependent-refinement types)

75 Subtask 1.2: Approximating Framework Properties Need 1: Scalable verification of framework invariants (on the bugfixed version) Fissile Types: intertwined invariant-based and operational-based reasoning ( almost everywhere dependent-refinement types) Need 2: Ming framework invariants Refine semantic diff to a framework-only specification

76 Urepair repair specification Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification

77 Traditional Program Analysis Program (Fragment) Program Analyzer Annotations

78 Traditional Program Analysis Program (Fragment) Program Analyzer Annotations Program Database

79 Connection with Repair Specifications Repair Spec #1 Repair Spec #2 Probabilistic Repair Synthesis Repair Spec #N Goal: Synthesize multiple repair specifications into a likely candidates.

80 Bayesian Reasoning Prior Hypothesis Bayesian Update Posterior Hypothesis Observational Data

81 Bayesian Program Analysis class GadgetStore{ List<Gadget> lst; void addtolist( Gadget x ){ // Can you guess what the function does? pre (x!= null); } } post( len(x_0) <= len(x_p) <= len(x_0) +1 );

82 Approach There are lots of soft information sources that program analysis tools can use. However, we do not use them Fear of unsoundness. Integrate multiple sources into prior annotations. Allow weights for prior facts to signify degree of belief.

83 Prior Beliefs class GadgetStore{ List<Gadget> lst; void addtolist( Gadget x ){ // Can you guess what the function does? pre (x!= null); Prior Weight: 0.9 } } post( len(x_0) <= len(x_p) <= len(x_0) +1 );

84 Program Analysis: Bayesian Updating Program analysis is operates over distributions of static analysis facts. Assume a level of belief in the correctness of a function. Pr(Assertion FunctionCorrect) = Pr(FunctionCorrect Assertion) Pr(Assertion) Pr(FunctionCorrect)

85 Current Progress Ongoing investigations into probabilistic program analysis. Sankaranarayanan et al. PLDI 2013 Chakarov et al. CAV 2013, SAS Goals: Support Bayesian interpretation. Design program analysis tools for Bayesian Update. Integrate into repair specification synthesis.

86 Patchr probabilistic repair specification patch Patchr: Detecting Potential Bugs and Synthesizing Patches

87 Patchr How do we validate repair specifications? We synthesize appropriate patches [Patchr]. and validate the patches using human input (mturk, pull requests) [part of Harvestr] Motivation for developers to provide feedback: 1. Patches easier to understand than repair specs 2. Patches, if correct, immediately useful to developers

88 Subtask 3.1: Applying and concretizing repair specifications Subtask 3.2: Synthesis using MUSE queries

89 Subtask 3.1: Applying and concretizing repair specifications 1. Finding where to apply a repair spec 2. Finding how to apply it (concretizing repair specs): Example: suppose repair spec says always call methods A and B before C suppose the program-under-repair already calls A patch: call B after A and before C

90 Subtask 3.2: Synthesis using MUSE queries While producing patches, we need to understand the program-under-repair Develop techniques for synthesis with rich queries: pre- and post-conditions of relevant methods invariants Studied for finite-state systems as synthesis from libraries components

91 Team: Pavol Cerny Vaibhav Singh (MS) Currently recruiting PhD students and postdocs

92 Harvestr fix interaction Harvestr: Social Validation and Mining of Fixes social delta patch

93 Harvestr Runtime Input: A possible bug fix. Output: Yes! It is a bug fix. (or not) Training: Input: Lots and lots of apps and repos Output: Social proof

94 Social Validation Scenario 1: Developer releases an app. Users leave comments about a bug. Developer releases an update containing a bug fix. Users no longer leave comments about the bug.

95 Social Validation Scenario 2: Developer releases an app. After 1 day Developer releases an update.

96 Social Validation Scenario 3: Developer releases an app. After 30 days. Developer releases an update.

97 Social Validation Scenario 4: Developer A releases an update. After 1 day Developer B releases an update. Two updates are similar.

98 Big data problem Volume Variety Velocity Veracity

99 Volume Number of apps (1 millions) x Number of versions (10) x Number of source files (100) Current: 200K apps, 2-7 versions/app.

100 Velocity New app (10/day) New update (100/day) New review (1000/day) New repo (1/day) New commit (1/repo/day)

101 Variety user ratings download counts version increments update intervals app s description user reviews ( many more)

102 Veracity Biases Low-level vs. high-level Uncertainties Precision vs. recall Abnormality Data entry, Outliers

103 Fixr Big Code Architecture MUSE commit Code

104 Scaling Fixr Up Scalability requirements to make Fixr operational Target: Collect at least 50% of public Android apps on Google Play and Github > 500K apps => ~100M files Handle updates to revision histories 10 updates on Google Play; 50 commits on Github Velocity: 10 new apps, 100 updates, 1000 user reviews per day; 1 new repo + 1 new commit per day

105 Data Profile Data is Text: Source code, user reviews, bug reports Binary: git repos, application assets, Meta: ratings, download counts, version numbers More importantly, Data is read-only highly interconnected

106 Proposed Architecture MUSE Front End Application Layer DataStax Enterprise Pig Hadoop Solr Redis Service Layer Cassandra Storage Layer

107 Discussion (I) Cassandra used to store all read-only data very fast at streaming results of key-based queries Solr used to index the important metadata and to perform complex queries Pig/Hadoop used to process applications in batch Redis used to cache results of frequent queries

108 Discussion (II) The artifacts generated by Deltar, Urepair, Patchr, and Harvestr also stored in Cassandra and tagged by type Allows Hadoop jobs to find them and process them at scale Not clear, currently, whether we will incorporate a scalable graph technology such as Titan, MapD, or Neo4j May be okay initially to handle interconnections between artifacts via row-key references

109 Scaling App/Repo Collection Have scripts to download Android apps and their associated artifacts from Google Play and Github Need to scale the capabilities of these scripts by one order of magnitude Need to automate the collection to happen on daily basis Will explore the use of Apache Nutch for this purpose Enables massively parallel web crawling Each crawling thread can be engineered to store retrieved assets in Cassandra

110 Computing Infrastructure Currently have access to a cluster of 7 machines 4 dedicated Cassandra nodes 1 analytics machine with 128 GB of memory 1 compute server used to host VMs 1 storage server used to perform backups Acquired 5 new servers last week Goal is to combine all 12 machines into a single OpenStack cluster and then provision servers as needed

MUSE Site Visit: Planned Agenda

MUSE Site Visit: Planned Agenda MUSE Site Visit: Planned Agenda Welcome DARPA and AFRL Visitors! MUSE Meeting in DLC 170 1:00pm-1:10pm Welcome 1:10pm-1:45pm Overview of the Fixr Project, Evan Chang 1:45pm-3:00pm Demo and Discussion:

More information

Mining and Understanding Software Enclaves (MUSE)

Mining and Understanding Software Enclaves (MUSE) Mining and Understanding Software Enclaves (MUSE) Suresh Jagannathan Information Innovation Office DARPA http://www.darpa.mil/our_work/i2o/programs/mining_and_understanding_software_enclaves_(muse).aspx

More information

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Using the SDACK Architecture to Build a Big Data Product Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Outline A Threat Analytic Big Data product The SDACK Architecture Akka Streams and data

More information

Programming Languages Research at the University of Colorado, Boulder

Programming Languages Research at the University of Colorado, Boulder Amer Diwan Jeremy Siek Bor-Yuh Evan Chang Sriram Sankaranarayanan Programming Languages Research at the University of Colorado, Boulder PL research at CU has breadth! How do we effectively express computation?

More information

Building Microservices with the 12 Factor App Pattern

Building Microservices with the 12 Factor App Pattern Building Microservices with the 12 Factor App Pattern Context This documentation will help introduce Developers to implementing MICROSERVICES by applying the TWELVE- FACTOR PRINCIPLES, a set of best practices

More information

BIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29,

BIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, BIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, 2016 1 OBJECTIVES ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, 2016 2 WHAT

More information

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region Azure DevOps Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region What is DevOps? People. Process. Products. Build & Test Deploy DevOps is the union of people, process, and products to

More information

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?

More information

Differential program verification

Differential program verification Differential program verification Shuvendu K. Lahiri Research in Software Engineering (RiSE), Microsoft Research Redmond, WA Involved in building static assertion checkers HAVOC [POPL 06, 08, 09, CAV 09,

More information

Azure Day Application Development. Randy Pagels Sr. Developer Technology Specialist US DX Developer Tools - Central Region

Azure Day Application Development. Randy Pagels Sr. Developer Technology Specialist US DX Developer Tools - Central Region Azure Day Application Development Randy Pagels Sr. Developer Technology Specialist US DX Developer Tools - Central Region Azure App Service.NET, Java, Node.js, PHP, Python Auto patching Auto scale Integration

More information

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype? Big data hype? Big Data: Hype or Hallelujah? Data Base and Data Mining Group of 2 Google Flu trends On the Internet February 2010 detected flu outbreak two weeks ahead of CDC data Nowcasting http://www.internetlivestats.com/

More information

Ian Choy. Technology Solutions Professional

Ian Choy. Technology Solutions Professional Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration

More information

Automatic Scaling Iterative Computations. Aug. 7 th, 2012

Automatic Scaling Iterative Computations. Aug. 7 th, 2012 Automatic Scaling Iterative Computations Guozhang Wang Cornell University Aug. 7 th, 2012 1 What are Non-Iterative Computations? Non-iterative computation flow Directed Acyclic Examples Batch style analytics

More information

At a high level, the current OPNFV CI pipeline can be summarized as follows:

At a high level, the current OPNFV CI pipeline can be summarized as follows: Cross Community Continuous Integration (XCI) Empowers Innovation by Increasing Collaboration Between and Upstream Communities With XCI, regularly integrates the latest from each supported branch of select

More information

Appendix to The Health of Software Engineering Research

Appendix to The Health of Software Engineering Research Appendix to The Health of Software Engineering Research David Lo School of Information Systems Singapore Management University Singapore davidlo@smu.edu.sg Nachiappan Nagappan and Thomas Zimmermann Research

More information

Calling Context Abstraction with Shapes

Calling Context Abstraction with Shapes Calling Context Abstraction with Shapes Xavier Rival INRIA/ENS Paris Bor-Yuh Evan Chang 張博聿 U of Colorado, Boulder National Taiwan University December 17, 2010 Work to Appear in POPL 2011 Programming Languages

More information

Data Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens

Data Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens Data Science and Open Source Software Iraklis Varlamis Assistant Professor Harokopio University of Athens varlamis@hua.gr What is data science? 2 Why data science is important? More data (volume, variety,...)

More information

Microsoft Exam

Microsoft Exam Volume: 42 Questions Case Study: 1 Relecloud General Overview Relecloud is a social media company that processes hundreds of millions of social media posts per day and sells advertisements to several hundred

More information

CS 553: Algorithmic Language Compilers (PLDI) Graduate Students and Super Undergraduates... Logistics. Plan for Today

CS 553: Algorithmic Language Compilers (PLDI) Graduate Students and Super Undergraduates... Logistics. Plan for Today Graduate Students and Super Undergraduates... CS 553: Algorithmic Language Compilers (PLDI) look for other sources of information make decisions, because all research problems are under-specified evaluate

More information

Cloud-Native Applications. Copyright 2017 Pivotal Software, Inc. All rights Reserved. Version 1.0

Cloud-Native Applications. Copyright 2017 Pivotal Software, Inc. All rights Reserved. Version 1.0 Cloud-Native Applications Copyright 2017 Pivotal Software, Inc. All rights Reserved. Version 1.0 Cloud-Native Characteristics Lean Form a hypothesis, build just enough to validate or disprove it. Learn

More information

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018 Big Data com Hadoop Impala, Hive e Spark VIII Sessão - SQL Bahia 03/03/2018 Diógenes Pires Connect with PASS Sign up for a free membership today at: pass.org #sqlpass Internet Live http://www.internetlivestats.com/

More information

Laboratorio di Programmazione. Prof. Marco Bertini

Laboratorio di Programmazione. Prof. Marco Bertini Laboratorio di Programmazione Prof. Marco Bertini marco.bertini@unifi.it http://www.micc.unifi.it/bertini/ Code versioning: techniques and tools Software versions All software has multiple versions: Each

More information

A Gentle Introduction to Program Analysis

A Gentle Introduction to Program Analysis A Gentle Introduction to Program Analysis Işıl Dillig University of Texas, Austin January 21, 2014 Programming Languages Mentoring Workshop 1 / 24 What is Program Analysis? Very broad topic, but generally

More information

CS 520 Theory and Practice of Software Engineering Fall 2018

CS 520 Theory and Practice of Software Engineering Fall 2018 CS 520 Theory and Practice of Software Engineering Fall 2018 Nediyana Daskalova Monday, 4PM CS 151 Debugging October 30, 2018 Personalized Behavior-Powered Systems for Guiding Self-Experiments Help me

More information

Interactive Debugging for Big Data Analytics

Interactive Debugging for Big Data Analytics Interactive Debugging for Big Data Analytics Muhammad Ali Gulzar, Xueyuan Han, Matteo Interlandi, Shaghayegh Mardani, Sai Deep Tetali, Tyson Condie, Todd Millstein, Miryung Kim University of California,

More information

Application Management Webinar. Daniela Field

Application Management Webinar. Daniela Field Application Management Webinar Daniela Field Agenda } Agile Deployment } Project vs Node Security } Deployment } Cloud Administration } Monitoring } Logging } Alerting Cloud Overview Cloud Overview Project

More information

DATA SCIENCE USING SPARK: AN INTRODUCTION

DATA SCIENCE USING SPARK: AN INTRODUCTION DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data

More information

Working in Teams CS 520 Theory and Practice of Software Engineering Fall 2018

Working in Teams CS 520 Theory and Practice of Software Engineering Fall 2018 Working in Teams CS 520 Theory and Practice of Software Engineering Fall 2018 Version Control September 18, 2018 Thursday (September 20) First in-class exercise On using git (today is a prelude with useful

More information

Steps for project success. git status. Milestones. Deliverables. Homework 1 submitted Homework 2 will be posted October 26.

Steps for project success. git status. Milestones. Deliverables. Homework 1 submitted Homework 2 will be posted October 26. git status Steps for project success Homework 1 submitted Homework 2 will be posted October 26 due November 16, 9AM Projects underway project status check-in meetings November 9 System-building project

More information

Dynamic Graph Query Support for SDN Management. Ramya Raghavendra IBM TJ Watson Research Center

Dynamic Graph Query Support for SDN Management. Ramya Raghavendra IBM TJ Watson Research Center Dynamic Graph Query Support for SDN Management Ramya Raghavendra IBM TJ Watson Research Center Roadmap SDN scenario 1: Cloud provisioning Management/Analytics primitives Current Cloud Offerings Limited

More information

BigDataBench-MT: Multi-tenancy version of BigDataBench

BigDataBench-MT: Multi-tenancy version of BigDataBench BigDataBench-MT: Multi-tenancy version of BigDataBench Gang Lu Beijing Academy of Frontier Science and Technology BigDataBench Tutorial, ASPLOS 2016 Atlanta, GA, USA n Software perspective Multi-tenancy

More information

Using Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies

Using Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies Using Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies Giulio Barcaroli 1 (barcarol@istat.it), Monica Scannapieco 1 (scannapi@istat.it), Donato Summa

More information

Spark 2. Alexey Zinovyev, Java/BigData Trainer in EPAM

Spark 2. Alexey Zinovyev, Java/BigData Trainer in EPAM Spark 2 Alexey Zinovyev, Java/BigData Trainer in EPAM With IT since 2007 With Java since 2009 With Hadoop since 2012 With EPAM since 2015 About Secret Word from EPAM itsubbotnik Big Data Training 3 Contacts

More information

@joerg_schad Nightmares of a Container Orchestration System

@joerg_schad Nightmares of a Container Orchestration System @joerg_schad Nightmares of a Container Orchestration System 2017 Mesosphere, Inc. All Rights Reserved. 1 Jörg Schad Distributed Systems Engineer @joerg_schad Jan Repnak Support Engineer/ Solution Architect

More information

GIT TUTORIAL. Creative Software Architectures for Collaborative Projects CS 130 Donald J. Patterson

GIT TUTORIAL. Creative Software Architectures for Collaborative Projects CS 130 Donald J. Patterson GIT TUTORIAL Creative Software Architectures for Collaborative Projects CS 130 Donald J. Patterson SCM SOFTWARE CONFIGURATION MANAGEMENT SOURCE CODE MANAGEMENT Generic term for the ability to manage multiple

More information

Tools for Scalable Data Mining

Tools for Scalable Data Mining Tools for Scalable Data Mining XANDA SCHOFIELD CS 6410 11/13/2014 1. Astrolabe Large, eventuallyconsistent distributed system ROBERT VAN RENESSE, KEN BIRMAN, WERNER VOGELS [Source: Wikipedia] The Problem

More information

Why Quality Depends on Big Data

Why Quality Depends on Big Data Why Quality Depends on Big Data Korea Test Conference Michael Schuldenfrei, CTO Who are Optimal+? 2 Company Overview Optimal+ provides Manufacturing Intelligence software that delivers realtime, big data

More information

Slice Intelligence!

Slice Intelligence! Intern @ Slice Intelligence! Wei1an(Wu( September(8,(2014( Outline!! Details about the job!! Skills required and learned!! My thoughts regarding the internship! About the company!! Slice, which we call

More information

Beyond 1001 Dedicated Data Service Instances

Beyond 1001 Dedicated Data Service Instances Beyond 1001 Dedicated Data Service Instances Introduction The Challenge Given: Application platform based on Cloud Foundry to serve thousands of apps Application Runtime Many platform users - who don

More information

Science-as-a-Service

Science-as-a-Service Science-as-a-Service The iplant Foundation Rion Dooley Edwin Skidmore Dan Stanzione Steve Terry Matthew Vaughn Outline Why, why, why! When duct tape isn t enough Building an API for the web Core services

More information

Next-Generation Cloud Platform

Next-Generation Cloud Platform Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA Distributed CI: Scaling Jenkins on Mesos and Marathon Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA About Me Roger Ignazio QE Automation Engineer Puppet Labs, Inc. @rogerignazio Mesos In Action

More information

Firefox Crash Reporting.

Firefox Crash Reporting. Firefox Crash Reporting laura@ mozilla.com @lxt Webtools @ Mozilla Crash reporting Localization Performance measurement Code search and static analysis Other stuff: product delivery and updates, plugins

More information

Version Control. CSC207 Fall 2014

Version Control. CSC207 Fall 2014 Version Control CSC207 Fall 2014 Problem 1: Working Solo How do you keep track of changes to your program? Option 1: Don t bother Hope you get it right the first time Hope you can remember what changes

More information

Apache Solr A Practical Approach To Enterprise Search

Apache Solr A Practical Approach To Enterprise Search We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with apache solr a practical

More information

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou The Hadoop Ecosystem EECS 4415 Big Data Systems Tilemachos Pechlivanoglou tipech@eecs.yorku.ca A lot of tools designed to work with Hadoop 2 HDFS, MapReduce Hadoop Distributed File System Core Hadoop component

More information

Creating a World Class Build System

Creating a World Class Build System Creating a World Class Build System Scott Estes Release Management and Ops Citrix Systems, Inc. 10 June 2011 AGENDA Getting Started Planning and Design Implementation and Execution Optimizing the System

More information

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423

More information

Parallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce

Parallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce Parallel Programming Principle and Practice Lecture 10 Big Data Processing with MapReduce Outline MapReduce Programming Model MapReduce Examples Hadoop 2 Incredible Things That Happen Every Minute On The

More information

Apache HAWQ (incubating)

Apache HAWQ (incubating) HADOOP NATIVE SQL What is HAWQ? Apache HAWQ (incubating) Is an elastic parallel processing SQL engine that runs native in Apache Hadoop to directly access data for advanced analytics. Why HAWQ? Hadoop

More information

Lambda Architecture for Batch and Stream Processing. October 2018

Lambda Architecture for Batch and Stream Processing. October 2018 Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.

More information

Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS

Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS ContainerCon @ Open Source Summit North America 2017 Elizabeth K. Joseph @pleia2 1 Elizabeth K. Joseph, Developer Advocate

More information

Faculté Polytechnique

Faculté Polytechnique Faculté Polytechnique INFORMATIQUE PARALLÈLE ET DISTRIBUÉE CHAPTER 7 : CLOUD COMPUTING Sidi Ahmed Mahmoudi sidi.mahmoudi@umons.ac.be 13 December 2017 PLAN Introduction I. History of Cloud Computing and

More information

Open Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria

Open Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria Open Source Search Andreas Pesenhofer max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria max.recall information systems max.recall is a software and consulting company enabling

More information

Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life

Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life Ch.Srilakshmi Asst Professor,Department of Information Technology R.M.D Engineering College, Kavaraipettai,

More information

The Seven Steps to Implement DataOps

The Seven Steps to Implement DataOps The Seven Steps to Implement Ops ABSTRACT analytics teams challenged by inflexibility and poor quality have found that Ops can address these and many other obstacles. Ops includes tools and process improvements

More information

Embedded Technosolutions

Embedded Technosolutions Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication

More information

Getting started with GitHub

Getting started with GitHub Getting started with GitHub A beginner s guide. (There s no code in this slide deck!) Presented by Quinn Supplee https://github.com/quinns What is GitHub? GitHub is a code hosting platform for version

More information

70-532: Developing Microsoft Azure Solutions

70-532: Developing Microsoft Azure Solutions 70-532: Developing Microsoft Azure Solutions Exam Design Target Audience Candidates of this exam are experienced in designing, programming, implementing, automating, and monitoring Microsoft Azure solutions.

More information

Databases 2 (VU) ( / )

Databases 2 (VU) ( / ) Databases 2 (VU) (706.711 / 707.030) MapReduce (Part 3) Mark Kröll ISDS, TU Graz Nov. 27, 2017 Mark Kröll (ISDS, TU Graz) MapReduce Nov. 27, 2017 1 / 42 Outline 1 Problems Suited for Map-Reduce 2 MapReduce:

More information

Rubix Documentation. Release Qubole

Rubix Documentation. Release Qubole Rubix Documentation Release 0.2.12 Qubole Jul 02, 2018 Contents: 1 RubiX 3 1.1 Usecase.................................................. 3 1.2 Supported Engines and Cloud Stores..................................

More information

GNU Radio Technical Update

GNU Radio Technical Update GNU Radio Technical Update Johnathan Corgan GRCON17 GRCON17 GNU GNU Radio Radio Technical Technical Update Update September September 2017 2017 Topics Release 3.8 Features and capabilities Milestones and

More information

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992 2014-05-20 MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992 @SoQooL http://blog.mssqlserver.se Mattias.Lind@Sogeti.se 1 The evolution of the Microsoft data platform

More information

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018 Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning

More information

Vishesh Oberoi Seth Reid Technical Evangelist, Microsoft Software Developer, Intergen

Vishesh Oberoi Seth Reid Technical Evangelist, Microsoft Software Developer, Intergen Vishesh Oberoi Technical Evangelist, Microsoft VishO@microsoft.com @ovishesh Seth Reid Software Developer, Intergen contact@sethreid.co.nz @sethreidnz Vishesh Oberoi Technical Evangelist, Microsoft VishO@microsoft.com

More information

Multi-tenancy version of BigDataBench

Multi-tenancy version of BigDataBench Multi-tenancy version of BigDataBench Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Multi-tenancy

More information

Moving Fast with High Reliability: Program Analysis at Uber

Moving Fast with High Reliability: Program Analysis at Uber Moving Fast with High Reliability: Program Analysis at Uber Manu Sridharan Software Reliability Workshop ETH Zurich OCTOBER 14, 2017 Uber Apps Rider Driver Eats ios and Android Uber Apps Rider Driver Eats

More information

DevOps Anti-Patterns. Have the Ops team deal with it. Time to fire the Ops team! Let s hire a DevOps unit! COPYRIGHT 2019 MANICODE SECURITY

DevOps Anti-Patterns. Have the Ops team deal with it. Time to fire the Ops team! Let s hire a DevOps unit! COPYRIGHT 2019 MANICODE SECURITY DevOps Anti-Patterns Have the Ops team deal with it. Time to fire the Ops team! Let s hire a DevOps unit! 31 Anti-Pattern: Throw it Over the Wall Development Operations 32 Anti-Pattern: DevOps Team Silo

More information

Using DC/OS for Continuous Delivery

Using DC/OS for Continuous Delivery Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years

More information

FROM VSTS TO AZURE DEVOPS

FROM VSTS TO AZURE DEVOPS #DOH18 FROM VSTS TO AZURE DEVOPS People. Process. Products. Gaetano Paternò @tanopaterno info@gaetanopaterno.it 2 VSTS #DOH18 3 Azure DevOps Azure Boards (ex Work) Deliver value to your users faster using

More information

Software Engineering 2 A practical course in software engineering. Ekkart Kindler

Software Engineering 2 A practical course in software engineering. Ekkart Kindler Software Engineering 2 A practical course in software engineering V. Working Together Working together Management Process Models Version Management Systems Collaborative Development Environments 3 Parts

More information

Why Git and GitHub? Ming Chow Senior Lecturer, Tufts University Department of Computer Science SIGCSE 2018

Why Git and GitHub? Ming Chow Senior Lecturer, Tufts University Department of Computer Science SIGCSE 2018 Why Git and GitHub? Ming Chow Senior Lecturer, Tufts University Department of Computer Science mchow@cs.tufts.edu Twitter: @0xmchow SIGCSE 2018 Why Git? Keep track of changes and work Documenting changes

More information

Webinar Series TMIP VISION

Webinar Series TMIP VISION Webinar Series TMIP VISION TMIP provides technical support and promotes knowledge and information exchange in the transportation planning and modeling community. Today s Goals To Consider: Parallel Processing

More information

Tour Trak Project Plan

Tour Trak Project Plan Tour Trak Project Plan The Cycologists Project Plan 1. Project Overview 1.1 Scope Statement This project is a continuation of TourTrak, developed by previous senior project teams Centri Pedal, Dev Cycle,

More information

Prototyping Data Intensive Apps: TrendingTopics.org

Prototyping Data Intensive Apps: TrendingTopics.org Prototyping Data Intensive Apps: TrendingTopics.org Pete Skomoroch Research Scientist at LinkedIn Consultant at Data Wrangling @peteskomoroch 09/29/09 1 Talk Outline TrendingTopics Overview Wikipedia Page

More information

X.Org and XFree86 Version Numbering Schemes

X.Org and XFree86 Version Numbering Schemes The XFree86 Project, Inc Updated for X11R7.5 by Keith Packard and Kevin E. Martin 22 May 2006 X.Org has adopted the same basic numbering scheme used by the XFree86 Project, Inc. for their releases. The

More information

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)? What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)? What is Amazon Machine Image (AMI)? Amazon Elastic Compute Cloud (EC2)?

More information

Your Complete Guide to Backup and Recovery for MongoDB

Your Complete Guide to Backup and Recovery for MongoDB Your Complete Guide to Backup and Recovery for MongoDB EBOOK Your Complete Guide to Backup and Recovery for MongoDB Table of Contents Part I: Backup and Recovery for MongoDB Part II: Customer Case Study

More information

A Software Architecture for Progressive Scanning of On-line Communities

A Software Architecture for Progressive Scanning of On-line Communities A Software Architecture for Progressive Scanning of On-line Communities Roberto Baldoni, Fabrizio d Amore, Massimo Mecella, Daniele Ucci Sapienza Università di Roma, Italy Motivations On-line communities

More information

How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects?

How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? Saraj Singh Manes School of Computer Science Carleton University Ottawa, Canada sarajmanes@cmail.carleton.ca Olga

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

Software Security: Vulnerability Analysis

Software Security: Vulnerability Analysis Computer Security Course. Software Security: Vulnerability Analysis Program Verification Program Verification How to prove a program free of buffer overflows? Precondition Postcondition Loop invariants

More information

Real-time Fraud Detection with Innovative Big Graph Feature. Gaurav Deshpande, VP Marketing, TigerGraph; Mingxi Wu, VP Engineering, TigerGraph

Real-time Fraud Detection with Innovative Big Graph Feature. Gaurav Deshpande, VP Marketing, TigerGraph; Mingxi Wu, VP Engineering, TigerGraph Real-time Fraud Detection with Innovative Big Graph Feature Gaurav Deshpande, VP Marketing, TigerGraph; Mingxi Wu, VP Engineering, TigerGraph Speaking Today Gaurav Deshpande VP Marketing, TigerGraph gaurav@tigergraph.com

More information

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been

More information

NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre

NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre NoSQL systems: introduction and data models Riccardo Torlone Università Roma Tre Leveraging the NoSQL boom 2 Why NoSQL? In the last fourty years relational databases have been the default choice for serious

More information

Distributed computing: index building and use

Distributed computing: index building and use Distributed computing: index building and use Distributed computing Goals Distributing computation across several machines to Do one computation faster - latency Do more computations in given time - throughput

More information

Automated Netezza Migration to Big Data Open Source

Automated Netezza Migration to Big Data Open Source Automated Netezza Migration to Big Data Open Source CASE STUDY Client Overview Our client is one of the largest cable companies in the world*, offering a wide range of services including basic cable, digital

More information

A Review Paper on Big data & Hadoop

A Review Paper on Big data & Hadoop A Review Paper on Big data & Hadoop Rupali Jagadale MCA Department, Modern College of Engg. Modern College of Engginering Pune,India rupalijagadale02@gmail.com Pratibha Adkar MCA Department, Modern College

More information

CESSDA Expert Seminar 13 & 14 September 2016 Prague, Czech Republic

CESSDA Expert Seminar 13 & 14 September 2016 Prague, Czech Republic CESSDA Expert Seminar 13 & 14 September 2016 Prague, Czech Republic - basics Matthäus Zloch GESIS Outline for this session Git introduction and some theory Git command basics (plus some little advanced)

More information

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES 1 THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES Vincent Garonne, Mario Lassnig, Martin Barisits, Thomas Beermann, Ralph Vigne, Cedric Serfon Vincent.Garonne@cern.ch ph-adp-ddm-lab@cern.ch XLDB

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Static Analysis of C++ Projects with CodeSonar

Static Analysis of C++ Projects with CodeSonar Static Analysis of C++ Projects with CodeSonar John Plaice, Senior Scientist, GrammaTech jplaice@grammatech.com 25 July 2017, Meetup C++ de Montréal Abstract Static program analysis consists of the analysis

More information

Bringing Data to Life

Bringing Data to Life Bringing Data to Life Data management and Visualization Techniques Benika Hall Rob Harrison Corporate Model Risk March 16, 2018 Introduction Benika Hall Analytic Consultant Wells Fargo - Corporate Model

More information

CPSC 491. Lecture 19 & 20: Source Code Version Control. VCS = Version Control Software SCM = Source Code Management

CPSC 491. Lecture 19 & 20: Source Code Version Control. VCS = Version Control Software SCM = Source Code Management CPSC 491 Lecture 19 & 20: Source Code Version Control VCS = Version Control Software SCM = Source Code Management Exercise: Source Code (Version) Control 1. Pretend like you don t have a version control

More information

Confidence Intervals. Dennis Sun Data 301

Confidence Intervals. Dennis Sun Data 301 Dennis Sun Data 301 Statistical Inference probability Population / Box Sample / Data statistics The goal of statistics is to infer the unknown population from the sample. We ve already seen one mode of

More information

Composer Best Practices Nils Private Packagist

Composer Best Practices Nils Private Packagist Composer Best Practices 2018 Private Packagist https://packagist.com 2018? Delete your lock files 2018? Delete your lock files Composer Ecosystem Reality Update 2018 Best Practices? Deployment Improving

More information

Flash Storage Complementing a Data Lake for Real-Time Insight

Flash Storage Complementing a Data Lake for Real-Time Insight Flash Storage Complementing a Data Lake for Real-Time Insight Dr. Sanhita Sarkar Global Director, Analytics Software Development August 7, 2018 Agenda 1 2 3 4 5 Delivering insight along the entire spectrum

More information

Submitting your Work using GIT

Submitting your Work using GIT Submitting your Work using GIT You will be using the git distributed source control system in order to manage and submit your assignments. Why? allows you to take snapshots of your project at safe points

More information

Department of Information Technology, St. Joseph s College (Autonomous), Trichy, TamilNadu, India

Department of Information Technology, St. Joseph s College (Autonomous), Trichy, TamilNadu, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 5 ISSN : 2456-3307 A Survey on Big Data and Hadoop Ecosystem Components

More information