Pliny and Fixr Meeting. September 15, 2014
|
|
- Nigel Ford
- 5 years ago
- Views:
Transcription
1 Pliny and Fixr Meeting September 15, 2014
2 Fixr: Mining and Understanding Bug Fixes for App-Framework Protocol Defects (TA2) University of Colorado Boulder September 15, 2014
3 Fixr: Mining and Understanding Bug Fixes for App-Framework Protocol Defects (TA2) Bor-Yuh Evan Chang Ken Anderson Pavol Cerny Sriram Sankaranarayanan Tom Yeh University of Colorado Boulder University of Colorado Boulder September 15, 2014
4 A bug that manifests spectacularly
5 A bug that manifests spectacularly
6 A bug that manifests spectacularly
7 A bug that manifests spectacularly Crash
8 A bug that manifests spectacularly Crash caused by an app-created memory leak
9 Framework Dev s say...
10 Framework Dev s say...
11 Framework Dev s say... Do not keep long-lived references to a context-activity
12 Framework Dev s say... Do not keep long-lived references to a context-activity I don t know how I created a long-lived reference to an Activity!
13 Framework Dev s say... Do not keep long-lived references to a context-activity I don t know how I created a long-lived reference to an Activity! Often: A misunderstanding of a library causes the library to keep the Activity reference.
14 Framework Dev s say... Do not keep long-lived references to a context-activity Bug from violating I don t know how I created a long-lived reference to an Activity! Often: A misunderstanding of a library causes the library to keep (implicit) framework protocol rules the Activity reference.
15 Imagining a post-muse scenario... I don t know how I created a long-lived reference to an Activity! for xxxxxxxxxxxx
16 Elsewhere, following the state of practice for debugging leaks...
17 Elsewhere, following the state of practice for debugging leaks Run the app
18 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage
19 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit!
20 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit!
21 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit! 4. Commit a bugfix
22 Elsewhere, following the state of practice for debugging leaks Run the app 2. Watch the heap usage 3. Dump the heap. Dig around and finally find the culprit! 4. Commit a bugfix 5. Bugfix is picked up by Fixr Fixr
23 A Fixr-enabled IDE responds... I don t know how I created a long-lived reference to an Activity!
24 A Fixr-enabled IDE responds... I don t know how I created a long-lived reference to an Activity!
25 A Fixr-enabled IDE responds... It looks like you ve created a memory leak like and 100,000 others. Would you like to apply? I don t know how I created a long-lived reference to an Activity!
26 A Fixr-enabled IDE responds... It looks like you ve created a memory leak like and 100,000 others. Would you like to apply? the bugfix is transferred I don t know how I created a long-lived reference to an Activity!
27 Summary: Mine specifications of framework rules (indirectly) from bugfixes Leverage volume and variety of bugfixes made by the crowd of client app developers
28 Summary: Mine specifications of framework rules (indirectly) from bugfixes Leverage volume and variety of bugfixes made by the crowd of client app developers
29 Summary: Mine specifications of framework rules (indirectly) from bugfixes Leverage volume and variety of bugfixes made by the crowd of client app developers toolify stackoverflow
30 Fixr: Proposed System MUSE
31 Fixr: Proposed System fix MUSE
32 Fixr: Proposed System E.g., Two successive versions of source code fix MUSE
33 Fixr: Proposed System fix MUSE
34 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications fix MUSE
35 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications semantic delta fix MUSE
36 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications E.g., Diff in relevant flow-insensitive summary semantic delta fix MUSE
37 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications semantic delta fix MUSE
38 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification semantic delta fix MUSE
39 Fixr: Proposed System E.g., framework Deltar: Inferring Semantic Deltas and Repair Specifications semantic delta invariant + app pre for bug + fix repair specification fix MUSE
40 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification semantic delta fix MUSE
41 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE
42 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification
43 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications fix semantic delta MUSE probabilistic repair specification E.g., generalized repair spec with confidence measure
44 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification
45 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification Patchr: Detecting Potential Bugs and Synthesizing Patches
46 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification patch Patchr: Detecting Potential Bugs and Synthesizing Patches
47 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification patch E.g., bug evidence and patch Patchr: Detecting Potential Bugs and Synthesizing Patches
48 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification patch Patchr: Detecting Potential Bugs and Synthesizing Patches
49 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification Harvestr: Social Validation and Mining of Fixes patch Patchr: Detecting Potential Bugs and Synthesizing Patches
50 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification social delta Harvestr: Social Validation and Mining of Fixes patch Patchr: Detecting Potential Bugs and Synthesizing Patches
51 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification E.g., bug fix confirmation social delta Harvestr: Social Validation and Mining of Fixes patch Patchr: Detecting Potential Bugs and Synthesizing Patches
52 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction E.g., bug fix confirmation Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches
53 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches
54 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches
55 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code
56 Fixr: Proposed System Deltar: Inferring Semantic Deltas and Repair Specifications repair specification Urepair: Deriving Probabilistic Repair Specifications semantic delta fix MUSE probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code
57 Fixr: Proposed System semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code
58 Fixr: Proposed System semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Goal: Create a positive feedback loop commit Patches to Code derive high-quality repair specifications
59 semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code
60 symbolic program analysis Bor-Yuh Evan Chang Team Sriram Sankaranarayanan numerical-probabilistic program analysis semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications software engineering for big data fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes Ken Anderson social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code user-centered big data analytics program synthesis Tom Yeh Pavol Cerny
61 Evaluation Questions
62 Evaluation Questions Iterative and incremental design and evaluation of the Fixr loop
63 Evaluation Questions Iterative and incremental design and evaluation of the Fixr loop Effectiveness of Bugfix Transfer: Given an isolated bugfix, can we derive high-quality repair specifications to lead to useful patches?
64 Evaluation Questions Iterative and incremental design and evaluation of the Fixr loop Effectiveness of Bugfix Transfer: Given an isolated bugfix, can we derive high-quality repair specifications to lead to useful patches? Effectiveness of Bugfix Seeding: Can we isolate likely bugfixes from source repositories?
65 Bor-Yuh Evan Chang Sriram Sankaranarayanan semantic statistical-semantic syntactic social Deltar: Inferring Semantic Deltas and Repair Specifications fix semantic delta repair specification MUSE Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification interaction commit Harvestr: Social Validation and Mining of Fixes Ken Anderson social delta patch Patchr: Detecting Potential Bugs and Synthesizing Patches Code Tom Yeh Pavol Cerny
66 Deltar Deltar: Inferring Semantic Deltas and Repair Specifications repair specification semantic delta fix
67 Bug: On Android <4 aview.settag(..., anobject)
68 Bug: On Android <4 aview.settag(..., anobject) if anobject can reach aview
69 Bug: On Android <4 aview.settag(..., anobject) if anobject can reach aview Goal: Produce the above ( bug pre ) with framework invariant and fix
70 Subtask 1.1: Summarizing App Commits
71 Subtask 1.1: Summarizing App Commits Diff coarse semantic summaries E.g., points-to graphs abstracted to appframework crossings
72 Subtask 1.1: Summarizing App Commits Diff coarse semantic summaries E.g., points-to graphs abstracted to appframework crossings Refine diffs
73 Subtask 1.2: Approximating Framework Properties
74 Subtask 1.2: Approximating Framework Properties Need 1: Scalable verification of framework invariants (on the bugfixed version) Fissile Types: intertwined invariant-based and operational-based reasoning ( almost everywhere dependent-refinement types)
75 Subtask 1.2: Approximating Framework Properties Need 1: Scalable verification of framework invariants (on the bugfixed version) Fissile Types: intertwined invariant-based and operational-based reasoning ( almost everywhere dependent-refinement types) Need 2: Ming framework invariants Refine semantic diff to a framework-only specification
76 Urepair repair specification Urepair: Deriving Probabilistic Repair Specifications probabilistic repair specification
77 Traditional Program Analysis Program (Fragment) Program Analyzer Annotations
78 Traditional Program Analysis Program (Fragment) Program Analyzer Annotations Program Database
79 Connection with Repair Specifications Repair Spec #1 Repair Spec #2 Probabilistic Repair Synthesis Repair Spec #N Goal: Synthesize multiple repair specifications into a likely candidates.
80 Bayesian Reasoning Prior Hypothesis Bayesian Update Posterior Hypothesis Observational Data
81 Bayesian Program Analysis class GadgetStore{ List<Gadget> lst; void addtolist( Gadget x ){ // Can you guess what the function does? pre (x!= null); } } post( len(x_0) <= len(x_p) <= len(x_0) +1 );
82 Approach There are lots of soft information sources that program analysis tools can use. However, we do not use them Fear of unsoundness. Integrate multiple sources into prior annotations. Allow weights for prior facts to signify degree of belief.
83 Prior Beliefs class GadgetStore{ List<Gadget> lst; void addtolist( Gadget x ){ // Can you guess what the function does? pre (x!= null); Prior Weight: 0.9 } } post( len(x_0) <= len(x_p) <= len(x_0) +1 );
84 Program Analysis: Bayesian Updating Program analysis is operates over distributions of static analysis facts. Assume a level of belief in the correctness of a function. Pr(Assertion FunctionCorrect) = Pr(FunctionCorrect Assertion) Pr(Assertion) Pr(FunctionCorrect)
85 Current Progress Ongoing investigations into probabilistic program analysis. Sankaranarayanan et al. PLDI 2013 Chakarov et al. CAV 2013, SAS Goals: Support Bayesian interpretation. Design program analysis tools for Bayesian Update. Integrate into repair specification synthesis.
86 Patchr probabilistic repair specification patch Patchr: Detecting Potential Bugs and Synthesizing Patches
87 Patchr How do we validate repair specifications? We synthesize appropriate patches [Patchr]. and validate the patches using human input (mturk, pull requests) [part of Harvestr] Motivation for developers to provide feedback: 1. Patches easier to understand than repair specs 2. Patches, if correct, immediately useful to developers
88 Subtask 3.1: Applying and concretizing repair specifications Subtask 3.2: Synthesis using MUSE queries
89 Subtask 3.1: Applying and concretizing repair specifications 1. Finding where to apply a repair spec 2. Finding how to apply it (concretizing repair specs): Example: suppose repair spec says always call methods A and B before C suppose the program-under-repair already calls A patch: call B after A and before C
90 Subtask 3.2: Synthesis using MUSE queries While producing patches, we need to understand the program-under-repair Develop techniques for synthesis with rich queries: pre- and post-conditions of relevant methods invariants Studied for finite-state systems as synthesis from libraries components
91 Team: Pavol Cerny Vaibhav Singh (MS) Currently recruiting PhD students and postdocs
92 Harvestr fix interaction Harvestr: Social Validation and Mining of Fixes social delta patch
93 Harvestr Runtime Input: A possible bug fix. Output: Yes! It is a bug fix. (or not) Training: Input: Lots and lots of apps and repos Output: Social proof
94 Social Validation Scenario 1: Developer releases an app. Users leave comments about a bug. Developer releases an update containing a bug fix. Users no longer leave comments about the bug.
95 Social Validation Scenario 2: Developer releases an app. After 1 day Developer releases an update.
96 Social Validation Scenario 3: Developer releases an app. After 30 days. Developer releases an update.
97 Social Validation Scenario 4: Developer A releases an update. After 1 day Developer B releases an update. Two updates are similar.
98 Big data problem Volume Variety Velocity Veracity
99 Volume Number of apps (1 millions) x Number of versions (10) x Number of source files (100) Current: 200K apps, 2-7 versions/app.
100 Velocity New app (10/day) New update (100/day) New review (1000/day) New repo (1/day) New commit (1/repo/day)
101 Variety user ratings download counts version increments update intervals app s description user reviews ( many more)
102 Veracity Biases Low-level vs. high-level Uncertainties Precision vs. recall Abnormality Data entry, Outliers
103 Fixr Big Code Architecture MUSE commit Code
104 Scaling Fixr Up Scalability requirements to make Fixr operational Target: Collect at least 50% of public Android apps on Google Play and Github > 500K apps => ~100M files Handle updates to revision histories 10 updates on Google Play; 50 commits on Github Velocity: 10 new apps, 100 updates, 1000 user reviews per day; 1 new repo + 1 new commit per day
105 Data Profile Data is Text: Source code, user reviews, bug reports Binary: git repos, application assets, Meta: ratings, download counts, version numbers More importantly, Data is read-only highly interconnected
106 Proposed Architecture MUSE Front End Application Layer DataStax Enterprise Pig Hadoop Solr Redis Service Layer Cassandra Storage Layer
107 Discussion (I) Cassandra used to store all read-only data very fast at streaming results of key-based queries Solr used to index the important metadata and to perform complex queries Pig/Hadoop used to process applications in batch Redis used to cache results of frequent queries
108 Discussion (II) The artifacts generated by Deltar, Urepair, Patchr, and Harvestr also stored in Cassandra and tagged by type Allows Hadoop jobs to find them and process them at scale Not clear, currently, whether we will incorporate a scalable graph technology such as Titan, MapD, or Neo4j May be okay initially to handle interconnections between artifacts via row-key references
109 Scaling App/Repo Collection Have scripts to download Android apps and their associated artifacts from Google Play and Github Need to scale the capabilities of these scripts by one order of magnitude Need to automate the collection to happen on daily basis Will explore the use of Apache Nutch for this purpose Enables massively parallel web crawling Each crawling thread can be engineered to store retrieved assets in Cassandra
110 Computing Infrastructure Currently have access to a cluster of 7 machines 4 dedicated Cassandra nodes 1 analytics machine with 128 GB of memory 1 compute server used to host VMs 1 storage server used to perform backups Acquired 5 new servers last week Goal is to combine all 12 machines into a single OpenStack cluster and then provision servers as needed
MUSE Site Visit: Planned Agenda
MUSE Site Visit: Planned Agenda Welcome DARPA and AFRL Visitors! MUSE Meeting in DLC 170 1:00pm-1:10pm Welcome 1:10pm-1:45pm Overview of the Fixr Project, Evan Chang 1:45pm-3:00pm Demo and Discussion:
More informationMining and Understanding Software Enclaves (MUSE)
Mining and Understanding Software Enclaves (MUSE) Suresh Jagannathan Information Innovation Office DARPA http://www.darpa.mil/our_work/i2o/programs/mining_and_understanding_software_enclaves_(muse).aspx
More informationUsing the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver
Using the SDACK Architecture to Build a Big Data Product Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Outline A Threat Analytic Big Data product The SDACK Architecture Akka Streams and data
More informationProgramming Languages Research at the University of Colorado, Boulder
Amer Diwan Jeremy Siek Bor-Yuh Evan Chang Sriram Sankaranarayanan Programming Languages Research at the University of Colorado, Boulder PL research at CU has breadth! How do we effectively express computation?
More informationBuilding Microservices with the 12 Factor App Pattern
Building Microservices with the 12 Factor App Pattern Context This documentation will help introduce Developers to implementing MICROSERVICES by applying the TWELVE- FACTOR PRINCIPLES, a set of best practices
More informationBIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29,
BIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, 2016 1 OBJECTIVES ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, 2016 2 WHAT
More informationAzure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region
Azure DevOps Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region What is DevOps? People. Process. Products. Build & Test Deploy DevOps is the union of people, process, and products to
More informationTopics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples
Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?
More informationDifferential program verification
Differential program verification Shuvendu K. Lahiri Research in Software Engineering (RiSE), Microsoft Research Redmond, WA Involved in building static assertion checkers HAVOC [POPL 06, 08, 09, CAV 09,
More informationAzure Day Application Development. Randy Pagels Sr. Developer Technology Specialist US DX Developer Tools - Central Region
Azure Day Application Development Randy Pagels Sr. Developer Technology Specialist US DX Developer Tools - Central Region Azure App Service.NET, Java, Node.js, PHP, Python Auto patching Auto scale Integration
More informationNowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?
Big data hype? Big Data: Hype or Hallelujah? Data Base and Data Mining Group of 2 Google Flu trends On the Internet February 2010 detected flu outbreak two weeks ahead of CDC data Nowcasting http://www.internetlivestats.com/
More informationIan Choy. Technology Solutions Professional
Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration
More informationAutomatic Scaling Iterative Computations. Aug. 7 th, 2012
Automatic Scaling Iterative Computations Guozhang Wang Cornell University Aug. 7 th, 2012 1 What are Non-Iterative Computations? Non-iterative computation flow Directed Acyclic Examples Batch style analytics
More informationAt a high level, the current OPNFV CI pipeline can be summarized as follows:
Cross Community Continuous Integration (XCI) Empowers Innovation by Increasing Collaboration Between and Upstream Communities With XCI, regularly integrates the latest from each supported branch of select
More informationAppendix to The Health of Software Engineering Research
Appendix to The Health of Software Engineering Research David Lo School of Information Systems Singapore Management University Singapore davidlo@smu.edu.sg Nachiappan Nagappan and Thomas Zimmermann Research
More informationCalling Context Abstraction with Shapes
Calling Context Abstraction with Shapes Xavier Rival INRIA/ENS Paris Bor-Yuh Evan Chang 張博聿 U of Colorado, Boulder National Taiwan University December 17, 2010 Work to Appear in POPL 2011 Programming Languages
More informationData Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens
Data Science and Open Source Software Iraklis Varlamis Assistant Professor Harokopio University of Athens varlamis@hua.gr What is data science? 2 Why data science is important? More data (volume, variety,...)
More informationMicrosoft Exam
Volume: 42 Questions Case Study: 1 Relecloud General Overview Relecloud is a social media company that processes hundreds of millions of social media posts per day and sells advertisements to several hundred
More informationCS 553: Algorithmic Language Compilers (PLDI) Graduate Students and Super Undergraduates... Logistics. Plan for Today
Graduate Students and Super Undergraduates... CS 553: Algorithmic Language Compilers (PLDI) look for other sources of information make decisions, because all research problems are under-specified evaluate
More informationCloud-Native Applications. Copyright 2017 Pivotal Software, Inc. All rights Reserved. Version 1.0
Cloud-Native Applications Copyright 2017 Pivotal Software, Inc. All rights Reserved. Version 1.0 Cloud-Native Characteristics Lean Form a hypothesis, build just enough to validate or disprove it. Learn
More informationBig Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018
Big Data com Hadoop Impala, Hive e Spark VIII Sessão - SQL Bahia 03/03/2018 Diógenes Pires Connect with PASS Sign up for a free membership today at: pass.org #sqlpass Internet Live http://www.internetlivestats.com/
More informationLaboratorio di Programmazione. Prof. Marco Bertini
Laboratorio di Programmazione Prof. Marco Bertini marco.bertini@unifi.it http://www.micc.unifi.it/bertini/ Code versioning: techniques and tools Software versions All software has multiple versions: Each
More informationA Gentle Introduction to Program Analysis
A Gentle Introduction to Program Analysis Işıl Dillig University of Texas, Austin January 21, 2014 Programming Languages Mentoring Workshop 1 / 24 What is Program Analysis? Very broad topic, but generally
More informationCS 520 Theory and Practice of Software Engineering Fall 2018
CS 520 Theory and Practice of Software Engineering Fall 2018 Nediyana Daskalova Monday, 4PM CS 151 Debugging October 30, 2018 Personalized Behavior-Powered Systems for Guiding Self-Experiments Help me
More informationInteractive Debugging for Big Data Analytics
Interactive Debugging for Big Data Analytics Muhammad Ali Gulzar, Xueyuan Han, Matteo Interlandi, Shaghayegh Mardani, Sai Deep Tetali, Tyson Condie, Todd Millstein, Miryung Kim University of California,
More informationApplication Management Webinar. Daniela Field
Application Management Webinar Daniela Field Agenda } Agile Deployment } Project vs Node Security } Deployment } Cloud Administration } Monitoring } Logging } Alerting Cloud Overview Cloud Overview Project
More informationDATA SCIENCE USING SPARK: AN INTRODUCTION
DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data
More informationWorking in Teams CS 520 Theory and Practice of Software Engineering Fall 2018
Working in Teams CS 520 Theory and Practice of Software Engineering Fall 2018 Version Control September 18, 2018 Thursday (September 20) First in-class exercise On using git (today is a prelude with useful
More informationSteps for project success. git status. Milestones. Deliverables. Homework 1 submitted Homework 2 will be posted October 26.
git status Steps for project success Homework 1 submitted Homework 2 will be posted October 26 due November 16, 9AM Projects underway project status check-in meetings November 9 System-building project
More informationDynamic Graph Query Support for SDN Management. Ramya Raghavendra IBM TJ Watson Research Center
Dynamic Graph Query Support for SDN Management Ramya Raghavendra IBM TJ Watson Research Center Roadmap SDN scenario 1: Cloud provisioning Management/Analytics primitives Current Cloud Offerings Limited
More informationBigDataBench-MT: Multi-tenancy version of BigDataBench
BigDataBench-MT: Multi-tenancy version of BigDataBench Gang Lu Beijing Academy of Frontier Science and Technology BigDataBench Tutorial, ASPLOS 2016 Atlanta, GA, USA n Software perspective Multi-tenancy
More informationUsing Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies
Using Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies Giulio Barcaroli 1 (barcarol@istat.it), Monica Scannapieco 1 (scannapi@istat.it), Donato Summa
More informationSpark 2. Alexey Zinovyev, Java/BigData Trainer in EPAM
Spark 2 Alexey Zinovyev, Java/BigData Trainer in EPAM With IT since 2007 With Java since 2009 With Hadoop since 2012 With EPAM since 2015 About Secret Word from EPAM itsubbotnik Big Data Training 3 Contacts
More information@joerg_schad Nightmares of a Container Orchestration System
@joerg_schad Nightmares of a Container Orchestration System 2017 Mesosphere, Inc. All Rights Reserved. 1 Jörg Schad Distributed Systems Engineer @joerg_schad Jan Repnak Support Engineer/ Solution Architect
More informationGIT TUTORIAL. Creative Software Architectures for Collaborative Projects CS 130 Donald J. Patterson
GIT TUTORIAL Creative Software Architectures for Collaborative Projects CS 130 Donald J. Patterson SCM SOFTWARE CONFIGURATION MANAGEMENT SOURCE CODE MANAGEMENT Generic term for the ability to manage multiple
More informationTools for Scalable Data Mining
Tools for Scalable Data Mining XANDA SCHOFIELD CS 6410 11/13/2014 1. Astrolabe Large, eventuallyconsistent distributed system ROBERT VAN RENESSE, KEN BIRMAN, WERNER VOGELS [Source: Wikipedia] The Problem
More informationWhy Quality Depends on Big Data
Why Quality Depends on Big Data Korea Test Conference Michael Schuldenfrei, CTO Who are Optimal+? 2 Company Overview Optimal+ provides Manufacturing Intelligence software that delivers realtime, big data
More informationSlice Intelligence!
Intern @ Slice Intelligence! Wei1an(Wu( September(8,(2014( Outline!! Details about the job!! Skills required and learned!! My thoughts regarding the internship! About the company!! Slice, which we call
More informationBeyond 1001 Dedicated Data Service Instances
Beyond 1001 Dedicated Data Service Instances Introduction The Challenge Given: Application platform based on Cloud Foundry to serve thousands of apps Application Runtime Many platform users - who don
More informationScience-as-a-Service
Science-as-a-Service The iplant Foundation Rion Dooley Edwin Skidmore Dan Stanzione Steve Terry Matthew Vaughn Outline Why, why, why! When duct tape isn t enough Building an API for the web Core services
More informationNext-Generation Cloud Platform
Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationDistributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA
Distributed CI: Scaling Jenkins on Mesos and Marathon Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA About Me Roger Ignazio QE Automation Engineer Puppet Labs, Inc. @rogerignazio Mesos In Action
More informationFirefox Crash Reporting.
Firefox Crash Reporting laura@ mozilla.com @lxt Webtools @ Mozilla Crash reporting Localization Performance measurement Code search and static analysis Other stuff: product delivery and updates, plugins
More informationVersion Control. CSC207 Fall 2014
Version Control CSC207 Fall 2014 Problem 1: Working Solo How do you keep track of changes to your program? Option 1: Don t bother Hope you get it right the first time Hope you can remember what changes
More informationApache Solr A Practical Approach To Enterprise Search
We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with apache solr a practical
More informationThe Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou
The Hadoop Ecosystem EECS 4415 Big Data Systems Tilemachos Pechlivanoglou tipech@eecs.yorku.ca A lot of tools designed to work with Hadoop 2 HDFS, MapReduce Hadoop Distributed File System Core Hadoop component
More informationCreating a World Class Build System
Creating a World Class Build System Scott Estes Release Management and Ops Citrix Systems, Inc. 10 June 2011 AGENDA Getting Started Planning and Design Implementation and Execution Optimizing the System
More informationWe are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info
We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423
More informationParallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce
Parallel Programming Principle and Practice Lecture 10 Big Data Processing with MapReduce Outline MapReduce Programming Model MapReduce Examples Hadoop 2 Incredible Things That Happen Every Minute On The
More informationApache HAWQ (incubating)
HADOOP NATIVE SQL What is HAWQ? Apache HAWQ (incubating) Is an elastic parallel processing SQL engine that runs native in Apache Hadoop to directly access data for advanced analytics. Why HAWQ? Hadoop
More informationLambda Architecture for Batch and Stream Processing. October 2018
Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.
More informationAdvanced Continuous Delivery Strategies for Containerized Applications Using DC/OS
Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS ContainerCon @ Open Source Summit North America 2017 Elizabeth K. Joseph @pleia2 1 Elizabeth K. Joseph, Developer Advocate
More informationFaculté Polytechnique
Faculté Polytechnique INFORMATIQUE PARALLÈLE ET DISTRIBUÉE CHAPTER 7 : CLOUD COMPUTING Sidi Ahmed Mahmoudi sidi.mahmoudi@umons.ac.be 13 December 2017 PLAN Introduction I. History of Cloud Computing and
More informationOpen Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria
Open Source Search Andreas Pesenhofer max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria max.recall information systems max.recall is a software and consulting company enabling
More informationWearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life
Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life Ch.Srilakshmi Asst Professor,Department of Information Technology R.M.D Engineering College, Kavaraipettai,
More informationThe Seven Steps to Implement DataOps
The Seven Steps to Implement Ops ABSTRACT analytics teams challenged by inflexibility and poor quality have found that Ops can address these and many other obstacles. Ops includes tools and process improvements
More informationEmbedded Technosolutions
Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication
More informationGetting started with GitHub
Getting started with GitHub A beginner s guide. (There s no code in this slide deck!) Presented by Quinn Supplee https://github.com/quinns What is GitHub? GitHub is a code hosting platform for version
More information70-532: Developing Microsoft Azure Solutions
70-532: Developing Microsoft Azure Solutions Exam Design Target Audience Candidates of this exam are experienced in designing, programming, implementing, automating, and monitoring Microsoft Azure solutions.
More informationDatabases 2 (VU) ( / )
Databases 2 (VU) (706.711 / 707.030) MapReduce (Part 3) Mark Kröll ISDS, TU Graz Nov. 27, 2017 Mark Kröll (ISDS, TU Graz) MapReduce Nov. 27, 2017 1 / 42 Outline 1 Problems Suited for Map-Reduce 2 MapReduce:
More informationRubix Documentation. Release Qubole
Rubix Documentation Release 0.2.12 Qubole Jul 02, 2018 Contents: 1 RubiX 3 1.1 Usecase.................................................. 3 1.2 Supported Engines and Cloud Stores..................................
More informationGNU Radio Technical Update
GNU Radio Technical Update Johnathan Corgan GRCON17 GRCON17 GNU GNU Radio Radio Technical Technical Update Update September September 2017 2017 Topics Release 3.8 Features and capabilities Milestones and
More information5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992
2014-05-20 MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992 @SoQooL http://blog.mssqlserver.se Mattias.Lind@Sogeti.se 1 The evolution of the Microsoft data platform
More informationCloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationVishesh Oberoi Seth Reid Technical Evangelist, Microsoft Software Developer, Intergen
Vishesh Oberoi Technical Evangelist, Microsoft VishO@microsoft.com @ovishesh Seth Reid Software Developer, Intergen contact@sethreid.co.nz @sethreidnz Vishesh Oberoi Technical Evangelist, Microsoft VishO@microsoft.com
More informationMulti-tenancy version of BigDataBench
Multi-tenancy version of BigDataBench Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Multi-tenancy
More informationMoving Fast with High Reliability: Program Analysis at Uber
Moving Fast with High Reliability: Program Analysis at Uber Manu Sridharan Software Reliability Workshop ETH Zurich OCTOBER 14, 2017 Uber Apps Rider Driver Eats ios and Android Uber Apps Rider Driver Eats
More informationDevOps Anti-Patterns. Have the Ops team deal with it. Time to fire the Ops team! Let s hire a DevOps unit! COPYRIGHT 2019 MANICODE SECURITY
DevOps Anti-Patterns Have the Ops team deal with it. Time to fire the Ops team! Let s hire a DevOps unit! 31 Anti-Pattern: Throw it Over the Wall Development Operations 32 Anti-Pattern: DevOps Team Silo
More informationUsing DC/OS for Continuous Delivery
Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years
More informationFROM VSTS TO AZURE DEVOPS
#DOH18 FROM VSTS TO AZURE DEVOPS People. Process. Products. Gaetano Paternò @tanopaterno info@gaetanopaterno.it 2 VSTS #DOH18 3 Azure DevOps Azure Boards (ex Work) Deliver value to your users faster using
More informationSoftware Engineering 2 A practical course in software engineering. Ekkart Kindler
Software Engineering 2 A practical course in software engineering V. Working Together Working together Management Process Models Version Management Systems Collaborative Development Environments 3 Parts
More informationWhy Git and GitHub? Ming Chow Senior Lecturer, Tufts University Department of Computer Science SIGCSE 2018
Why Git and GitHub? Ming Chow Senior Lecturer, Tufts University Department of Computer Science mchow@cs.tufts.edu Twitter: @0xmchow SIGCSE 2018 Why Git? Keep track of changes and work Documenting changes
More informationWebinar Series TMIP VISION
Webinar Series TMIP VISION TMIP provides technical support and promotes knowledge and information exchange in the transportation planning and modeling community. Today s Goals To Consider: Parallel Processing
More informationTour Trak Project Plan
Tour Trak Project Plan The Cycologists Project Plan 1. Project Overview 1.1 Scope Statement This project is a continuation of TourTrak, developed by previous senior project teams Centri Pedal, Dev Cycle,
More informationPrototyping Data Intensive Apps: TrendingTopics.org
Prototyping Data Intensive Apps: TrendingTopics.org Pete Skomoroch Research Scientist at LinkedIn Consultant at Data Wrangling @peteskomoroch 09/29/09 1 Talk Outline TrendingTopics Overview Wikipedia Page
More informationX.Org and XFree86 Version Numbering Schemes
The XFree86 Project, Inc Updated for X11R7.5 by Keith Packard and Kevin E. Martin 22 May 2006 X.Org has adopted the same basic numbering scheme used by the XFree86 Project, Inc. for their releases. The
More informationWhat is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?
What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)? What is Amazon Machine Image (AMI)? Amazon Elastic Compute Cloud (EC2)?
More informationYour Complete Guide to Backup and Recovery for MongoDB
Your Complete Guide to Backup and Recovery for MongoDB EBOOK Your Complete Guide to Backup and Recovery for MongoDB Table of Contents Part I: Backup and Recovery for MongoDB Part II: Customer Case Study
More informationA Software Architecture for Progressive Scanning of On-line Communities
A Software Architecture for Progressive Scanning of On-line Communities Roberto Baldoni, Fabrizio d Amore, Massimo Mecella, Daniele Ucci Sapienza Università di Roma, Italy Motivations On-line communities
More informationHow Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects?
How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? Saraj Singh Manes School of Computer Science Carleton University Ottawa, Canada sarajmanes@cmail.carleton.ca Olga
More informationFusion Registry 9 SDMX Data and Metadata Management System
Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance
More informationSoftware Security: Vulnerability Analysis
Computer Security Course. Software Security: Vulnerability Analysis Program Verification Program Verification How to prove a program free of buffer overflows? Precondition Postcondition Loop invariants
More informationReal-time Fraud Detection with Innovative Big Graph Feature. Gaurav Deshpande, VP Marketing, TigerGraph; Mingxi Wu, VP Engineering, TigerGraph
Real-time Fraud Detection with Innovative Big Graph Feature Gaurav Deshpande, VP Marketing, TigerGraph; Mingxi Wu, VP Engineering, TigerGraph Speaking Today Gaurav Deshpande VP Marketing, TigerGraph gaurav@tigergraph.com
More informationCombine Native SQL Flexibility with SAP HANA Platform Performance and Tools
SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been
More informationNoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre
NoSQL systems: introduction and data models Riccardo Torlone Università Roma Tre Leveraging the NoSQL boom 2 Why NoSQL? In the last fourty years relational databases have been the default choice for serious
More informationDistributed computing: index building and use
Distributed computing: index building and use Distributed computing Goals Distributing computation across several machines to Do one computation faster - latency Do more computations in given time - throughput
More informationAutomated Netezza Migration to Big Data Open Source
Automated Netezza Migration to Big Data Open Source CASE STUDY Client Overview Our client is one of the largest cable companies in the world*, offering a wide range of services including basic cable, digital
More informationA Review Paper on Big data & Hadoop
A Review Paper on Big data & Hadoop Rupali Jagadale MCA Department, Modern College of Engg. Modern College of Engginering Pune,India rupalijagadale02@gmail.com Pratibha Adkar MCA Department, Modern College
More informationCESSDA Expert Seminar 13 & 14 September 2016 Prague, Czech Republic
CESSDA Expert Seminar 13 & 14 September 2016 Prague, Czech Republic - basics Matthäus Zloch GESIS Outline for this session Git introduction and some theory Git command basics (plus some little advanced)
More informationTHE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES
1 THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES Vincent Garonne, Mario Lassnig, Martin Barisits, Thomas Beermann, Ralph Vigne, Cedric Serfon Vincent.Garonne@cern.ch ph-adp-ddm-lab@cern.ch XLDB
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationStatic Analysis of C++ Projects with CodeSonar
Static Analysis of C++ Projects with CodeSonar John Plaice, Senior Scientist, GrammaTech jplaice@grammatech.com 25 July 2017, Meetup C++ de Montréal Abstract Static program analysis consists of the analysis
More informationBringing Data to Life
Bringing Data to Life Data management and Visualization Techniques Benika Hall Rob Harrison Corporate Model Risk March 16, 2018 Introduction Benika Hall Analytic Consultant Wells Fargo - Corporate Model
More informationCPSC 491. Lecture 19 & 20: Source Code Version Control. VCS = Version Control Software SCM = Source Code Management
CPSC 491 Lecture 19 & 20: Source Code Version Control VCS = Version Control Software SCM = Source Code Management Exercise: Source Code (Version) Control 1. Pretend like you don t have a version control
More informationConfidence Intervals. Dennis Sun Data 301
Dennis Sun Data 301 Statistical Inference probability Population / Box Sample / Data statistics The goal of statistics is to infer the unknown population from the sample. We ve already seen one mode of
More informationComposer Best Practices Nils Private Packagist
Composer Best Practices 2018 Private Packagist https://packagist.com 2018? Delete your lock files 2018? Delete your lock files Composer Ecosystem Reality Update 2018 Best Practices? Deployment Improving
More informationFlash Storage Complementing a Data Lake for Real-Time Insight
Flash Storage Complementing a Data Lake for Real-Time Insight Dr. Sanhita Sarkar Global Director, Analytics Software Development August 7, 2018 Agenda 1 2 3 4 5 Delivering insight along the entire spectrum
More informationSubmitting your Work using GIT
Submitting your Work using GIT You will be using the git distributed source control system in order to manage and submit your assignments. Why? allows you to take snapshots of your project at safe points
More informationDepartment of Information Technology, St. Joseph s College (Autonomous), Trichy, TamilNadu, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 5 ISSN : 2456-3307 A Survey on Big Data and Hadoop Ecosystem Components
More information