Project II. argument/reasoning based on the dataset)

Size: px
Start display at page:

Download "Project II. argument/reasoning based on the dataset)"

Transcription

1 Project II Hive: Simple queries (join, aggregation, group by) Hive: Advanced queries (text extraction, link prediction and graph analysis) Tableau: Visualizations (mutidimensional, interactive, support an argument/reasoning based on the dataset) 60

2 Formalizing Design Choosing Visual Encoding Expressiveness Effectiveness 61

3 Choosing visual encodings Challenge: large design space Assume 8 visual encodings and n data attributes. We would like to pick the best encoding among a combinatorial set of possibilities with size (n+1) 8 Principle of Consistency: The properties of the image (visual variables) should match the properties of the data. Principle of Importance Ordering: Encode the most important information in the most effective way 62

4 Design Criteria 1 Expressiveness A set of facts is expressible in a visual language if the sentences (i.e. the visualizations) in the language express all the facts in the set of data, and only the facts in the data. 63

5 Cannot express the facts Color/Shape -> student count? Blood pressure? A one-to-many (1 N) relation cannot be expressed in a single horizontal dot plot because multiple tuples are mapped to the same position 64

6 Expresses facts not in the data A length is interpreted as a quantitative value; So, length of bar says something untrue about N data 65

7 Design Criteria 2 Effectiveness A visualization is more effective than another visualization if the information conveyed by one visualization is more readily perceived than the information in the other visualization. 66

8 Mackinlay s Ranking Conjectured effectiveness of the encoding 67

9 Mackinlay s Visualization Design Algorithm User formally specifies data model and type Additional input: ordered list of data variables to show Algorithm searches over design space Tests expressiveness of each visual encoding Generates specification for encodings that pass test Tests perceptual effectiveness of resulting image Outputs the most effective visualization 68

10 Limitations Does not cover many visualization techniques Bertin and others discuss networks, maps, diagrams Does not consider 3D, animation, illustration, photography, Does not model interaction Does not consider semantic data types / conventions 69

11 Tableau 70

12 Tableau 71

13 Tableau: Interactive Mashups 72

14 Image Model Visual language is a sign system Images perceived as a set of signs Sender encodes information in signs Receiver decodes information from signs Sémiologie Graphique,

15 Design Considerations 1 Expressiveness Prioritizes important information / Avoids false inferences Consistent visual mappings (e.g., respect color mappings) Make encodings meaningful rather than arbitrary Effectiveness Facilitates accurate decoding / Minimizes cognitive overhead Highlight elements of primary interest Grouping / Sorting Data Transformation Non-Data Elements Descriptive: Title, Label, Caption, Data Source, Annotations Reference: Gridlines, Legend 74

16 Design Considerations 2 Don t distract: faint gridlines, pastel highlights/fills The elimination diet approach start minimal Support comparison and pattern perception Between elements, to a reference line, or to totals Reduce cognitive overhead Minimize visual search, minimize ambiguity -> Avoid legend lookups if direct labeling works -> Avoid color mappings with indiscernible colors 75

17 Bertin s Antibiotic Data Set Effectiveness of three antibiotics on sixteen strains of bacteria. antibiotic dataset Genus of Bacteria: String Species of Bacteria: String Antibiotic Applied: String Gram-Staining: Pos / Neg Min. Inhibitory Concent. (g): Number Collected prior to Different Visualizations: Bar Chart, Line Chart, Dot Plots, Tables/Heat Maps, Others, e.g., Pie chart 76

18 Bertin s Antibiotic Data Set MIC is the lowest concentration of an antimicrobial that will inhibit the visible growth of a microorganism after overnight incubation. 77

19 Different Types of Visualizations Bar Charts MIC for 3 medicine 16 Bacteria 78

20 Different Types of Visualizations Bar Chart 79

21 Different Types of Visualizations Bar Chart 80

22 Different Types of Visualizations Bar Chart 81

23 Different Types of Visualizations Bar Chart 82

24 Different Types of Visualizations Bar Chart 83

25 Different Types of Visualizations Line Chart 84

26 Different Types of Visualizations Dot Plot 85

27 Different Types of Visualizations Table/Heap Map 86

28 Data Quality and Usability Hurdles Missing Data: no measurements Erroneous Values: misspelling, outliers Type Conversion: e.g., zip code to latlon Entity Resolution: diff. values for the same thing? Data Integration: effort/errors when combining data LESSON: Anticipate problems with your data. 87

29 Data Wrangling One often needs to manipulate data prior to analysis. Tasks include reformatting, cleaning, quality assessment, and integration. Some approaches include: Writing custom scripts Manual manipulation in spreadsheets Data Wrangler: Google Refine: 88

30 Exploratory Data Analysis Exploratory Process Construct graphics to address questions Inspect answer and assess new questions Repeat! Transform the data appropriately (e.g., invert, log) Show data variation, not design variation -Tufte 89

31 Exploratory Data analysis How do the drugs compare? Critique? 90

32 Exploratory Data Analysis How do the bacteria group with respect to antibiotic resistance? 91

33 Exploratory Data Analysis How do the bacteria group w.r.t. resistance? Do different drugs correlate? 92

34 Incorporating Model Hypothesis testing: What is the probability that the pattern might have arisen by chance? Prediction: How well do one (or more) data variables predict another? Abstract description: With what parameters does the data best fit a given function? What is the goodness of fit? Scientific theory: Which model explains reality? 93

35 Summary Value of Visualization Data And Image Models Visualization Design Exploratory Data Analysis Interactive Visualization 94

Last Time: Data and Image Models

Last Time: Data and Image Models CS448B :: 2 Oct 2012 Visualization Design Last Time: Data and Image Models Jeffrey Heer Stanford University The Big Picture Nominal, Ordinal and Quantitative task questions & hypotheses intended audience

More information

Visualization Re-Design

Visualization Re-Design CS448B :: 28 Sep 2010 Visualization Re-Design Last Time: Data and Image Models Jeffrey Heer Stanford University The Big Picture Taxonomy task data physical type int, float, etc. abstract type nominal,

More information

Data and Image Models

Data and Image Models CSE 442 - Data Visualization Data and Image Models Jeffrey Heer University of Washington Last Week: Value of Visualization The Value of Visualization Record information Blueprints, photographs, seismographs,

More information

Data and Image Models

Data and Image Models CSE 512 - Data Visualization Data and Image Models Jeffrey Heer University of Washington Last Time: Value of Visualization The Value of Visualization Record information Blueprints, photographs, seismographs,

More information

Last Time: Value of Visualization

Last Time: Value of Visualization CS448B :: 29 Sep 2011 Data and Image Models Last Time: Value of Visualization Jeffrey Heer Stanford University The Value of Visualization Record information Blueprints, photographs, seismographs, Analyze

More information

We will start at 2:05 pm! Thanks for coming early!

We will start at 2:05 pm! Thanks for coming early! We will start at 2:05 pm! Thanks for coming early! Yesterday Fundamental 1. Value of visualization 2. Design principles 3. Graphical perception Record Information Support Analytical Reasoning Communicate

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Visual Encoding Design

Visual Encoding Design CSE 442 - Data Visualization Visual Encoding Design Jeffrey Heer University of Washington Review: Expressiveness & Effectiveness / APT Choosing Visual Encodings Assume k visual encodings and n data attributes.

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Visual Encoding Design

Visual Encoding Design CSE 442 - Data Visualization Visual Encoding Design Jeffrey Heer University of Washington Last Time: Data & Image Models The Big Picture task questions, goals assumptions data physical data type conceptual

More information

Data and Image Models

Data and Image Models CSE 442 - Data Visualization Data and Image Models Jeffrey Heer University of Washington Last Time: Value of Visualization The Value of Visualization Record information Blueprints, photographs, seismographs,

More information

MODELS AND FRAMEWORKS. Information Visualization Fall 2009 Jinwook Seo SNU CSE

MODELS AND FRAMEWORKS. Information Visualization Fall 2009 Jinwook Seo SNU CSE MODELS AND FRAMEWORKS Information Visualization Fall 2009 Jinwook Seo SNU CSE Wednesday Prof. Hee-Joon Bae, Seoul National University Bundang Hostpital blood pressure and END (early neurologic deterioration)

More information

Business Analytics Nanodegree Syllabus

Business Analytics Nanodegree Syllabus Business Analytics Nanodegree Syllabus Master data fundamentals applicable to any industry Before You Start There are no prerequisites for this program, aside from basic computer skills. You should be

More information

Multi-Dimensional Vis

Multi-Dimensional Vis CSE512 :: 21 Jan 2014 Multi-Dimensional Vis Jeffrey Heer University of Washington 1 Last Time: Exploratory Data Analysis 2 Exposure, the effective laying open of the data to display the unanticipated,

More information

To study the application of Data Visualization and Analysis tools

To study the application of Data Visualization and Analysis tools To study the application of Data Visualization and Analysis tools Mrs. Shibani Kulkarni, Department of Computer Science, Dr. D. Y. Patil ACS College, Pimpri, Pune-18 Ms. Neeta Takawale, Department of Computer

More information

Why Should We Care? More importantly, it is easy to lie or deceive people with bad plots

Why Should We Care? More importantly, it is easy to lie or deceive people with bad plots Plots & Graphs Why Should We Care? Everyone uses plots and/or graphs But most people ignore or are unaware of simple principles Default plotting tools (or default settings) are not always the best More

More information

LTER Schoolyard Ecology Program

LTER Schoolyard Ecology Program LTER Schoolyard Ecology Program Title: How to Make Awesome Graphs Teacher/Author: Nora Murphy School: Concord Carslisle High School Date: March 30, 2017 1 Name Block Date How to Make Awesome Graphs Note,

More information

GRAPHING BAYOUSIDE CLASSROOM DATA

GRAPHING BAYOUSIDE CLASSROOM DATA LUMCON S BAYOUSIDE CLASSROOM GRAPHING BAYOUSIDE CLASSROOM DATA Focus/Overview This activity allows students to answer questions about their environment using data collected during water sampling. Learning

More information

CSE Data Visualization. Multidimensional Vis. Jeffrey Heer University of Washington

CSE Data Visualization. Multidimensional Vis. Jeffrey Heer University of Washington CSE 512 - Data Visualization Multidimensional Vis Jeffrey Heer University of Washington Last Time: Exploratory Data Analysis Exposure, the effective laying open of the data to display the unanticipated,

More information

The Semiology of Graphics Pat Hanrahan Stanford University Representations

The Semiology of Graphics Pat Hanrahan Stanford University Representations The Semiology of Graphics 2 Pat Hanrahan Stanford University Representations Page 1 Number Scrabble [Simon] Given: The numbers 1 through 9 Goal: Pick three numbers that sum to 15 Number Scrabble [Simon]

More information

Perception Maneesh Agrawala CS 448B: Visualization Fall 2017 Last Time: Exploratory Data Analysis

Perception Maneesh Agrawala CS 448B: Visualization Fall 2017 Last Time: Exploratory Data Analysis Perception Maneesh Agrawala CS 448B: Visualization Fall 2017 Last Time: Exploratory Data Analysis 1 Will Burtin, 1951 How do the drugs compare? How do the bacteria group with respect to antibiotic resistance?

More information

Six Core Data Wrangling Activities. An introductory guide to data wrangling with Trifacta

Six Core Data Wrangling Activities. An introductory guide to data wrangling with Trifacta Six Core Data Wrangling Activities An introductory guide to data wrangling with Trifacta Today s Data Driven Culture Are you inundated with data? Today, most organizations are collecting as much data in

More information

Construction Change Order analysis CPSC 533C Analysis Project

Construction Change Order analysis CPSC 533C Analysis Project Construction Change Order analysis CPSC 533C Analysis Project Presented by Chiu, Chao-Ying Department of Civil Engineering University of British Columbia Problems of Using Construction Data Hybrid of physical

More information

Making Science Graphs and Interpreting Data

Making Science Graphs and Interpreting Data Making Science Graphs and Interpreting Data Eye Opener: 5 mins What do you see? What do you think? Look up terms you don t know What do Graphs Tell You? A graph is a way of expressing a relationship between

More information

Lab Activity #2- Statistics and Graphing

Lab Activity #2- Statistics and Graphing Lab Activity #2- Statistics and Graphing Graphical Representation of Data and the Use of Google Sheets : Scientists answer posed questions by performing experiments which provide information about a given

More information

CSE Data Visualization. Multidimensional Vis. Jeffrey Heer University of Washington

CSE Data Visualization. Multidimensional Vis. Jeffrey Heer University of Washington CSE 512 - Data Visualization Multidimensional Vis Jeffrey Heer University of Washington Last Time: Exploratory Data Analysis Exposure, the effective laying open of the data to display the unanticipated,

More information

DATA CLEANING & DATA MANIPULATION

DATA CLEANING & DATA MANIPULATION DATA CLEANING & DATA MANIPULATION WESLEY WILLETT INFO VISUAL 340 ANALYTICS D 13 FEB 2014 1 OCT 2014 WHAT IS DIRTY DATA? BEFORE WE CAN TALK ABOUT CLEANING,WE NEED TO KNOW ABOUT TYPES OF ERROR AND WHERE

More information

Problems With Using Microsoft Excel for Statistics

Problems With Using Microsoft Excel for Statistics Problems With Using Microsoft Excel for Statistics Jonathan D. Cryer (Jon-Cryer@uiowa.edu) Department of Statistics and Actuarial Science University of Iowa, Iowa City, Iowa Joint Statistical Meetings

More information

Visualization Tools. Interaction. How do people create visualizations? Jeffrey Heer Stanford University

Visualization Tools. Interaction. How do people create visualizations? Jeffrey Heer Stanford University CS448B :: 20 Oct 2011 Interaction Visualization Tools Jeffrey Heer Stanford University How do people create visualizations? Today's first task is not to invent wholly new [graphical] techniques, though

More information

Visualization Tools. Interaction. How do people create visualizations? Jeffrey Heer Stanford University

Visualization Tools. Interaction. How do people create visualizations? Jeffrey Heer Stanford University CS448B :: 23 Oct 2012 Interaction Visualization Tools Jeffrey Heer Stanford University How do people create visualizations? Today's first task is not to invent wholly new [graphical] techniques, though

More information

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof.

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof. CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing University of Florida, CISE Department Prof. Daisy Zhe Wang Data Visualization Value of Visualization Data And Image Models

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Interactive Data Analysis

Interactive Data Analysis domain specific languages for Interactive Data Analysis Jeffrey Heer Stanford University 1 HTML / CSS Table SELECT customer_id, customer_name, COUNT(order_id) as total FROM customers INNER JOIN orders

More information

Automating the Design of Graphical Presentations of Relational Information. By Jock Mackinlay, Tableau (then Stanford)

Automating the Design of Graphical Presentations of Relational Information. By Jock Mackinlay, Tableau (then Stanford) Automating the Design of Graphical Presentations of Relational Information By Jock Mackinlay, Tableau (then Stanford) Overview Automate the design of 2D graphical presentations for relational data Why

More information

Y8 IT in Science Using Forces

Y8 IT in Science Using Forces Y8 IT in Science Using Forces IT in Science Information Technology can be used in all subjects but is very useful for Science. We can use Information Technology in Science to help us with all these things:

More information

Practicing on word processor, spreadsheet, search engines, citation and referencing, and creating zipped files.

Practicing on word processor, spreadsheet, search engines, citation and referencing,  and creating zipped files. Practicing on word processor, spreadsheet, search engines, citation and referencing, email and creating zipped files. 1 Exercise 1: Creating MS Word document 1. Create a new folder in My Documents and

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Exploratory Data Analysis Dr. David Koop What is Exploratory Data Analysis? "Detective work" to summarize and explore datasets Includes: - Data acquisition and input

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Analytics:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

Visualization? Information Visualization. Information Visualization? Ceci n est pas une visualization! So why two disciplines? So why two disciplines?

Visualization? Information Visualization. Information Visualization? Ceci n est pas une visualization! So why two disciplines? So why two disciplines? Visualization? New Oxford Dictionary of English, 1999 Information Visualization Matt Cooper visualize - verb [with obj.] 1. form a mental image of; imagine: it is not easy to visualize the future. 2. make

More information

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Predictive Analysis: Evaluation and Experimentation. Heejun Kim Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training

More information

Glyphs. Presentation Overview. What is a Glyph!? Cont. What is a Glyph!? Glyph Fundamentals. Goal of Paper. Presented by Bertrand Low

Glyphs. Presentation Overview. What is a Glyph!? Cont. What is a Glyph!? Glyph Fundamentals. Goal of Paper. Presented by Bertrand Low Presentation Overview Glyphs Presented by Bertrand Low A Taxonomy of Glyph Placement Strategies for Multidimensional Data Visualization Matthew O. Ward, Information Visualization Journal, Palmgrave,, Volume

More information

Course on Database Design Carlo Batini University of Milano Bicocca

Course on Database Design Carlo Batini University of Milano Bicocca Course on Database Design Carlo Batini University of Milano Bicocca 1 Carlo Batini, 2015 This work is licensed under the Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License.

More information

Knowledge Discovery and Data Mining 1 (VO) ( )

Knowledge Discovery and Data Mining 1 (VO) ( ) Knowledge Discovery and Data Mining 1 (VO) (707.003) Data Matrices and Vector Space Model Denis Helic KTI, TU Graz Nov 6, 2014 Denis Helic (KTI, TU Graz) KDDM1 Nov 6, 2014 1 / 55 Big picture: KDDM Probability

More information

Excel Tips and FAQs - MS 2010

Excel Tips and FAQs - MS 2010 BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 2 Summarizing and Graphing Data 2-1 Review and Preview 2-2 Frequency Distributions 2-3 Histograms

More information

Microsoft Excel 2010 Training. Excel 2010 Basics

Microsoft Excel 2010 Training. Excel 2010 Basics Microsoft Excel 2010 Training Excel 2010 Basics Overview Excel is a spreadsheet, a grid made from columns and rows. It is a software program that can make number manipulation easy and somewhat painless.

More information

Module 4: Creating Content Lesson 4: Creating Charts Learn

Module 4: Creating Content Lesson 4: Creating Charts Learn Module 4: Creating Content Lesson 4: Creating Charts Learn Welcome to Module 4, Lesson 4. In this lesson, you will learn how to create, edit, and save new charts based on your uploaded data using InfoAssist+.

More information

STAT1010 cautions on graphics

STAT1010 cautions on graphics 3.4 Cautions with Graphics Graphics can be misleading. How do we perceive the graphic? Is the graphic portrayed honestly? Are they comparing apples to apples? Or apples to oranges? Is it a percentage change

More information

INDEX UNIT 4 PPT SLIDES

INDEX UNIT 4 PPT SLIDES INDEX UNIT 4 PPT SLIDES S.NO. TOPIC 1. 2. Screen designing Screen planning and purpose arganizing screen elements 3. 4. screen navigation and flow Visually pleasing composition 5. 6. 7. 8. focus and emphasis

More information

IAT 355 Visual Analytics. Animation 2. Lyn Bartram. Many of these slides were borrowed from M. Hearst and J. Heer

IAT 355 Visual Analytics. Animation 2. Lyn Bartram. Many of these slides were borrowed from M. Hearst and J. Heer IAT 355 Visual Analytics Animation 2 Lyn Bartram Many of these slides were borrowed from M. Hearst and J. Heer Today A Primer! Things to do when designing your visualization Project concerns Animation

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:

More information

Tableau Advanced Training. Student Guide April x. For Evaluation Only

Tableau Advanced Training. Student Guide April x. For Evaluation Only Tableau Advanced Training Student Guide www.datarevelations.com 914.945.0567 April 2017 10.x Contents A. Warm Up 1 Bar Chart Colored by Profit 1 Salary Curve 2 2015 v s. 2014 Sales 3 VII. Programmatic

More information

Spreadsheet and Graphing Exercise Biology 210 Introduction to Research

Spreadsheet and Graphing Exercise Biology 210 Introduction to Research 1 Spreadsheet and Graphing Exercise Biology 210 Introduction to Research There are many good spreadsheet programs for analyzing data. In this class we will use MS Excel. Below are a series of examples

More information

Excel Tutorial. Look at this simple spreadsheet and then answer the questions below.

Excel Tutorial. Look at this simple spreadsheet and then answer the questions below. Excel Tutorial During this introduction you will learn the basics of how to use Excel, which means making a spreadsheet, creating a graph from that spreadsheet, and then interpreting the data to answer

More information

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite

More information

Data Visualization Principles for Scientific Communication

Data Visualization Principles for Scientific Communication Data Visualization Principles for Scientific Communication 8-888 Introduction to Linguistic Data Analysis Using R Jerzy Wieczorek 11//15 Follow along These slides and a summary checklist are at http://www.stat.cmu.edu/~jwieczor/

More information

Hours Assignments:

Hours Assignments: IT103 Database Management Course Objectives Upon completion of this course, the student will be able to: 1. Demonstrate database concepts and terminology. 2. Demonstrate how to create and use a basic database.

More information

Interaction Techniques. SWE 432, Fall 2017 Design and Implementation of Software for the Web

Interaction Techniques. SWE 432, Fall 2017 Design and Implementation of Software for the Web Interaction Techniques SWE 432, Fall 2017 Design and Implementation of Software for the Web Today What principles guide the design of usable interaction techniques? How can interaction designs help support

More information

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office) SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon

More information

Data Analysis Guidelines

Data Analysis Guidelines Data Analysis Guidelines DESCRIPTIVE STATISTICS Standard Deviation Standard deviation is a calculated value that describes the variation (or spread) of values in a data set. It is calculated using a formula

More information

Lecture 5: DATA MAPPING & VISUALIZATION. November 3 rd, Presented by: Anum Masood (TA)

Lecture 5: DATA MAPPING & VISUALIZATION. November 3 rd, Presented by: Anum Masood (TA) 1/59 Lecture 5: DATA MAPPING & VISUALIZATION November 3 rd, 2017 Presented by: Anum Masood (TA) 2/59 Recap: Data What is Data Visualization? Data Attributes Visual Attributes Mapping What are data attributes?

More information

A Formalism for Visual Query Interface Design

A Formalism for Visual Query Interface Design A Formalism for Visual Query Interface Design by Jiwen Huo A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Doctor of Philosophy in Computer Science

More information

EZY Intellect Pte. Ltd., #1 Changi North Street 1, Singapore

EZY Intellect Pte. Ltd., #1 Changi North Street 1, Singapore Tableau in Business Intelligence Duration: 6 Days Tableau Desktop Tableau Introduction Tableau Introduction. Overview of Tableau workbook, worksheets. Dimension & Measures Discrete and Continuous Install

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 Objectives 2.1 What Are the Types of Data? www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative

More information

Applying Machine Learning to Real Problems: Why is it Difficult? How Research Can Help?

Applying Machine Learning to Real Problems: Why is it Difficult? How Research Can Help? Applying Machine Learning to Real Problems: Why is it Difficult? How Research Can Help? Olivier Bousquet, Google, Zürich, obousquet@google.com June 4th, 2007 Outline 1 Introduction 2 Features 3 Minimax

More information

The Trustworthiness of Digital Records

The Trustworthiness of Digital Records The Trustworthiness of Digital Records International Congress on Digital Records Preservation Beijing, China 16 April 2010 1 The Concept of Record Record: any document made or received by a physical or

More information

Making Tables and Graphs with Excel. The Basics

Making Tables and Graphs with Excel. The Basics Making Tables and Graphs with Excel The Basics Where do my IV and DV go? Just like you would create a data table on paper, your IV goes in the leftmost column and your DV goes to the right of the IV Enter

More information

Sorted bar plot with 45 degree labels in R

Sorted bar plot with 45 degree labels in R Sorted bar plot with 45 degree labels in R In this exercise we ll plot a bar graph, sort it in decreasing order (big to small from left to right) and place long labels under the bars. The labels will be

More information

COPYRIGHT & TRADEMARKS... I

COPYRIGHT & TRADEMARKS... I Microsoft Excel XP Level 3 ABOUT VICON... I COPYRIGHT & TRADEMARKS... I DISCLAIMER... I LESSON 1 - WORKING WITH DATABASES...1 Using a Database...2 Creating a Database...2 Modifying a Database...4 Sorting

More information

General Program Description

General Program Description General Program Description This program is designed to interpret the results of a sampling inspection, for the purpose of judging compliance with chosen limits. It may also be used to identify outlying

More information

New Approach to Graph Databases

New Approach to Graph Databases Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few

More information

MHPE 494: Data Analysis. Welcome! The Analytic Process

MHPE 494: Data Analysis. Welcome! The Analytic Process MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your

More information

AND NUMERICAL SUMMARIES. Chapter 2

AND NUMERICAL SUMMARIES. Chapter 2 EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 What Are the Types of Data? 2.1 Objectives www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative

More information

Information Visualization. SWE 432, Fall 2016 Design and Implementation of Software for the Web

Information Visualization. SWE 432, Fall 2016 Design and Implementation of Software for the Web Information Visualization SWE 432, Fall 2016 Design and Implementation of Software for the Web Today What types of information visualization are there? Which one should you choose? What does usability

More information

Visual Tools & Methods for Data Cleaning

Visual Tools & Methods for Data Cleaning Visual Tools & Methods for Data Cleaning DaQuaTa International Workshop 2017, Lyon, FR http://romain.vuillemot.net/ @romsson Reality * Time series * Geo-spatial data Select, filter, sort, zoom,.. Need

More information

Math 227 EXCEL / MEGASTAT Guide

Math 227 EXCEL / MEGASTAT Guide Math 227 EXCEL / MEGASTAT Guide Introduction Introduction: Ch2: Frequency Distributions and Graphs Construct Frequency Distributions and various types of graphs: Histograms, Polygons, Pie Charts, Stem-and-Leaf

More information

D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research

D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research Dr. Fabian Bös, Senior Application Scientist Certara Spain SL Martin-Kollar-Str. 17, 81829 Munich

More information

Statistical Graphs & Charts

Statistical Graphs & Charts Statistical Graphs & Charts CS 4460 Intro. to Information Visualization August 30, 2017 John Stasko Learning Objectives Learn different statistical data graphs Line graph, Bar Graph, Scatterplot, Trellis,

More information

Interaction Techniques. SWE 432, Fall 2016 Design and Implementation of Software for the Web

Interaction Techniques. SWE 432, Fall 2016 Design and Implementation of Software for the Web Interaction Techniques SWE 432, Fall 2016 Design and Implementation of Software for the Web Today What principles guide the design of usable interaction techniques? How can interaction designs help support

More information

Unit 1 Lesson 4 Representing Data. Copyright Houghton Mifflin Harcourt Publishing Company

Unit 1 Lesson 4 Representing Data. Copyright Houghton Mifflin Harcourt Publishing Company Florida Benchmarks SC.6.N.1.1 Define a problem from the sixth grade curriculum, use appropriate reference materials to support scientific understanding, plan and carry out scientific investigation of various

More information

Unit Assessment Guide

Unit Assessment Guide Unit Assessment Guide Unit Details Unit code Unit name Unit purpose/application ICTWEB425 Apply structured query language to extract and manipulate data This unit describes the skills and knowledge required

More information

Semantic Web Technologies. Topic: Data Cleaning

Semantic Web Technologies. Topic: Data Cleaning Semantic Web Technologies Topic: Data Cleaning olaf.hartig@liu.se Terminology and Methodologies Data cleaning (data cleansing, data scrubbing) deals with detecting and removing errors and inconsistencies

More information

Parallel Coordinates CS 6630 Scientific Visualization

Parallel Coordinates CS 6630 Scientific Visualization Parallel Coordinates CS 6630 Scientific Visualization Yang Shen u0879466 Jason Davies Example Jason s parallel coordinates is fairly effective when it comes to interaction. I like the way creating filter

More information

Introduction to BEST Viewpoints

Introduction to BEST Viewpoints Introduction to BEST Viewpoints This is not all but just one of the documentation files included in BEST Viewpoints. Introduction BEST Viewpoints is a user friendly data manipulation and analysis application

More information

The main topics that we will learn about during the class are

The main topics that we will learn about during the class are STA 141 Syllabus This class is about scientific and statistical computing. It is intended to provide you with a strong foundation in computing skills that are increasingly necessary for a practicing statistician

More information

Data Analytics Training Program

Data Analytics Training Program Data Analytics Training Program In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers Willing

More information

Figures, Graphs, and Tables. ChEn 475

Figures, Graphs, and Tables. ChEn 475 Figures, Graphs, and Tables ChEn 475 General Tips Tables have HEADINGS, figures have CAPTIONS!! Footnotes under tables are okay. Headings and captions should be concise and precise, not long narratives.

More information

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. + What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and

More information

Data Preprocessing. Why Data Preprocessing? MIT-652 Data Mining Applications. Chapter 3: Data Preprocessing. Multi-Dimensional Measure of Data Quality

Data Preprocessing. Why Data Preprocessing? MIT-652 Data Mining Applications. Chapter 3: Data Preprocessing. Multi-Dimensional Measure of Data Quality Why Data Preprocessing? Data in the real world is dirty incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate data e.g., occupation = noisy: containing

More information

VAPOR Product Roadmap. Visualization and Analysis Software Team October 2017

VAPOR Product Roadmap. Visualization and Analysis Software Team October 2017 VAPOR Product Roadmap Visualization and Analysis Software Team October 2017 VAPOR Introduction In 2015 the VAPOR team began a major refactoring of the VAPOR codebase aimed at addressing a myriad of limitations

More information

3. Visual Analytics (Part 1: Visual Encoding) Jacobs University Visualization and Computer Graphics Lab

3. Visual Analytics (Part 1: Visual Encoding) Jacobs University Visualization and Computer Graphics Lab 3. Visual Analytics (Part 1: Visual Encoding) 3.1 Introduction Motivation Big Data cannot be analyzed anymore without the help of computers. Computers are good in quickly processing large amounts of data.

More information

Perception Maneesh Agrawala CS : Visualization Fall 2013 Multidimensional Visualization

Perception Maneesh Agrawala CS : Visualization Fall 2013 Multidimensional Visualization Perception Maneesh Agrawala CS 294-10: Visualization Fall 2013 Multidimensional Visualization 1 Visual Encoding Variables Position Length Area Volume Value Texture Color Orientation Shape ~8 dimensions?

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Step 10 Visualisation Carlos Moura

Step 10 Visualisation Carlos Moura Step 10 Visualisation Carlos Moura COIN 2017-15th JRC Annual Training on Composite Indicators & Scoreboards 06-08/11/2017, Ispra (IT) Effective communication through visualization Why investing on visual

More information

Algebra 1, 4th 4.5 weeks

Algebra 1, 4th 4.5 weeks The following practice standards will be used throughout 4.5 weeks:. Make sense of problems and persevere in solving them.. Reason abstractly and quantitatively. 3. Construct viable arguments and critique

More information

Excel for Data Visualization

Excel for Data Visualization Introduction to Excel for Data Visualization CHEAT SHEET CONTENT Basic data cleaning troubleshooting... 04 Three useful excel formulas... 05 INDEX MATCH VLOOKUP COUNTIF Why use INDEX MATCH?... 06 Why use

More information

Information Retrieval. CS630 Representing and Accessing Digital Information. What is a Retrieval Model? Basic IR Processes

Information Retrieval. CS630 Representing and Accessing Digital Information. What is a Retrieval Model? Basic IR Processes CS630 Representing and Accessing Digital Information Information Retrieval: Retrieval Models Information Retrieval Basics Data Structures and Access Indexing and Preprocessing Retrieval Models Thorsten

More information

TNM093 Tillämpad visualisering och virtuell verklighet. Jimmy Johansson C-Research, Linköping University

TNM093 Tillämpad visualisering och virtuell verklighet. Jimmy Johansson C-Research, Linköping University TNM093 Tillämpad visualisering och virtuell verklighet Jimmy Johansson C-Research, Linköping University Introduction to Visualization New Oxford Dictionary of English, 1999 visualize - verb [with obj.]

More information