Alyssa Grieco. Data Wrangling Final Project Report Fall 2016 Dangerous Dogs and Off-leash Areas in Austin Housing Market Zip Codes.

Size: px
Start display at page:

Download "Alyssa Grieco. Data Wrangling Final Project Report Fall 2016 Dangerous Dogs and Off-leash Areas in Austin Housing Market Zip Codes."

Transcription

1 Alyssa Grieco Data Wrangling Final Project Report Fall 2016 Dangerous Dogs and Off-leash Areas in Austin Housing Market Zip Codes Workflow

2 Datasets Data was taken from three sources on data.austintexas.gov. One was data about the housing market in Austin, Texas in 2014 incorporating median household income, median rent and median home value for every zip code across the city:

3 Another dataset showed the address, zip code and owner of dangerous dogs throughout Austin:

4 The third dataset had the name and address, along with a lot of extraneous detail of off-leash areas across Austin: Database After finding and looking at these datasets and seeing that they could all be exported as CSVs, which I was able to easily export to the server, I created a database ER diagram for a database linking all of these datasets and created that database on phpmyadmin.

5 Importing Data to Database Once the data was created and all three tables were ready to recieve data, I wrote python code to clean up the data in the housing market csv so that dollar signs and commas were no longer in the median income, rent and home value data, wrote python code to pull out the zip code from the table full of description of the off leash areas, and wrote python code to import the cleaned up data from all three datasets to their respective tables in the database using a mysql query within the code. The code is as follows: (and is also on the server, here sftp://agrieco:@holden.ischool.utexas.edu//export/home/u16/agrieco/project/project_code.py) import pymysql #allows me to use mysql within the python code import pprint import csv import re connection = pymysql.connect(host="localhost", # your host, usually localhost user="grieco_a", # your username passwd="arg985", # your password db="grieco_a_project_dangerous_dogs", # name of the db autocommit=true, # removes a step in queries cursorclass=pymysql.cursors.dictcursor) cursor = connection.cursor() cursor.execute("truncate dangerous_dogs; TRUNCATE housing_market; TRUNCATE off_leash_area") with open('housing_market.csv') as csvfile: mycsvreader = csv.dictreader(csvfile, delimiter=",", quotechar='"') for row in mycsvreader: raw_income = row["median_household_income"] no_sign_income = re.sub('\$', '', raw_income) #gets rid of dollar signs no_comma_income = re.sub(',', '', no_sign_income) #gets rid of commas raw_rent = row["median_rent"] no_sign_rent = re.sub('\$', '', raw_rent) #gets rid of dollar signs

6 no_comma_rent = re.sub(',', '', no_sign_rent) #gets rid of commas raw_value = row["median_home_value"] no_sign_value = re.sub('\$', '', raw_value) #gets rid of dollar signs no_comma_value = re.sub(',', '', no_sign_value) #gets rid of commas # pprint.pprint(row) sql_placeholder = """ INSERT INTO housing_market(zip_code,median_household_income,median_rent,median_home_value) VALUE (%(Zip_Code)s,%(Median_household_income)s,%(Median_rent)s,%(Median_home_value)s) """ #mysql to insert the columns from the housing market csv into the appropriate #columns in the correct table in the database param_dict = {'Zip_Code': row['zip_code'], 'Median_household_income': no_comma_income, 'Median_rent': no_comma_rent, 'Median_home_value': no_comma_value} #allows me to import the cleaned data #print(param_dict) cursor.execute(sql_placeholder, param_dict) import pymysql import pprint import csv connection = pymysql.connect(host="localhost", # your host, usually localhost user="grieco_a", # your username passwd="arg985", # your password db="grieco_a_project_dangerous_dogs", # name of the db autocommit=true, # removes a step in queries cursorclass=pymysql.cursors.dictcursor) cursor = connection.cursor()

7 with open('map_of_declared_dangerous_dogs.csv') as csvfile: mycsvreader = csv.dictreader(csvfile, delimiter=",", quotechar='"') for row in mycsvreader: # pprint.pprint(row) sql_placeholder = "INSERT INTO dangerous_dogs (First_Name,Address,Zip_Code) VALUES (%(First_Name)s,%(Address)s,%(Zip_Code)s)"; #this one needed no cleaning so I could insert the data straight into the table without creating a paramdict cursor.execute(sql_placeholder, row) import pymysql import pprint import csv import re connection = pymysql.connect(host="localhost", # your host, usually localhost user="grieco_a", # your username passwd="arg985", # your password db="grieco_a_project_dangerous_dogs", # name of the db autocommit=true, # removes a step in queries cursorclass=pymysql.cursors.dictcursor) cursor = connection.cursor() with open('off_leash_areas.csv') as csvfile: mycsvreader = csv.dictreader(csvfile, delimiter=",", quotechar='"') for row in mycsvreader:

8 #pprint.pprint(row) print("-"*20) raw_address = row["address"] print(raw_address) #remove tags <.+> no_tags_address = re.sub('<[^>]+>',' ', raw_address) no_tags_address = re.sub('<.*$','', no_tags_address) print(no_tags_address) matches = re.search('(\d\d\d\d\d)',no_tags_address) #finds only 5 numbers in a row to pull out just the zip codes myzip = matches.group(1) #tells it to only print the matches of 5 numbers in a row print(myzip) param_dict = {'Name': row['name'], 'Address': no_tags_address, 'Zip_Code': myzip} sql_placeholder = "INSERT INTO off_leash_area (Name,Address,Zip_Code) VALUES (%(Name)s,%(Address)s,%(Zip_Code)s)" #print(param_dict) cursor.execute(sql_placeholder, param_dict) #inserts cleaned up data Analysis After all the data was successfully cleaned up and put into the database, I needed to find a way to link it and analyze something about it. I wanted to see if the number of dangerous dogs was in any way linked to off-leash areas or median income across Austin zip codes. Therefore, I created a mysql query that counted all the dangerous dogs in each zip code and all the off leash areas in each zip code and put those zip codes in order from lowest to highest median income. This involved COUNT(DISTINCT), grouping by zip code and ordering by income but when I did this with a regular inner join of all the tables it would not give me values where the count was zero. Therefore, I was only getting a portion of Austin zip codes, not all Austin zip codes. I then had to do an outer join of the off leash area table to the housing market table in order to get all values. This resulted in the following mysql query:

9 SELECT housing_market.zip_code, housing_market.median_household_income, COUNT ( DISTINCT dangerous_dogs.id ) AS dangerous_dogs_in_zip, COUNT ( DISTINCT off_leash_area.id ) AS off_leash_area_in_zip FROM housing_market LEFT OUTER JOIN off_leash_area ON housing_market.zip_code = off_leash_area.zip_code LEFT OUTER JOIN dangerous_dogs ON housing_market.zip_code = dangerous_dogs.zip_code GROUP BY housing_market.zip_code ORDER BY housing_market.median_household_income ASC I then wrote python script to make this query in the database and then write the results into a csv on the server that I could use to create a visual analysis of the results. This code is also on the server in the same file as the import code (sftp://agrieco:@holden.ischool.utexas.edu//export/home/u16/agrieco/project/project_code.py) import pymysql import csv import pprint # First set up the connection to the server connection = pymysql.connect(host="localhost", # your host, usually localhost user="grieco_a", # your username passwd="arg985", # your password db="grieco_a_project_dangerous_dogs", # name of the data base autocommit=true, cursorclass=pymysql.cursors.dictcursor) # as with opening a file we can use with to open the connection # the cursor is the object through which we talk to the sql server. with connection.cursor() as cursor: sql = "SELECT housing_market.zip_code, housing_market.median_household_income, COUNT( DISTINCT dangerous_dogs.id ) AS dangerous_dogs_in_zip, COUNT( DISTINCT off_leash_area.id ) AS off_leash_area_in_zip FROM housing_market LEFT OUTER JOIN off_leash_area ON housing_market.zip_code = off_leash_area.zip_code LEFT OUTER JOIN dangerous_dogs ON housing_market.zip_code = dangerous_dogs.zip_code GROUP BY housing_market.zip_code ORDER BY housing_market.median_household_income ASC " # SQL queries are just a string. #mysql query for my analysis cursor.execute(sql)

10 results = cursor.fetchall() # here we get the column names from the keys of the first item. csv_column_order = list(results[0].keys()) with open('project_query.csv', 'w', newline='') as csvfile: mycsvwriter = csv.dictwriter(csvfile, delimiter=',', quotechar='"', fieldnames = csv_column_order) # write the header row (it gets those from the fieldnames) mycsvwriter.writeheader() # and then each of the other results, row by row. for row in results: mycsvwriter.writerow(row) #writes results to the csv specified above: project_query.csv

11 Results The results of this query were written to the following csv:

12 I then decided to create a clustered column chart in excel with columns for count of dangerous dogs and off leash areas for each zip code and have those columns and zip codes be in order from lowest to highest median income for each zip code. From this analysis, it would seem that there is no correlation between number of dangerous dogs and median income because the count of dangerous dogs is higher and lower across the zip codes for all income levels. There are also a scattered number of off leash areas throughout zip codes of all income levels. Also, the number of dangerous dogs does not seem to increase or decrease depending on number of off leash areas in the same zip code. Therefore, it would seem that none of these datasets have any sort of immediate ties to each other. Dangerous dog count must be tied to something other than income or number of off-leash areas. Analysis Tool Learning To create my analytical bar graph from my query results, I chose to use Excel. I have used Excel many times for simple tables and for using functions on certain columns, rows and cells, such as SUM, but I have not created charts using it and not charts as complicated as the one I wanted for this project. My analysis got me a chart with every Austin zip code in order of lowest to highest median household income and the count of dangerous dogs and off leash areas for each zip code. I wanted the count to be in the form of two bars for each zip code and I wanted those zip codes to be in the median income order. I opened the csv file with these results in Excel and tried to highlight and create a bar graph from all the data. However, the bar graph made the

13 bars horizontal and I wanted them vertical, so I changed it to a clustered column graph. However, by simply highlighting all the data and putting it into a chart, it put everything into the bars instead of putting the zip code and median income along the x-axis. Because of this problem, I thought that I needed to create a pivot table with the data because I know you can designate columns and rows there. So, I tried that and made all the data come up as product so that I would get the same data and it wouldn t be a sum. However, there was no way to get zip code into median income order or even have them really be together on the same axis on the clustered column chart. Therefore, I went back to the original data in the original results table and found that in the select data section of the chart formatting menu, you can choose which data you want on the x-axis. I put all the data from the zip code and median household value columns into this and it came up exactly the way that I wanted with every zip code and count in median income order. I then deleted the zip code and median income bars and added labels to the axis. All of this I figured out from trial and error and didn t have to look anything up online. Challenge When working on this project, a challenge that I faced was getting the database design correct and cleaning up the CSVs enough to import them to the database. In class, we learned how to join tables through foreign keys and ids, so I tried to create the database tables with id foreign keys. However, that wasn t really working, so I realized that because the zip code for the housing market table is unique, I could use that to join the other tables through their zip codes. The csv files that I got for the housing market data and the off-leash area data had a lot of issues with them that made them difficult to import to the database. The housing market csv had dollar signs and commas in the columns relating to median household income, median rent and median home value that the database would not let me import into tables. I had to use re.sub() in python in order to get the import without the dollar signs and commas. The off-leash area csv column for address had a bunch of description and symbols that were not necessary and I needed to pull out just the address and zip code to but into column in the table on the database. I used re.sub() to get rid of the symbols and then had the code find five numbers together and count that as the zip code since that was the only item in the column with five numbers together. I used \d\d\d\d\d for this. Also with both of these, I had to create a param dict and have the param dict import into the fields in the database instead of rows since I messed around with the data in the rows.

OVERVIEW OF RELATIONAL DATABASES: KEYS

OVERVIEW OF RELATIONAL DATABASES: KEYS OVERVIEW OF RELATIONAL DATABASES: KEYS Keys (typically called ID s in the Sierra Database) come in two varieties, and they define the relationship between tables. Primary Key Foreign Key OVERVIEW OF DATABASE

More information

How to import text files to Microsoft Excel 2016:

How to import text files to Microsoft Excel 2016: How to import text files to Microsoft Excel 2016: You would use these directions if you get a delimited text file from a government agency (or some other source). This might be tab-delimited, comma-delimited

More information

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL We have spent the first part of the course learning Excel: importing files, cleaning, sorting, filtering, pivot tables and exporting

More information

Filter and PivotTables in Excel

Filter and PivotTables in Excel Filter and PivotTables in Excel FILTERING With filters in Excel you can quickly collapse your spreadsheet to find records meeting specific criteria. A lot of reporters use filter to cut their data down

More information

MASSTRANSIT DATABASE ANALYSIS. Using Microsoft Excel And ODBC To Analyze MySQL Data

MASSTRANSIT DATABASE ANALYSIS. Using Microsoft Excel And ODBC To Analyze MySQL Data MASSTRANSIT DATABASE ANALYSIS Using Microsoft Excel And ODBC To Analyze MySQL Data AUTHOR: PETE CODY petecody@grouplogic.com 31 May 2007 Document Revision History: Date: Author: Comment: 19 Apr 2007 PMC

More information

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL We have spent the first part of the course learning Excel: importing files, cleaning, sorting, filtering, pivot tables and exporting

More information

Become strong in Excel (2.0) - 5 Tips To Rock A Spreadsheet!

Become strong in Excel (2.0) - 5 Tips To Rock A Spreadsheet! Become strong in Excel (2.0) - 5 Tips To Rock A Spreadsheet! Hi folks! Before beginning the article, I just wanted to thank Brian Allan for starting an interesting discussion on what Strong at Excel means

More information

Key concepts through Excel Basic videos 01 to 25

Key concepts through Excel Basic videos 01 to 25 Key concepts through Excel Basic videos 01 to 25 1) Row and Colum make up Cell 2) All Cells = Worksheet = Sheet 3) Name of Sheet is in Sheet Tab 4) All Worksheets = Workbook File 5) Default Alignment In

More information

Enterprise Reporting -- APEX

Enterprise Reporting -- APEX Quick Reference Enterprise Reporting -- APEX This Quick Reference Guide documents Oracle Application Express (APEX) as it relates to Enterprise Reporting (ER). This is not an exhaustive APEX documentation

More information

Texas Death Row. Last Statements. Data Warehousing and Data Mart. By Group 16. Irving Rodriguez Joseph Lai Joe Martinez

Texas Death Row. Last Statements. Data Warehousing and Data Mart. By Group 16. Irving Rodriguez Joseph Lai Joe Martinez Texas Death Row Last Statements Data Warehousing and Data Mart By Group 16 Irving Rodriguez Joseph Lai Joe Martinez Introduction For our data warehousing and data mart project we chose to use the Texas

More information

EXCELLING WITH ANALYSIS AND VISUALIZATION

EXCELLING WITH ANALYSIS AND VISUALIZATION EXCELLING WITH ANALYSIS AND VISUALIZATION A PRACTICAL GUIDE FOR DEALING WITH DATA Prepared by Ann K. Emery July 2016 Ann K. Emery 1 Welcome Hello there! In July 2016, I led two workshops Excel Basics for

More information

MicroStrategy Desktop

MicroStrategy Desktop MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from

More information

Stat Wk 3. Stat 342 Notes. Week 3, Page 1 / 71

Stat Wk 3. Stat 342 Notes. Week 3, Page 1 / 71 Stat 342 - Wk 3 What is SQL Proc SQL 'Select' command and 'from' clause 'group by' clause 'order by' clause 'where' clause 'create table' command 'inner join' (as time permits) Stat 342 Notes. Week 3,

More information

How To Clone, Backup & Move Your WordPress Blog! Step By Step Guide by Marian Krajcovic

How To Clone, Backup & Move Your WordPress Blog! Step By Step Guide by Marian Krajcovic How To Clone, Backup & Move Your WordPress Blog! Step By Step Guide by Marian Krajcovic 2010 Marian Krajcovic You may NOT resell or giveaway this ebook! 1 If you have many WordPress blogs and especially

More information

comma separated values .csv extension. "save as" CSV (Comma Delimited)

comma separated values .csv extension. save as CSV (Comma Delimited) What is a CSV and how do I import it? A CSV is a comma separated values file which allows data to be saved in a table structured format. CSVs look like normal spreadsheet but with a.csv extension. Traditionally

More information

BaSICS OF excel By: Steven 10.1

BaSICS OF excel By: Steven 10.1 BaSICS OF excel By: Steven 10.1 Workbook 1 workbook is made out of spreadsheet files. You can add it by going to (File > New Workbook). Cell Each & every rectangular box in a spreadsheet is referred as

More information

Connecting SQL Data Sources to Excel Using Windward Studios Report Designer

Connecting SQL Data Sources to Excel Using Windward Studios Report Designer Connecting SQL Data Sources to Excel Using Windward Studios Report Designer Welcome to Windward Studios Report Designer Windward Studios takes a unique approach to reporting. Our Report Designer sits directly

More information

CS 2316 Exam 4 Fall 2011

CS 2316 Exam 4 Fall 2011 CS 2316 Exam 4 Fall 2011 Name : Grading TA: Integrity: By taking this exam, you pledge that this is your work and you have neither given nor received inappropriate help during the taking of this exam in

More information

MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 18) Finding Bad Data in Excel

MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 18) Finding Bad Data in Excel MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 18) Finding Bad Data in Excel Objective: Find and fix a data set with incorrect values Learning Outcomes: Use Excel to identify incorrect

More information

Microsoft Power Tools for Data Analysis #7 Power Query 6 Types of Merges/ Joins 9 Examples Notes from Video:

Microsoft Power Tools for Data Analysis #7 Power Query 6 Types of Merges/ Joins 9 Examples Notes from Video: Table of Contents: Microsoft Power Tools for Data Analysis #7 Power Query 6 Types of Merges/ Joins 9 Examples Notes from Video: 1. Power Query Has Six Types of Merges / Joins... 2 2. What is a Merge /

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Analytics:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

Management Reports Centre. User Guide. Emmanuel Amekuedi

Management Reports Centre. User Guide. Emmanuel Amekuedi Management Reports Centre User Guide Emmanuel Amekuedi Table of Contents Introduction... 3 Overview... 3 Key features... 4 Authentication methods... 4 System requirements... 5 Deployment options... 5 Getting

More information

We re going to start with two.csv files that need to be imported to SQL Lite housing2000.csv and housing2013.csv

We re going to start with two.csv files that need to be imported to SQL Lite housing2000.csv and housing2013.csv Basic SQL joining exercise using SQL Lite Using Census data on housing units, by place Created by @MaryJoWebster January 2017 The goal of this exercise is to introduce how joining tables works in SQL.

More information

Light Speed with Excel

Light Speed with Excel Work @ Light Speed with Excel 2018 Excel University, Inc. All Rights Reserved. http://beacon.by/magazine/v4/94012/pdf?type=print 1/64 Table of Contents Cover Table of Contents PivotTable from Many CSV

More information

HOW TO USE THE EXPORT FEATURE IN LCL

HOW TO USE THE EXPORT FEATURE IN LCL HOW TO USE THE EXPORT FEATURE IN LCL In LCL go to the Go To menu and select Export. Select the items that you would like to have exported to the file. To select them you will click the item in the left

More information

Homework 1 Excel Basics

Homework 1 Excel Basics Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the

More information

1 Introduction to Using Excel Spreadsheets

1 Introduction to Using Excel Spreadsheets Survey of Math: Excel Spreadsheet Guide (for Excel 2007) Page 1 of 6 1 Introduction to Using Excel Spreadsheets This section of the guide is based on the file (a faux grade sheet created for messing with)

More information

More on MS Access queries

More on MS Access queries More on MS Access queries BSAD 141 Dave Novak Topics Covered MS Access query capabilities Aggregate queries Different joins Review: AND and OR Parameter query Exact match criteria versus range Formatting

More information

SQLite vs. MongoDB for Big Data

SQLite vs. MongoDB for Big Data SQLite vs. MongoDB for Big Data In my latest tutorial I walked readers through a Python script designed to download tweets by a set of Twitter users and insert them into an SQLite database. In this post

More information

Word: Print Address Labels Using Mail Merge

Word: Print Address Labels Using Mail Merge Word: Print Address Labels Using Mail Merge No Typing! The Quick and Easy Way to Print Sheets of Address Labels Here at PC Knowledge for Seniors we re often asked how to print sticky address labels in

More information

CenterStone. Reports User Guide. Manhattan Software Inc. World Leading Real Estate, Asset & Facilities Management Software.

CenterStone. Reports User Guide. Manhattan Software Inc. World Leading Real Estate, Asset & Facilities Management Software. CenterStone Reports User Guide Version 1 Manhattan Software Inc. World Leading Real Estate, Asset & Facilities Management Software The information contained herein is the property of Manhattan Software,

More information

How to use Pivot table macro

How to use Pivot table macro How to use Pivot table macro Managing Pivot Tables Table Filter and Charts for Confluence add-on allows you to summarize your table data and produce its aggregated view in the form of a pivot table. You

More information

Biocomputing II Coursework guidance

Biocomputing II Coursework guidance Biocomputing II Coursework guidance I refer to the database layer as DB, the middle (business logic) layer as BL and the front end graphical interface with CGI scripts as (FE). Standardized file headers

More information

Advanced Excel Reporting

Advanced Excel Reporting SedonaOffice Users Conference San Francisco, CA January 21 24, 2018 Advanced Excel Reporting Presented by: Matt Howe This Page Intentionally Left Blank Page 2 of 20 Table of Contents Overview... 4 Making

More information

SPREADSHEETS. (Data for this tutorial at

SPREADSHEETS. (Data for this tutorial at SPREADSHEETS (Data for this tutorial at www.peteraldhous.com/data) Spreadsheets are great tools for sorting, filtering and running calculations on tables of data. Journalists who know the basics can interview

More information

ND_CLASSIFICATION WORKING NOTES. groups table: group_id, homepage url, project unix_name, license type, registration date

ND_CLASSIFICATION WORKING NOTES. groups table: group_id, homepage url, project unix_name, license type, registration date Please note that the text below only includes notes for the 2009 Notre Dame Initiation Stage classification. Due to a server crash, the Growth Stage notes are temporarily unavailable. Bob English ND_CLASSIFICATION

More information

Instructions for Using the Databases

Instructions for Using the Databases Appendix D Instructions for Using the Databases Two sets of databases have been created for you if you choose to use the Documenting Our Work forms. One set is in Access and one set is in Excel. They are

More information

Workshop. Import Workshop

Workshop. Import Workshop Import Overview This workshop will help participants understand the tools and techniques used in importing a variety of different types of data. It will also showcase a couple of the new import features

More information

ITS Training Class Charts and PivotTables Using Excel 2007

ITS Training Class Charts and PivotTables Using Excel 2007 When you have a large amount of data and you need to get summary information and graph it, the PivotTable and PivotChart tools in Microsoft Excel will be the answer. The data does not need to be in one

More information

GiftWorks Import Guide Page 2

GiftWorks Import Guide Page 2 Import Guide Introduction... 2 GiftWorks Import Services... 3 Import Sources... 4 Preparing for Import... 9 Importing and Matching to Existing Donors... 11 Handling Receipting of Imported Donations...

More information

Introduction to the workbook and spreadsheet

Introduction to the workbook and spreadsheet Excel Tutorial To make the most of this tutorial I suggest you follow through it while sitting in front of a computer with Microsoft Excel running. This will allow you to try things out as you follow along.

More information

Importing a txt or csv file into ArcGIS Online [AGO]

Importing a txt or csv file into ArcGIS Online [AGO] Importing a txt or csv file into ArcGIS Online [AGO] Spring 2014 This is how to find addresses online and copy them into an Excel spreadsheet, save them in a format that ArcGIS can use, and then import

More information

Tableau Tutorial Using Canadian Arms Sales Data

Tableau Tutorial Using Canadian Arms Sales Data Tableau Tutorial Using Canadian Arms Sales Data 1) Your data comes from Industry Canada s Trade site. 2) If you don t want to download the data yourself, use this file. You can also download it from the

More information

Excel Tables and Pivot Tables

Excel Tables and Pivot Tables A) Why use a table in the first place a. Easy to filter and sort if you only sort or filter by one item b. Automatically fills formulas down c. Can easily add a totals row d. Easy formatting with preformatted

More information

Importing Local Contacts from Thunderbird

Importing Local Contacts from Thunderbird 1 Importing Local Contacts from Thunderbird Step 1, Export Contacts from Thunderbird In Thunderbird, select Address Book. In the Address Book, click on Personal Address Book and then select Export from

More information

GENIE CLOUD- HOW TO USE MANUAL

GENIE CLOUD- HOW TO USE MANUAL Genie Cloud User Manual GENIE CLOUD- HOW TO USE MANUAL Appraiser Genie How to Use the Genie Manual 3a June 2017 June 14, 2017 APPRAISER GENIE LLC info@appraisergenie.com www.appraisergenie.com 2 Thank

More information

T-SQL Training: T-SQL for SQL Server for Developers

T-SQL Training: T-SQL for SQL Server for Developers Duration: 3 days T-SQL Training Overview T-SQL for SQL Server for Developers training teaches developers all the Transact-SQL skills they need to develop queries and views, and manipulate data in a SQL

More information

Building Self-Service BI Solutions with Power Query. Written By: Devin

Building Self-Service BI Solutions with Power Query. Written By: Devin Building Self-Service BI Solutions with Power Query Written By: Devin Knight DKnight@PragmaticWorks.com @Knight_Devin CONTENTS PAGE 3 PAGE 4 PAGE 5 PAGE 6 PAGE 7 PAGE 8 PAGE 9 PAGE 11 PAGE 17 PAGE 20 PAGE

More information

MS Excel Advanced Level

MS Excel Advanced Level MS Excel Advanced Level Trainer : Etech Global Solution Contents Conditional Formatting... 1 Remove Duplicates... 4 Sorting... 5 Filtering... 6 Charts Column... 7 Charts Line... 10 Charts Bar... 10 Charts

More information

Office 2016 Excel Basics 25 Video/Class Project #37 Excel Basics 25: Power Query (Get & Transform Data) to Convert Bad Data into Proper Data Set

Office 2016 Excel Basics 25 Video/Class Project #37 Excel Basics 25: Power Query (Get & Transform Data) to Convert Bad Data into Proper Data Set Office 2016 Excel Basics 25 Video/Class Project #37 Excel Basics 25: Power Query (Get & Transform Data) to Convert Bad Data into Proper Data Set Goal in video # 25: Learn about how to use the Get & Transform

More information

Business Analytics Nanodegree Syllabus

Business Analytics Nanodegree Syllabus Business Analytics Nanodegree Syllabus Master data fundamentals applicable to any industry Before You Start There are no prerequisites for this program, aside from basic computer skills. You should be

More information

IELM 511 Information Systems Design Labs 5 and 6. DB creation and Population

IELM 511 Information Systems Design Labs 5 and 6. DB creation and Population IELM 511 Information Systems Design Labs 5 and 6. DB creation and Population In this lab, your objective is to learn the basics of creating and managing a DB system. One way to interact with the DBMS (MySQL)

More information

OneView. User s Guide

OneView. User s Guide OneView User s Guide Welcome to OneView. This user guide will show you everything you need to know to access and utilize the wealth of information available from OneView. The OneView program is an Internet-based

More information

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS. 1 SPSS 11.5 for Windows Introductory Assignment Material covered: Opening an existing SPSS data file, creating new data files, generating frequency distributions and descriptive statistics, obtaining printouts

More information

Excel Tips and FAQs - MS 2010

Excel Tips and FAQs - MS 2010 BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my

More information

My Query Builder Function

My Query Builder Function My Query Builder Function The My Query Builder function is used to build custom SQL queries for reporting information out of the TEAMS system. Query results can be exported to a comma-separated value file,

More information

How to Export Data from LIS and analyze in with an Excel Pivot table

How to Export Data from LIS and analyze in with an Excel Pivot table How to Export Data from LIS and analyze in with an Excel Pivot table All machine stops are captured within LIS and are displayed in the TPM interface screen. This TPM Interface screen shows the machine

More information

INFO 1103 Homework Project 2

INFO 1103 Homework Project 2 INFO 1103 Homework Project 2 February 15, 2019 Due March 13, 2019, at the end of the lecture period. 1 Introduction In this project, you will design and create the appropriate tables for a version of the

More information

Oracle Compare Two Database Tables Sql Query Join

Oracle Compare Two Database Tables Sql Query Join Oracle Compare Two Database Tables Sql Query Join data types. Namely, it assumes that the two tables payments and How to use SQL PIVOT to Compare Two Tables in Your Database. This can (not that using the

More information

Introduction to relational databases and MySQL

Introduction to relational databases and MySQL Chapter 3 Introduction to relational databases and MySQL A products table Columns 2017, Mike Murach & Associates, Inc. C3, Slide 1 2017, Mike Murach & Associates, Inc. C3, Slide 4 Objectives Applied 1.

More information

Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication

Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication Spreadsheet Basics Excel is a powerful productivity tool. It s a spreadsheet

More information

PyMySQL Documentation

PyMySQL Documentation PyMySQL Documentation Release 0.7.2 Yutaka Matsubara and GitHub contributors Mar 22, 2017 Contents 1 User Guide 1 1.1 Installation................................................ 1 1.2 Examples.................................................

More information

QUICK EXCEL TUTORIAL. The Very Basics

QUICK EXCEL TUTORIAL. The Very Basics QUICK EXCEL TUTORIAL The Very Basics You Are Here. Titles & Column Headers Merging Cells Text Alignment When we work on spread sheets we often need to have a title and/or header clearly visible. Merge

More information

Evolution Query Builder Manual

Evolution Query Builder Manual Evolution Query Builder Manual PayData A Vermont Company Working for You! Page 1 of 37 Report Writer Introduction... 3 Creating Customized Reports... 4 Go to Client RW Reports... 4 Reports Tab... 4 Details

More information

MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 15) Creating Interactive Dashboards

MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 15) Creating Interactive Dashboards MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 15) Creating Interactive Dashboards Objective: Create a dashboard with interactive data filtering using Tableau Learning Outcomes: Understand

More information

SHOW ME THE NUMBERS: DESIGNING YOUR OWN DATA VISUALIZATIONS PEPFAR Applied Learning Summit September 2017 A. Chafetz

SHOW ME THE NUMBERS: DESIGNING YOUR OWN DATA VISUALIZATIONS PEPFAR Applied Learning Summit September 2017 A. Chafetz SHOW ME THE NUMBERS: DESIGNING YOUR OWN DATA VISUALIZATIONS PEPFAR Applied Learning Summit September 2017 A. Chafetz Overview In order to prepare for the upcoming POART, you need to look into testing as

More information

Instructor: Craig Duckett. Lecture 03: Tuesday, April 3, 2018 SQL Sorting, Aggregates and Joining Tables

Instructor: Craig Duckett. Lecture 03: Tuesday, April 3, 2018 SQL Sorting, Aggregates and Joining Tables Instructor: Craig Duckett Lecture 03: Tuesday, April 3, 2018 SQL Sorting, Aggregates and Joining Tables 1 Assignment 1 is due LECTURE 5, Tuesday, April 10 th, 2018 in StudentTracker by MIDNIGHT MID-TERM

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Data Merging Dr. David Koop Data Wrangling Data wrangling: transform raw data to a more meaningful format that can be better analyzed Data cleaning: getting rid of

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) Clinical SAS:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

MicroStrategy Desktop Quick Start Guide

MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop Quick Start Guide Version: 10.4 10.4, December 2017 Copyright 2017 by MicroStrategy Incorporated. All rights reserved. Trademark Information The following are either trademarks or

More information

DATA STRUCTURE AND ALGORITHM USING PYTHON

DATA STRUCTURE AND ALGORITHM USING PYTHON DATA STRUCTURE AND ALGORITHM USING PYTHON Common Use Python Module II Peter Lo Pandas Data Structures and Data Analysis tools 2 What is Pandas? Pandas is an open-source Python library providing highperformance,

More information

Excel and Banner. Presented by: Annelle Colevins University of West Georgia Friday, Sept 30, :30 a.m.

Excel and Banner. Presented by: Annelle Colevins University of West Georgia Friday, Sept 30, :30 a.m. Excel and Banner Presented by: Annelle Colevins University of West Georgia Friday, Sept 30, 2011 8:30 a.m. 1 Introduction This session will cover how to get data from Banner, how to move it into Excel,

More information

NWA Quality Analyst Version 6.1 Patch Maintenance Release Update Notes December 2007

NWA Quality Analyst Version 6.1 Patch Maintenance Release Update Notes December 2007 NWA Quality Analyst Version 6.1 Patch Maintenance Release Update Notes December 2007 This patch updates releases 2.3.157, 2.3.175, 2.3.185, 2.3.237, 2.3.245, and 2.3.278 of NWA Quality Analyst Version

More information

Sql Server Syllabus. Overview

Sql Server Syllabus. Overview Sql Server Syllabus Overview This SQL Server training teaches developers all the Transact-SQL skills they need to create database objects like Tables, Views, Stored procedures & Functions and triggers

More information

Intermediate Excel Training Course Content

Intermediate Excel Training Course Content Intermediate Excel Training Course Content Lesson Page 1 Absolute Cell Addressing 2 Using Absolute References 2 Naming Cells and Ranges 2 Using the Create Method to Name Cells 3 Data Consolidation 3 Consolidating

More information

TUTORIAL Get Started with Tableau Desktop

TUTORIAL Get Started with Tableau Desktop TUTORIAL Get Started with Tableau Desktop Learn how to connect to data, create data visualizations, present your findings, and share your insights with others. http://onlinehelp.tableau.com/current/guides/get-started-tutorial/en-us/get-started-tutorialhome.html

More information

MIS0855: Data Science In-Class Exercise for Mar Creating Interactive Dashboards

MIS0855: Data Science In-Class Exercise for Mar Creating Interactive Dashboards MIS0855: Data Science In-Class Exercise for Mar 25-27 Creating Interactive Dashboards Objective: Create a dashboard with interactive data filtering using Tableau Learning Outcomes: Understand how to create

More information

Mysql Workbench 5.2 Ce

Mysql Workbench 5.2 Ce How To Create Database Schema Diagram In Mysql Workbench 5.2 Ce Connect to your Database with MySQL Workbench! Default Schema: This can be left blank. Click Test Connection. 10:17 am. Is there any way

More information

MovieRec - CS 410 Project Report

MovieRec - CS 410 Project Report MovieRec - CS 410 Project Report Team : Pattanee Chutipongpattanakul - chutipo2 Swapnil Shah - sshah219 Abstract MovieRec is a unique movie search engine that allows users to search for any type of the

More information

3/31/2016. Spreadsheets. Spreadsheets. Spreadsheets and Data Management. Unit 3. Can be used to automatically

3/31/2016. Spreadsheets. Spreadsheets. Spreadsheets and Data Management. Unit 3. Can be used to automatically MICROSOFT EXCEL and Data Management Unit 3 Thursday March 31, 2016 Allow users to perform simple and complex sorting Allow users to perform calculations quickly Organizes and presents figures that can

More information

POSTGRESQL - PYTHON INTERFACE

POSTGRESQL - PYTHON INTERFACE POSTGRESQL - PYTHON INTERFACE http://www.tutorialspoint.com/postgresql/postgresql_python.htm Copyright tutorialspoint.com Installation The PostgreSQL can be integrated with Python using psycopg2 module.

More information

Final Project. Blair Gemmer. CSCI 444 Data Visualization. Fall 2011

Final Project. Blair Gemmer. CSCI 444 Data Visualization. Fall 2011 Final Project Blair Gemmer CSCI 444 Data Visualization Fall 2011 Hypothesis Most earthquakes occur between the 36N and -36N lines of Latitude, with an exceptional number of occurrences between the -180W

More information

Excel Tips and Tricks. Andrew J. Wright (2014);

Excel Tips and Tricks. Andrew J. Wright (2014); Excel Tips and Tricks Andrew J. Wright (2014); marinebrit@gmail.com Some useful shortcuts There are a few simple tricks that can save you a lot of time in Excel. Here are four of the most simple and useful.

More information

Manual Speedy Report. Copyright 2013 Im Softly. All rights reserved.

Manual Speedy Report. Copyright 2013 Im Softly. All rights reserved. 1 Manual Speedy Report 2 Table of Contents Manual Speedy Report... 1 Welcome!... 4 Preparations... 5 Technical Structure... 5 Main Window... 6 Create Report... 7 Overview... 7 Tree View... 8 Query Settings

More information

CSSCR Excel Intermediate 4/13/06 GH Page 1 of 23 INTERMEDIATE EXCEL

CSSCR Excel Intermediate 4/13/06 GH Page 1 of 23 INTERMEDIATE EXCEL CSSCR Excel Intermediate 4/13/06 GH Page 1 of 23 INTERMEDIATE EXCEL This document is for those who already know the basics of spreadsheets and have worked with either Excel for Windows or Excel for Macintosh.

More information

» How do I Integrate Excel information and objects in Word documents? How Do I... Page 2 of 10 How do I Integrate Excel information and objects in Word documents? Date: July 16th, 2007 Blogger: Scott Lowe

More information

DATA 301 Introduction to Data Analytics Visualization. Dr. Ramon Lawrence University of British Columbia Okanagan

DATA 301 Introduction to Data Analytics Visualization. Dr. Ramon Lawrence University of British Columbia Okanagan DATA 301 Introduction to Data Analytics Visualization Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca DATA 301: Data Analytics (2) Why learn Visualization? Visualization

More information

Workbooks (File) and Worksheet Handling

Workbooks (File) and Worksheet Handling Workbooks (File) and Worksheet Handling Excel Limitation Excel shortcut use and benefits Excel setting and custom list creation Excel Template and File location system Advanced Paste Special Calculation

More information

PivotTables & Charts for Health

PivotTables & Charts for Health PivotTables & Charts for Health Data Inputs PivotTables Pivot Charts Global Strategic Information UCSF Global Health Sciences Version Malaria 1.0 1 Table of Contents 1.1. Introduction... 3 1.1.1. Software

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

2. Click File and then select Import from the menu above the toolbar. 3. From the Import window click the Create File to Import button:

2. Click File and then select Import from the menu above the toolbar. 3. From the Import window click the Create File to Import button: Totality 4 Import How to Import data into Totality 4. Totality 4 will allow you to import data from an Excel spreadsheet or CSV (comma separated values). You must have Microsoft Excel installed in order

More information

Chapter 3. Introduction to relational databases and MySQL. 2010, Mike Murach & Associates, Inc. Murach's PHP and MySQL, C3

Chapter 3. Introduction to relational databases and MySQL. 2010, Mike Murach & Associates, Inc. Murach's PHP and MySQL, C3 1 Chapter 3 Introduction to relational databases and MySQL Slide 2 Objectives Applied 1. Use phpmyadmin to review the data and structure of the tables in a database, to import and run SQL scripts that

More information

Tutorial for downloading and analyzing data from the Atlantic Canada Opportunities Agency

Tutorial for downloading and analyzing data from the Atlantic Canada Opportunities Agency Tutorial for downloading and analyzing data from the Atlantic Canada Opportunities Agency The agency, which goes by the acronym ACOA, is one of many federal institutions that uploads data to the federal

More information

ESIF Open Data User guide to platform functionalities V

ESIF Open Data User guide to platform functionalities V EUROPEAN COMMISSION DIRECTORATE-GENERAL REGIONAL AND URBAN POLICY Policy ESIF Open Data User guide to platform functionalities V 2018-10 Table of Contents 1. What is "Open Data"... 2 2. How to access the

More information

STIDistrict Query (Basic)

STIDistrict Query (Basic) STIDistrict Query (Basic) Creating a Basic Query To create a basic query in the Query Builder, open the STIDistrict workstation and click on Utilities Query Builder. When the program opens, database objects

More information

Importing in Offertory Donations from Spreadsheets into Connect Now

Importing in Offertory Donations from Spreadsheets into Connect Now Importing in Offertory Donations from Spreadsheets into Connect Now When you have an excel spreadsheet that has donations in it, if you have a key identifier, such as an envelope number, then the spreadsheet

More information

CCRS Quick Start Guide for Program Administrators. September Bank Handlowy w Warszawie S.A.

CCRS Quick Start Guide for Program Administrators. September Bank Handlowy w Warszawie S.A. CCRS Quick Start Guide for Program Administrators September 2017 www.citihandlowy.pl Bank Handlowy w Warszawie S.A. CitiManager Quick Start Guide for Program Administrators Table of Contents Table of Contents

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Chapter 2 Assignment (due Thursday, April 19)

Chapter 2 Assignment (due Thursday, April 19) (due Thursday, April 19) Introduction: The purpose of this assignment is to analyze data sets by creating histograms and scatterplots. You will use the STATDISK program for both. Therefore, you should

More information

Introduction to Python and PyMARC

Introduction to Python and PyMARC Introduction to Python and PyMARC Session II: Using PyMARC Lauren Magnuson California State University, San Marcos Hosted by ALCTS: Association for Library Collections & Technical Services Quick Recap

More information