Texas Death Row. Last Statements. Data Warehousing and Data Mart. By Group 16. Irving Rodriguez Joseph Lai Joe Martinez

Size: px
Start display at page:

Download "Texas Death Row. Last Statements. Data Warehousing and Data Mart. By Group 16. Irving Rodriguez Joseph Lai Joe Martinez"

Transcription

1 Texas Death Row Last Statements Data Warehousing and Data Mart By Group 16 Irving Rodriguez Joseph Lai Joe Martinez

2 Introduction For our data warehousing and data mart project we chose to use the Texas death row data set. The reason for this is that the data sparked our curiosity and building a data warehouse from it would give us some more insight into the data and therefore the minds of the inmates who were on death row. We wanted to find the most common words they used in their last statement as well as use some visualization tools to see the data in a new light. The data set consists of 536 inmates from the Texas Department of Criminal Justice that were on death row. To build our data warehouse we used a star schema consisting of five tables and used a LAMP stack back end driven by PHP to query the tables and output the appropriate data. This querying tool can be seen on our website for the CSC 177 class under the Data Mart link. Gathering and Cleaning Our Data In order to gather our data we had to crawl the Texas Department of Criminal Justice website, luckily we found a crawler written in python that could obtain the list of all inmates however to get the more detailed information we had to add a lot to it and test it thoroughly. This brought up a few challenges, most notably directing it to follow the appropriate link for each inmate and getting it to grab the correct html element. Another big issue was that many of the links for the more detailed information only contained a PDF so the information could not be gathered which left holes in our data. Eventually we got it all together in a CSV file and tried to load it into Weka. This did not go well at all. We naively thought that our data would not need to be cleaned and we were very wrong. There were quite a few issues; there were non ascii characters, the pound sign threw off Rapidminer, and there were many quotations that were also unacceptable. Finally we got our data cleaned and Rapid Miner would accept it. Data Mart The design of our data mart was primarily a MySQL database that was hosted on a virtual protected cloud. The database was an RDS database built by amazon located in

3 Our database up and running the US West 2 (Oregon) region. We used the Command Line Interface to create our database which we decided to name TexasDB. The tables in our star schema were built on the attributes that were produced with RapidMiner; they are the most common words from the inmate s last statement, occupation, the summary of the incident, and the victim s information. The main table has an execution number and this number is used to reference all the most used words in the other four tables. In the picture below you can see the four tables and many, if not all, of the attributes in the table. Our star schema

4 To use this data mart we created a web page that allows the user to display and sort the data based on 17 different attributes in our main table. We wanted to add a lot more functionality to this but we ran out of time mostly due to the fact that we did not include the data mart in the original scope of our project. Luckily our professor gave us one more week to build our data mart and create this reporting tool. This way anyone will be able to explore our data and find more interesting facts that we may have missed. The querying tool written in PHP The querying tool is hosted using Athena s apache server via CSUS and was built using PHP and some Javascript. This was a bit of a challenge because PHP is not always the easiest language to deal with but after connecting to the database it was simply a matter of getting our checkboxes and drop down menus to correlate with the query we were sending. This took quite a bit of time and was all done within one week because of the reason stated previously. The database was loaded in successfully using a variety of CSV files that were created by a hand made web scraper or by Rapidminer. Our fact table, TexasData, contains the information gathered via the web scraper. It comes directly from the Texas Department of Criminal Justice website. The other four tables were generated by Rapidminer and contain the most commonly used words from their appropriate attribute in the fact table.

5 The five tables in our database The creation of each table was a bit challenging because we had to clean the data and make sure it would be read appropriately by MySQL WorkBench. Setting up the primary key and foreign keys was not a difficult task however it was very time consuming. Once all the data was imported and the schema was created, we were able to successfully query the database and connect the front end of our website to get some crude results that we fine tuned to get what you see now on the website. Learning Experience The data warehousing part of our project was a lot more challenging and therefore more educational. Other than setting up a basic database we had not had any experience. So turning that simple database/dataset into something valuable that could be mined was a bit of a challenge. We did a lot of hands on learning though, a lot of it involved us being frustrated and running into multiple problems but, like I previously stated, those are the things that will stick with you and help you do it again in the future. Summary Overall the most difficult part of the project was the Data Warehousing. We had to come up with a schema in a short amount of time and build it. The loading of the database also posed somewhat of a challenge, cleaning up the data and making sure that it would be accepted correctly into its respective rows and tables. Some characters aren t accepted by

6 MySQL characters such as spaces and often quotes caused problems. Setting up a MySQL server was also a little bit of a challenge, getting it set up and being able to connect to it sometimes caused problems. Overall this was a great learning experience and it was a great way to dive into some of the concepts we learned in class. Bibliography Data Source: Web Scraper Base: Our website:

7 Texas Death Row Last Statements Data Classification and Data Mining By Group 16 Irving Rodriguez Joseph Lai Joe Martinez

8 Introduction When we first sat down and met as a group we went exploring to find a data set that would be interesting. We searched for quite awhile and nothing really peaked our interest. Then we found the Texas Department of Criminal Justice web site and couldn t really find anything else that seemed more interesting. This posed a problem though because we wouldn t necessarily be solving a problem so we decided to solve a virtual problem. What intrigued us most was the last statements and we thought we could find the most common words and therefore themes within those statements which could give us some insight into their mind and their experience. So we ended up with 536 rows with about 20 columns and then generated quite a few more columns and tables from that. We wanted to come at this in a technical sense but we also wanted to let the data guide us and reveal cool or interesting correlations between the inmates. That is why we have included so many visuals in our project, we wanted the data mining to be interesting because the subject matter is innately interesting. Data Mining and Classification Results When we ran KNN, we came to the realization that our data set was smaller than most, this mean our accuracy wasn t going to be that great. However, even with our small data set, when we ran the KNN algorithm with RapidMiner, it was still 68% accurate. This can be seen in the pie chart to the right. The two pie charts below give a better representation of our KNN classification. We were

9 trying to predict the race of the inmate and this is the result that Rapidminer gave us. You can see that the Hispanic prediction matches up almost exactly with our actual data set. But the White and Black predictions were not as good. It predicted many more white people than the actual set which had a few more Black people. On the left, we have the our actual data and our predictions are on the right. KNN actual vs predicted We also performed the Naive Bayes classification on our data. We wanted to classify, or predict, the age of offense based on education level. This would show the correlation between education level and the age at which the crime was committed. We were also able to break this down by race. You can see the results below, but generally they inmates received tenth grade education level and committed their crime at age 26. Results So, in general, we wanted to find any interesting or insightful correlations within our data. One interesting thing came up immediately after loading in our data to Rapidminer. It gives you the minimum, maximum, and average amounts for each attribute. So immediately we

10 got the average inmate that had been executed. The average offender would be a white male named James Johnson with black hair and brown eyes from Harris County. He would be 39 years old with a 9th grade education level standing 5 6 tall and weighing 186 pounds. The next step was to get the most used words from the inmate s last statement, occupation, the summary of the incident, and the victim s information. We used some text mining modules within Rapidminer to do this. First you select the attribute to use, convert that to text, tokenize it, transform to lower case, and lastly filter stop words. This worked surprisingly well and was relatively easy. You can see in the image below the top five most common words as well as a larger subset in word cloud form. Last statements common words Last statements word cloud

11 We performed this same text mining on the the summary of offense, occupations, and victim information. As well as the most common phrases from the last statements however that did not yield a very interesting result. We were able to do some other cool visualizations. For instance we plotted the number of executions by year from 1982 until today. The other one below is the number of inmates based on the county they were from. Executions by year

12 Number of inmates by county Learning Experience The first major speed bump we hit was gathering all of the data. We had a main webpage that contained links to individual profiles and statements of the dead inmates. There were no pre cleaned and downloadable CSV files to use. Our group managed to overcome this by creating a web crawler with Python, and exported all the data into a CSV. A tip we would give to future classes is to find a dataset that is exportable to a CSV because it would allow for a more complete set of data. Expanding on that, our data did not fully represent what was online because some of the entries online were PDFs which could not be exported via the web crawler. A major resource that contributed to our success is RapidMinerTutorial s channel on YouTube. His KNN and Naive Bayes videos were the compass that gave us direction when we

13 were lost. None of our team had experience with RapidMiner, and we only had limited knowledge of Weka. In fact, nothing helped us with Weka, not even the volunteer tutor. We spent a few hours trying to load our dataset into Weka, but it kept giving us an error along the lines of number of columns not matching the number of datapoints. After trying to load it into RapidMiner and having it work on the second try we decided to stick with it, plus it has a lot of functionality; more than we were able to explore for this project. Summary The results of our project were better than we expected. We managed to create nearly 70% accurate predictions with our KNN algorithm, we were able to predict age of offense given highest education level using Naive Bayes algorithm, and we produced a generic profile of a typical executed inmate given our semi comprehensive dataset. Not to mention we successfully mined the text and accomplished our original goal which was to find these most common words. This has been an overall success in that we were able to apply classroom concepts to real life data, and because we did not use previously gathered data, our group was allowed to experience data mining on a lower and deeper level despite it being more problematic and sometimes more frustrating. Bibliography Data Source: Text Mining Walkthrough: RapidMiner Tutorials: RapidMiner: Tableau: Word Cloud Creation: Our website:

Data Marting Crime Correlations Using San Francisco Crime Open Data

Data Marting Crime Correlations Using San Francisco Crime Open Data Data Marting Crime Correlations Using San Francisco Crime Open Data Kiel Gordon Matt Pymm John Tuazon California State University Sacramento CSC 177 Data Warehousing and Data Mining Dr. Lu May 16, 2016

More information

Codify: Code Search Engine

Codify: Code Search Engine Codify: Code Search Engine Dimitriy Zavelevich (zavelev2) Kirill Varhavskiy (varshav2) Abstract: Codify is a vertical search engine focusing on searching code and coding problems due to it s ability to

More information

MovieRec - CS 410 Project Report

MovieRec - CS 410 Project Report MovieRec - CS 410 Project Report Team : Pattanee Chutipongpattanakul - chutipo2 Swapnil Shah - sshah219 Abstract MovieRec is a unique movie search engine that allows users to search for any type of the

More information

Alyssa Grieco. Data Wrangling Final Project Report Fall 2016 Dangerous Dogs and Off-leash Areas in Austin Housing Market Zip Codes.

Alyssa Grieco. Data Wrangling Final Project Report Fall 2016 Dangerous Dogs and Off-leash Areas in Austin Housing Market Zip Codes. Alyssa Grieco Data Wrangling Final Project Report Fall 2016 Dangerous Dogs and Off-leash Areas in Austin Housing Market Zip Codes Workflow Datasets Data was taken from three sources on data.austintexas.gov.

More information

Known Visual Bug with UBC CLF Theme Publishing Surveys Deploying Survey Customizing the Survey URL Embedding Surveys on to

Known Visual Bug with UBC CLF Theme Publishing Surveys Deploying Survey Customizing the Survey URL Embedding Surveys on to Contents Accounts... 3 Logging In... 3 Note about CWL user accounts... 4 Updating Your Account Details... 4 Adding/Inviting Users... 5 Surveys... 5 Creating a Survey from a Template... 5 Creating a Survey

More information

Filter and PivotTables in Excel

Filter and PivotTables in Excel Filter and PivotTables in Excel FILTERING With filters in Excel you can quickly collapse your spreadsheet to find records meeting specific criteria. A lot of reporters use filter to cut their data down

More information

If Statements, For Loops, Functions

If Statements, For Loops, Functions Fundamentals of Programming If Statements, For Loops, Functions Table of Contents Hello World Types of Variables Integers and Floats String Boolean Relational Operators Lists Conditionals If and Else Statements

More information

Lehigh Walking Wizard Final Report Steven Costa & Zhi Huang

Lehigh Walking Wizard Final Report Steven Costa & Zhi Huang Lehigh Walking Wizard Final Report Steven Costa & Zhi Huang Table of Contents I. Executive Summary II. Introduction & Motivation a. What is the Problem? b. Why is it interesting/important? c. How do you

More information

Process Book. Website Github Repo. By Claudia Huang, Raul Jordan and Jacques van Rhyn

Process Book. Website Github Repo. By Claudia Huang, Raul Jordan and Jacques van Rhyn PyCoin: Process Book Website Github Repo By Claudia Huang, Raul Jordan and Jacques van Rhyn Our Project We live in a period of unprecedented change, a time where viable challenges to some of core belief

More information

One SAS To Rule Them All

One SAS To Rule Them All SAS Global Forum 2017 ABSTRACT Paper 1042 One SAS To Rule Them All William Gui Zupko II, Federal Law Enforcement Training Centers In order to display data visually, our audience preferred Excel s compared

More information

Incluvie: Actor Data Collection Ada Gok, Dana Hochman, Lucy Zhan

Incluvie: Actor Data Collection Ada Gok, Dana Hochman, Lucy Zhan Incluvie: Actor Data Collection Ada Gok, Dana Hochman, Lucy Zhan {goka,danarh,lucyzh}@bu.edu Figure 0. Our partner company: Incluvie. 1. Project Task Incluvie is a platform that promotes and celebrates

More information

Improving the ROI of Your Data Warehouse

Improving the ROI of Your Data Warehouse Improving the ROI of Your Data Warehouse Many organizations are struggling with a straightforward but challenging problem: their data warehouse can t affordably house all of their data and simultaneously

More information

Read & Download (PDF Kindle) Data Structures And Other Objects Using Java (4th Edition)

Read & Download (PDF Kindle) Data Structures And Other Objects Using Java (4th Edition) Read & Download (PDF Kindle) Data Structures And Other Objects Using Java (4th Edition) Data Structures and Other Objects Using Java is a gradual, "just-in-time" introduction to Data Structures for a CS2

More information

PHP & MySQL For Dummies, 4th Edition PDF

PHP & MySQL For Dummies, 4th Edition PDF PHP & MySQL For Dummies, 4th Edition PDF Here's what Web designers need to know to create dynamic, database-driven Web sites To be on the cutting edge, Web sites need to serve up HTML, CSS, and products

More information

Quick Reference for the FloridaCHARTS Fetal Death Query

Quick Reference for the FloridaCHARTS Fetal Death Query Quick Reference for the FloridaCHARTS Fetal Death Query 1. Toolbar Functions 2. Reports 3. Frequently Asked Questions This application is set up in sections. To use it, you do not have to follow any particular

More information

5 R1 The one green in the same place so either of these could be green.

5 R1 The one green in the same place so either of these could be green. Page: 1 of 20 1 R1 Now. Maybe what we should do is write out the cases that work. We wrote out one of them really very clearly here. [R1 takes out some papers.] Right? You did the one here um where you

More information

BEGINNER PHP Table of Contents

BEGINNER PHP Table of Contents Table of Contents 4 5 6 7 8 9 0 Introduction Getting Setup Your first PHP webpage Working with text Talking to the user Comparison & If statements If & Else Cleaning up the game Remembering values Finishing

More information

Title: Episode 11 - Walking through the Rapid Business Warehouse at TOMS Shoes (Duration: 18:10)

Title: Episode 11 - Walking through the Rapid Business Warehouse at TOMS Shoes (Duration: 18:10) SAP HANA EFFECT Title: Episode 11 - Walking through the Rapid Business Warehouse at (Duration: 18:10) Publish Date: April 6, 2015 Description: Rita Lefler walks us through how has revolutionized their

More information

understanding media metrics WEB METRICS Basics for Journalists FIRST IN A SERIES

understanding media metrics WEB METRICS Basics for Journalists FIRST IN A SERIES understanding media metrics WEB METRICS Basics for Journalists FIRST IN A SERIES Contents p 1 p 3 p 3 Introduction Basic Questions about Your Website Getting Started: Overall, how is our website doing?

More information

Adding content to your Blackboard 9.1 class

Adding content to your Blackboard 9.1 class Adding content to your Blackboard 9.1 class There are quite a few options listed when you click the Build Content button in your class, but you ll probably only use a couple of them most of the time. Note

More information

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL We have spent the first part of the course learning Excel: importing files, cleaning, sorting, filtering, pivot tables and exporting

More information

Improving Stack Overflow Tag Prediction Using Eye Tracking Alina Lazar Youngstown State University Bonita Sharif, Youngstown State University

Improving Stack Overflow Tag Prediction Using Eye Tracking Alina Lazar Youngstown State University Bonita Sharif, Youngstown State University Improving Stack Overflow Tag Prediction Using Eye Tracking Alina Lazar, Youngstown State University Bonita Sharif, Youngstown State University Jenna Wise, Youngstown State University Alyssa Pawluk, Youngstown

More information

Personal Health Assistant: Final Report Prepared by K. Morillo, J. Redway, and I. Smyrnow Version Date April 29, 2010 Personal Health Assistant

Personal Health Assistant: Final Report Prepared by K. Morillo, J. Redway, and I. Smyrnow Version Date April 29, 2010 Personal Health Assistant Personal Health Assistant Ishmael Smyrnow Kevin Morillo James Redway CSE 293 Final Report Table of Contents 0... 3 1...General Overview... 3 1.1 Introduction... 3 1.2 Goal...3 1.3 Overview... 3 2... Server

More information

Chapter 2: Understanding Data Distributions with Tables and Graphs

Chapter 2: Understanding Data Distributions with Tables and Graphs Test Bank Chapter 2: Understanding Data with Tables and Graphs Multiple Choice 1. Which of the following would best depict nominal level data? a. pie chart b. line graph c. histogram d. polygon Ans: A

More information

Parcel QA/QC: Video Script. 1. Introduction 1

Parcel QA/QC: Video Script. 1. Introduction 1 1. Introduction 1 Hi! It s been a while since you heard from us. I am Ara Erickson with the University of Washington. We are taking this opportunity to do a quick introduction and explain a few things

More information

(Refer Slide Time: 02.06)

(Refer Slide Time: 02.06) Data Structures and Algorithms Dr. Naveen Garg Department of Computer Science and Engineering Indian Institute of Technology, Delhi Lecture 27 Depth First Search (DFS) Today we are going to be talking

More information

Ruby on Rails Welcome. Using the exercise files

Ruby on Rails Welcome. Using the exercise files Ruby on Rails Welcome Welcome to Ruby on Rails Essential Training. In this course, we're going to learn the popular open source web development framework. We will walk through each part of the framework,

More information

How Do I Choose Which Type of Graph to Use?

How Do I Choose Which Type of Graph to Use? How Do I Choose Which Type of Graph to Use? When to Use...... a Line graph. Line graphs are used to track changes over short and long periods of time. When smaller changes exist, line graphs are better

More information

I started off with a quick nmap scan, which showed both port 80 and 443 open.

I started off with a quick nmap scan, which showed both port 80 and 443 open. Mr-Robot: 1 Walkthrough Author: mrb3n Download location: https://download.vulnhub.com/mrrobot/mrrobot.ova Goal: Find 3 keys hidden in different locations -----------------------------------------------------------------------------------------------------------------

More information

Oracle Database 11g & MySQL 5.6 Developer Handbook (Oracle Press) PDF

Oracle Database 11g & MySQL 5.6 Developer Handbook (Oracle Press) PDF Oracle Database 11g & MySQL 5.6 Developer Handbook (Oracle Press) PDF Master Application Development in a Mixed-Platform Environment Build powerful database applications in a mixed environment using the

More information

CSI5387: Data Mining Project

CSI5387: Data Mining Project CSI5387: Data Mining Project Terri Oda April 14, 2008 1 Introduction Web pages have become more like applications that documents. Not only do they provide dynamic content, they also allow users to play

More information

Participation Status Report STUDIO ELEMENTS I KATE SOHNG

Participation Status Report STUDIO ELEMENTS I KATE SOHNG 2015 Participation Status Report STUDIO ELEMENTS I KATE SOHNG Table of Contents I. Wataru... 1 A. JQuery discussion... 1 B. Other JavaScript libraries... 1 C. CakePHP framework... 2 D. Webpage feedback...

More information

The compiler is spewing error messages.

The compiler is spewing error messages. Appendix B Debugging There are a few different kinds of errors that can occur in a program, and it is useful to distinguish between them in order to track them down more quickly. Compile-time errors are

More information

Seen here are four film frames between frame 307 and

Seen here are four film frames between frame 307 and Bigfoot Insights Just for the Record Christopher L. Murphy.9 87...7 6.8 FEET Seen here are four film frames between frame 07 and frame inclusive; so there were 6 frames all told. The time for all of these

More information

Day Health Planner. Team 4YourHealth. Zachary Nielson Daniel Hudy Peter Butler. Karen Snavely Eric Majchrzak Calvin D. Rosario.

Day Health Planner. Team 4YourHealth. Zachary Nielson Daniel Hudy Peter Butler. Karen Snavely Eric Majchrzak Calvin D. Rosario. Day Health Planner Team 4YourHealth Zachary Nielson Daniel Hudy Peter Butler Project Overview Karen Snavely Eric Majchrzak Calvin D. Rosario Trillium Health AJ Faculty Coach Professor Hawker Trillium Health

More information

Chameleon Metadata s Data Science Basics Tutorial Series. DSB-2: Information Gain (IG) By Eric Thornton,

Chameleon Metadata s Data Science Basics Tutorial Series. DSB-2: Information Gain (IG) By Eric Thornton, Chameleon Metadata s Data Science Basics Tutorial Series Data Science Basics Syllabus for DSB-2 (DSB-2-Infographic-1) Download PDF version here: DSB-2-Information-Gain-V10.pdf DSB-2: Information Gain (IG)

More information

Effective MySQL Optimizing SQL Statements (Oracle Press) PDF

Effective MySQL Optimizing SQL Statements (Oracle Press) PDF Effective MySQL Optimizing SQL Statements (Oracle Press) PDF The Essential Guide to SQL Statement Optimization Written by Oracle ACE Director and MySQL expert Ronald Bradford, Effective MySQL: Optimizing

More information

JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc.

JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc. JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization

More information

DB2: Data Warehousing. by Andrea Piermarteri & Matteo Micheletti

DB2: Data Warehousing. by Andrea Piermarteri & Matteo Micheletti DB2: Data Warehousing by Andrea Piermarteri & Matteo Micheletti Introduction to Data Warehousing What is Data Warehousing? A collection of methods, technologies and tools to assist the knowledge workers

More information

Programming: Computer Programming For Beginners: Learn The Basics Of Java, SQL & C Edition (Coding, C Programming, Java Programming, SQL

Programming: Computer Programming For Beginners: Learn The Basics Of Java, SQL & C Edition (Coding, C Programming, Java Programming, SQL Programming: Computer Programming For Beginners: Learn The Basics Of Java, SQL & C++ - 3. Edition (Coding, C Programming, Java Programming, SQL Programming, JavaScript, Python, PHP) PDF PLEASE NOTE: You

More information

Electronic Gateway Functional Team Website Usability Working Group Usability Test September 2005

Electronic Gateway Functional Team Website Usability Working Group Usability Test September 2005 Electronic Gateway Functional Team Website Usability Working Group Usability Test September 2005 Queen s University Library website http://library.queensu.ca/ SUMMARY There were 11 testers, drawn from

More information

Data Structures And Other Objects Using Java Download Free (EPUB, PDF)

Data Structures And Other Objects Using Java Download Free (EPUB, PDF) Data Structures And Other Objects Using Java Download Free (EPUB, PDF) This is the ebook of the printed book and may not include any media, website access codes, or print supplements that may come packaged

More information

Data Analysis and Data Science

Data Analysis and Data Science Data Analysis and Data Science CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/29/15 Agenda Check-in Online Analytical Processing Data Science Homework 8 Check-in Online Analytical

More information

Free Downloads Delivering Business Intelligence With Microsoft SQL Server 2008

Free Downloads Delivering Business Intelligence With Microsoft SQL Server 2008 Free Downloads Delivering Business Intelligence With Microsoft SQL Server 2008 Maximize the Business Intelligence Tools in Microsoft SQL Server 2008Manage, analyze, and distribute enterprise data with

More information

News Article Categorization Team Members: Himay Jesal Desai, Bharat Thatavarti, Aditi Satish Mhapsekar

News Article Categorization Team Members: Himay Jesal Desai, Bharat Thatavarti, Aditi Satish Mhapsekar CS 410 PROJECT REPORT News Article Categorization Team Members: Himay Jesal Desai, Bharat Thatavarti, Aditi Satish Mhapsekar Overview: Our project, News Explorer, is a system that categorizes news articles

More information

Google Docs Tipsheet. ABEL Summer Institute 2009

Google Docs Tipsheet. ABEL Summer Institute 2009 Google Docs Tipsheet ABEL Summer Institute 2009 Contents Logging in to Google Apps for CollaborativeSchools.net for the First Time... 2 Text Documents Creating a New Text Document in Google Docs... 5 Uploading

More information

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL

TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL TUTORIAL FOR IMPORTING OTTAWA FIRE HYDRANT PARKING VIOLATION DATA INTO MYSQL We have spent the first part of the course learning Excel: importing files, cleaning, sorting, filtering, pivot tables and exporting

More information

Data mining: concepts and algorithms

Data mining: concepts and algorithms Data mining: concepts and algorithms Practice Data mining Objective Exploit data mining algorithms to analyze a real dataset using the RapidMiner machine learning tool. The practice session is organized

More information

Victra A Verizon Authorized Retailer

Victra A Verizon Authorized Retailer Case Study & TCO Analysis Victra Victra A Verizon Authorized Retailer National Verizon Retailer with nearly 1,200 stores in 46 states Saved 80% in hardware and operational costs by going with Meraki Leverages

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

Class #7 Guidebook Page Expansion. By Ryan Stevenson

Class #7 Guidebook Page Expansion. By Ryan Stevenson Class #7 Guidebook Page Expansion By Ryan Stevenson Table of Contents 1. Class Purpose 2. Expansion Overview 3. Structure Changes 4. Traffic Funnel 5. Page Updates 6. Advertising Updates 7. Prepare for

More information

Memorandum Participants Method

Memorandum Participants Method Memorandum To: Elizabeth Pass, Associate Professor, School of Writing, Rhetoric and Technical Communication From: Andrew Carnes, WRTC 456 Section 1[ADC] Date: February 2, 2016 Re: Project 1 Competitor

More information

Writing for Use: Intersections Between Genre and Usability

Writing for Use: Intersections Between Genre and Usability Writing for Use: Intersections Between Genre and Usability Rob Koehler This article tackles usability a very big idea in a small amount of space. And, because I don t believe in doing things in a small

More information

Case study on PhoneGap / Apache Cordova

Case study on PhoneGap / Apache Cordova Chapter 1 Case study on PhoneGap / Apache Cordova 1.1 Introduction to PhoneGap / Apache Cordova PhoneGap is a free and open source framework that allows you to create mobile applications in a cross platform

More information

Worksheet Answer Key: Scanning and Mapping Projects > Mine Mapping > Investigation 2

Worksheet Answer Key: Scanning and Mapping Projects > Mine Mapping > Investigation 2 Worksheet Answer Key: Scanning and Mapping Projects > Mine Mapping > Investigation 2 Ruler Graph: Analyze your graph 1. Examine the shape formed by the connected dots. i. Does the connected graph create

More information

Categorizing Migrations

Categorizing Migrations What to Migrate? Categorizing Migrations A version control repository contains two distinct types of data. The first type of data is the actual content of the directories and files themselves which are

More information

Extend EBS Using Applications Express

Extend EBS Using Applications Express Extend EBS Using Applications Express John Peters JRPJR, Inc. Abstract Few people know about Oracle Applications Express (APEX) an actual free Oracle Tool included with your Oracle DB Licenses. How many

More information

CODE MAROON TEST SEPT. 30, 2011 SURVEY RESULTS

CODE MAROON TEST SEPT. 30, 2011 SURVEY RESULTS CODE MAROON TEST SEPT. 30, 2011 SURVEY RESULTS SMS (text) Message 1. Respondents were asked if they received the Code Maroon test by SMS (text) message. The responses indicate that 93% of the respondents

More information

What about when it s down? An Application for the Enhancement of the SAS Middle Tier User Experience

What about when it s down? An Application for the Enhancement of the SAS Middle Tier User Experience Paper 11421-2016 What about when it s down? An Application for the Enhancement of the SAS Middle Tier User Experience Christopher Blake, Royal Bank of Scotland ABSTRACT The SAS Web Application Server goes

More information

Spam Detection ECE 539 Fall 2013 Ethan Grefe. For Public Use

Spam Detection ECE 539 Fall 2013 Ethan Grefe. For Public Use Email Detection ECE 539 Fall 2013 Ethan Grefe For Public Use Introduction email is sent out in large quantities every day. This results in email inboxes being filled with unwanted and inappropriate messages.

More information

Week - 01 Lecture - 04 Downloading and installing Python

Week - 01 Lecture - 04 Downloading and installing Python Programming, Data Structures and Algorithms in Python Prof. Madhavan Mukund Department of Computer Science and Engineering Indian Institute of Technology, Madras Week - 01 Lecture - 04 Downloading and

More information

Paper William E Benjamin Jr, Owl Computer Consultancy, LLC

Paper William E Benjamin Jr, Owl Computer Consultancy, LLC Paper 025-2009 So, You ve Got Data Enterprise Wide (SAS, ACCESS, EXCEL, MySQL, and Others); Well, Let SAS Enterprise Guide Software Point-n-Click Your Way to Using It William E Benjamin Jr, Owl Computer

More information

2. Click on the Freeform Pen Tool. It looks like the image to the right. If it s not showing, right click on that square and choose it from the list.

2. Click on the Freeform Pen Tool. It looks like the image to the right. If it s not showing, right click on that square and choose it from the list. This tutorial will walk you through how to use Paths in Photoshop. It explains the general workings of paths, as well as how to use them to do specific tasks. Tasks such as how to create vector shapes

More information

Learning to Provide Modern Solutions

Learning to Provide Modern Solutions 1 Learning to Provide Modern Solutions Over the course of this book, you will learn to enhance your existing applications to modernize the output of the system. To do this, we ll take advantage of the

More information

How to approach a computational problem

How to approach a computational problem How to approach a computational problem A lot of people find computer programming difficult, especially when they first get started with it. Sometimes the problems are problems specifically related to

More information

FINAL REPORT 04/25/2015 FINAL REPORT SUNY CANTON MOBILE APPLICATION

FINAL REPORT 04/25/2015 FINAL REPORT SUNY CANTON MOBILE APPLICATION FINAL REPORT SUNY CANTON MOBILE APPLICATION GROUP MEMBERS: Alexander Royce & Luke Harper SUNY CANTON SPRING 2015 Table of Contents List of Figures... 2 Research... 4 Programming Language... 4 Android Studio...

More information

Record Linkage. with SAS and Link King. Dinu Corbu. Queensland Health Health Statistics Centre Integration and Linkage Unit

Record Linkage. with SAS and Link King. Dinu Corbu. Queensland Health Health Statistics Centre Integration and Linkage Unit Record Linkage with SAS and Link King Dinu Corbu Queensland Health Health Statistics Centre Integration and Linkage Unit Presented at Queensland Users Exploring SAS Technology QUEST 4 June 2009 Basics

More information

ISR Semester 1 Whitepaper Guidelines This whitepaper will serve as the summative documentation of your work for the first semester.

ISR Semester 1 Whitepaper Guidelines This whitepaper will serve as the summative documentation of your work for the first semester. ISR Semester 1 Whitepaper Guidelines This whitepaper will serve as the summative documentation of your work for the first semester. In concise prose, you will detail how you implemented your project, discuss

More information

Read & Download (PDF Kindle) Programming Python

Read & Download (PDF Kindle) Programming Python Read & Download (PDF Kindle) Programming Python If you've mastered Python's fundamentals, you're ready to start using it to get real work done. Programming Python will show you how, with in-depth tutorials

More information

INCOGNITO TOOLKIT: TOOLS, APPS, AND CREATIVE METHODS FOR REMAINING ANONYMOUS, PRIVATE, AND SECURE WHILE COMMUNICATING, PUBLISHING, BUYING,

INCOGNITO TOOLKIT: TOOLS, APPS, AND CREATIVE METHODS FOR REMAINING ANONYMOUS, PRIVATE, AND SECURE WHILE COMMUNICATING, PUBLISHING, BUYING, INCOGNITO TOOLKIT: TOOLS, APPS, AND CREATIVE METHODS FOR REMAINING ANONYMOUS, PRIVATE, AND SECURE WHILE COMMUNICATING, PUBLISHING, BUYING, DOWNLOAD EBOOK : INCOGNITO TOOLKIT: TOOLS, APPS, AND CREATIVE

More information

SQLite vs. MongoDB for Big Data

SQLite vs. MongoDB for Big Data SQLite vs. MongoDB for Big Data In my latest tutorial I walked readers through a Python script designed to download tweets by a set of Twitter users and insert them into an SQLite database. In this post

More information

Assignment 0. Nothing here to hand in

Assignment 0. Nothing here to hand in Assignment 0 Nothing here to hand in The questions here have solutions attached. Follow the solutions to see what to do, if you cannot otherwise guess. Though there is nothing here to hand in, it is very

More information

Fractional. Design of Experiments. Overview. Scenario

Fractional. Design of Experiments. Overview. Scenario Design of Experiments Overview We are going to learn about DOEs. Specifically, you ll learn what a DOE is, as well as, what a key concept known as Confounding is all about. Finally, you ll learn what the

More information

The Journey of a Senior System Center Consultant Implementing BSM

The Journey of a Senior System Center Consultant Implementing BSM November 2014 The Journey of a Senior System Center Consultant Implementing BSM AUTHOR: APPROVED CONSULTING, JONAS LENNTUN Amsterdam - New York Ottawa - Dallas www.savision.com sales@savision.com 1.1 How

More information

For Volunteers An Elvanto Guide

For Volunteers An Elvanto Guide For Volunteers An Elvanto Guide www.elvanto.com Volunteers are what keep churches running! This guide is for volunteers who use Elvanto. If you re in charge of volunteers, why not check out our Volunteer

More information

Joopal and Drumla. Sam Moffatt, Joomla! September 13, 2009

Joopal and Drumla. Sam Moffatt, Joomla! September 13, 2009 Joopal and Drumla Sam Moffatt, Joomla! September 13, 2009 1 Introduction Joopal and Drumla grew out of a curiousity of mine. I wondered if it would be possible to integrate Drupal 6 into Joomla! 1.5 (hence

More information

Lecture 34 SDLC Phases and UML Diagrams

Lecture 34 SDLC Phases and UML Diagrams That Object-Oriented Analysis and Design Prof. Partha Pratim Das Department of Computer Science and Engineering Indian Institute of Technology-Kharagpur Lecture 34 SDLC Phases and UML Diagrams Welcome

More information

Final Project Report. Sharon O Boyle. George Mason University. ENGH 375, Section 001. May 12, 2014

Final Project Report. Sharon O Boyle. George Mason University. ENGH 375, Section 001. May 12, 2014 Final Project Report Sharon O Boyle George Mason University ENGH 375, Section 001 May 12, 2014 ENGH 375, Web Authoring, is a course that teaches the fundamentals of good website design. The class textbooks,

More information

DOWNLOAD PDF EXCEL MACRO TO PRINT WORKSHEET TO

DOWNLOAD PDF EXCEL MACRO TO PRINT WORKSHEET TO Chapter 1 : All about printing sheets, workbook, charts etc. from Excel VBA - blog.quintoapp.com Hello Friends, Hope you are doing well!! Thought of sharing a small VBA code to help you writing a code

More information

Read & Download (PDF Kindle) Data Structures And Other Objects Using C++ (4th Edition)

Read & Download (PDF Kindle) Data Structures And Other Objects Using C++ (4th Edition) Read & Download (PDF Kindle) Data Structures And Other Objects Using C++ (4th Edition) Data Structures and Other Objects Using C++ takes a gentle approach to the data structures course in C++. Providing

More information

Learn Windows PowerShell in a Month of Lunches

Learn Windows PowerShell in a Month of Lunches Learn Windows PowerShell in a Month of Lunches by Don Jones Chapter 4 Copyright 2011 Manning Publications brief contents 1 Before you begin 1 2 Running commands 9 3 Using the help system 23 4 The pipeline:

More information

CIO 24/7 Podcast: Tapping into Accenture s rich content with a new search capability

CIO 24/7 Podcast: Tapping into Accenture s rich content with a new search capability CIO 24/7 Podcast: Tapping into Accenture s rich content with a new search capability CIO 24/7 Podcast: Tapping into Accenture s rich content with a new search capability Featuring Accenture managing directors

More information

Welcome Back! Without further delay, let s get started! First Things First. If you haven t done it already, download Turbo Lister from ebay.

Welcome Back! Without further delay, let s get started! First Things First. If you haven t done it already, download Turbo Lister from ebay. Welcome Back! Now that we ve covered the basics on how to use templates and how to customise them, it s time to learn some more advanced techniques that will help you create outstanding ebay listings!

More information

THE AUDIENCE FOR THIS BOOK. 2 Ajax Construction Kit

THE AUDIENCE FOR THIS BOOK. 2 Ajax Construction Kit Introduction This whole book idea started as a bet I had with my editor that we couldn t pick two random techie topics and tie them together in a book. Two darts flew through the air and the topics Ajax

More information

MDM 4UI: Navigating and Using the Statistics Canada Website

MDM 4UI: Navigating and Using the Statistics Canada Website MDM 4UI: Navigating and Using the Statistics Canada Website Method 1: Module search How to search the Statistics Canada website to find articles and data for projects The Statistics Canada website (www.statcan.gc.ca)

More information

BUILDING ANDROID APPS IN EASY STEPS: USING APP INVENTOR BY MIKE MCGRATH

BUILDING ANDROID APPS IN EASY STEPS: USING APP INVENTOR BY MIKE MCGRATH BUILDING ANDROID APPS IN EASY STEPS: USING APP INVENTOR BY MIKE MCGRATH DOWNLOAD EBOOK : BUILDING ANDROID APPS IN EASY STEPS: USING APP Click link bellow and free register to download ebook: BUILDING ANDROID

More information

Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication

Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication Spreadsheet Basics Excel is a powerful productivity tool. It s a spreadsheet

More information

How to integrate data into Tableau

How to integrate data into Tableau 1 How to integrate data into Tableau a comparison of 3 approaches: ETL, Tableau self-service and WHITE PAPER WHITE PAPER 2 data How to integrate data into Tableau a comparison of 3 es: ETL, Tableau self-service

More information

1 Introduction to Networking

1 Introduction to Networking 1 Introduction to Networking 1.1 What are networks? That seems like an appropriate question to start with. Pretty much anything that s connected to anything else in some way can be described as a network.

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Intro. Scheme Basics. scm> 5 5. scm>

Intro. Scheme Basics. scm> 5 5. scm> Intro Let s take some time to talk about LISP. It stands for LISt Processing a way of coding using only lists! It sounds pretty radical, and it is. There are lots of cool things to know about LISP; if

More information

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses Designing Data Warehouses To begin a data warehouse project, need to find answers for questions such as: Data Warehousing Design Which user requirements are most important and which data should be considered

More information

Usability Testing Review

Usability Testing Review Usability Testing Summary Usability Testing Review Alexis Anand, Katrina Ezis, Ma Shixuan, Cynthia Zhang CSE 440 Section AD All of our usability tests were conducted with students from Computer Science

More information

Spotfire and Tableau Positioning. Summary

Spotfire and Tableau Positioning. Summary Licensed for distribution Summary So how do the products compare? In a nutshell Spotfire is the more sophisticated and better performing visual analytics platform, and this would be true of comparisons

More information

Ranking in a Domain Specific Search Engine

Ranking in a Domain Specific Search Engine Ranking in a Domain Specific Search Engine CS6998-03 - NLP for the Web Spring 2008, Final Report Sara Stolbach, ss3067 [at] columbia.edu Abstract A search engine that runs over all domains must give equal

More information

Name Ella Swain Assessment Number. East St Cafe. Project Name ORGANISING AND PLANNING

Name Ella Swain Assessment Number. East St Cafe. Project Name ORGANISING AND PLANNING This template is for recording a journal of your development process. It provides evidence of how skilfully and efficiently you implemented your plan. You can copy and paste the entire contents of this

More information

THE 18 POINT CHECKLIST TO BUILDING THE PERFECT LANDING PAGE

THE 18 POINT CHECKLIST TO BUILDING THE PERFECT LANDING PAGE THE 18 POINT CHECKLIST TO BUILDING THE PERFECT LANDING PAGE The 18 point checklist to building the Perfect landing page Landing pages come in all shapes and sizes. They re your metaphorical shop front

More information

Facial Keypoint Detection

Facial Keypoint Detection Facial Keypoint Detection CS365 Artificial Intelligence Abheet Aggarwal 12012 Ajay Sharma 12055 Abstract Recognizing faces is a very challenging problem in the field of image processing. The techniques

More information

ArcMap Online Tutorial Sarah Pierce How to map in ArcMap Online using the Fresh Prince of Bel Air as an example

ArcMap Online Tutorial Sarah Pierce How to map in ArcMap Online using the Fresh Prince of Bel Air as an example Fall GARP ArcMap Online Tutorial Sarah Pierce How to map in ArcMap Online using the Fresh Prince of Bel Air as an example Westfield State University Let s say you ve never used ArcGIS before and your professor

More information

ArticlesPlus Launch Survey

ArticlesPlus Launch Survey University of Michigan Deep Blue deepblue.lib.umich.edu 2011-07-25 ArticlesPlus Launch Survey Chapman, Suzanne http://hdl.handle.net/2027.42/106781 Project ArticlesPlus Launch Survey Report Info Report

More information