Data Exploration. The table below lists each of the files available for analysis with a short description of what is found in each one.

Size: px
Start display at page:

Download "Data Exploration. The table below lists each of the files available for analysis with a short description of what is found in each one."

Transcription

1 Data Exploration Data Set Overview The table below lists each of the files available for analysis with a short description of what is found in each one. File Name Description Fields ad-clicks.csv This csv file corresponds to ERD table Adclicks. A line is added to this file when a player clicks on an advertisement in the Flamingo app timestamp: when the click occurred txid: a unique id (within adclicks.log) for the click usersessionid: the id of the user session for the user who made the click teamid: the current team id of the user who made the click userid: the user id of the user who made the click adid: the id of the ad clicked on adcategory: the category/type of ad clicked on buy-clicks.csv This csv file corresponds to InAppPurchases ERD table. A line is added to this file when a player makes an in-app purchase in the Flamingo app. timestamp: when the purchase was made txid: a unique id (within buyclicks.log) for the purchase usersessionid: the id of the user session for the user who made the purchase team: the current team id of the user who made the purchase userid: the user id of the user who made the purchase buyid: the id of the item purchased price: the price of the item purchased users.csv This csv file corresponds to User ERD table. This file contains a line for each user playing the game. timestamp: when user first played the game id: the user id assigned to the user nick: the nickname chosen by the user twitter: the twitter handle of the

2 user dob: the date of birth of the user country: the two-letter country code where the user lives team.csv This csv file corresponds to Team ERD table. This file contains a line for each team terminated in the game. teamid: the id of the team name: the name of the team teamcreationtime: the timestamp when the team was created teamendtime: the timestamp when the last member left the team strength: a measure of team strength, roughly corresponding to the success of a team currentlevel: the current level of the team teamassignments.csv This csv file corresponds to TeamAssignment ERD table. A line is added to this file each time a user joins a team. A user can be in at most a single team at a time. time: when the user joined the team team: the id of the team userid: the id of the user assignmentid: a unique id for this assignment level-events.csv This csv file corresponds to LevelEvent ERD table. time: when the event occurred. eventid: a unique id for the event teamid: the id of the team level: the level started or completed eventtype: the type of event, either start or end user-session.csv This csv file corresponds to User_Sessions ERD table. Each line in this file describes a user session, which denotes when a user starts and stops playing the game. Additionally, when a team goes to the next level in the game, the session is ended for each user in the team and a new one started. timestamp: a timestamp denoting when the event occurred usersessionid: a unique id for the session userid: the current user's ID teamid: the current user's team assignmentid: the team assignment id for the user to the team sessiontype: whether the event is the start or end of a

3 session teamlevel: the level of the team during this session platformtype: the type of platform of the user during this session game-clicks.csv This csv file corresponds to GameClicks ERD table. A line is added to this file each time a user performs a click in the game. time: when the click occurred clickid: a unique id for the click userid: the id of the user performing the click usersessionid: the id of the session of the user when the click is performed ishit: denotes if the click was on a flamingo (value is 1) or missed the flamingo (value is 0) teamid: the id of the team of the user teamlevel: the current level of the team of the user Aggregation Amount spent buying items buyid Sum [source="buy-clicks.csv" stats sum(price) by buyid] # Unique items available to be purchased 30 [source="ad-clicks.csv" stats count by adid stats count]

4 A histogram showing how many times each item is purchased: A histogram showing how much money was made from each item:

5 Filtering A histogram showing total amount of money spent by the top ten users (ranked by how much money they spent). source="buy-clicks.csv" stats sum(price) by userid sort -sum(price) head 10 The following table shows the user id, platform, and hit-ratio percentage for the top three buying users: Rank User Id Platform Hit-Ratio (%) iphone 61/526*100=11.6% 2 12 iphone 92/704*100=13.1% iphone 76/524*100=14.5% Sample query used to find Platform: source="user-session.csv" userid=2229 stats count by platformtype Sample query used to find Hit-Ratio(%) source="game-clicks.csv" userid=2229 stats sum(ishit) count

6 Analysis of combined_data.csv Data Preparation Sample Selection Item Amount # of Samples 4619 # of Samples with Purchases 1411 Attribute Creation A new categorical attribute was created to enable analysis of players as broken into 2 categories (HighRollers and PennyPinchers). A screenshot of the attribute follows: <Fill In: Describe the design of your attribute in 1-3 sentences.> Used Rule Engine Knime node. Logic as below: $avg_price$ > 5.0 => "HighRollers" $avg_price$ <= 5.0 => "PennyPinchers" The creation of this new categorical attribute was necessary because we want to aggregate the users into small number of groups. Avg_price being continuous value does not enable this.

7 Attribute Selection The following attributes were filtered from the dataset for the following reasons: Attribute Avg_price UserId usersessionid Rationale for Filtering The type of customer (PennyPinchers or HighRollers) is derived from avg_price. This attribute cannot give any prediction about customer s spending. This is a random number. It is realistically impossible to have any pattern between userid to customer s spending. Same as above. This does not influence customer s spending.

8 Data Partitioning and Modeling The data was partitioned into train and test datasets. The train data set was used to create the decision tree model. The trained model was then applied to the test dataset. This is important because train data enables to define the model and this model can be verified on test data. The accuracy on test data will provide an reasonable measure of performance for actual predictions. When partitioning the data using sampling, it is important to set the random seed because it ensures partitioning node produces same partition each time it is executed thereby producing the same result. A screenshot of the resulting decision tree can be seen below:

9 Evaluation A screenshot of the confusion matrix can be seen below: As seen in the screenshot above, the overall accuracy of the model is % <Fill In: Write one sentence for each of the values of the confusion matrix indicating what has been correctly or incorrectly predicted.> 308 PennyPinchers and 192 HighRollers have been correctly classified. However, 38 HighRollers have been wrongly classified as PennyPinchers. Similarly, 27 PennyPinchers have been wrongly classified as HighRollers. Overall accuracy is good (88.496%).

10 Analysis Conclusions The final KNIME workflow is shown below: What makes a HighRoller vs. a PennyPincher? iphone platformtypes users are most likely to be HighRollers whereas linux, windows and android are most likely to to PennyPinchers. Mac platform users are heterogeneous (high entropy) with more tilted towards PennyPinchers. Specific Recommendations to Increase Revenue 1. Eglence should firmly focus on iphone platform as users of this are more likely to purchase. 2. Incentivise non-iphone users to join iphone platform.

11 Clustering Analysis Attribute Selection Summary: The idea of creating this cluster is to find whether any pattern exists between user game clicks, ad-clicks and revenue generated by a user. Attribute Total game clicks Total ad-clicks Revenue per user userid of user Rationale for Selection Total number of clicks by each user (both hit and misses) Total number of clicks on ads. Amount of money earned from each user s spending on their purchase of apps key source of revenue This is the key to bind the data

12 Training Data Set Creation The training data set used for this analysis is shown below (first 10 lines): Dimensions of the training data set (rows x columns) : 543 x 3 # of clusters created: 3 (checked with a few others it looks like the data is more meaningful when we create two different clusters).

13 Cluster Centers Cluster # Cluster Center 1 (gameclicks, adclicks, revenue)=( , 32.26, 41.89) 2 (gameclicks, adclicks, revenue)=(959,52, 36.37, 46.03) 3 (gameclicks, adclicks, revenue)=(363.42, 25.17, 35.36) These clusters can be differentiated from each other as follows: Cluster 1 is different from the others in that its gameclicks is much higher compared to others. Cluster 2 is different from the others it has the highest revenue per user. It also has highest adclicks per user. Cluster 3 has least gameclicks and least ad clicks per user. Cluster 2 is desired type for user best revenue fuelled by high ad-clicks and gameclicks (underpinned by in-game purchase).

14 Recommended Actions Action Recommended Negotiate better rates for ad clicks with advertiser Do further study to understand background and rationale of the users who generate highest revenue. Rationale for the action Ad clicks is a key source for revenue. Eglence should negotiate better deals with advertisers. Users in cluster #2 are the preferred for Eglence Inc. They are moderate users (gameclicks) but generate the best revenue. Eglence should do further analytics to understand who these users are (i.e. whether they belong to any country, age group, platform).

15 Graph Analytics Modelling Chat Data using a Graph Data Model Chat data created by Eglence Inc. users is represented as a graph data model. There are 4 different types of nodes: User, Team, TeamChatSession and ChatItem. An instance of TeamChatSession is created when a User creates a new chat session for a team. ChatItem is individual instance of chat. Below are different relationships: CreateSession from User to TeamChatSession node CreateChat from User to ChatItem node ResponseTo ChatItem to ChatItem node (during a chat conversation) OwnedBy TeamChatSession to Team PartOf ChatItem to TeamChatSession Mentioned ChatItem to User Joins User to TeamChatSession Leaves User to TeamChatSession Creation of the Graph Database for Chats Describe the steps you took for creating the graph database. As part of these steps i) Write the schema of the 6 CSV files Schema of the 6 CSV files: chat_create_team_chat.csv UserId, TeamId, TeamChatSessionId, CreatedTimestamp chat_join_team_chat.csv UserId, TeamChatSessionId and JoinsEdgeTimestamp chat_leave_team_chat.csv UserId, TeamChatSessionId and LeaveTimestamp chat_leave_team_chat.csv UserId, TeamChatSessionId, ChatItemId, CreateChatTimestamp chat_mention_team_chat.csv ChatItemId, UserId, MentionedTimestamp chat_respond_team_chat.csv ChatItemId, ResponseChatItemId, ResponseTimestamp ii) Explain the loading process and include a sample LOAD command Loading process

16 Extracted the csv files and placed in the directory location Neo4j referred to. In my case, Neo4j refers to a specific location in D: drive. Used the sample scripts and made necessary amendments. Below is a sample Load command: LOAD CSV FROM "file:///d:/chat_item_team_chat.csv" AS row MERGE (u:user {id: toint(row[0])}) MERGE (c:teamchatsession {id: toint(row[1])}) MERGE (i:chatitem {id: toint(row[2])}) MERGE (i)-[:partof{timestamp: row[3]}]->(c) MERGE (u)-[:createchat{timestamp: row[3]}]->(i) iii) Present a screenshot of some part of the graph you have generated. The graphs must include clearly visible examples of most node and edge types. Below are two acceptable examples. The first example is a rendered in the default Neo4j distribution, the second has had some nodes moved to expose the edges more clearly. Both include examples of most node and edge types. Screen of graph generated. Used match ()-[r]->() return r limit 70 command and adjusted scale to 0.8. match ()-[r]->() return r limit 25

17

18 Finding the longest conversation chain and its participants Report the results including the length of the conversation (path length) and how many unique users were part of the conversation chain. Describe your steps. Write the query that produces the correct answer. To find the number of chats in longest conversation chain: match p=(a)-[:responseto*]->(c) return size(nodes(p)) order by size(nodes(p)) desc limit 1 Above returns 10. To find unique users of the conversation chain: match p=(i:chatitem)-[:responseto*]->(j:chatitem) where size(nodes(p)) = 10 with p match (u:user)-[:createchat]->(i:chatitem) where i in nodes(p) return distinct u 5 unique users in the chain. Ids are 1192, 1978, 1153, 853, 1514.

19 Analyzing the relationship between top 10 chattiest users and top 10 chattiest teams Describe your steps from Question 2. In the process, create the following two tables. You only need to include the top 3 for each table. Identify and report whether any of the chattiest users were part of any of the chattiest teams. Chattiest Users match q=(a:user)-[r:createchat]->(b:chatitem) return a.id,count(a) order by count(a) desc limit 10 Users Number of Chats Chattiest Teams match p=(c:chatitem)-[r1:partof]->(s:teamchatsession)-[r2:ownedby]->(t:team) return t.id, count(c) order by count(c) desc limit 10 Teams Number of Chats Finally, present your answer, i.e. whether or not any of the chattiest users are part of any of the chattiest teams. match (u:user)-[:createchat]->(i:chatitem)-[:partof]->(s:teamchatsession)-[:ownedby]->(t) where t.id in [82, 185, 112, 18, 194, 129, 52, 136, 146, 81] and u.id in [394, 2067, 209, 1087, 554, 516, 1627, 999, 668, 461] return distinct u.id, t.id Above query returns user 999 is the only one of the top 10 chattiest users belong to one of the top 10 chattiest teams (52). None among the top 10 chattiest users belongs to the top 10 chattiest teams.

20 How Active Are Groups of Users? Describe your steps for performing this analysis. Be as clear, concise, and as brief as possible. Finally, report the top 3 most active users in the table below. Created the edge InteractsWith: Match (u1:user)-[:createchat]->(i:chatitem)-[:mentioned]->(u2:user) create (u1)- [:InteractsWith]->(u2) Match (u1:user)-[:createchat]->(i:chatitem)-[:responseto]->(i2:chatitem)<-[:createchat]- (u2:user) create (u1)-[:interactswith]->(u2) Deleted self loops: Match (u1)-[r:interactswith]->(u1) delete r To get number of nodes and the number of edges between their neighbour s nodes, and to get clustering coefficients by using this formula: (number of neighbour nodes)/[number of edges*(number of edges 1)]: Match (u)-[cc:createchat]->() With u,count(u) as chatcount order by chatcount desc limit 10 With collect(u.id) as users match(u:user)-[:interactswith]-(u1:user) where u.id in users With u,count(distinct u1) as k, collect(distinct u1.id) as neighbours Match (n:user)-[:interactswith]-(n1:user),(m:user)-[:interactswith]-(m1:user) where n.id=u.id and n1.id in neighbours and m1.id in neighbours With distinct u,n1,m1,k With u,sum(case when (n1)--(m1) then 1 else 0 end) as edges, k return u.id,edges,k,k*(k-1), tofloat(edges)/tofloat(k*(k-1)) as coefficient order by coefficient desc Above query returns the list below: u.id edges k k*(k-1) coefficient

21 Most Active Users (based on Cluster Coefficients) User ID Coefficient

Acquiring, Exploring and Preparing the Data

Acquiring, Exploring and Preparing the Data Technical Appendix Catch the Pink Flamingo Analysis Produced by: Prabhat Tripathi Acquiring, Exploring and Preparing the Data Data Exploration Data Set Overview The table below lists each of the files

More information

Graph Analytics. Modeling Chat Data using a Graph Data Model. Creation of the Graph Database for Chats

Graph Analytics. Modeling Chat Data using a Graph Data Model. Creation of the Graph Database for Chats Graph Analytics Modeling Chat Data using a Graph Data Model The Pink Flamingo graph model includes users, teams, chat sessions, and chat item nodes with relationships or edges of a) creating sessions,

More information

I) write schema of the six files.

I) write schema of the six files. Graph Analytics Modeling Chat Data using a Graph Data Model (Describe the graph model for chats in a few sentences. Try to be clear and complete.) Creation of the Graph Database for Chats Describe the

More information

Graph Analytics. Modeling Chat Data using a Graph Data Model. Creation of the Graph Database for Chats

Graph Analytics. Modeling Chat Data using a Graph Data Model. Creation of the Graph Database for Chats Graph Analytics Modeling Chat Data using a Graph Data Model This we will be using a graph analytics approach to chat data from the Catch the Pink Flamingo game. Currently this chat data is purely numeric,

More information

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data. Code No: M0502/R05 Set No. 1 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss

More information

EXAM - FM Developer Essentials for FileMaker 12 Exam. Buy Full Product.

EXAM - FM Developer Essentials for FileMaker 12 Exam. Buy Full Product. FileMaker EXAM - FM0-307 Developer Essentials for FileMaker 12 Exam Buy Full Product http://www.examskey.com/fm0-307.html Examskey FileMaker FM0-307 exam demo product is here for you to test the quality

More information

Business Club. Decision Trees

Business Club. Decision Trees Business Club Decision Trees Business Club Analytics Team December 2017 Index 1. Motivation- A Case Study 2. The Trees a. What is a decision tree b. Representation 3. Regression v/s Classification 4. Building

More information

Advanced Multidimensional Reporting

Advanced Multidimensional Reporting Guideline Advanced Multidimensional Reporting Product(s): IBM Cognos 8 Report Studio Area of Interest: Report Design Advanced Multidimensional Reporting 2 Copyright Copyright 2008 Cognos ULC (formerly

More information

Oracle Compare Two Database Tables Sql Query Join

Oracle Compare Two Database Tables Sql Query Join Oracle Compare Two Database Tables Sql Query Join data types. Namely, it assumes that the two tables payments and How to use SQL PIVOT to Compare Two Tables in Your Database. This can (not that using the

More information

John Biancamano Inbound Digital LLC InboundDigital.net

John Biancamano Inbound Digital LLC InboundDigital.net John Biancamano Inbound Digital LLC 609.865.7994 InboundDigital.net About Me Owner of Inbound Digital, LLC digital marketing consulting and training: websites, SEO, advertising, and social media. Senior

More information

Learn SQL by Calculating Customer Lifetime Value

Learn SQL by Calculating Customer Lifetime Value Learn SQL Learn SQL by Calculating Customer Lifetime Value Setup, Counting and Filtering 1 Learn SQL CONTENTS Getting Started Scenario Setup Sorting with ORDER BY FilteringwithWHERE FilteringandSorting

More information

Qlik Sense Performance Benchmark

Qlik Sense Performance Benchmark Technical Brief Qlik Sense Performance Benchmark This technical brief outlines performance benchmarks for Qlik Sense and is based on a testing methodology called the Qlik Capacity Benchmark. This series

More information

T-SQL Training: T-SQL for SQL Server for Developers

T-SQL Training: T-SQL for SQL Server for Developers Duration: 3 days T-SQL Training Overview T-SQL for SQL Server for Developers training teaches developers all the Transact-SQL skills they need to develop queries and views, and manipulate data in a SQL

More information

Chapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction

Chapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction CHAPTER 5 SUMMARY AND CONCLUSION Chapter 1: Introduction Data mining is used to extract the hidden, potential, useful and valuable information from very large amount of data. Data mining tools can handle

More information

What is KNIME? workflows nodes standard data mining, data analysis data manipulation

What is KNIME? workflows nodes standard data mining, data analysis data manipulation KNIME TUTORIAL What is KNIME? KNIME = Konstanz Information Miner Developed at University of Konstanz in Germany Desktop version available free of charge (Open Source) Modular platform for building and

More information

TURN DATA INTO ACTIONABLE INSIGHTS. Google Analytics Workshop

TURN DATA INTO ACTIONABLE INSIGHTS. Google Analytics Workshop TURN DATA INTO ACTIONABLE INSIGHTS Google Analytics Workshop The Value of Analytics Google Analytics is more than just numbers and stats. It tells the story of how people are interacting with your brand

More information

Weighted Powers Ranking Method

Weighted Powers Ranking Method Weighted Powers Ranking Method Introduction The Weighted Powers Ranking Method is a method for ranking sports teams utilizing both number of teams, and strength of the schedule (i.e. how good are the teams

More information

Vendor: IBM. Exam Code: C Exam Name: IBM Cognos 10 BI Author. Version: Demo

Vendor: IBM. Exam Code: C Exam Name: IBM Cognos 10 BI Author. Version: Demo Vendor: IBM Exam Code: C2020-612 Exam Name: IBM Cognos 10 BI Author Version: Demo Question No : 1 In Report Studio, an author creates a list report containing columns for Staff Name, Order number, and

More information

Oracle Database: Introduction to SQL

Oracle Database: Introduction to SQL Oracle Database: Introduction to SQL What you will learn Understanding the basic concepts of relational databases ensure refined code by developers. This course helps the participants to write subqueries,

More information

Introduction to BEST Viewpoints

Introduction to BEST Viewpoints Introduction to BEST Viewpoints This is not all but just one of the documentation files included in BEST Viewpoints. Introduction BEST Viewpoints is a user friendly data manipulation and analysis application

More information

normalization are being violated o Apply the rule of Third Normal Form to resolve a violation in the model

normalization are being violated o Apply the rule of Third Normal Form to resolve a violation in the model Database Design Section1 - Introduction 1-1 Introduction to the Oracle Academy o Give examples of jobs, salaries, and opportunities that are possible by participating in the Academy. o Explain how your

More information

Hustle Documentation. Release 0.1. Tim Spurway

Hustle Documentation. Release 0.1. Tim Spurway Hustle Documentation Release 0.1 Tim Spurway February 26, 2014 Contents 1 Features 3 2 Getting started 5 2.1 Installing Hustle............................................. 5 2.2 Hustle Tutorial..............................................

More information

Dr. Barbara Morgan Quantitative Methods

Dr. Barbara Morgan Quantitative Methods Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In

More information

Data Mining By IK Unit 4. Unit 4

Data Mining By IK Unit 4. Unit 4 Unit 4 Data mining can be classified into two categories 1) Descriptive mining: describes concepts or task-relevant data sets in concise, summarative, informative, discriminative forms 2) Predictive mining:

More information

University of California, Berkeley. (2 points for each row; 1 point given if part of the change in the row was correct)

University of California, Berkeley. (2 points for each row; 1 point given if part of the change in the row was correct) University of California, Berkeley CS 186 Intro to Database Systems, Fall 2012, Prof. Michael J. Franklin MIDTERM II - Questions This is a closed book examination but you are allowed one 8.5 x 11 sheet

More information

1

1 1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance

More information

EECE 418. Fantasy Exchange Pass 2 Portfolio

EECE 418. Fantasy Exchange Pass 2 Portfolio EECE 418 Fantasy Exchange Pass 2 Portfolio Team Ovthree Tommy Tsz Kiu Wan Yawei Li Nenghao Lin 41720103 56674104 77222099 Table of Contents Section 1: Prototype and Re-Design Rationale... 1 1.1 Redesign

More information

TRANSACTIONAL CLUSTERING. Anna Monreale University of Pisa

TRANSACTIONAL CLUSTERING. Anna Monreale University of Pisa TRANSACTIONAL CLUSTERING Anna Monreale University of Pisa Clustering Clustering : Grouping of objects into different sets, or more precisely, the partitioning of a data set into subsets (clusters), so

More information

Topic 1, Volume A QUESTION NO: 1 In your ETL application design you have found several areas of common processing requirements in the mapping specific

Topic 1, Volume A QUESTION NO: 1 In your ETL application design you have found several areas of common processing requirements in the mapping specific Vendor: IBM Exam Code: C2090-303 Exam Name: IBM InfoSphere DataStage v9.1 Version: Demo Topic 1, Volume A QUESTION NO: 1 In your ETL application design you have found several areas of common processing

More information

University of Waterloo Midterm Examination Sample Solution

University of Waterloo Midterm Examination Sample Solution 1. (4 total marks) University of Waterloo Midterm Examination Sample Solution Winter, 2012 Suppose that a relational database contains the following large relation: Track(ReleaseID, TrackNum, Title, Length,

More information

Aster Data Basics Class Outline

Aster Data Basics Class Outline Aster Data Basics Class Outline CoffingDW education has been customized for every customer for the past 20 years. Our classes can be taught either on site or remotely via the internet. Education Contact:

More information

Oracle Database 10g: Introduction to SQL

Oracle Database 10g: Introduction to SQL ORACLE UNIVERSITY CONTACT US: 00 9714 390 9000 Oracle Database 10g: Introduction to SQL Duration: 5 Days What you will learn This course offers students an introduction to Oracle Database 10g database

More information

Knowingo + Dashboard Manual

Knowingo + Dashboard Manual Knowingo + Dashboard Manual Introduction The purpose of this manual is to provide current Knowingo + users with references and instructions on how to utilize the Knowingo + Dashboard. The contents of this

More information

Data warehouses Decision support The multidimensional model OLAP queries

Data warehouses Decision support The multidimensional model OLAP queries Data warehouses Decision support The multidimensional model OLAP queries Traditional DBMSs are used by organizations for maintaining data to record day to day operations On-line Transaction Processing

More information

1 Topic. Image classification using Knime.

1 Topic. Image classification using Knime. 1 Topic Image classification using Knime. The aim of image mining is to extract valuable knowledge from image data. In the context of supervised image classification, we want to assign automatically a

More information

DB2 SQL Class Outline

DB2 SQL Class Outline DB2 SQL Class Outline The Basics of SQL Introduction Finding Your Current Schema Setting Your Default SCHEMA SELECT * (All Columns) in a Table SELECT Specific Columns in a Table Commas in the Front or

More information

BUYER S GUIDE WEBSITE DEVELOPMENT

BUYER S GUIDE WEBSITE DEVELOPMENT BUYER S GUIDE WEBSITE DEVELOPMENT At Curzon we understand the importance of user focused design. EXECUTIVE SUMMARY This document is designed to provide prospective clients with a short guide to website

More information

Outbound Engine Manual 1

Outbound Engine Manual 1 1 Contents Outbound Engine... 3 Login... 4 Main View...5 Filter... 6 Edit... 6 Delete... 6 Contacts Menu... 7 Imports... 7 Field Mapping... 10 Field Information...12 Start import... 20 Import Formats...21

More information

Bridgemate App. Information for players. Version 2. Bridge Systems BV

Bridgemate App. Information for players. Version 2. Bridge Systems BV Bridgemate App Information for players Version 2 Bridge Systems BV Bridgemate App Information for players Page 2 Contents Introduction... 3 Downloading the app... 3 Starting the app and creating an account...

More information

Unit Assessment Guide

Unit Assessment Guide Unit Assessment Guide Unit Details Unit code Unit name Unit purpose/application ICTWEB425 Apply structured query language to extract and manipulate data This unit describes the skills and knowledge required

More information

Redshift Queries Playbook

Redshift Queries Playbook scalable analytics built for growth Redshift Queries Playbook SQL queries for understanding user behavior Updated June 23, 2015 This playbook shows you how you can use Amplitude's Amazon Redshift database

More information

How to Increase Sales With Personalized Website Visitor Engagement. How to Increase Sales With Personalized Website Visitor Engagement

How to Increase Sales With Personalized Website Visitor Engagement. How to Increase Sales With Personalized Website Visitor Engagement How to Increase Sales With Personalized Website Visitor Engagement 1 What is personalized website visitor engagement? The first impression visitors have about products or services is through your company

More information

The Now Lifestyle Marketing Dictionary

The Now Lifestyle Marketing Dictionary The following are some of the basic terms you might hear about when you are going through the training videos in the member area or ones I might create. They are not all inclusive of everything you need

More information

Server Side Scripting Report

Server Side Scripting Report Server Side Scripting Report David Nelson 205CDE Developing the Modern Web Assignment 2 Student ID: 3622926 Computing BSc 29 th March, 2013 http://creative.coventry.ac.uk/~nelsond/ - Page 1 - Contents

More information

Abstract. Problem Statement. Objective. Benefits

Abstract. Problem Statement. Objective. Benefits Abstract The purpose of this final year project is to create an Android mobile application that can automatically extract relevant information from pictures of receipts. Users can also load their own images

More information

Introduction to Computer Science and Business

Introduction to Computer Science and Business Introduction to Computer Science and Business This is the second portion of the Database Design and Programming with SQL course. In this portion, students implement their database design by creating a

More information

A MODEL FOR COMPARATIVE ANALYSIS OF THE SIMILARITY BETWEEN ANDROID AND IOS OPERATING SYSTEMS

A MODEL FOR COMPARATIVE ANALYSIS OF THE SIMILARITY BETWEEN ANDROID AND IOS OPERATING SYSTEMS Bulletin of the Transilvania University of Braşov Series V: Economic Sciences Vol. 7 (56) No. 2-2014 A MODEL FOR COMPARATIVE ANALYSIS OF THE SIMILARITY BETWEEN ANDROID AND IOS OPERATING SYSTEMS R. LIXĂNDROIU

More information

Cisco Unified Contact Center Express Historical Reporting Guide, Release 10.6(1)

Cisco Unified Contact Center Express Historical Reporting Guide, Release 10.6(1) Cisco Unified Contact Center Express Historical Reporting Guide, Release 10.6(1) First Published: December 15, 2014 Americas Headquarters Cisco Systems, Inc. 170 West Tasman Drive San Jose, CA 95134-1706

More information

Sitecore Experience Platform 8.0 Rev: September 13, Sitecore Experience Platform 8.0

Sitecore Experience Platform 8.0 Rev: September 13, Sitecore Experience Platform 8.0 Sitecore Experience Platform 8.0 Rev: September 13, 2018 Sitecore Experience Platform 8.0 All the official Sitecore documentation. Page 1 of 455 Experience Analytics glossary This topic contains a glossary

More information

Extra readings beyond the lecture slides are important:

Extra readings beyond the lecture slides are important: 1 Notes To preview next lecture: Check the lecture notes, if slides are not available: http://web.cse.ohio-state.edu/~sun.397/courses/au2017/cse5243-new.html Check UIUC course on the same topic. All their

More information

E-Companion: On Styles in Product Design: An Analysis of US. Design Patents

E-Companion: On Styles in Product Design: An Analysis of US. Design Patents E-Companion: On Styles in Product Design: An Analysis of US Design Patents 1 PART A: FORMALIZING THE DEFINITION OF STYLES A.1 Styles as categories of designs of similar form Our task involves categorizing

More information

FileMaker Exam FM0-306 Developer Essential for FileMaker 12 Version: 6.0 [ Total Questions: 198 ]

FileMaker Exam FM0-306 Developer Essential for FileMaker 12 Version: 6.0 [ Total Questions: 198 ] s@lm@n FileMaker Exam FM0-306 Developer Essential for FileMaker 12 Version: 6.0 [ Total Questions: 198 ] Question No : 1 DRAG DROP Match the FileMaker Server 12 hosting technology with the maximum number

More information

Aster Data SQL and MapReduce Class Outline

Aster Data SQL and MapReduce Class Outline Aster Data SQL and MapReduce Class Outline CoffingDW education has been customized for every customer for the past 20 years. Our classes can be taught either on site or remotely via the internet. Education

More information

XVIII Open Cup named after E.V. Pankratiev Stage 1: Grand Prix of Romania, Sunday, September 17, 2017

XVIII Open Cup named after E.V. Pankratiev Stage 1: Grand Prix of Romania, Sunday, September 17, 2017 Problem A. Balance file: 1 second 512 mebibytes We say that a matrix A of size N N is balanced if A[i][j] + A[i + 1][j + 1] = A[i + 1][j] + A[i][j + 1] for all 1 i, j N 1. You are given a matrix A of size

More information

IBM emessage Version 9 Release 1 February 13, User's Guide

IBM emessage Version 9 Release 1 February 13, User's Guide IBM emessage Version 9 Release 1 February 13, 2015 User's Guide Note Before using this information and the product it supports, read the information in Notices on page 471. This edition applies to version

More information

DoConference Web Conferencing: DoMore DoConference

DoConference Web Conferencing: DoMore DoConference DoConference Web Conferencing: DoMore DoConference Use DoConference Conferencing for: Training and E-Learning Deliver interactive online classes, E-Learning sessions and educate easily and affordably.

More information

REDCap Indiana University Release Notes

REDCap Indiana University Release Notes New Features: REDCap 7.4.x @ Indiana University Release Notes New feature: Custom Record Status Dashboards Users can build and save custom versions of the Record Status Dashboard to customize the dashboard

More information

Data Explorer: User Guide 1. Data Explorer User Guide

Data Explorer: User Guide 1. Data Explorer User Guide Data Explorer: User Guide 1 Data Explorer User Guide Data Explorer: User Guide 2 Contents About this User Guide.. 4 System Requirements. 4 Browser Requirements... 4 Important Terminology.. 5 Getting Started

More information

X-AFFILIATE module for X-Cart 4.0.x

X-AFFILIATE module for X-Cart 4.0.x X-AFFILIATE module for X-Cart 4.0.x Partner Interface Reference Manual Revision Date: 2004-11-22 Copyright 2001-2004 Ruslan R. Fazliev. All rights reserved. TABLE OF CONTENTS GENERAL INFORMATION...3 REGISTRATION...4

More information

Stat Wk 3. Stat 342 Notes. Week 3, Page 1 / 71

Stat Wk 3. Stat 342 Notes. Week 3, Page 1 / 71 Stat 342 - Wk 3 What is SQL Proc SQL 'Select' command and 'from' clause 'group by' clause 'order by' clause 'where' clause 'create table' command 'inner join' (as time permits) Stat 342 Notes. Week 3,

More information

Fundamentals of Database Systems

Fundamentals of Database Systems Fundamentals of Database Systems Assignment: 2 Due Date: 18th August, 2017 Instructions This question paper contains 10 questions in 6 pages. Q1: Consider the following schema for an office payroll system,

More information

EXAMINATIONS 2013 MID-YEAR SWEN 432 ADVANCED DATABASE DESIGN AND IMPLEMENTATION

EXAMINATIONS 2013 MID-YEAR SWEN 432 ADVANCED DATABASE DESIGN AND IMPLEMENTATION EXAMINATIONS 2013 MID-YEAR ADVANCED DATABASE DESIGN AND IMPLEMENTATION Time Allowed: 3 Hours (180 minutes) Instructions: Attempt all questions. There are 180 possible marks on the exam. Make sure your

More information

Data Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology

Data Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology ❷Chapter 2 Basic Statistics Business School, University of Shanghai for Science & Technology 2016-2017 2nd Semester, Spring2017 Contents of chapter 1 1 recording data using computers 2 3 4 5 6 some famous

More information

VISITOR SEGMENTATION

VISITOR SEGMENTATION support@magestore.com sales@magestore.com Phone: 084.4.8585.4587 VISITOR SEGMENTATION USER GUIDE Version 1.0.0 Table of Contents 1. INTRODUCTION... 3 Create unlimited visitor segments... 3 Show targeted

More information

CHAPTER 4 K-MEANS AND UCAM CLUSTERING ALGORITHM

CHAPTER 4 K-MEANS AND UCAM CLUSTERING ALGORITHM CHAPTER 4 K-MEANS AND UCAM CLUSTERING 4.1 Introduction ALGORITHM Clustering has been used in a number of applications such as engineering, biology, medicine and data mining. The most popular clustering

More information

Actual4Test. Actual4test - actual test exam dumps-pass for IT exams

Actual4Test.   Actual4test - actual test exam dumps-pass for IT exams Actual4Test http://www.actual4test.com Actual4test - actual test exam dumps-pass for IT exams Exam : FM0-306 Title : Developer Essentials for FileMaker 12 Vendors : FileMaker Version : DEMO Get Latest

More information

Dataflow Editor User Guide

Dataflow Editor User Guide - Cisco EFF, Release 1.0.1 Cisco (EFF) 1.0.1 Revised: August 25, 2017 Conventions This document uses the following conventions. Convention bold font italic font string courier font Indication Menu options,

More information

Cisco Jabber Features and Options

Cisco Jabber Features and Options Cisco Jabber 10.6 Features, page 1 Cisco Jabber Features for Windows, Mac, ios and Android, page 3 Cisco Jabber Features for Windows, page 15 Cisco Jabber Features for Mac, page 36 Cisco Jabber for Android

More information

Making MongoDB Accessible to All. Brody Messmer Product Owner DataDirect On-Premise Drivers Progress Software

Making MongoDB Accessible to All. Brody Messmer Product Owner DataDirect On-Premise Drivers Progress Software Making MongoDB Accessible to All Brody Messmer Product Owner DataDirect On-Premise Drivers Progress Software Agenda Intro to MongoDB What is MongoDB? Benefits Challenges and Common Criticisms Schema Design

More information

Monitor Qlik Sense sites. Qlik Sense Copyright QlikTech International AB. All rights reserved.

Monitor Qlik Sense sites. Qlik Sense Copyright QlikTech International AB. All rights reserved. Monitor Qlik Sense sites Qlik Sense 2.1.2 Copyright 1993-2015 QlikTech International AB. All rights reserved. Copyright 1993-2015 QlikTech International AB. All rights reserved. Qlik, QlikTech, Qlik Sense,

More information

Decision Making Procedure: Applications of IBM SPSS Cluster Analysis and Decision Tree

Decision Making Procedure: Applications of IBM SPSS Cluster Analysis and Decision Tree World Applied Sciences Journal 21 (8): 1207-1212, 2013 ISSN 1818-4952 IDOSI Publications, 2013 DOI: 10.5829/idosi.wasj.2013.21.8.2913 Decision Making Procedure: Applications of IBM SPSS Cluster Analysis

More information

Analytics: Server Architect (Siebel 7.7)

Analytics: Server Architect (Siebel 7.7) Analytics: Server Architect (Siebel 7.7) Student Guide June 2005 Part # 10PO2-ASAS-07710 D44608GC10 Edition 1.0 D44917 Copyright 2005, 2006, Oracle. All rights reserved. Disclaimer This document contains

More information

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005 Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005 Abstract Deciding on which algorithm to use, in terms of which is the most effective and accurate

More information

IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including:

IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including: IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including: 1. IT Cost Containment 84 topics 2. Cloud Computing Readiness 225

More information

Play Guide. Download. 3 Introduction. 4 Claim Your Name. 6 Projections. 7 Trading. 17 Ticker. 24 Community

Play Guide. Download. 3 Introduction. 4 Claim Your Name. 6 Projections. 7 Trading. 17 Ticker. 24 Community Play Guide Download 3 Introduction 4 Claim Your Name 6 Projections 7 Trading 17 Ticker 24 Community 25 Download Protoblock App Mobile Apps Google Play App Store on-the-go convenient light clients Desktop

More information

Data Mining. Part 2. Data Understanding and Preparation. 2.4 Data Transformation. Spring Instructor: Dr. Masoud Yaghini. Data Transformation

Data Mining. Part 2. Data Understanding and Preparation. 2.4 Data Transformation. Spring Instructor: Dr. Masoud Yaghini. Data Transformation Data Mining Part 2. Data Understanding and Preparation 2.4 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Normalization Attribute Construction Aggregation Attribute Subset Selection Discretization

More information

15-110: Principles of Computing, Spring 2018

15-110: Principles of Computing, Spring 2018 5-: Principles of Computing, Spring 28 Problem Set 8 (PS8) Due: Friday, March 3 by 2:3PM via Gradescope Hand-in HANDIN INSTRUCTIONS Download a copy of this PDF file. You have two ways to fill in your answers:.

More information

ABC Business Magazines Reporting Standards (UK)

ABC Business Magazines Reporting Standards (UK) July 2017 ABC Business Magazines Reporting Standards (UK) Changes have been agreed to the ABC Business Magazine Reporting Standards. We have updated the latest Reporting Standards to incorporate these

More information

PostgreSQL what's new

PostgreSQL what's new PostgreSQL 9.1 - what's new PGDay.IT 2011 Prato, Italy Magnus Hagander magnus@hagander.net @magnushagander PRODUCTS CONSULTING APPLICATION MANAGEMENT IT OPERATIONS SUPPORT TRAINING PostgreSQL 9.1 Released

More information

Approaching the Petabyte Analytic Database: What I learned

Approaching the Petabyte Analytic Database: What I learned Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may

More information

Midterm 1: CS186, Spring 2012

Midterm 1: CS186, Spring 2012 Midterm 1: CS186, Spring 2012 Prof. J. Hellerstein You should receive a double- sided answer sheet and a 7- page exam. Mark your name and login on both sides of the answer sheet. For each question, place

More information

Grid Computing Systems: A Survey and Taxonomy

Grid Computing Systems: A Survey and Taxonomy Grid Computing Systems: A Survey and Taxonomy Material for this lecture from: A Survey and Taxonomy of Resource Management Systems for Grid Computing Systems, K. Krauter, R. Buyya, M. Maheswaran, CS Technical

More information

1. [1 pt] What is the solution to the recurrence T(n) = 2T(n-1) + 1, T(1) = 1

1. [1 pt] What is the solution to the recurrence T(n) = 2T(n-1) + 1, T(1) = 1 Asymptotics, Recurrence and Basic Algorithms 1. [1 pt] What is the solution to the recurrence T(n) = 2T(n-1) + 1, T(1) = 1 2. O(n) 2. [1 pt] What is the solution to the recurrence T(n) = T(n/2) + n, T(1)

More information

Oracle Database 11g: SQL and PL/SQL Fundamentals

Oracle Database 11g: SQL and PL/SQL Fundamentals Oracle University Contact Us: +33 (0) 1 57 60 20 81 Oracle Database 11g: SQL and PL/SQL Fundamentals Duration: 5 Days What you will learn In this course, students learn the fundamentals of SQL and PL/SQL

More information

Fall 2018 CSE 482 Big Data Analysis: Exam 1 Total: 36 (+3 bonus points)

Fall 2018 CSE 482 Big Data Analysis: Exam 1 Total: 36 (+3 bonus points) Fall 2018 CSE 482 Big Data Analysis: Exam 1 Total: 36 (+3 bonus points) Name: This exam is open book and notes. You can use a calculator but no laptops, cell phones, nor other electronic devices are allowed.

More information

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska Preprocessing Short Lecture Notes cse352 Professor Anita Wasilewska Data Preprocessing Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept

More information

Stream Processing Platforms Storm, Spark,.. Batch Processing Platforms MapReduce, SparkSQL, BigQuery, Hive, Cypher,...

Stream Processing Platforms Storm, Spark,.. Batch Processing Platforms MapReduce, SparkSQL, BigQuery, Hive, Cypher,... Data Ingestion ETL, Distcp, Kafka, OpenRefine, Query & Exploration SQL, Search, Cypher, Stream Processing Platforms Storm, Spark,.. Batch Processing Platforms MapReduce, SparkSQL, BigQuery, Hive, Cypher,...

More information

Pig A language for data processing in Hadoop

Pig A language for data processing in Hadoop Pig A language for data processing in Hadoop Antonino Virgillito THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Apache Pig: Introduction Tool for querying data on Hadoop

More information

kjhf MIS 510 sharethisdeal - Final Project Report Team Members:- Damini Akash Krittika Karan Khurana Agrawal Patil Dhingra

kjhf MIS 510 sharethisdeal - Final Project Report Team Members:- Damini Akash Krittika Karan Khurana Agrawal Patil Dhingra kjhf MIS 510 sharethisdeal - Final Project Report Team Members:- Damini Akash Krittika Karan Khurana Agrawal Patil Dhingra 5/14/2014 Introduction Shoppers often come across offers that require them to

More information

Data Analysis and Data Science

Data Analysis and Data Science Data Analysis and Data Science CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/29/15 Agenda Check-in Online Analytical Processing Data Science Homework 8 Check-in Online Analytical

More information

Supplementary text S6 Comparison studies on simulated data

Supplementary text S6 Comparison studies on simulated data Supplementary text S Comparison studies on simulated data Peter Langfelder, Rui Luo, Michael C. Oldham, and Steve Horvath Corresponding author: shorvath@mednet.ucla.edu Overview In this document we illustrate

More information

Building Self-Service BI Solutions with Power Query. Written By: Devin

Building Self-Service BI Solutions with Power Query. Written By: Devin Building Self-Service BI Solutions with Power Query Written By: Devin Knight DKnight@PragmaticWorks.com @Knight_Devin CONTENTS PAGE 3 PAGE 4 PAGE 5 PAGE 6 PAGE 7 PAGE 8 PAGE 9 PAGE 11 PAGE 17 PAGE 20 PAGE

More information

Decision Manager Help. Version 7.1.7

Decision Manager Help. Version 7.1.7 Version 7.1.7 This document describes products and services of Pegasystems Inc. It may contain trade secrets and proprietary information. The document and product are protected by copyright and distributed

More information

Query Processing with Indexes. Announcements (February 24) Review. CPS 216 Advanced Database Systems

Query Processing with Indexes. Announcements (February 24) Review. CPS 216 Advanced Database Systems Query Processing with Indexes CPS 216 Advanced Database Systems Announcements (February 24) 2 More reading assignment for next week Buffer management (due next Wednesday) Homework #2 due next Thursday

More information

Inbound Marketing Glossary

Inbound Marketing Glossary Inbound Marketing Glossary A quick guide to essential imbuecreative.com Inbound Marketing Glossary A/B Testing Above the Fold Alt Tag Blog Buyer s Journey Buyer Persona Call to Action (CTA) Click-through

More information

Cisco Unified Contact Center Express Historical Reporting Guide, Release 10.5(1)

Cisco Unified Contact Center Express Historical Reporting Guide, Release 10.5(1) Cisco Unified Contact Center Express Historical Reporting Guide, Release 10.5(1) First Published: June 11, 2014 Americas Headquarters Cisco Systems, Inc. 170 West Tasman Drive San Jose, CA 95134-1706 USA

More information

3. Cluster analysis Overview

3. Cluster analysis Overview Université Laval Multivariate analysis - February 2006 1 3.1. Overview 3. Cluster analysis Clustering requires the recognition of discontinuous subsets in an environment that is sometimes discrete (as

More information

DIGITAL IDENTITY MANAGEMENT Temple Community Platform

DIGITAL IDENTITY MANAGEMENT Temple Community Platform DIGITAL IDENTITY MANAGEMENT Temple Community Platform Digital identity is what uniquely describes a person and their relationships online. Most of us today are experts in social identity management (e.g.,

More information

Guest Lecture. Daniel Dao & Nick Buroojy

Guest Lecture. Daniel Dao & Nick Buroojy Guest Lecture Daniel Dao & Nick Buroojy OVERVIEW What is Civitas Learning What We Do Mission Statement Demo What I Do How I Use Databases Nick Buroojy WHAT IS CIVITAS LEARNING Civitas Learning Mid-sized

More information

Setup Google Analytics

Setup Google Analytics Setup Google Analytics 1.1 Sign Up Google Analytics 1. Once you have a Google account, you can go to Google Analytics (https://analytics.google.com) and click the Sign into Google Analytics button. You

More information