PREDICTION OF POPULAR SMARTPHONE COMPANIES IN THE SOCIETY

Similar documents
Iteration Reduction K Means Clustering Algorithm

Correlation Based Feature Selection with Irrelevant Feature Removal

International Journal of Software and Web Sciences (IJSWS)

Uncertain Data Classification Using Decision Tree Classification Tool With Probability Density Function Modeling Technique

KEYWORDS: Clustering, RFPCM Algorithm, Ranking Method, Query Redirection Method.

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies

Credit card Fraud Detection using Predictive Modeling: a Review

Development of Prediction Model for Linked Data based on the Decision Tree for Track A, Task A1

Application of Data Mining in Manufacturing Industry

Dynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering

Classifying Twitter Data in Multiple Classes Based On Sentiment Class Labels

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005

A Cloud Framework for Big Data Analytics Workflows on Azure

Comparison of Online Record Linkage Techniques

Web Page Classification using FP Growth Algorithm Akansha Garg,Computer Science Department Swami Vivekanad Subharti University,Meerut, India

A Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)

International Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications

Introduction to Data Mining

Role of Fuzzy Set in Students Performance Prediction

Preparation of Data Set for Data Mining Analysis using Horizontal Aggregation in SQL

Detection and Deletion of Outliers from Large Datasets

Keywords: clustering algorithms, unsupervised learning, cluster validity

Mira Shapiro, Analytic Designers LLC, Bethesda, MD

CHAPTER 4 METHODOLOGY AND TOOLS

INFREQUENT WEIGHTED ITEM SET MINING USING NODE SET BASED ALGORITHM

Decision Tree Learning

Algorithms: Decision Trees

Data Mining. 3.2 Decision Tree Classifier. Fall Instructor: Dr. Masoud Yaghini. Chapter 5: Decision Tree Classifier

Improving Quality of Products in Hard Drive Manufacturing by Decision Tree Technique

Performance Analysis of Data Mining Classification Techniques

CLASSIFICATION OF C4.5 AND CART ALGORITHMS USING DECISION TREE METHOD

STUDY PAPER ON CLASSIFICATION TECHIQUE IN DATA MINING

Decision Trees Dr. G. Bharadwaja Kumar VIT Chennai

Comparative Study of Clustering Algorithms using R

Extended R-Tree Indexing Structure for Ensemble Stream Data Classification

DESIGN AND IMPLEMENTATION OF BUILDING DECISION TREE USING C4.5 ALGORITHM

Data Mining. Part 2. Data Understanding and Preparation. 2.4 Data Transformation. Spring Instructor: Dr. Masoud Yaghini. Data Transformation

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE

Basic Data Mining Technique

Analysis of a Population of Diabetic Patients Databases in Weka Tool P.Yasodha, M. Kannan

An Improved Apriori Algorithm for Association Rules

An Efficient Clustering for Crime Analysis

Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV

Building Data Mining Application for Customer Relationship Management

TIM 50 - Business Information Systems

Dynamic Clustering of Data with Modified K-Means Algorithm

A Program demonstrating Gini Index Classification

A Two Stage Zone Regression Method for Global Characterization of a Project Database

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

Marco Porta Betim Çiço Peter Kaczmarski Neki Frasheri Virginio Cantoni. Fernand Vandamme (BIKEMA)

Classification with Decision Tree Induction

Research Scholar, Sree Saraswathi Thyagaraja College, Pollachi, Tamil Nadu, India. Pollachi, Tamil Nadu, India. 1. Introduction

Index Terms Data Mining, Classification, Rapid Miner. Fig.1. RapidMiner User Interface

An Improved Document Clustering Approach Using Weighted K-Means Algorithm

Clustering of Data with Mixed Attributes based on Unified Similarity Metric

Overview of Web Mining Techniques and its Application towards Web

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

APRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS A REVIEW

Cse634 DATA MINING TEST REVIEW. Professor Anita Wasilewska Computer Science Department Stony Brook University

MySQL Data Mining: Extending MySQL to support data mining primitives (demo)

A study of classification algorithms using Rapidminer

Data Mining Techniques Methods Algorithms and Tools

COMP 465: Data Mining Classification Basics

Extra readings beyond the lecture slides are important:

A SURVEY ON AUTOMOBILE INDUSTRIES USING DATA MINING TECHNIQUES

CATEGORIZATION OF THE DOCUMENTS BY USING MACHINE LEARNING

Enhanced Bug Detection by Data Mining Techniques

IMPLEMENTATION AND COMPARATIVE STUDY OF IMPROVED APRIORI ALGORITHM FOR ASSOCIATION PATTERN MINING

A Systematic Overview of Data Mining Algorithms

Improving Classifier Performance by Imputing Missing Values using Discretization Method

Chapter 3. Foundations of Business Intelligence: Databases and Information Management

Chapter 6 VIDEO CASES

Hybrid Approach for Classification using Support Vector Machine and Decision Tree

Comparative Study of Subspace Clustering Algorithms

Logging Reservoir Evaluation Based on Spark. Meng-xin SONG*, Hong-ping MIAO and Yao SUN

CS229 Lecture notes. Raphael John Lamarre Townshend

Improving Quality of Products in Hard Drive Manufacturing by Decision Tree Technique

Data Mining: An experimental approach with WEKA on UCI Dataset

Carmela Comito and Domenico Talia. University of Calabria & ICAR-CNR

The Comparison of CBA Algorithm and CBS Algorithm for Meteorological Data Classification Mohammad Iqbal, Imam Mukhlash, Hanim Maria Astuti

A Comparative Study of Classification Techniques for Fire Data Set

International Journal of Advanced Research in Computer Science and Software Engineering

Data Preprocessing. Slides by: Shree Jaswal

Random Forest A. Fornaser

Data Preprocessing Yudho Giri Sucahyo y, Ph.D , CISA

Improved Frequent Pattern Mining Algorithm with Indexing

Research on outlier intrusion detection technologybased on data mining

Study on Classifiers using Genetic Algorithm and Class based Rules Generation

ANALYSIS COMPUTER SCIENCE Discovery Science, Volume 9, Number 20, April 3, Comparative Study of Classification Algorithms Using Data Mining

DISCOVERING INFORMATIVE KNOWLEDGE FROM HETEROGENEOUS DATA SOURCES TO DEVELOP EFFECTIVE DATA MINING

CLASSIFICATION OF WEB LOG DATA TO IDENTIFY INTERESTED USERS USING DECISION TREES

Classification and Regression Trees

International Journal of Advanced Research in Computer Science and Software Engineering

Decision Trees Oct

Research and Application of Mobile Geographic Information Service Technology Based on JSP Chengtong GUO1, a, Yan YAO1,b

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

Automated Test Case Generation using Data Mining

Transcription:

PREDICTION OF POPULAR SMARTPHONE COMPANIES IN THE SOCIETY T.Ramya 1, A.Mithra 2, J.Sathiya 3, T.Abirami 4 1 Assistant Professor, 2,3,4 Nadar Saraswathi college of Arts and Science, Theni, Tamil Nadu (India) ABSTRACT This Paper Mainly focus on the impact of Smartphone on business, education.the ability to predict the peoples selection for Smartphone s on business persons, employees and home maker is to identify their opinion on what characteristic they are selecting the Smartphone which Smartphone s they are willing to choose and based upon which company are find out for that questionnaire has been designed and circulated among different kinds of peoples around two district and the collected dataset are tested using Rapid Miner 6.0 by implementing ID3 algorithm. Keywords: ID3, Rapid Miner, Smart Phones. I. Introduction 1.1 Data Mining Data mining is an analytic process designed to explore data in search of consistent pattern and/or systematic relationships between variables, and then to validate the finding by applying the detected pattern. Data mining is the process of analyzing data from different perspective and summarizing it into useful information-information that can be used to increase revenue. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. 1.2 Id3 Algorithm ID3 (Iterative Dichotomiser 3) is an algorithm used to generate a decision tree invented by Ross Quinlan. ID3 is the precursor to the C4.5 algorithm. Very simply, ID3 builds a decision tree from a fixed set of examples. The resulting tree is used to classify future samples. The examples of the given ExampleSet have several attributes and every example belongs to a class (like yes or no). The leaf nodes of the decision tree contain the class name whereas a non-leaf node is a decision node. The decision node is an attribute test with each branch (to another decision tree) being a possible value of the attribute. ID3 uses feature selection heuristic to help it decide which attribute goes into a decision node. The required heuristic can be selected by the criterion parameter. The ID3 algorithm can be summarized as follows: Take all unused attributes and calculate their selection criterion (e.g. information gain) Choose the attribute for which the selection criterion has the best value (e.g. minimum entropy or maximum information gain) Make node containing that attribute 100 P a g e

ID3 searches through the attributes of the training instances and extracts the attribute that best separates the given examples. If the attribute perfectly classifies the training sets then ID3 stops; otherwise it recursively operates on the n (where n = number of possible values of an attribute) partitioned subsets to get their best attribute. The algorithm uses a greedy search, meaning it picks the best attribute and never looks back to reconsider earlier choices. Some major benefits of ID3 are: Understandable prediction rules are created from the training data. Builds a short tree in relatively small time. It only needs to test enough attributes until all data is classified. Finding leaf nodes enables test data to be pruned, reducing the number of tests. 1.3 Mobile Application s Effect in Society from the Ethical Perspective Not only individuals or business, the mobile application also has a great effect in society. The whole society can be facilitate using mobile application. Some issues of social effect describe bellow. Quick communication Save time and increase productivity Increase Job vacancy Less computer use less power consumption Considerable Cost Saving Entertainment 1.4 Limitation of Mobile Application One of the big challenges of mobile application is its platform capability and limitation. Beside the interesting usability of mobile application they have some more interesting platform problems and limitation. We are trying to discuss the limitation in below. 1. Small Screen Size: In mobile platform it is difficult or impossible to view text and graphics like a desktop computer screen. 2. Lack of windows: In desktop we can see many windows at a time. But in mobile platform it is difficult. 3. Navigation: Most mobile devices do not have mouse like pointer, so it has limited flexibility in navigation. 4. Types of pages accessible: The mobile platform does not support all type of file format. 5. Speed: The speed of processing and speed of connectivity of mobile platform is slow. II. METHODOLOGY 2.1 Objective The main objective of the Prediction of popular Smartphone companies in the society is to use data mining methodologies and to predict peoples/citizen selection of Smartphone s. In this research, the classification task is used To Determine the peoples/citizen choice of smartphone characteristic. 101 P a g e

2.2 Data Collection and Data Set Preparation For the Prediction of Popular Smart Phone Company on the Society, Questionnaire has been designed and circulated among 137 various kinds of peoples around Theni and Dindigul district in Tamil Nadu. Finally the Collected data sets are entered in Ms-Excel and imported at Rapid Miner6.0 and ID3 Algorithm is implemented. Table 1.0: data Structure of the basic data. S.NO ATTRIBUTE PEOPLES 1. Business 46 Persons 2. Employees 46 3. Housewives 45 2.3 Rapid Miner Rapid Miner is an open source data mining tool that provides data mining and machine learning procedures including data loading and transformation, data preprocessing and visualization, Modelling, evaluation, and deployment. It is written in the Java programming language. We have used Rapid Miner to generate decision trees of ID3 algorithms. 2.4 Decision Tree In decision tree learning, ID3 (Iterative Dichotomiser 3) is an algorithm invented by Ross Quinlan used to generate a decision tree from the dataset. ID3 is typically used in the machine learning and natural language processing domains. The decision tree technique involves constructing a tree to model the classification process. Once a tree is built, it is applied to each tuple in the database and results in classification for that tuple. The following issues are faced by most decision tree algorithms: Choosing splitting attributes Ordering of splitting attributes Number of splits to take Balance of tree structure and pruning Stopping criteria 2.5 Classification and Prediction There are two forms of data analysis that can be used for extract models describing important classes or predict future data trends. These two forms are as follows: Classification Prediction These data analysis help us to provide a better understanding of large data. Classification predicts categorical and predictions models predicts continuous valued function. 102 P a g e

III. RESULT AND DISCUSSION The Original Dataset is applied in Rapid Miner tool to produce the following Result. In this survey the number of business peoples are 46, employees are 46 and housewives are 45 the graphical representation of various kinds of peoples details are displayed below using Rapid Miner Tool by using Histogram Plot View. Figure 1.1 Usage of Smartphones In Society The Most Popular Smart Phone Companies are Predicted for that dataset are implemented using Rapid Miner Tool and the result are show as pie chart according to the result most people in the society choose Samsung SmartPhones when compared with others FIGURE 1.2 Popular SmartPhone Company 103 P a g e

IV. CONLUSION The Prediction of Popular Smart Phone Company in the Society are identified and the impact of Smart Phone towards society are predicted Finally the result was generated by using various graph and decision tree using Rapid Miner Tool. REFERENCES [1]. Social Data Mining On Smartphones;Ajita Gupta; International Journal Of Computer Science, Engineering And Information Technology (Ijcseit ) Vol,5, Issue 3, July 7, 2011 [2]. Smartphone app usage log mining ;woan-rou tseng and kuo-wei hsu, international journal of computer and electrical engineering, vol. 6, issue 2, [3]. Mining personal data using smartphones and earable devices:a survey ; muhammad habib urrehman, chee sun liew, teh ying wah, junaidshuja and babakdaghighi,journal of faculty of computer science and information technology,vol.3, issue. 4,13 feb2015 [4]. Mobile Marketing Association, 2008, USA 1670 Broadway, Suite 850, Denver, CO 80202. [5]. T. Hsu, "Real-time risk control system for CNP (card not present)", 17th ACM SIGKDD [6]. international conference on Knowledge discovery and data mining, pp.783-783, 2011. [7]. IBM Survey: IT Professionals Predict Mobile and Cloud Technologies Will Dominate Enterprise Computing By 2015, Posted October 17, 2010, URL:http://www.fiercemobilecontent.com/pressreleases/ibm-survey-it-professionals-predict-mobile-and-cloud-technologies-will-dominate-enter. [8]. Markus Hofmann, Ralf Klinkenberg, RapidMiner: Data Mining Use Cases and Business Analytics Applications (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series), CRC Press, October 25, 2013. [9]. WWW.DATAMININGTOOLS.NET [10]. http://learning-ml.blogspot.nl/2014/09/best-programming-language-for-machine.html [11]. Data Mining: Concepts and Techniques (The Morgan Kaufmann Series in Data Management Systems), Jiawei Han, Morgan Kaufmann, 2011. [12]. Data Mining Techniques: For Marketing, Sales, and Customer Support, Michael J. A. Berry, Gordon S. Linoff, Wiley, 1997. 104 P a g e