Network Devices Data Visualization Using Weka
|
|
- Lillian Norris
- 5 years ago
- Views:
Transcription
1 Volume 118 No , ISSN: (printed version); ISSN: (on-line version) url: ijpam.eu Network Devices Data Visualization Using Weka B.krishna Sagar 1, E. Madhusudhana Reddy 2, S.Ramakrishna 3 1 Research Scholar,Computer Science and Engineering, J.N.T.U.K, Kakinada, India. 2 Professor, Computer Science & Engineering, SRM DRK College of Engineering & Technology,Hyderabad,India. 3 Professor, Computer Science, Sri Venkateswara University,Tirupati, India 1 krishna.sagar521@gmail.com, 2 e mreddy@yahoo.com, 3 drsramakrishna@yahoo.com January 29, 2018 Abstract Big Data brings a new era for research scholars. Different tools are used to process, analyze and visualization of big data. These can help scholars to find problems and study theoretically. Different articles are published in various areas in big data technologies. thus we studied various published research articles to find better approach to process the big data and analyzing it. In particular, we propose an idea to process a big XML data and visualize through Weka tool. Weka tool accepts the CSV or ARFF file for analysis and visualization. Here we proposed one algorithm to convert router generated XML data to csv format to process and analyze the data through Weka. Key Words: XML Data, Weka Tool, ARFF, CSV
2 1 Introduction In recent years processing and analyzing the customer data becomes a crucial part for organizations and institutions. But this data become very large to handle and analyze. For example, Social Media, e-commerce, Logistics, Intuitions Data etc., To process these data, various tools are used like Hadoop, Hive, and Spark etc. and for visualization we have Tableau, Qlikview, Datawrappers, Microsoft Power BI, Oracle Visual Analyzer. The acquired big data are largescale, heterogeneous, and generated at high speed, thus potentially complex to cope with [1]. For example, to analyze the data which are generated from different sensors connected to patient in hospital to provide better treatment. To handle the big data analysis, some data mining algorithms provides better solutions. In recent years scholars are interested in supervised and unsupervised algorithms. For example, classification algorithms [2], and clustering algorithms [3]. For instance, hadoop is used for processing data and Mahout, MLlib for analysis of data. Distributed systems provide an environment that can allow big data processing. Such systems, made up of organized collections of commodity hardware, process big data in a distributed manner. The main difficulty in big data mining to propose an algorithm in such a way to include in big data mining toolkits. The organizations aim to analysis of big data to extract knowledge and better to understand the customer needs. Recent big data technologies provide a user friendly environment to include the algorithm to analysis of big data. such developments require a basic understanding of distributed big data technologies for implementation of algorithms. Weka is a data mining tool that contains different algorithms to analyze the data. In recent years Weka tool is using to analyze the big data in distributed environment. For example DistributedWekaSpark [2], the Distributed Weka for Spark Package has been available in Weka for several years[3]. The success of Weka is to have a wide range of well data analysis algorithms and model evaluation procedures and metrics. Weka also provides an API (application programming interface) for integration of its algorithms and procedures resulting in visual environments such as Tableau and Qlikview. DistributedWekaSpark, is an in-memory processing of Big Data. DistributedWekaSpark contains various 2 600
3 machine learning data analysis algorithms with this spark integration. DistributedWekaSpark toolkit includes packages to provide map-reduce procedures and data processing, classification and execute tasks in Big Data. For example, Classification contains SVM (Support Vector Machine), Nave Bayes, Mapreduce, Decision Tree. These algorithms are implemented in this DistributedWekaSpark toolkit to reduce the processing speed, effective visualization and Load Balancing. In a distributed computing system, a process is used to coordinate many tasks. It is not a problem which systems are doing the work, but there should be a coordinator that will work at any time. So, electing a coordinator or a leader is very essential problem in distributed environment and there are several algorithms that are used in this process. Leader, Ring and Bully algorithms are some of them. In a group of communicating protocols, the elect a new coordinator is essential when a leader is crashes or away from the group. In this scenario in distributed environment, elections are conduct in appropriate situations. Election algorithm has a variety of applications such as key distribution, routing coordination, sensor coordination, and general control. When nodes are mobile, topologies can change and nodes may dynamically join/leave a network. In such networks, election algorithm can play a major role frequently, making it a particularly critical component of system operation. In this paper, we are used this novel election algorithm for delivering messages in absentia of destination system by choosing a nearby system or friend system through its past communication history. The traditional statement of the election problem is to finally elect a distinct coordinator from a set of nodes from many sources. 2 Implementation of Python Program to convert xml data to csv 2.1 Python Program to Convert the XML Data to CSV Data File import xml.etree.elementtree as etree tree = etree.parse( /root/omi12.xml ) 3 601
4 root = tree.getroot() def normalize(name): if name[0] == : uri, tag = name[1:].split( } ) return tag else: return name def finds child names attrib(i):... for c1 in root[i]:... yield(normalize(c1.tag),c1.attrib,c1.text)... for c2 in c1:... yield(normalize(c2.tag),c2.attrib,c2.text)... for c3 in c2:... yield(normalize(c3.tag),c3.attrib,c3.text) i=finds child names attrib(1) l=[] for j in i:... l.append(j) 2.2 Python Program to Convert the XML Data to CSV Data File This XML Router Data Taken by HPE, Software Company For Research
5 3 Data Transformation This Module contains of 3 phases, data selection, data collection, and data normalization. This section is important to improve the data quality for mining. The data selection phase is handled using Python programming statements. 4 Architecture for XML Data Visualization Fig 1.Architecture for XML Data Visualization 5 Data Visualization This module consists of data inputs to the Weka Tool and Data Visualization. To start the data visualization, the input data in Comma-separated values (CSV) was loaded into the main WEKA GUI which is Explorer menu. WEKA has four main menus which are Explorer, Experimenter, KnowledgeFow, and Simple CLI. To start the data exploration for mining, the input data in the CSV are loaded into the Explorer menu. Fig. 2 shows the WEKA main GUI[4]
6 Fig. 2. Entering in to the Weka GUI Inside the Explorer menu, it has six different panels appear on the tabs at the top, that corresponds to the various DM available tasks which are preprocess, classify, cluster, associate, select attribute, and visualize. To begin the data analysis, CSV file are input on the Open file button in preprocess panel. For this research, the file named output.csv were imported on the preprocess panel and 154 instances, 3 Attributes were displayed [4]. As show in fig 3, fig 4 and fig 5. Fig. 3. Weka Explorer -Preprocess tab 6 604
7 Fig. 4. Choose the CSV File Which is Generated by Python Program Fig. 5. Weka Explorer -Preprocess view for Network Routers Data In fig 5, we can see the complete XML Data. Here we can delete the attributes and visualize a particular attribute. in fig 6, we are visualizing a particular attribute and its instances. for example, the attribute device ID:828baa70-1e8e-71e7-1d25-0fda have Last Modified Time. As shown in fig 6 and fig
8 Fig. 6. Weka Visualization for Network Routers Data Figure.7. Weka Visualization for Instances 8 606
9 6 CONCLUSION This paper presents visualization of complex XML DATA generated by net-work devices. Here we used python libraries to convert xml data to csv and inputs into the Weka GUI tool for visualization. Future work will reduce the noise data from csv and providing distributed environment for analyzing network device data. Acknowledgment In this paper we collected XML DATA from HPE Company for Research. References [1] D. Agrawal, S. Das, and A. El Abbadi, Big data and cloud computing: Current state and future opportunities, in Proceedings of the 14th International Conference on Extending Database Technology, ser. EDBT/ICDT 11.New York, NY, USA: ACM, 2011, pp [2] Aris-Kyriakos Koliopoulos, Paraskevas Yiapanis, Firat Tekiner, Goran Nenadic, John Keane, A Parallel DistributedWeka Framework for Big Data Mining using Spark, BigDataCongress , 2015, pp.916. [3] [4] Nur Hafieza Ismail, Fadhilah Ahmad, and Azwa Abdul Aziz Implementing WEKA as a Data Mining Tool to Analyze Students Academic Performances Using Nave Bayes Classifier
10 608
Twitter data Analytics using Distributed Computing
Twitter data Analytics using Distributed Computing Uma Narayanan Athrira Unnikrishnan Dr. Varghese Paul Dr. Shelbi Joseph Research Scholar M.tech Student Professor Assistant Professor Dept. of IT, SOE
More informationSCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR
SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR P.SHENBAGAVALLI M.E., Research Scholar, Assistant professor/cse MPNMJ Engineering college Sspshenba2@gmail.com J.SARAVANAKUMAR B.Tech(IT)., PG
More informationInternational Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X
Analysis about Classification Techniques on Categorical Data in Data Mining Assistant Professor P. Meena Department of Computer Science Adhiyaman Arts and Science College for Women Uthangarai, Krishnagiri,
More informationClassifying Twitter Data in Multiple Classes Based On Sentiment Class Labels
Classifying Twitter Data in Multiple Classes Based On Sentiment Class Labels Richa Jain 1, Namrata Sharma 2 1M.Tech Scholar, Department of CSE, Sushila Devi Bansal College of Engineering, Indore (M.P.),
More informationIMPLEMENTATION OF CLASSIFICATION ALGORITHMS USING WEKA NAÏVE BAYES CLASSIFIER
IMPLEMENTATION OF CLASSIFICATION ALGORITHMS USING WEKA NAÏVE BAYES CLASSIFIER N. Suresh Kumar, Dr. M. Thangamani 1 Assistant Professor, Sri Ramakrishna Engineering College, Coimbatore, India 2 Assistant
More informationApache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context
1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes
More informationDealing with Data Especially Big Data
Dealing with Data Especially Big Data INFO-GB-2346.01 Fall 2017 Professor Norman White nwhite@stern.nyu.edu normwhite@twitter Teaching Assistant: Frenil Sanghavi fps241@stern.nyu.edu Administrative Assistant:
More informationApache Spark and Hadoop Based Big Data Processing System for Clinical Research
Apache Spark and Hadoop Based Big Data Processing System for Clinical Research Sreekanth Rallapalli 1,*, Gondkar R R 2 1 Research Scholar, R&D Centre, Bharathiyar University, Coimbatore, Tamilnadu, India.
More informationCloud Computing 3. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 3 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationChapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction
CHAPTER 5 SUMMARY AND CONCLUSION Chapter 1: Introduction Data mining is used to extract the hidden, potential, useful and valuable information from very large amount of data. Data mining tools can handle
More informationResearch Article Apriori Association Rule Algorithms using VMware Environment
Research Journal of Applied Sciences, Engineering and Technology 8(2): 16-166, 214 DOI:1.1926/rjaset.8.955 ISSN: 24-7459; e-issn: 24-7467 214 Maxwell Scientific Publication Corp. Submitted: January 2,
More informationData Clustering on the Parallel Hadoop MapReduce Model. Dimitrios Verraros
Data Clustering on the Parallel Hadoop MapReduce Model Dimitrios Verraros Overview The purpose of this thesis is to implement and benchmark the performance of a parallel K- means clustering algorithm on
More informationChapter 1, Introduction
CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from
More informationCloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationAn Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm
Proceedings of the National Conference on Recent Trends in Mathematical Computing NCRTMC 13 427 An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm A.Veeraswamy
More informationPre-Requisites: CS2510. NU Core Designations: AD
DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification
More informationIdentifying Important Communications
Identifying Important Communications Aaron Jaffey ajaffey@stanford.edu Akifumi Kobashi akobashi@stanford.edu Abstract As we move towards a society increasingly dependent on electronic communication, our
More informationIJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):
IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online): 2321-0613 A Study on Handling Missing Values and Noisy Data using WEKA Tool R. Vinodhini 1 A. Rajalakshmi
More informationSpecialist ICT Learning
Specialist ICT Learning APPLIED DATA SCIENCE AND BIG DATA ANALYTICS GTBD7 Course Description This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics.
More informationISSN: (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Paper / Case Study Available online at: www.ijarcsms.com
More informationA Parallel Distributed Weka Framework for Big Data Mining using Spark
A Parallel Distributed Weka Framework for Big Data Mining using Spark Aris-Kyriakos Koliopoulos, Paraskevas Yiapanis, Firat Tekiner, Goran Nenadic, John Keane School of Computer Science, The University
More informationChapter 1 - The Spark Machine Learning Library
Chapter 1 - The Spark Machine Learning Library Objectives Key objectives of this chapter: The Spark Machine Learning Library (MLlib) MLlib dense and sparse vectors and matrices Types of distributed matrices
More informationNida Afreen Rizvi 1, Anjana Pandey 2, Ratish Agrawal 2 and Mahesh Pawar 2
P r e d i c t i o n m o d e l o f D i a b e t e s D r u g U s i n g H i v e a n d R Prediction model of Diabetes Drug Using Hive and R Nida Afreen Rizvi 1, Anjana Pandey 2, Ratish Agrawal 2 and Mahesh
More informationCOMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES
COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES USING DIFFERENT DATASETS V. Vaithiyanathan 1, K. Rajeswari 2, Kapil Tajane 3, Rahul Pitale 3 1 Associate Dean Research, CTS Chair Professor, SASTRA University,
More informationMining User - Aware Rare Sequential Topic Pattern in Document Streams
Mining User - Aware Rare Sequential Topic Pattern in Document Streams A.Mary Assistant Professor, Department of Computer Science And Engineering Alpha College Of Engineering, Thirumazhisai, Tamil Nadu,
More informationData Science Bootcamp Curriculum. NYC Data Science Academy
Data Science Bootcamp Curriculum NYC Data Science Academy 100+ hours free, self-paced online course. Access to part-time in-person courses hosted at NYC campus Machine Learning with R and Python Foundations
More informationBIG DATA & HADOOP: A Survey
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,
More informationOracle Big Data Fundamentals Ed 1
Oracle University Contact Us: +0097143909050 Oracle Big Data Fundamentals Ed 1 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, learn to use Oracle's Integrated Big Data
More informationCorrelation Based Feature Selection with Irrelevant Feature Removal
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
More informationOracle Big Data Connectors
Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process
More informationUsing Weka for Classification. Preparing a data file
Using Weka for Classification Preparing a data file Prepare a data file in CSV format. It should have the names of the features, which Weka calls attributes, on the first line, with the names separated
More informationA ProM Operational Support Provider for Predictive Monitoring of Business Processes
A ProM Operational Support Provider for Predictive Monitoring of Business Processes Marco Federici 1,2, Williams Rizzi 1,2, Chiara Di Francescomarino 1, Marlon Dumas 3, Chiara Ghidini 1, Fabrizio Maria
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK DISTRIBUTED FRAMEWORK FOR DATA MINING AS A SERVICE ON PRIVATE CLOUD RUCHA V. JAMNEKAR
More informationInfrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset
Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset M.Hamsathvani 1, D.Rajeswari 2 M.E, R.Kalaiselvi 3 1 PG Scholar(M.E), Angel College of Engineering and Technology, Tiruppur,
More informationData Mining. Lab 1: Data sets: characteristics, formats, repositories Introduction to Weka. I. Data sets. I.1. Data sets characteristics and formats
Data Mining Lab 1: Data sets: characteristics, formats, repositories Introduction to Weka I. Data sets I.1. Data sets characteristics and formats The data to be processed can be structured (e.g. data matrix,
More information2/26/2017. Originally developed at the University of California - Berkeley's AMPLab
Apache is a fast and general engine for large-scale data processing aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes Low latency: sub-second
More informationWEKA homepage.
WEKA homepage http://www.cs.waikato.ac.nz/ml/weka/ Data mining software written in Java (distributed under the GNU Public License). Used for research, education, and applications. Comprehensive set of
More informationMACHINE LEARNING BASED METHODOLOGY FOR TESTING OBJECT ORIENTED APPLICATIONS
MACHINE LEARNING BASED METHODOLOGY FOR TESTING OBJECT ORIENTED APPLICATIONS N. Kannadhasan and B. Uma Maheswari Department of Master of Computer Applications St. Joseph s College of Engineering, Chennai,
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationData Mining Concepts & Tasks
Data Mining Concepts & Tasks Duen Horng (Polo) Chau Georgia Tech CSE6242 / CX4242 Jan 16, 2014 Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos Last Time
More informationTackling Big Data Using MATLAB
Tackling Big Data Using MATLAB Alka Nair Application Engineer 2015 The MathWorks, Inc. 1 Building Machine Learning Models with Big Data Access Preprocess, Exploration & Model Development Scale up & Integrate
More informationNatural Language Processing on Hospitals: Sentimental Analysis and Feature Extraction #1 Atul Kamat, #2 Snehal Chavan, #3 Neil Bamb, #4 Hiral Athwani,
ISSN 2395-1621 Natural Language Processing on Hospitals: Sentimental Analysis and Feature Extraction #1 Atul Kamat, #2 Snehal Chavan, #3 Neil Bamb, #4 Hiral Athwani, #5 Prof. Shital A. Hande 2 chavansnehal247@gmail.com
More informationBig Data. Big Data Analyst. Big Data Engineer. Big Data Architect
Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION
More informationPublished by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 1
Cluster Based Speed and Effective Feature Extraction for Efficient Search Engine Manjuparkavi A 1, Arokiamuthu M 2 1 PG Scholar, Computer Science, Dr. Pauls Engineering College, Villupuram, India 2 Assistant
More informationDIGIT.B4 Big Data PoC
DIGIT.B4 Big Data PoC DIGIT 01 Social Media D02.01 PoC Requirements Table of contents 1 Introduction... 5 1.1 Context... 5 1.2 Objective... 5 2 Data SOURCES... 6 2.1 Data sources... 6 2.2 Data fields...
More informationEfficient Algorithm for Frequent Itemset Generation in Big Data
Efficient Algorithm for Frequent Itemset Generation in Big Data Anbumalar Smilin V, Siddique Ibrahim S.P, Dr.M.Sivabalakrishnan P.G. Student, Department of Computer Science and Engineering, Kumaraguru
More informationPACKET HANDLING SCHEDULING IN MULTIPLE ROUTING CONFIGURATIONS FOR FAST IP NETWORK RECOVERY
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 4, April 2013,
More informationFREQUENT PATTERN MINING IN BIG DATA USING MAVEN PLUGIN. School of Computing, SASTRA University, Thanjavur , India
Volume 115 No. 7 2017, 105-110 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu FREQUENT PATTERN MINING IN BIG DATA USING MAVEN PLUGIN Balaji.N 1,
More informationAN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE
AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3
More informationData Mining With Weka A Short Tutorial
Data Mining With Weka A Short Tutorial Dr. Wenjia Wang School of Computing Sciences University of East Anglia (UEA), Norwich, UK Content 1. Introduction to Weka 2. Data Mining Functions and Tools 3. Data
More informationIJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 12, 2017 ISSN (online):
IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 12, 2017 ISSN (online): 2321-0613 S.Subhitsha 1 S.Selvakumar 2 V.P.Sumathi 3 1,2 Student 3 Assistant Professor (SRG) 1,2,3
More informationResearch Article Combining Pre-fetching and Intelligent Caching Technique (SVM) to Predict Attractive Tourist Places
Research Journal of Applied Sciences, Engineering and Technology 9(1): -46, 15 DOI:.1926/rjaset.9.1374 ISSN: -7459; e-issn: -7467 15 Maxwell Scientific Publication Corp. Submitted: July 1, 14 Accepted:
More informationSQL Query Optimization on Cross Nodes for Distributed System
2016 International Conference on Power, Energy Engineering and Management (PEEM 2016) ISBN: 978-1-60595-324-3 SQL Query Optimization on Cross Nodes for Distributed System Feng ZHAO 1, Qiao SUN 1, Yan-bin
More informationPython With Data Science
Course Overview This course covers theoretical and technical aspects of using Python in Applied Data Science projects and Data Logistics use cases. Who Should Attend Data Scientists, Software Developers,
More informationAn Efficient Approach for Color Pattern Matching Using Image Mining
An Efficient Approach for Color Pattern Matching Using Image Mining * Manjot Kaur Navjot Kaur Master of Technology in Computer Science & Engineering, Sri Guru Granth Sahib World University, Fatehgarh Sahib,
More informationSURVEY PAPER ON CLOUD COMPUTING
SURVEY PAPER ON CLOUD COMPUTING Kalpana Tiwari 1, Er. Sachin Chaudhary 2, Er. Kumar Shanu 3 1,2,3 Department of Computer Science and Engineering Bhagwant Institute of Technology, Muzaffarnagar, Uttar Pradesh
More informationKeywords Hadoop, Map Reduce, K-Means, Data Analysis, Storage, Clusters.
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
More informationBEST BIG DATA CERTIFICATIONS
VALIANCE INSIGHTS BIG DATA BEST BIG DATA CERTIFICATIONS email : info@valiancesolutions.com website : www.valiancesolutions.com VALIANCE SOLUTIONS Analytics: Optimizing Certificate Engineer Engineering
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationFile Inclusion Vulnerability Analysis using Hadoop and Navie Bayes Classifier
File Inclusion Vulnerability Analysis using Hadoop and Navie Bayes Classifier [1] Vidya Muraleedharan [2] Dr.KSatheesh Kumar [3] Ashok Babu [1] M.Tech Student, School of Computer Sciences, Mahatma Gandhi
More informationA Comparative study of Clustering Algorithms using MapReduce in Hadoop
A Comparative study of Clustering Algorithms using MapReduce in Hadoop Dweepna Garg 1, Khushboo Trivedi 2, B.B.Panchal 3 1 Department of Computer Science and Engineering, Parul Institute of Engineering
More informationA SURVEY ON SIMPLIFIED PARALLEL DATA PROCESSING ON LARGE WEIGHTED ITEMSET USING MAPREDUCE
A SURVEY ON SIMPLIFIED PARALLEL DATA PROCESSING ON LARGE WEIGHTED ITEMSET USING MAPREDUCE Sunitha S 1, Sahanadevi K J 2 1 Mtech Student, Depatrment Of Computer Science and Engineering, EWIT, B lore, India
More informationMahout in Action MANNING ROBIN ANIL SEAN OWEN TED DUNNING ELLEN FRIEDMAN. Shelter Island
Mahout in Action SEAN OWEN ROBIN ANIL TED DUNNING ELLEN FRIEDMAN II MANNING Shelter Island contents preface xvii acknowledgments about this book xx xix about multimedia extras xxiii about the cover illustration
More informationInformation Retrieval System Based on Context-aware in Internet of Things. Ma Junhong 1, a *
Information Retrieval System Based on Context-aware in Internet of Things Ma Junhong 1, a * 1 Xi an International University, Shaanxi, China, 710000 a sufeiya913@qq.com Keywords: Context-aware computing,
More informationBest Combination of Machine Learning Algorithms for Course Recommendation System in E-learning
Best Combination of Machine Learning Algorithms for Course Recommendation System in E-learning Sunita B Aher M.E. (CSE) -II Walchand Institute of Technology Solapur University India Lobo L.M.R.J. Associate
More informationInternational Journal of Software and Web Sciences (IJSWS)
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) ISSN (Print): 2279-0063 ISSN (Online): 2279-0071 International
More informationIntroduction to Big Data
Introduction to Big Data OVERVIEW We are experiencing transformational changes in the computing arena. Data is doubling every 12 to 18 months, accelerating the pace of innovation and time-to-value. The
More informationFraud Detection Using Random Forest Algorithm
Fraud Detection Using Random Forest Algorithm Eesha Goel Computer Science Engineering and Technology, GZSCCET, Bhatinda, India eesha1992@rediffmail.com Abhilasha Computer Science Engineering and Technology,
More informationIntroducing Partial Matching Approach in Association Rules for Better Treatment of Missing Values
Introducing Partial Matching Approach in Association Rules for Better Treatment of Missing Values SHARIQ BASHIR, SAAD RAZZAQ, UMER MAQBOOL, SONYA TAHIR, A. RAUF BAIG Department of Computer Science (Machine
More informationDATA SCIENCE USING SPARK: AN INTRODUCTION
DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data
More informationInternational Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications
Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 5, Issue 01, January -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 A Survey
More informationEnhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm
Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,
More informationIntroduction to Big-Data
Introduction to Big-Data Ms.N.D.Sonwane 1, Mr.S.P.Taley 2 1 Assistant Professor, Computer Science & Engineering, DBACER, Maharashtra, India 2 Assistant Professor, Information Technology, DBACER, Maharashtra,
More informationEfficient Record De-Duplication Identifying Using Febrl Framework
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 10, Issue 2 (Mar. - Apr. 2013), PP 22-27 Efficient Record De-Duplication Identifying Using Febrl Framework K.Mala
More informationLink Prediction for Social Network
Link Prediction for Social Network Ning Lin Computer Science and Engineering University of California, San Diego Email: nil016@eng.ucsd.edu Abstract Friendship recommendation has become an important issue
More informationProject Requirements
Project Requirements Version 4.0 2 May, 2016 2015-2016 Computer Science Department, Texas Christian University Revision Signatures By signing the following document, the team member is acknowledging that
More informationSensor Based Time Series Classification of Body Movement
Sensor Based Time Series Classification of Body Movement Swapna Philip, Yu Cao*, and Ming Li Department of Computer Science California State University, Fresno Fresno, CA, U.S.A swapna.philip@gmail.com,
More informationPerformance Analysis of Frequent Closed Itemset Mining: PEPP Scalability over CHARM, CLOSET+ and BIDE
Volume 3, No. 1, Jan-Feb 2012 International Journal of Advanced Research in Computer Science RESEARCH PAPER Available Online at www.ijarcs.info ISSN No. 0976-5697 Performance Analysis of Frequent Closed
More informationData Analytics Framework and Methodology for WhatsApp Chats
Data Analytics Framework and Methodology for WhatsApp Chats Transliteration of Thanglish and Short WhatsApp Messages P. Sudhandradevi Department of Computer Applications Bharathiar University Coimbatore,
More informationBig Data Analytics for Host Misbehavior Detection
Big Data Analytics for Host Misbehavior Detection Miguel Pupo Correia joint work with Daniel Gonçalves, João Bota (Vodafone PT) 2016 European Security Conference June 2016 Motivation Networks are complex,
More informationObtaining Rough Set Approximation using MapReduce Technique in Data Mining
Obtaining Rough Set Approximation using MapReduce Technique in Data Mining Varda Dhande 1, Dr. B. K. Sarkar 2 1 M.E II yr student, Dept of Computer Engg, P.V.P.I.T Collage of Engineering Pune, Maharashtra,
More informationBig Data Analytics. Description:
Big Data Analytics Description: With the advance of IT storage, pcoressing, computation, and sensing technologies, Big Data has become a novel norm of life. Only until recently, computers are able to capture
More informationCollective Intelligence in Action
Collective Intelligence in Action SATNAM ALAG II MANNING Greenwich (74 w. long.) contents foreword xv preface xvii acknowledgments xix about this book xxi PART 1 GATHERING DATA FOR INTELLIGENCE 1 "1 Understanding
More informationData Science Course Content
CHAPTER 1: INTRODUCTION TO DATA SCIENCE Data Science Course Content What is the need for Data Scientists Data Science Foundation Business Intelligence Data Analysis Data Mining Machine Learning Difference
More informationKnowledge Discovery in Databases. Databases. date name surname street city account no. payment balance
Databases date name surname street city account no. payment balance 980103 Jan Novak Dlouha 5 Praha 1 9945371 100.00 100.00 980105 Jan Novak Dlouha 5 Praha 1 9945371 1500.00 1600.00 980106 Jan Novak Dlouha
More informationCOLD-START PRODUCT RECOMMENDATION THROUGH SOCIAL NETWORKING SITES USING WEB SERVICE INFORMATION
COLD-START PRODUCT RECOMMENDATION THROUGH SOCIAL NETWORKING SITES USING WEB SERVICE INFORMATION Z. HIMAJA 1, C.SREEDHAR 2 1 PG Scholar, Dept of CSE, G Pulla Reddy Engineering College, Kurnool (District),
More informationHigh Performance Computing on MapReduce Programming Framework
International Journal of Private Cloud Computing Environment and Management Vol. 2, No. 1, (2015), pp. 27-32 http://dx.doi.org/10.21742/ijpccem.2015.2.1.04 High Performance Computing on MapReduce Programming
More informationDATA ANALYSIS WITH WEKA. Author: Nagamani Mutteni Asst.Professor MERI
DATA ANALYSIS WITH WEKA Author: Nagamani Mutteni Asst.Professor MERI Topic: Data Analysis with Weka Course Duration: 2 Months Objective: Everybody talks about Data Mining and Big Data nowadays. Weka is
More informationData Preprocessing Method of Web Usage Mining for Data Cleaning and Identifying User navigational Pattern
Data Preprocessing Method of Web Usage Mining for Data Cleaning and Identifying User navigational Pattern Wasvand Chandrama, Prof. P.R.Devale, Prof. Ravindra Murumkar Department of Information technology,
More informationINFORMATION-THEORETIC OUTLIER DETECTION FOR LARGE-SCALE CATEGORICAL DATA
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,
More informationRegression Based Cluster Formation for Enhancement of Lifetime of WSN
Regression Based Cluster Formation for Enhancement of Lifetime of WSN K. Lakshmi Joshitha Assistant Professor Sri Sai Ram Engineering College Chennai, India lakshmijoshitha@yahoo.com A. Gangasri PG Scholar
More informationDisease Prediction in Data Mining
RESEARCH ARTICLE Comparative Analysis of Classification Algorithms Used for Disease Prediction in Data Mining Abstract: Amit Tate 1, Bajrangsingh Rajpurohit 2, Jayanand Pawar 3, Ujwala Gavhane 4 1,2,3,4
More informationA Review to the Approach for Transformation of Data from MySQL to NoSQL
A Review to the Approach for Transformation of Data from MySQL to NoSQL Monika 1 and Ashok 2 1 M. Tech. Scholar, Department of Computer Science and Engineering, BITS College of Engineering, Bhiwani, Haryana
More informationIan Choy. Technology Solutions Professional
Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration
More informationData Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens
Data Science and Open Source Software Iraklis Varlamis Assistant Professor Harokopio University of Athens varlamis@hua.gr What is data science? 2 Why data science is important? More data (volume, variety,...)
More informationPrognosis of Lung Cancer Using Data Mining Techniques
Prognosis of Lung Cancer Using Data Mining Techniques 1 C. Saranya, M.Phil, Research Scholar, Dr.M.G.R.Chockalingam Arts College, Arni 2 K. R. Dillirani, Associate Professor, Department of Computer Science,
More informationIBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics
IBM Data Science Experience White paper R Transforming R into a tool for big data analytics 2 R Executive summary This white paper introduces R, a package for the R statistical programming language that
More informationOptimization of Query Processing in XML Document Using Association and Path Based Indexing
Optimization of Query Processing in XML Document Using Association and Path Based Indexing D.Karthiga 1, S.Gunasekaran 2 Student,Dept. of CSE, V.S.B Engineering College, TamilNadu, India 1 Assistant Professor,Dept.
More informationApplication of Big Data Technology to Library data:a review
Application of Big Data Technology to Library data:a review A.Kaladhar Research Scholar Dept. of Library and Information Science JNTUK Kakinada (A.P) Email:librarian@svecw.edu.in and B.R. Doraswamy Naick
More informationIntegration of information security and network data mining technology in the era of big data
Acta Technica 62 No. 1A/2017, 157 166 c 2017 Institute of Thermomechanics CAS, v.v.i. Integration of information security and network data mining technology in the era of big data Lu Li 1 Abstract. The
More information