A Study on Various Data Mining Algorithms. Pertaining to VLSI Cell Partitioning

Similar documents
Analysis of Clustering Techniques. in VLSI Cell Partitioning

Effective Clustering Algorithms for VLSI Circuit. Partitioning Problems

An Enhanced Approach for Secure Pattern. Classification in Adversarial Environment

Image Segmentation Based on. Modified Tsallis Entropy

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

On the Parallel Implementation of Best Fit Decreasing Algorithm in Matlab

Unsupervised Learning : Clustering

6. Dicretization methods 6.1 The purpose of discretization

Fuzzy C-means Clustering with Temporal-based Membership Function

Machine Learning in Biology

Clustering Algorithms for Data Stream

Analyzing Outlier Detection Techniques with Hybrid Method

Implementation on Real Time Public. Transportation Information Using GSM Query. Response System

The k-means Algorithm and Genetic Algorithm

Global Journal of Engineering Science and Research Management

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

A Comparative study of Clustering Algorithms using MapReduce in Hadoop

A New Approach to Evaluate Operations on Multi Granular Nano Topology

Introduction to Clustering

ARTICLE; BIOINFORMATICS Clustering performance comparison using K-means and expectation maximization algorithms

Keywords Clustering, Goals of clustering, clustering techniques, clustering algorithms.

A Texture Extraction Technique for. Cloth Pattern Identification

UNIT 4 INTEGRATED CIRCUIT DESIGN METHODOLOGY E5163

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies

Max- Min Distance method

Efficient FM Algorithm for VLSI Circuit Partitioning

Using Ones Assignment Method and. Robust s Ranking Technique

Novel Cell Array Noise Cancelling Design Scheme. for Stacked Type MRAM. with NAND Structured Cell

Unsupervised Data Mining: Clustering. Izabela Moise, Evangelos Pournaras, Dirk Helbing

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

A Comparative Study on Optimization Techniques for Solving Multi-objective Geometric Programming Problems

Unit 9 : Fundamentals of Parallel Processing

Collaborative Filtering using Euclidean Distance in Recommendation Engine

BBS654 Data Mining. Pinar Duygulu. Slides are adapted from Nazli Ikizler

Keyword Extraction by KNN considering Similarity among Features

One-mode Additive Clustering of Multiway Data

ECLT 5810 Clustering

Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality

CHAPTER 3 A FAST K-MODES CLUSTERING ALGORITHM TO WAREHOUSE VERY LARGE HETEROGENEOUS MEDICAL DATABASES

Conditional Volatility Estimation by. Conditional Quantile Autoregression

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

CHAPTER VII INDEXED K TWIN NEIGHBOUR CLUSTERING ALGORITHM 7.1 INTRODUCTION

Unsupervised Learning and Clustering

A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM

CHAPTER 4 FUZZY LOGIC, K-MEANS, FUZZY C-MEANS AND BAYESIAN METHODS

Proximal Manifold Learning via Descriptive Neighbourhood Selection

Generating Topology on Graphs by. Operations on Graphs

Comparative Study of Clustering Algorithms using R

Segmentation of Images

Clustering CS 550: Machine Learning

On Self-complementary Chordal Graphs Defined. by Single Forbidden Induced Subgraph

Graph Sampling Approach for Reducing. Computational Complexity of. Large-Scale Social Network

Unsupervised Learning and Clustering

CS 229 Midterm Review

Fuzzy Segmentation. Chapter Introduction. 4.2 Unsupervised Clustering.

K-Nearest-Neighbours with a Novel Similarity Measure for Intrusion Detection

Complete Bipartite Graphs with No Rainbow Paths

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES

Particle swarm-based optimal partitioning algorithm for combinational CMOS circuits

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

A New Online Clustering Approach for Data in Arbitrary Shaped Clusters

Analysis of Bit Cost for Stacked Type MRAM. with NAND Structured Cell

Enumeration of Minimal Control Sets of Vertices in Oriented Graph

ECLT 5810 Clustering

Evaluation of Ad-hoc Routing Protocols with. Different Mobility Models for Warfield. Scenarios

Conveyor Performance based on Motor DC 12 Volt Eg-530ad-2f using K-Means Clustering

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Self-formation, Development and Reproduction of the Artificial System

Topic 1 Classification Alternatives

Efficient Windows Query Processing with. Expanded Grid Cells on Wireless Spatial Data. Broadcasting for Pervasive Computing

Dominator Coloring of Prism Graph

COMPUTATIONAL INTELLIGENCE SEW (INTRODUCTION TO MACHINE LEARNING) SS18. Lecture 6: k-nn Cross-validation Regularization

Lecture 6: Unsupervised Machine Learning Dagmar Gromann International Center For Computational Logic

Chapter 2 Logic Gates and Introduction to Computer Architecture

K-Nearest Neighbour Classifier. Izabela Moise, Evangelos Pournaras, Dirk Helbing

K- Nearest Neighbors(KNN) And Predictive Accuracy

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Comparative analysis of classifier algorithm in data mining Aikjot Kaur Narula#, Dr.Raman Maini*

Stochastic Coalitional Games with Constant Matrix of Transition Probabilities

Unsupervised Learning

Selection of n in K-Means Algorithm

AN EFFICIENT DESIGN OF VLSI ARCHITECTURE FOR FAULT DETECTION USING ORTHOGONAL LATIN SQUARES (OLS) CODES

A New Energy-Aware Routing Protocol for. Improving Path Stability in Ad-hoc Networks

Redefining and Enhancing K-means Algorithm

Design of Low Power Wide Gates used in Register File and Tag Comparator

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

A FUZZY BASED APPROACH TO TEXT MINING AND DOCUMENT CLUSTERING

Solutions of Stochastic Coalitional Games

Generative and discriminative classification techniques

Encoding Words into String Vectors for Word Categorization

Image Compression: An Artificial Neural Network Approach

Global Journal of Engineering Science and Research Management

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

A Note on Vertex Arboricity of Toroidal Graphs without 7-Cycles 1

Customer Clustering using RFM analysis

Fuzzy Entropy based feature selection for classification of hyperspectral data

Data Mining. Part 2. Data Understanding and Preparation. 2.4 Data Transformation. Spring Instructor: Dr. Masoud Yaghini. Data Transformation

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

Blood Microscopic Image Analysis for Acute Leukemia Detection

Transcription:

Contemporary Engineering Sciences, Vol. 7, 2014, no. 8, 363-368 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.4220 A Study on Various Data Mining Algorithms Pertaining to VLSI Cell Partitioning R. Sujitha School of Computing SASTRA University, Thanjavur-613401, India R. Manikandan School of Computing SASTRA University, Thanjavur-613401, India Copyright 2014 R. Sujitha and R. Manikandan. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract VLSI Cell Partitioning plays a substantial part in VLSI physical design. The task of designing integrated circuits is multifaceted, as modern circuits have a very huge number of modules. It is very much essential to split the circuit into a smaller and meeker logic blocks. In order to build a complex digital integrated circuit, the multi million transistors must be sub-divided into convenientnumber of pieces.this paper provides a heuristic approach by utilizing data mining algorithms to solve VLSI partitioning problem. A detailed study on several data mining algorithms like K-nearest neighbour, Support Vector Machine (SVM), Fuzzy c-means and K-means algorithms and their implementation in VLSI Cell Partitioning is provided. Keywords: Data mining, Partitioning, K-nearest neighbor, Support Vector Machine, Fuzzy c-means, K-means algorithm 1. INTRODUCTION The most promising outcome of VLSI technology is to develop high-speed, low-power computing structures dedicated for various applications. The design must satisfy numerous constraints on area, power, pins, throughput and noise. One

364 R. Sujitha and R. Manikandan of the major defies in designing a microelectronic VLSI chip is to successfully partition the design into its components[1]. VLSI physical design comprises of three main prospects: Partitioning, Placement, Routing. Division of a complex system design into sub-components,for the purpose of reducing the complexity, is termed as partitioning. Placement involves the process of assigning locations to the components within the die area. The process of interconnecting the placed components, subject to the constraint of minimum wiring, is known as Routing. Partitioning is a domineering area of VLSI Physical design. The main objective of partitioning is to divide the complex circuit into sub-blocks such that designing these sub-blocks individually and then assembling it separately, reduces the complexity level of the entire design to a much greater extent. Data mining algorithms provide a means to analyse the data and extract information from them. Imbibing data mining approaches into VLSI partitioning pave a new way to research domain. Data mining algorithms basically are of two types: Classification algorithms and Clustering algorithms. Both these approaches find their utility in recognising hidden patterns in data.classification involves in forecasting a firm outcome based on a given input. In order to predict the outcome, the algorithm processes a training set that contains a set of attributes and theircorresponding outcome, usually called goal or prediction attribute. The algorithm tries to determinerelations between the attributes that makes it possible to forecast the outcome. Cluster analysis is demarcated as a process of producing groups of objects, or clusters,in a way that alikeobjects are present in one cluster and objects in unlike clustersare divergent [2].The main rewards of cluster techniques is that it can be applied to a specified objective standard consistently to form groups. The speed and consistency of the clustering procedure in organizing the data together form an irresistible reason to employ clustering in partitioning. Section II describes the four data mining algorithms in detail. Section III defines the implementation of the data mining algorithms in VLSI cell partitioning. The results are discussed in the section IV. 2. ALGORITHM DISCUSSION This section briefs about the Classification algorithms: K Nearest Neighbour and Support Vector Machine algorithm and Clustering algorithms: Fuzzy C means and K means algorithm. Though Classification and Clustering algorithms are considered to be ambiguous, their differences are very inimitable. There are predefined set of classes in the classification process and decision has to be made on assigning the already existing class to the new data set. The underlying process in clustering technology is to determine some relation between the data sets and group them based on their relation. 2.1 CLASSIFICATION ALGORITHMS The main idea behind classification algorithm is selection of a hypothetical set from the existing sets that best fits the given input data set. This section invol-

A study on various data mining algorithms 365 ves detailed description of the selected data mining classification algorithms: K Nearest Neighbour Algorithm and Support Vector Machine Algorithm. 2.1.1 K NEAREST NEIGHBOUR ALGORITHM K Nearest Neighbour (KNN) algorithm is a lazy learning algorithm.it is also non-parametric in nature. Assumptions are not done on the underlying data as the practical data does not follow any hypothetical assumptions. There is no overt generalization stage or it is very negligible. On the basis of following presumptions, KNN algorithm is performed. The algorithm presumes that records are in metric space. The data can be scalar type or multidimensional type. Each of the data points has a notion of distance as they are in feature space. The training data set contains a vector set and a class taginterrelatedto the vector. But KNN works well with random number of classes. A single number "k" which denotes the number of neighbours that are defined based on the distances (The distance is calculated in one of the following techniques: Euclidean Distance, Minkowski Distance, Mahalanobis Distance.)[4]. If k=1, then the algorithm is simply called the nearest neighbour algorithm. 2.1.2 SUPPORT VECTOR MACHINE ALGORITHM A Support Vector Machine algorithm(svm) constructs a model that assigns given data set into one category or the other, to which the training data sets belong. Thus we can call this SVM algorithm as a non-probabilistic bilinear classifier. Data is represented in spatial domain, so that training data set of different categories are separated by a distinct distance. Therefore the risk of mismatching the given data set is highly reduced. However the SVM algorithm takes a longer and CPU memory than KNN algorithm. The SVM algorithm suffers a serious setback as it is directly applicable only in two classes and there is an uncaliberated class relationship probability. 2.2 CLUSTERING ALGORITHMS The idea behind clustering algorithms is to group objects into subsets or clusters based on their relationship with each other [6]. Data in the same cluster will have related properties whereas those in different clusters will have unrelated properties[2]. The algorithms discussed under this section are Fuzzy C means algorithm and K means algorithm. 2.2.1 FUZZY C-MEANS Algorithm In fuzzy clustering, each data point in the data set is related with every other cluster using a relationship function. The c-means algorithm allows overlapping of dataset, i.e. the given data set can belong to two or more clusters at the same. Let Y= {y1,y2,..., y M } be a set of numerical data in R N. Let jbe an integer, 1 < j< M. Given X, we say that jfuzzy subsets {a k :Y [0, 1]} are a j- partition of Yif the following conditions are satisfied[ ]: 0 a kl 1 for all k,l. 1.A kl =1 for all l. 1.B

U1A 366 R. Sujitha and R. Manikandan 0 < kl< n for all k...1.c where a kl = a k (y l ), 1 k jand 1 l M. Let the jmvalues a kl filling the above conditions be arrayed as a j Mmatrix A= [a kl ].Then the set of all such matrices are the non-degenerate fuzzy j-partitions of Y: M fcm = {A R N : a kl satisfies conditions (1.A, 1.B and 1.C) k and l}. If all the a kl s are either 0 or 1, we have the subset of hard j-partitions of Y: M cm = {A M FcM : a kl = 0 or 1 kand l}. This Fuzzy j-partitions is termed as Fuzzy C-partitions.Fuzzy clustering is used in mining complex and multi-dimensional datasets.fuzzy-c-means (FCM) algorithm iswidespread, in which a portion of data has restrictedrelation with each of the pre-defined cluster centres [5]. 2.2.2 K MEANS ALGORITHM The K means algorithm is designed to cluster data on the basis of cluster centre called mean. The sum of clusters k is supposed to be static. The algorithm begins for initial k number of clusters, by assigning the residual data to the adjoining clusters and repetitivelyaltering the relationship of the clusters between each other until the membership becomes stable. It has two stages: Initialization and Iteration stages.during the former phase, the data is allotted intok clusters. In the latter phase, the distance between the data setand each cluster is calculated and the data set is assigned to the nearest cluster. However, K means algorithm does not employ overlapping of the data sets. 3. IMPLEMENTATION The Classification and Clustering algorithms are implemented for the below circuit. The gate at every nodule is represented by the gate matrix, GM, and the assembly of gates is represented by an input matrix, IM. The gate matrix is designed one column at a, with each consequent columncontaining the gates. The input of the gates may be from the gatesof previous column or wires; therefore thetotal number of columns in GM, C GM, is equal to the maximumnumber of gates on any path from circuit input to output [3]. Table 1. Gate matrix representation U6A U6B U6C 7410N U2A U4A 74LS04N U5A 7408J NUMBER TYPE NUMBER TYPE 0 NO 5 WIRE GATE 1 AND 6 NAND 2 OR 7 NOR 3 XOR 8 XNOR 4 INV 9 BUF U6D U6E U6F U7A U7B U7C U2B U2C U2D U4B 74LS04N U3A Fig 1. Example circuit U5B 7408J The number of rows in GM, R GM, denotes the cut size of the circuit. The gate in each node is represented by a number in the gate matrix.the corresponding numbers for each gate are itemized in Table 1.The relation between gates in the gate matrix is represented in input matrix, IM, in such a way that the number of

iterations iteration 0 1 2 3 4 5 6 7 8 9 iterations iterations A study on various data mining algorithms 367 columns in input matrix and gate matrix are equal[3].subsequently, the number of rows ininput matrix, R IM, is the product of R GM and the maximum fanin of any gate in the circuit.therefore, the promising values forcolumn X are {1+R GM [X 1], R GM X} [3]. The four algorithms discussed are applied to the example circuit. The matrix is given as input to the classification and clustering algorithms. 4. RESULT AND DISCUSSION The Classification algorithms, K nearest neighbour and support vector machine algorithms yield the results as shown in figure 2 and 3.The K Nearest neighbour algorithm gives better classification result than the SVM algorithm as the VLSI cell partitioning employs multiple classes. The SVM algorithm fails to produce efficient result while operating for multiple classes. The Clustering algorithms, Fuzzy C-means and K means Algorithm yield the results shown in figure 4 and 5 when implemented on the circuit. The object function is greatly minimized in the Fuzzy C-means algorithm with less number of iterations when compared to the K means algorithm. Moreover, permitting overlapping of the data sets makes the fuzzy C-means to be more efficient than K means algorithm. 2.5 2.5 2 2 1.5 1 1.5 0.5 0 1 2 3 4 5 6 7 Fig 2.KNN Algorithm 1 3 3.5 4 4.5 5 5.5 6 6.5 7 Fig 3. SVM Algorithm 16 1000 14 12 10 900 800 700 600 8 6 4 2 0 Fig 4. FCM Algorithm 500 400 300 200 100 0 0 100 200 300 400 500 600 700 800 Fig 5.K Means Algorithm

368 R. Sujitha and R. Manikandan 5. CONCLUSION In this paper, the VLSI cell partitioning which plays a significant role in VLSI physical design is approached with the concept of data mining. The two main data mining approaches, Classification and Clustering are analysed. The Classification algorithms, K Nearest Neighbour and Support Vector Machine algorithms are implemented with the given circuit. It is identified that K Nearest Neighbour gives the optimum solution with regards to the VLSI Cell Partitioning. The Clustering algorithms, Fuzzy C-means and K means algorithms are also implemented on the circuit. It is recognized that Fuzzy C-means algorithm provides optimum solution to VLSI cell partitioning. REFERENCES [1] Brian Von Herzen,VLSI Partitioning of a 2-Gs/s Digital Spectrometer,IEEE journal of solid-state circuits, VOL. 26, NO. 5, MAY 1991 [2] Gan, Guojun, Chaoqun Ma, and Jianhong Wu, Data Clustering: Theory, Algorithms, and Applications, ASA-SIAM Series on Statistics and Applied Probability, SIAM, Philadelphia, ASA, Alexandria, VA, 2007. [3] Ganesh K. Venayagamoorthy et al., Particle swarm-based optimal partitioning algorithm for combinational CMOS circuits,engineering Applications of Artificial Intelligence (2006), doi:10.1016/j.engappai.2006.06.011. [4] InduSaini, Dilbag Singh, ArunKhosla.,Detection of QRS-complex using K- nearestneighbouralgorithm., Int. J. Medical Engineering and Informatics, 2013, 5(1),81-101. [5] SubhagataChattopadhyay, Dilip Kumar Pratihar, Sanjib Chandra De Sarkar, A Comparative study of fuzzy c-means algorithm and entropy-based fuzzy clustering algorithms,computing and Informatics, 2011,30, 701 720. [6] Sumithradevi. K.A, Vijayalakshmi.M.N, Optimization of Circuit Partitioning in VLSI through Classification Algorithms, IEEE 2010 4(10). Received: February 20, 2014