Certain investigations on data Clustering using hybrid algorithms for Unlabeled data sets;

by Komarasamy G

Institution: Anna University
Department: Certain investigations on data Clustering using hybrid algorithms for Unlabeled data sets
Year: 2015
Keywords: Data mining; K means; K Means Particle Swarm Optimization; Particle Swarm Optimization
Full text PDF: http://shodhganga.inflibnet.ac.in/handle/10603/37796


Data mining is a process of extracting knowledge from homogeneous wide variety of datasets It is mainly used in interdisciplinary subfield namely artificial intelligence machine learning statistics and database systems of computer science for discovering original patterns Clustering is one of the essential process of data mining The cluster analysis or clustering is the process of combining a set of items into same group and their relationships The K means KM algorithm is a major role in determine the number of clusters k for large Datasets It needs to predefine the k value itself which is difficult and it is hard to calculate before the number of clusters that would be there in data There are no competent and universal methods to select the best number of clusters the value selected as random The key challenge in the clustering process is sensitive to the selection of the initial partition in order to overcome this issue implement the hybrid algorithms to select best number of clusters The Particle Swarm Optimization PSO algorithm successfully converges during the global search initial stages but around global optimum the search process will become very slow The KM algorithm can achieve faster convergence to get the optimum solution The K Means Particle Swarm Optimization KMPSO algorithm