Automatic Clustering of Mixed Data Using Genetic Algorithm

Yaghini, Masoud; Vard, Mahdi

International Journal of Industrial Engineering & Production Management

Iran University of Science and Technology

Sat, Apr 19, 2025 [Archive]

نشریه بین المللی مهندسی صنایع و مدیریت تولید

Volume 23, Issue 2 (IJIEPM 2012) 2012, 23(2): 187-197 | Back to browse issues page

Mendeley

Zotero

RefWorks

Yaghini M, Vard M. Automatic Clustering of Mixed Data Using Genetic Algorithm. Journal title 2012; 23 (2) :187-197
URL: http://ijiepm.iust.ac.ir/article-1-880-en.html

Automatic Clustering of Mixed Data Using Genetic Algorithm

Masoud Yaghini ^*

, Mahdi Vard

Assistance professor of School of Railway Engineering - Iran University of Science and Technology , yaghini@iust.ac.ir

Abstract: (9805 Views)

In the real world clustering problems, it is often encountered to perform cluster analysis on data sets with mixed numeric and categorical values. However, most existing clustering algorithms are only efficient for the numeric data rather than the mixed data set. In addition, traditional methods, for example, the K-means algorithm, usually ask the user to provide the number of clusters. In this paper, we propose a new method to cluster mixed data and automatically evolve the number of clusters as well as clustering of data set. In the proposed method, Davies-Bouldin Index is used as fitness function and we use the genetic algorithm to optimize fitness function. Also, we use a more accurate distance measure for calculating the distance between categorical values. The performance of this algorithm has been studied on real world and simulated data sets. Comparisons with other clustering algorithms illustrate the effectiveness of this approach.

Keywords: Data mining, Clustering, Mixed data, Genetic algorithm, Davies-Bouldin index

Full-Text [PDF 2921 kb] (5434 Downloads)

Type of Study: Research | Subject: Other related Industrial and production reserach subjects in which has direct relation to the state-of-the art of the IE
Received: 2012/07/22 | Published: 2012/08/15