Abstract
A recurring problem in a wide variety of research areas such as pattern recognition, machine learning, data mining and statistics, among others, is characterized as a clustering problem. Such a problem can be described in a simplistic way as: given a set of data (observations, objects, points, etc.), group similar data into clusters (groups). A clustering of a given data set is then characterized as a set of clusters, in which elements belonging to a cluster are similar to each other and elements belonging to distinct clusters are not similar. Clustering algorithms are non-supervised algorithms and, among the many available in the literature, the k-Means, that uses a random initizalization process, can be considered one of the most popular and successful. The performance of the k-Means, however, is highly dependent on a ‘good’ initialization of the
Get full access to this article
View all access options for this article.
