Group for Research in Decision Analysis

G-2016-80

Less is more approach for balanced minimum sum-of-squares clustering

, , and

Clustering addresses the problem of finding homogeneous and well-separated subsets, called clusters, from a set of given data points. In addition to the points themselves, in many applications, there may exist constraints regarding the size of the clusters to be found. Particularly in balanced clustering, these constraints impose that the entities be equally spread among the different clusters.In this work, we present a basic variable neighborhood search heuristic for balanced minimum sum-of-squares clustering, following the recently proposed "less is more" approach, presented in Information Sciences 326, 160—171, (2016). Computational experiments show that the proposed algorithm outperforms the current state-of-the-art algorithm for the problem.

, 12 pages