Sage Journals: Discover world-class research

Abstract

Ensemble clustering methods have become increasingly important to ease the task of choosing the most appropriate cluster algorithm for a particular data analysis problem. The consensus clustering (CC) algorithm is a recognized ensemble clustering method that uses an artificial intelligence technique to optimize a fitness function. We formally prove the existence of a subspace of the search space for CC, which contains all solutions of maximal fitness and suggests two greedy algorithms to search this subspace. We evaluate the algorithms on two gene expression data sets and one synthetic data set, and compare the result with the results of other ensemble clustering approaches.

Keywords

ensemble clustering fitness function gene expression data greedy search search space restriction

Get full access to this article

View all access options for this article.

Optimal Search Space for Clustering Gene Expression Data via Consensus

Abstract

Keywords

Get full access to this article