Improved algorithms for the random cluster graph model Academic Article uri icon


  • The following probabilistic process models the generation of noisy clustering data: Clusters correspond to disjoint sets of vertices in a graph. Each two vertices from the same set are connected by an edge with probability p, and each two vertices from different sets are connected by an edge with probability r< p. The goal of the clustering problem is to reconstruct the clusters from the graph. We give algorithms that solve this problem with high probability. Compared to previous studies, our algorithms have lower time complexity and wider parameter range of applicability. In particular, our algorithms can handle O (√ n/log n) clusters in an n-vertex graph, while all previous algorithms require that the number of clusters is constant.

publication date

  • July 3, 2002