Local Correlation Clustering with Asymmetric Classification Errors

In the Correlation Clustering problem, we are given a complete weighted graph with its edges labeled as "similar" and "dissimilar" by a noisy binary classifier. For a clustering of graph , a similar edge is in disagreement with , if its endpoints belong to distinct clusters; and a dissimilar edge is in disagreement with if its endpoints belong to the same cluster. The disagreements vector, , is a vector indexed by the vertices of such that the -th coordinate equals the weight of all disagreeing edges incident on . The goal is to produce a clustering that minimizes the norm of the disagreements vector for . We study the objective in Correlation Clustering under the following assumption: Every similar edge has weight in the range of and every dissimilar edge has weight at least (where and is a scaling parameter). We give an approximation algorithm for this problem. Furthermore, we show an almost matching convex programming integrality gap.
View on arXiv