Abstract
We introduce a principled method for the signed clustering problem, where the goal is to partition a weighted undirected graph whose edge weights take both positive and negative values, such that edges within the same cluster are mostly positive, while edges spanning across clusters are mostly negative. Our method relies on a graph-based diffuse interface model formulation utilizing the Ginzburg-Landau functional, based on an adaptation of the classic numerical Merriman-Bence-Osher (MBO) scheme for minimizing such graph-based functionals. The proposed object ive function aims to minimize the total weight of inter-cluster positively-weighted edges, while maximizing the total weight of the inter-cluster negatively-weighted edges. Our method scales to large sparse networks, and can be easily adjusted to incorporate labelled data information, as is often the case in the context of semisupervised learning. We tested our method on a number of both synthetic stochastic block models and real-world data sets (including financial correlation matrices), and obtained promising results that compare favourably against a number of state-of-the-art approaches from the recent literature.
Original language | English |
---|---|
Pages (from-to) | 73-109 |
Number of pages | 37 |
Journal | Communications in Mathematical Sciences |
Volume | 19 |
Issue number | 1 |
DOIs | |
Publication status | Published - 24 Mar 2021 |
Keywords
- clustering
- graph Laplacians
- Merriman—Bence—Osher scheme
- signed networks
- spectral methods
- threshold dynamics
- time series
ASJC Scopus subject areas
- Mathematics(all)
- Applied Mathematics