Abstract
We present a multistage procedure to cluster directed and undirected weighted graphs by finding the block structure of their adjacency matrices. A central part of the process is to scale the adjacency matrix into a doubly-stochastic form, which permits detection of the whole matrix block structure with minimal spectral information (theoretically a single pair of singular vectors suffices).
We present the different stages of our method, namely the impact of the doubly-stochastic scaling on singular vectors, detection of the block structure by means of these vectors, and details such as cluster refinement and a stopping criterion. Then we test the algorithm’s effectiveness by using it on two unsupervised classification tasks: community detection in networks and shape detection in clouds of points in two dimensions. By comparing results of our approach with those of widely used algorithms designed for specific purposes, we observe that our method is competitive (for community detection) if not superior (for shape detection) in comparison with existing methods.
We present the different stages of our method, namely the impact of the doubly-stochastic scaling on singular vectors, detection of the block structure by means of these vectors, and details such as cluster refinement and a stopping criterion. Then we test the algorithm’s effectiveness by using it on two unsupervised classification tasks: community detection in networks and shape detection in clouds of points in two dimensions. By comparing results of our approach with those of widely used algorithms designed for specific purposes, we observe that our method is competitive (for community detection) if not superior (for shape detection) in comparison with existing methods.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases |
Subtitle of host publication | European Conference, ECML PKDD 2019, Würzburg, Germany, September 16–20, 2019, Proceedings, Part I |
Editors | Ulf Brefeld, Elisa Fromont, Andreas Hotho, Arno Knobbe, Marloes Maathuis, Céline Robardet |
Place of Publication | Cham, Switzerland |
Publisher | Springer |
Pages | 140-155 |
Number of pages | 16 |
ISBN (Electronic) | 978-3-030-46150-8 |
ISBN (Print) | 978-3-030-46149-2 |
DOIs | |
Publication status | Published - 30 Apr 2020 |
Event | Joint European Conference on Machine Learning and Knowledge Discovery in Databases - Würzburg, Germany Duration: 16 Sept 2019 → 20 Sept 2019 |
Publication series
Name | Lecture Notes in Artificial Intelligence |
---|---|
Publisher | Springer |
Volume | 11906 |
Conference
Conference | Joint European Conference on Machine Learning and Knowledge Discovery in Databases |
---|---|
Abbreviated title | ECML PKDD 2019 |
Country/Territory | Germany |
City | Würzburg |
Period | 16/09/19 → 20/09/19 |
Keywords
- clustering
- unsupervised learning
- spectral clustering
- doubly stochastic scaling
- community detection
- shape detection