This is the repository for the paper: Unbalanced Incomplete Multi-view Clustering via the Scheme of View Evolution: Weak Views are Meat; Strong Views do Eat accepted by IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) by Xiang Fang, Yuchong Hu, Pan Zhou, and Dapeng Oliver Wu. This demo is only used to reproduce our experimental results, please do not use it for other purposes. Both IEEE Version and arXiv Version are available.
Unbalanced Incomplete Multi-view Clustering Multi-view clustering has wide applications in many scenarios. In these scenarios, original data often contain unbalanced incomplete views. In this repository, we implement a novel method Unbalanced Incomplete Multi-view Clustering (UIMC).
We conduct extensive experiments on four real-world datasets, and experimental results demonstrate UIMC's superior advantages over other state-of-the-art clustering algorithms. The codes of the compared methods can be found on the authors' claimed websites.
.
├── run_UIMC.m # DEMO file of UIMC
├── UIMC.p # core function of UIMC
├── Digit.mat # data mat files
├── Init_F.m, constructW.m # variable initialization
├── NormalizeFea.m # regularization of data
├── ClusteringMeasure.m # clustering performance
└── EProjSimplex_new.m, EuDist2.m, gamma_norm.m, and L2_distance_1.m, and readsparse.m # intermediate functions
MATLAB R2020b, Windows 10, 3.30 GHz E3-1225 CPU, and 64 GB main memory.
-
Install the MATLAB. The scripts have been verified in Matlab 2019b.
-
Download this repository via git
git clone https://github.com/ZeusDavide/TETCI_UIMC.git
or download the zip file manually.
-
Get multi-view dataset: the BUAA dataset from paper "The buaa-visnir face database instructions", the 3-Sources dataset from (http://erdos.ucd.ie/datasets/3sources.html), the BBC dataset from (http://mlg.ucd.ie/datasets/segment.html), and the Handwritten digit dataset from (http://archive.ics.uci.edu/ml/datasets.html). We only provide the Handwritten digit dataset "digit.mat" in this repository as an example. For the other datasets in the experiments, please refer to the corresponding links or articles.
-
Add the root folder to the Matlab path before running the scripts.
To reproduce the experimental results in Section "Performance Evaluation" of the paper, we need to run the scripts run_UIMC.m
For nonnegative hyper-parameters (α, β, and η), we set α=10^{-2}, β=10^{5}, and η=10^{-1}. In general, increasing iteration number time will promote the clustering performance and consume more time. We recommend its maximum value is 40.
If you use this code please cite:
@article{fang2021uimc,
title={Unbalanced Incomplete Multi-view Clustering via the Scheme of View Evolution:
Weak Views are Meat; Strong Views do Eat},
author={Fang, Xiang and Hu, Yuchong and Zhou, Pan and Wu, Dapeng Oliver},
journal={IEEE Transactions on Emerging Topics in Computational Intelligence},
year={2021},
publisher={IEEE}
doi={10.1109/TETCI.2021.3077909}}