machine-learning federated-learning optimization rank-analysis

FedOptim

Paper on OpenReview
ArXiv: Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?
Data: We currently do not have a permanent link to the data (including model parameters, gradient history for generating plots). However, please feel free to reach out to me and we will try to figure a way out to share the data dump with you. The total data size is over 600 GB. If you need a subset of the data dump, it might be easier to share over dropbox or google drive.

Setup

# preferred
conda create --name <env-name> --file conda.txt

# or
python3 -m venv <env-name>
source <env-name>/bin/activate
pip install -r pip.txt

Running Experiments

to check the exact function names please refer to the shell scripts

cd src

# to run centralized experiments
# generates the gradients for preliminary experiments
sh sh/train_central_<cnn|fcn|resnet|svm|unet|vgg>.sh <dataset-name>

# to run federated baselines
sh sh/train_federated_<cnn|fcn|resnet|unet|vgg>.sh <dataset-name> <num-classes-per-node>

# to run distributed baselines
sh sh/train_distributed_<cnn|fcn|resnet|unet|vgg>.sh <dataset-name> <num-classes-per-node>

# to run standalone lbgm
sh sh/train_lbgm_<cnn|fcn|resnet|unet|vgg>.sh <dataset-name> <num-classes-per-node>

# to run plug-n-play baselines
sh sh/train_<signsgd|atomo|topk>.sh <dataset-name> <num-classes-per-node> <algorithm-hyperparams>

# to run plug-n-play lbgm
sh sh/train_<signsgd|atomo|topk>_lbgm.sh <dataset-name> <num-classes-per-node> <algorithm-hyperparams>

Generating plots

to check the exact function names please refer to the shell scripts
standalone plots are generated using script: src/sh/expt_1_standalone_with_acc_comm_curve.sh
plug-n-play plots are generated using script: src/sh/expt_2_plugnplay_with_acc_comm_curve.sh
plots exploring the effect of error threshold use: src/sh/expt_3_threshold_with_acc_comm_curve.sh
distributed plots are generated using script: src/sh/expt_4_distributed_with_acc_comm_curve.sh

# to process gradients for preliminary visualizations
sh sh/process_grads.sh <dataset-name>

# to generate preliminary visualizations
sh sh/prelim_<1|2|3>.sh <dataset-name|svm|seg> f

# to generate plots for exploratory experiments
sh sh/expt_<1|2|3|4>_standalone_with_acc_comm_curve.sh <main|pfl|sampled|architecture-name>_<iid|non_iid> f

Citation

If you find the repository or the paper useful, please cite the following paper

@inproceedings{
  azam2022recycling,
  title={Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?},
  author={Sheikh Shams Azam and Seyyedali Hosseinalipour and Qiang Qiu and Christopher Brinton},
  booktitle={International Conference on Learning Representations},
  year={2022},
  url={https://openreview.net/forum?id=B7ZbqNLDn-_}
}

About

Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

https://openreview.net/forum?id=B7ZbqNLDn-_

machine-learning federated-learning optimization rank-analysis

Languages

Language:Jupyter Notebook 92.9%Language:Python 3.6%Language:Shell 3.5%