FPGA-Federated-Learning

Knowedge Distillation baseline settings:

epochs = trange(60)
lr=1e-1, momentum=0.9, weight_decay=5e-4 #SGD
StepLR(optimizer, step_size=20, gamma=0.1)

Total time and accuracy

Student|Teacher ResNet18 LeNet5 None

ResNet18 52:47 (94.12%) 27:34 (78.24%) 27:11 (89.27%)

LeNet5 26:36 (62.63%) 05:30 (59.95%) 05:25 (57.83%)
Training time each epoch

Student|Teacher ResNet18 LeNet5 None

ResNet18 50s/epoch 26s/epoch 25s/epoch

LeNet5 25s/epoch 4s/epoch 4s/epoch
Single forward (ResNet18) during an epoch: 5s/epoch 2607MiB
Single forward (ResNet18) during an epoch with another model (ResNet18) training: 12s 5246MiB
Memory-Usage (11019MiB)

Student|Teacher ResNet18 LeNet5 None

ResNet18 3475MiB 2653MiB 2651MiB

LeNet5 2649MiB 925MiB 901MiB
Training time each epoch (mutual)

Local|Meme ResNet18 LeNet5

ResNet18 50s/epoch 25s/epoch

LeNet5 25s/epoch 4s/epoch
Mutual: 2 forward 2 backward (ResNet18) during an epoch: 50s/epoch

Student\|Teacher	ResNet18	LeNet5	None
ResNet18	52:47 (94.12%)	27:34 (78.24%)	27:11 (89.27%)
LeNet5	26:36 (62.63%)	05:30 (59.95%)	05:25 (57.83%)

Student\|Teacher	ResNet18	LeNet5	None
ResNet18	50s/epoch	26s/epoch	25s/epoch
LeNet5	25s/epoch	4s/epoch	4s/epoch

Student\|Teacher	ResNet18	LeNet5	None
ResNet18	3475MiB	2653MiB	2651MiB
LeNet5	2649MiB	925MiB	901MiB

Local\|Meme	ResNet18	LeNet5
ResNet18	50s/epoch	25s/epoch
LeNet5	25s/epoch	4s/epoch

FPGA for Federated Learning Speedup

Language:Python 100.0%