- https://discord.com/channels/879548962464493619/1050420105613430794/1054800474349514753
- https://discord.com/channels/879548962464493619/1052622386064793660/1053321191072464896
- https://discord.com/channels/879548962464493619/1052622386064793660/1053323395107913738
- https://github.com/huggingface/community-events/tree/main/whisper-fine-tuning-event#deepspee
- https://huggingface.co/docs/transformers/v4.19.2/en/main_classes/deepspeed#deployment-in-notebooks
- https://discuss.huggingface.co/t/deepspeed-integration-with-trainer-in-colab-crashing-typeerror-object-init-takes-exactly-one-argument-the-instance-to-initialize/28255
- SpeechT5
- Whisper
- Wav2Vec2
- Random Seed (https://discuss.huggingface.co/t/fixing-the-random-seed-in-the-trainer-does-not-produce-the-same-results-across-runs/3442)
- Training Steps
- train_batch_size
- Training Speed
- Inference Speed
- Does DeepSpeed affect Model Performance too?