thevasudevgupta / gsoc-wav2vec2

GSoC'2021 | TensorFlow implementation of Wav2Vec2

Home Page:https://thevasudevgupta.github.io/gsoc-wav2vec2/assets/final_report

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Feedback on the fine-tuning notebook

sayakpaul opened this issue · comments

@vasudevgupta7 great work thus far. Here ate some more pointers.

"How to train TensorFlow saved-model with extra head", I suggest "Fine-tuning with an extra head".

"In this notebook, we will load the pre-trained wav2vec2 model from TFHub and will train it on LibriSpeech dataset by appending LM head over the top of our pre-trained model.", I suggest something like -

In this notebook, we will load the pre-trained wav2vec2 model from TFHub (should be a link to the model when available) and will train it on LibriSpeech dataset by appending LM head over the top of our pre-trained model. The underlying task is to ...

"You can also refer to this repositary for some more amazing tutorials on speech-related tasks. In case you encountered any bug in this notebook, please create an issue here."

Typo.

Additional feedback:

  • Let's try to wrap the training and evaluation loop as a subclassed model (tf.keras.Model). Let us know if you face any problems there.
  • Is it possible to load a few FLAC files in an Audio widget and play them for reference? See if this Colab Notebook helps.

Cc: @MorganR