Asunny2019 / 2023_AI_Academy_ASR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

2023 AI Academy ASR

Jupyter Notebook PyTorch nVIDIA Python Markdown Colab Matplotlib NumPy

딥러닝을 이용한 음성인식 기초 실습

0. Tutorial

Open In Colab

Tutorial for python and data science packages

  • python review
  • numpy
  • matplotlib

1. Audio file handling

Open In Colab

Audio file handling using torchaudio

  • Load audio file(torchaudio.load)
  • Feature extraction(Mel-spectrogram, MFCC)

3. Audio Classification using MLP

Open In Colab

Audio MNIST classification using MLP(torch.Linear)

4. CTC

Open In Colab

Simple Exercise(model training using CTC loss) for Connectionist Temporal Classification

5. Whisper

Open In Colab

Exercise using OpenAI - Whisper and Gradio

6. E2E ASR model finetune with Nemo

Open In Colab

Quartznet Model finetune with Nemo(English to Korean)

7. WFST

Open In Colab

Exercise for WFST using k2

  • C,L,G transducer
  • composition, determinization

Libraries

About


Languages

Language:Jupyter Notebook 100.0%