딥러닝을 이용한 음성인식 기초 실습
Tutorial for python and data science packages
- python review
- numpy
- matplotlib
Audio file handling using torchaudio
- Load audio file(torchaudio.load)
- Feature extraction(Mel-spectrogram, MFCC)
Audio MNIST classification using MLP(torch.Linear)
Simple Exercise(model training using CTC loss) for Connectionist Temporal Classification
Exercise using OpenAI - Whisper and Gradio
Quartznet Model finetune with Nemo(English to Korean)
Exercise for WFST using k2
- C,L,G transducer
- composition, determinization
PyTorch
: pytorch/pytorchNeMo
: Nvidia/NeMoTorchAudio
: pytorch/audioNumPy
: numpy/numpymatplotlib
:matplotlib/matplotlibWhisper
: openai/whispergradio
: gradio-app/gradio