Ye Jia's starred repositories
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
LxgwWenKai
An open-source Chinese font derived from Fontworks' Klee One. 一款开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。
ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
stable-diffusion-webui-colab
stable diffusion webui colab
wavesurfer.js
Audio waveform player
DeepFilterNet
Noise supression using deep filtering
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
torchtitan
A native PyTorch Library for large model training
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
chinese_text_normalization
Chinese text normalization for speech processing
dataloader
The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX
NeMo-text-processing
NeMo text processing for ASR and TTS
Audio_Stuff
Notes about my experience in audio industry