Ye Jia's starred repositories
torchtitan
A native PyTorch Library for large model training
LxgwWenKai
An open-source Chinese font derived from Fontworks' Klee One. 一款开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。
stable-diffusion-webui-colab
stable diffusion webui colab
ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
NeMo-text-processing
NeMo text processing for ASR and TTS
Audio_Stuff
Notes about my experience in audio industry
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
dataloader
The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX
wavesurfer.js
Audio waveform player
chinese_text_normalization
Chinese text normalization for speech processing
DeepFilterNet
Noise supression using deep filtering