Joshua Zhou's repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
asv-subtools
An Open Source Tools for Speaker Recognition
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
chatgpt-mac
ChatGPT for Mac, living in your menubar.
ColossalAI
Making big AI models cheaper, easier, and more scalable
ddia
《Designing Data-Intensive Application》DDIA中文翻译
Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
FinanceDatabase
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.
g2p-seq2seq
G2P with Tensorflow
GPT2-chitchat
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI**)
InstructTTS
The deme page of InstructTTS
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
llama
Inference code for LLaMA models
numpy-ml
Machine learning, in numpy
onssen
An open-source speech separation and enhancement library
open-speech-corpora
A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
scaper
A library for soundscape synthesis and augmentation
speech-denoising-wavenet
A neural network for end-to-end speech denoising
stable-diffusion
A latent text-to-image diffusion model
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
TorchScript
Load a TorchScript Model in C++ ~ JUSTIN MITCHΞLL
vae
a simple vae and cvae from keras