Wei Xu's starred repositories
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
deepl-python
Official Python library for the DeepL language translation API.
PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
espnet_model_zoo
ESPnet Model Zoo
URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
NeMo-speech-data-processor
A toolkit for processing speech data and creating speech datasets
CQT_pytorch
Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters
TimeStretching
Pytorch implementation of Time Stretching in Music using an Autoencoder Network