Wei Xu's starred repositories
Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
AnimateZero
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"
textdistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
python-diskcache
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
vocal-remover
Vocal Remover using Deep Neural Networks
svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
fast-align-audio
A fast python library for aligning similar audio snippets passed in as NumPy arrays
codecov-action
GitHub Action that uploads coverage to Codecov :open_umbrella:
insightface
State-of-the-art 2D and 3D Face Analysis Project
hugo-theme-next
Easily & powerful theme for Hugo engine.
hugo-PaperModX
A fast, clean, responsive Hugo theme.
pytest-mock
Thin-wrapper around the mock package for easier use with pytest
monkey-net
Animating Arbitrary Objects via Deep Motion Transfer
lip-movement-net
Speaker detection using a lip movement based RNN detector
generative-models
Generative Models by Stability AI
blind_watermark
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!