Othanse's starred repositories
MoviePilot
NAS媒体库自动化管理工具
InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Stable-Diffusion-WebUI-TensorRT
TensorRT Extension for Stable Diffusion Web UI
sdwebuiapi
Python API client for AUTOMATIC1111/stable-diffusion-webui
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
Bert-VITS2
vits2 backbone with multilingual-bert
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!