Wathcet's repositories
AudioLDM2
Text-to-Audio/Music Generation
AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
facefusion
Next generation face swapper and enhancer
faceswap
Deepfakes Software For All
flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
free-programming-books
:books: Freely available programming books
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
IC-Light
More relighting!
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
nocobase
NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions.
PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
snail-job
灵活,可靠和快速的分布式任务重试和分布式任务调度平台
stable-diffusion-webui
Stable Diffusion web UI
Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
StoryDiffusion
Create Magic Story!