HN410's starred repositories
gligen-gui
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
imgbrd-grabber
Very customizable imageboard/booru downloader with powerful filenaming features.
obsidian-tabs
Plugin for tabbed obsidian browsing
ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
metavoice-src
Foundational model for human-like, expressive TTS
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
speechbrain
A PyTorch-based Speech Toolkit
Aivis-Dataset
💠 Aivis: AI Voice Imitation System
Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Uni-ControlNet
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
animatediff-cli
a CLI utility/library for AnimateDiff stable diffusion generation
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.