RayDean

TechDing's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0140023 1070 7640

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT32932 200 1207

MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonMIT16213 134 377

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonNOASSERTION15204 296 341

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++NOASSERTION12874 142 424

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.010741 125 217

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause10371 104 146

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python10338 167 656

MoneyPrinter

Automate Creation of YouTube Shorts using MoviePy.

Language:PythonMIT10094 74 165

backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Language:PythonMIT6669 49 136

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonNOASSERTION5469 55 87

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.04517 61 181

Rope

GUI-focused roop

Language:PythonGPL-3.04383 960

MODNet

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Language:PythonApache-2.03776 103 207

metahuman-stream

Real time interactive streaming digital human

Language:PythonApache-2.03516 43 242

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonApache-2.03091 37 150

AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声；指令协同SD画图。

Language:PythonGPL-3.02850 28 159

social-auto-upload

自动化上传视频到社交媒体：抖音、小红书、视频号、tiktok、youtube、bilibili

Language:Python2191 18 42

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonMIT1476 29 216

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Language:PythonApache-2.01241 23 119

DisCo

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Language:PythonApache-2.01052 42 98

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonNOASSERTION1021 35 62