TechDing's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:140023Issues:1070Issues:7640

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:32932Issues:200Issues:1207

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:16213Issues:134Issues:377

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:15204Issues:296Issues:341

Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++License:NOASSERTIONStargazers:12874Issues:142Issues:424

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10741Issues:125Issues:217

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10371Issues:104Issues:146

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

MoneyPrinter

Automate Creation of YouTube Shorts using MoviePy.

Language:PythonLicense:MITStargazers:10094Issues:74Issues:165

backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Language:PythonLicense:MITStargazers:6669Issues:49Issues:136

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:5469Issues:55Issues:87

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4517Issues:61Issues:181

Rope

GUI-focused roop

Language:PythonLicense:GPL-3.0Stargazers:4383Issues:96Issues:0

MODNet

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Language:PythonLicense:Apache-2.0Stargazers:3776Issues:103Issues:207

metahuman-stream

Real time interactive streaming digital human

Language:PythonLicense:Apache-2.0Stargazers:3516Issues:43Issues:242

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonLicense:Apache-2.0Stargazers:3091Issues:37Issues:150

AI-Vtuber

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。

Language:PythonLicense:GPL-3.0Stargazers:2850Issues:28Issues:159

social-auto-upload

自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonLicense:MITStargazers:1476Issues:29Issues:216

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Language:PythonLicense:Apache-2.0Stargazers:1241Issues:23Issues:119

DisCo

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Language:PythonLicense:Apache-2.0Stargazers:1052Issues:42Issues:98

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonLicense:NOASSERTIONStargazers:1021Issues:35Issues:62

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonLicense:MITStargazers:897Issues:23Issues:75

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonLicense:Apache-2.0Stargazers:892Issues:25Issues:28

codefuse-devops-eval

Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.

Language:PythonLicense:NOASSERTIONStargazers:673Issues:9Issues:13

Easy-Wav2Lip

Colab for making Wav2Lip high quality and easy to use

Language:Jupyter NotebookStargazers:608Issues:11Issues:61

VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language:PythonLicense:MITStargazers:489Issues:6Issues:56

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonLicense:Apache-2.0Stargazers:410Issues:10Issues:15

Waifu2x-Extension

Image, GIF and Video enlarger/upscaler achieved with waifu2x and Anime4K. [NO LONGER UPDATED]

Language:PythonLicense:NOASSERTIONStargazers:167Issues:8Issues:15

modnet-entry

【MODNet-entry】开箱即用的人像抠图工具