aethia-dev's repositories
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
t2v-turbo
Code repository for T2V-Turbo
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
StoryDiffusion
Create Magic Story!
PhotoMaker
PhotoMaker
upscayl
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.
video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
misskey
🌎 An interplanetary microblogging platform 🚀
GeminiProChat
Minimal web UI for GeminiPro.
gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
AnimateDiff
Official implementation of AnimateDiff.
TTS
XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate
Fay
Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
gallery
🖼️ The Answer of "How to save image or video to gallery in flutter"
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Rocket.Chat
The communications platform that puts data protection first.
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
cog-faceswap
Attempt at cog wrapper for faceswap with face enhancer
betterplayer
Bug fix version for betterplayer