aethia-dev's repositories

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Stargazers:0Issues:0Issues:0

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

License:NOASSERTIONStargazers:0Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

License:NOASSERTIONStargazers:0Issues:0Issues:0

t2v-turbo

Code repository for T2V-Turbo

Stargazers:0Issues:0Issues:0

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

License:Apache-2.0Stargazers:0Issues:0Issues:0

StoryDiffusion

Create Magic Story!

License:Apache-2.0Stargazers:0Issues:0Issues:0

PhotoMaker

PhotoMaker

License:NOASSERTIONStargazers:0Issues:0Issues:0

upscayl

🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.

License:AGPL-3.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

License:Apache-2.0Stargazers:0Issues:0Issues:0

misskey

🌎 An interplanetary microblogging platform 🚀

License:AGPL-3.0Stargazers:0Issues:0Issues:0

GeminiProChat

Minimal web UI for GeminiPro.

License:MITStargazers:0Issues:0Issues:0

gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

License:ISCStargazers:0Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

License:Apache-2.0Stargazers:0Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Stargazers:0Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

License:MITStargazers:0Issues:0Issues:0

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

License:MPL-2.0Stargazers:0Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI with a graph/nodes interface.

License:GPL-3.0Stargazers:0Issues:0Issues:0

LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:NOASSERTIONStargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TTS

XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate

License:MPL-2.0Stargazers:0Issues:0Issues:0

Fay

Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!

Stargazers:0Issues:0Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

License:NOASSERTIONStargazers:0Issues:0Issues:0

gallery

🖼️ The Answer of "How to save image or video to gallery in flutter"

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Rocket.Chat

The communications platform that puts data protection first.

License:NOASSERTIONStargazers:0Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

License:NOASSERTIONStargazers:0Issues:0Issues:0

cog-faceswap

Attempt at cog wrapper for faceswap with face enhancer

Stargazers:2Issues:0Issues:0

betterplayer

Bug fix version for betterplayer

License:Apache-2.0Stargazers:0Issues:0Issues:0