Beast code in Giters

Captain-1314's starred repositories

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonApache-2.0163500

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT3010600

AutoX

A UiAutomator on android, does not need root access(安卓平台上的JavaScript自动化工具)

Language:JavaScriptNOASSERTION698600

-Autox.js-

通过安装安卓端的autox.js，执行本项目的脚本，实现自动监测大麦，自动演唱会门票

Language:JavaScriptGPL-2.014600

damaihelper

支持大麦网，淘票票、缤玩岛等多个平台，演唱会演出抢票脚本

Language:HTMLAGPL-3.084200

RestoreFormerPlusPlus

[TPAMI2023] RestoreFormer++

Language:PythonApache-2.014800

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION207300

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonNOASSERTION1459100

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonApache-2.0659900

StableSR

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonNOASSERTION202100

MGLD-VSR

Code for ECCV 2024 Paper "Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution"

Language:PythonNOASSERTION7000

BasicVSR_PlusPlus

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Language:PythonApache-2.057600

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonNOASSERTION3520900

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonMIT42900

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION1135800

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.0619500

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03240600

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonAGPL-3.02683200

Cross-platform, customizable multimedia/video processing framework. With strong GPU acceleration, heterogeneous design, multi-language support, easy to use, multi-framework compatible and high performance, the framework is ideal for transcoding, AI inference, algorithm integration, live video streaming, and more.

Language:C++Apache-2.074300

Captain-1314

Captain-1314's starred repositories

EchoMimic

GPT-SoVITS

AutoX

-Autox.js-

DamaiHelper

damaihelper

RestoreFormerPlusPlus

MuseTalk

CodeFormer

modelscope

StableSR

GPEN

MGLD-VSR

BasicVSR_PlusPlus

GFPGAN

SLAM-LLM

SadTalker

video-retalking

TTS

ultralytics

bmf

RealSR

x264

DLVC

x264_saliency_mod

Fast-SRGAN

CoreML-Models

SR-LUT

jitsi-meet

mediasoup