Captain-1314

Captain-1314

Geek Repo

Github PK Tool:Github PK Tool

Captain-1314's starred repositories

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonLicense:Apache-2.0Stargazers:1635Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:30106Issues:0Issues:0

AutoX

A UiAutomator on android, does not need root access(安卓平台上的JavaScript自动化工具)

Language:JavaScriptLicense:NOASSERTIONStargazers:6986Issues:0Issues:0

-Autox.js-

通过安装安卓端的autox.js,执行本项目的脚本,实现自动监测大麦,自动演唱会门票

Language:JavaScriptLicense:GPL-2.0Stargazers:146Issues:0Issues:0

DamaiHelper

大麦网演唱会演出抢票脚本。

Language:PythonLicense:MITStargazers:252Issues:0Issues:0

damaihelper

支持大麦网,淘票票、缤玩岛等多个平台,演唱会演出抢票脚本

Language:HTMLLicense:AGPL-3.0Stargazers:842Issues:0Issues:0

RestoreFormerPlusPlus

[TPAMI2023] RestoreFormer++

Language:PythonLicense:Apache-2.0Stargazers:148Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:2073Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:14591Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6599Issues:0Issues:0

StableSR

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:2021Issues:0Issues:0
Language:Jupyter NotebookStargazers:2372Issues:0Issues:0

MGLD-VSR

Code for ECCV 2024 Paper "Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution"

Language:PythonLicense:NOASSERTIONStargazers:70Issues:0Issues:0

BasicVSR_PlusPlus

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Language:PythonLicense:Apache-2.0Stargazers:576Issues:0Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonLicense:NOASSERTIONStargazers:35209Issues:0Issues:0

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonLicense:MITStargazers:429Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11358Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6195Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32406Issues:0Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:26832Issues:0Issues:0

bmf

Cross-platform, customizable multimedia/video processing framework. With strong GPU acceleration, heterogeneous design, multi-language support, easy to use, multi-framework compatible and high performance, the framework is ideal for transcoding, AI inference, algorithm integration, live video streaming, and more.

Language:C++License:Apache-2.0Stargazers:743Issues:0Issues:0

RealSR

Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model (ICCV 2019)

Language:MATLABStargazers:436Issues:0Issues:0

x264

x264 Git mirror

Language:CLicense:GPL-2.0Stargazers:287Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:62Issues:0Issues:0

x264_saliency_mod

A fork of x264 video encoder supporting custom saliency maps as an additional input to improve quality of salient objects.

Language:CLicense:GPL-2.0Stargazers:48Issues:0Issues:0

Fast-SRGAN

A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps

Language:PythonLicense:MITStargazers:657Issues:0Issues:0

CoreML-Models

Converted CoreML Model Zoo.

Stargazers:1242Issues:0Issues:0
Language:PythonStargazers:163Issues:0Issues:0

jitsi-meet

Jitsi Meet - Secure, Simple and Scalable Video Conferences that you use as a standalone app or embed in your web application.

Language:TypeScriptLicense:Apache-2.0Stargazers:22221Issues:0Issues:0

mediasoup

Cutting Edge WebRTC Video Conferencing

Language:C++License:ISCStargazers:6072Issues:0Issues:0