Beast code in Giters

jasonwongw's starred repositories

InterpAny-Clearer

[ECCV2024] Clearer anytime frame interpolation & Manipulated interpolation of anything

Language:PythonMIT18500

Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Language:Python17900

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonMIT215000

IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Language:PythonApache-2.063700

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonMIT149700

Awesome-AITools

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具，欢迎提交issues 或者pull requests

397500

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Language:Python91300

APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Language:PythonGPL-3.077600

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION202900

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonNOASSERTION217700

Video-Frame-Interpolation-Summary

Video Frame Interpolation Summary and Infer

Language:PythonApache-2.010400

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonApache-2.0276500

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.0434500

ComfyUI_Workflows

A repository of well documented easy to follow workflows for ComfyUI

Apache-2.029500

Wav2Lip-GFPGAN

High quality Lip sync

Language:Python97100

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonMIT148600

digital_human_video_player

带HTTP API的数字人视频播放器，使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk，也可以用于播放本地视频

Language:PythonGPL-3.09300

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonMIT631000

Mindustry

The automation tower defense RTS

Language:JavaGPL-3.02176800

FashionMatrix

Fashion Matrix is dedicated to bridging various visual and language models and continuously refining its capabilities as a comprehensive fashion AI assistant. This project will continue to update new features and optimization effects.

Language:Jupyter NotebookMIT11600

AnimatedDrawings

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Language:PythonMIT1034800

Awesome-AIGC-Tutorials

Curated tutorials and resources for Large Language Models, AI Painting, and more.

MIT366400

awesome-aigc

A list of awesome AIGC works

CC0-1.053900

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonNOASSERTION31300

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT2147200

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

Language:Jupyter NotebookAGPL-3.0267400

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.0465500

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonGPL-3.0481200

jasonwongw