jasonwongw

jasonwongw

Geek Repo

Github PK Tool:Github PK Tool

jasonwongw's starred repositories

InterpAny-Clearer

[ECCV2024] Clearer anytime frame interpolation & Manipulated interpolation of anything

Language:PythonLicense:MITStargazers:185Issues:0Issues:0

Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Language:PythonStargazers:179Issues:0Issues:0

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonLicense:MITStargazers:2150Issues:0Issues:0

IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Language:PythonLicense:Apache-2.0Stargazers:637Issues:0Issues:0

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:1497Issues:0Issues:0

Awesome-AITools

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests

Stargazers:3975Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:317Issues:0Issues:0

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Language:PythonStargazers:913Issues:0Issues:0

APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Language:PythonLicense:GPL-3.0Stargazers:776Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:2029Issues:0Issues:0

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:2177Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:124Issues:0Issues:0

Video-Frame-Interpolation-Summary

Video Frame Interpolation Summary and Infer

Language:PythonLicense:Apache-2.0Stargazers:104Issues:0Issues:0

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:2765Issues:0Issues:0

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4345Issues:0Issues:0

ComfyUI_Workflows

A repository of well documented easy to follow workflows for ComfyUI

License:Apache-2.0Stargazers:295Issues:0Issues:0

Wav2Lip-GFPGAN

High quality Lip sync

Language:PythonStargazers:971Issues:0Issues:0

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:1486Issues:0Issues:0

digital_human_video_player

带HTTP API的数字人视频播放器,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频

Language:PythonLicense:GPL-3.0Stargazers:93Issues:0Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6310Issues:0Issues:0

Mindustry

The automation tower defense RTS

Language:JavaLicense:GPL-3.0Stargazers:21768Issues:0Issues:0

FashionMatrix

Fashion Matrix is dedicated to bridging various visual and language models and continuously refining its capabilities as a comprehensive fashion AI assistant. This project will continue to update new features and optimization effects.

Language:Jupyter NotebookLicense:MITStargazers:116Issues:0Issues:0

AnimatedDrawings

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Language:PythonLicense:MITStargazers:10348Issues:0Issues:0

Awesome-AIGC-Tutorials

Curated tutorials and resources for Large Language Models, AI Painting, and more.

License:MITStargazers:3664Issues:0Issues:0

awesome-aigc

A list of awesome AIGC works

License:CC0-1.0Stargazers:539Issues:0Issues:0

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonLicense:NOASSERTIONStargazers:313Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21472Issues:0Issues:0

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2674Issues:0Issues:0

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4655Issues:0Issues:0

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonLicense:GPL-3.0Stargazers:4812Issues:0Issues:0