jasonwongw

0

followers

following

stars

jasonwongw's starred repositories

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION1683100

Emote-hack

Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)

Language:Python16800

Diffusion-SVC

Language:PythonMIT38800

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT256600

avatar_ernerf

Just a suturing monster project.

3000

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonMIT247300

CBLUE

中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Language:PythonApache-2.069700

AlphaPose

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Language:PythonNOASSERTION786600

AI-Song-Cover-RVC

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab

Language:Jupyter Notebook90300

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonApache-2.0220000

AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声；指令协同SD画图。

Language:PythonGPL-3.0253100

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonApache-2.039200

wav2lip-576x576

This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital human videos.

Language:PythonMIT4400

grok-1

Grok open release

Language:PythonApache-2.04920700

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.03840400

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Language:PythonMIT112700

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02089900

fay-android

app会常驻手机后台，你可以随时随地保持与Fay数字人的沟通。

Language:JavaGPL-3.02600

torch-ngp

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Language:PythonMIT205500

train_your_own_sora

Language:PythonApache-2.017000

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT412200

python_rtmpstream

python库，实现推送实时rtmp音视频流

Language:C++MIT6800

xtts-streaming-server

Language:PythonMPL-2.025900

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03223700

Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Language:Python18500

xuniren

Language:HTMLMIT55500

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION3462000

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Language:Python42600

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonMIT33500

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonNOASSERTION112000