jasonwongw

jasonwongw

Geek Repo

Github PK Tool:Github PK Tool

jasonwongw's starred repositories

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:16831Issues:0Issues:0

Emote-hack

Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)

Language:PythonStargazers:168Issues:0Issues:0
Language:PythonLicense:MITStargazers:388Issues:0Issues:0

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2566Issues:0Issues:0

avatar_ernerf

Just a suturing monster project.

Stargazers:30Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2473Issues:0Issues:0

CBLUE

中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Language:PythonLicense:Apache-2.0Stargazers:697Issues:0Issues:0

AlphaPose

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Language:PythonLicense:NOASSERTIONStargazers:7866Issues:0Issues:0

AI-Song-Cover-RVC

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab

Language:Jupyter NotebookStargazers:903Issues:0Issues:0

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2200Issues:0Issues:0

AI-Vtuber

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。

Language:PythonLicense:GPL-3.0Stargazers:2531Issues:0Issues:0

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonLicense:Apache-2.0Stargazers:392Issues:0Issues:0

wav2lip-576x576

This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital human videos.

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49207Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38404Issues:0Issues:0

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Language:PythonLicense:MITStargazers:1127Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20899Issues:0Issues:0

fay-android

app会常驻手机后台,你可以随时随地保持与Fay数字人的沟通。

Language:JavaLicense:GPL-3.0Stargazers:26Issues:0Issues:0

torch-ngp

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Language:PythonLicense:MITStargazers:2055Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:170Issues:0Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4122Issues:0Issues:0

python_rtmpstream

python库,实现推送实时rtmp音视频流

Language:C++License:MITStargazers:68Issues:0Issues:0
Language:PythonLicense:MPL-2.0Stargazers:259Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32237Issues:0Issues:0

Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Language:PythonStargazers:185Issues:0Issues:0
Language:HTMLLicense:MITStargazers:555Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34620Issues:0Issues:0

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Language:PythonStargazers:426Issues:0Issues:0

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonLicense:MITStargazers:335Issues:0Issues:0

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1120Issues:0Issues:0