tianlinzx's starred repositories

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:7020Issues:0Issues:0

chinese_speech_pretrain

chinese speech pretrained models

Language:ShellStargazers:914Issues:0Issues:0

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

License:MITStargazers:501Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonLicense:MITStargazers:1183Issues:0Issues:0

Portrait-Talker

Talking head animation

Language:PythonStargazers:28Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:796Issues:0Issues:0

python-hls-stream

Minimal HLS streaming demo with dynamic marker support in Python

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

fragmented-mpeg4-live-streaming

FastAPI + ffmpeg as a solution for natively supported live streaming in browsers

Language:PythonStargazers:5Issues:0Issues:0

PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

Language:PythonLicense:Apache-2.0Stargazers:641Issues:0Issues:0

PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language:PythonLicense:Apache-2.0Stargazers:777Issues:0Issues:0

kinit-fast-task

本项目基于Python社区FastAPI技术栈编写而成,本意为所有需要使用FastAPI开发的人提供一个合适的脚手架,避免重复开发。 在项目中也融合了很多FastAPI技术栈可以参考使用。

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

vue-super-flow

Flow chart component based on Vue。vue flowchart

Language:VueLicense:MITStargazers:720Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:15320Issues:0Issues:0

dlib

A toolkit for making real world machine learning and data analysis applications in C++

Language:C++License:BSL-1.0Stargazers:13118Issues:0Issues:0

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:838Issues:0Issues:0

articulated-animation

Code for Motion Representations for Articulated Animation paper

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1194Issues:0Issues:0

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookLicense:MITStargazers:3351Issues:0Issues:0
Language:PythonLicense:MITStargazers:476Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:PythonStargazers:1233Issues:0Issues:0

agentUniverse

agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.

Language:PythonLicense:Apache-2.0Stargazers:106Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:72117Issues:0Issues:0

Easy-Wav2Lip

Colab for making Wav2Lip high quality and easy to use

Language:Jupyter NotebookStargazers:416Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7311Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:10838Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:PythonStargazers:9516Issues:0Issues:0

digitalAvatarRealtime

基于DINet的推理服务,推理视频流和视频

Stargazers:12Issues:0Issues:0

DINet_optimized

An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.

Language:PythonStargazers:89Issues:0Issues:0

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABLicense:NOASSERTIONStargazers:6664Issues:0Issues:0