lc150303

followers

following

stars

University of Science and Technology of China

China

Cong Liang's starred repositories

llama

Inference code for Llama models

Language:PythonNOASSERTION54774 517 946

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonApache-2.036873 430 1641

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT34022 316 424

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25203 223 453

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.024965 174 130

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonNOASSERTION18034 91 216

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.010629 123 207

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6391 60 78

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT5621 70 978

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.04721 60 359

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION3955 48 841

T2I-Adapter

T2I-Adapter

Language:PythonApache-2.03338 40 107

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++Apache-2.02642 43 397

SysMocap

A real-time motion capture system for 3D virtual character animating.

Language:JavaScriptMPL-2.02443 35 55

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonMIT1618 26 174

emoca

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

Language:PythonNOASSERTION679 16 80

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

syncnet_python

Out of time: automated lip sync in the wild

Language:PythonMIT621 15 61

VAD-python

Voice Activity Detector in Python

Language:Python469 21 15

DigiHuman

Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques

Language:C#GPL-3.0453 14 12

awesome-faceReenactment

papers about Face Reenactment/Talking Face Generation

Mediapipe4u-plugin

Language:Dockerfile348 13 149

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonNOASSERTION313 11 28

DiffGesture

[CVPR 2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

Language:PythonGPL-3.0219 12 24

HumanBehaviorAnimation

Language:Python165 16 13

sugar-wifi-conf

A BLE service on raspberry pi for wifi configuration and wireless control. 使用微信小程序随时随地设置树莓派wifi连接，控制树莓派

Language:JavaScriptGPL-3.0130 11 9

DiffSpeaker

This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Language:Python126 7 9

Face_Landmark_Link

creates live link app blendshape data formated in csv from video, for facial motion capture

Language:PythonApache-2.0115 5 5

GAU-alpha

基于Gated Attention Unit的Transformer模型（尝鲜版）

Language:Python94 4 3

AvatarWebKit

Web-first SDK that provides real-time ARKit-compatible 52 blend shapes from a camera feed, video or image at 60 FPS using ML models.

NOASSERTION82 2 5