Cong Liang (lc150303)

lc150303

Geek Repo

Company:University of Science and Technology of China

Location:China

Github PK Tool:Github PK Tool

Cong Liang's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54774Issues:517Issues:946

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36873Issues:430Issues:1641

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34022Issues:316Issues:424

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25203Issues:223Issues:453

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:24965Issues:174Issues:130

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonLicense:NOASSERTIONStargazers:18034Issues:91Issues:216

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10629Issues:123Issues:207

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6391Issues:60Issues:78

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5621Issues:70Issues:978

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4721Issues:60Issues:359

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:3955Issues:48Issues:841

T2I-Adapter

T2I-Adapter

Language:PythonLicense:Apache-2.0Stargazers:3338Issues:40Issues:107

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2642Issues:43Issues:397

SysMocap

A real-time motion capture system for 3D virtual character animating.

Language:JavaScriptLicense:MPL-2.0Stargazers:2443Issues:35Issues:55

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1618Issues:26Issues:174

emoca

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

Language:PythonLicense:NOASSERTIONStargazers:679Issues:16Issues:80

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

syncnet_python

Out of time: automated lip sync in the wild

Language:PythonLicense:MITStargazers:621Issues:15Issues:61

VAD-python

Voice Activity Detector in Python

DigiHuman

Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques

Language:C#License:GPL-3.0Stargazers:453Issues:14Issues:12

awesome-faceReenactment

papers about Face Reenactment/Talking Face Generation

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonLicense:NOASSERTIONStargazers:313Issues:11Issues:28

DiffGesture

[CVPR 2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

Language:PythonLicense:GPL-3.0Stargazers:219Issues:12Issues:24

sugar-wifi-conf

A BLE service on raspberry pi for wifi configuration and wireless control. 使用微信小程序随时随地设置树莓派wifi连接,控制树莓派

Language:JavaScriptLicense:GPL-3.0Stargazers:130Issues:11Issues:9

DiffSpeaker

This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Face_Landmark_Link

creates live link app blendshape data formated in csv from video, for facial motion capture

Language:PythonLicense:Apache-2.0Stargazers:115Issues:5Issues:5

GAU-alpha

基于Gated Attention Unit的Transformer模型(尝鲜版)

AvatarWebKit

Web-first SDK that provides real-time ARKit-compatible 52 blend shapes from a camera feed, video or image at 60 FPS using ML models.