helloliucc

helloliucc

Geek Repo

Github PK Tool:Github PK Tool

helloliucc's starred repositories

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1191Issues:0Issues:0

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonLicense:Apache-2.0Stargazers:2143Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:10898Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:4465Issues:0Issues:0

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Language:MATLABStargazers:833Issues:0Issues:0

metahuman-stream

Real time interactive streaming digital human

Language:PythonLicense:Apache-2.0Stargazers:3264Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8476Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3824Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:20032Issues:0Issues:0

flowframes

Flowframes Windows GUI for video interpolation using DAIN (NCNN) or RIFE (CUDA/NCNN)

Language:PythonLicense:GPL-3.0Stargazers:1448Issues:0Issues:0

paper2gui

Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookLicense:MITStargazers:10123Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:7272Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:2207Issues:0Issues:0

chatgpt-web

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Language:VueLicense:MITStargazers:31188Issues:0Issues:0

libopenshot

OpenShot Video Library (libopenshot) is a free, open-source project dedicated to delivering high quality video editing, animation, and playback solutions to the world. API currently supports C++, Python, and Ruby.

Language:C++License:LGPL-3.0Stargazers:1250Issues:0Issues:0

vidgear

A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features :fire:

Language:PythonLicense:Apache-2.0Stargazers:3317Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:8602Issues:0Issues:0

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:43324Issues:0Issues:0

Agently-Daily-News-Collector

An open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.

Language:PythonLicense:Apache-2.0Stargazers:408Issues:0Issues:0
Language:PythonStargazers:895Issues:0Issues:0

json-repair

🔧 Repair JSON!Solution for JSON Anomalies from LLMs.

Language:GoLicense:GPL-3.0Stargazers:155Issues:0Issues:0

TypeChat

TypeChat is a library that makes it easy to build natural language interfaces using types.

Language:TypeScriptLicense:MITStargazers:8123Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:PythonStargazers:2154Issues:0Issues:0
Language:PythonStargazers:113Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29760Issues:0Issues:0

Vach

Real time streaming talking head

Language:PythonStargazers:406Issues:0Issues:0

whip-go

Simple WHIP client for WebRTC streaming from any media source

Language:GoLicense:MITStargazers:53Issues:0Issues:0

OBS-studio-webrtc

This is a fork of OBS-studio with generic support for webrtc. It leverages the same webrtc implementation most browsers use.

Language:CLicense:GPL-2.0Stargazers:585Issues:0Issues:0

AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1334Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5261Issues:0Issues:0