chaozhang's repositories
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
streamlit-webrtc
Real-time video and audio streams over the network, with Streamlit.
stable-diffusion.cpp
Stable Diffusion in pure C/C++
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
digital_human_video_player
带HTTP API的数字人视频播放器,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus
maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
chatppt
ChatPPT is powered by chatgpt/ollama, it could help you to generate PPT/slide. It supports output in English and Chinese
drawdb
Free, simple, and intuitive online database design tool and SQL generator.
gpt_sovits_python
Python wrapper for fast inference with GPT-SoVITS
twinny
The ultimate straightforward, locally or API-hosted AI code completion plugin for Visual Studio Code—like GitHub Copilot but completely free!
Steerable-Motion
A ComfyUI node for driving videos using batches of images.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
python_rtmpstream
python库,实现推送实时rtmp音视频流
MiniGemini
Official implementation for Mini-Gemini
campus-imaotai
i茅台app自动预约,每日自动预约,支持docker一键部署(本项目不提供成品,使用的是已淘汰的算法)
ghost_sa
open_server for sensorsdata ghost_sa(鬼策)的用途是接收 神策SDK 上报的数据,移动广告监测,站外阅读监测,短链创建与解析,反爬,接入控制与管理,用户分群与召回等功能
kimi-free-api
🚀 KIMI AI 长文本大模型白嫖服务,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
OpenDevin
🐚 OpenDevin: Code Less, Make More
llamafile
Distribute and run LLMs with a single file.
OpenSK
OpenSK is an open-source implementation for security keys written in Rust that supports both FIDO U2F and FIDO2 standards.
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
ComfyUI
A powerful and modular stable diffusion GUI with a graph/nodes interface.
RAD-NeRF
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
ComfyUI-fastblend
fastblend for comfyui, and other nodes that I write for generate video. rebatch image, my openpose
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
lora-scripts
LoRA training scripts use kohya-ss's trainer, for diffusion model.
Open-SD-WebUI-Launcher
StableDiffusion WebUI启动器 绘梦