chaozhang's repositories
AFFiNE
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
chinese-independent-developer
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻**独立开发者项目列表 -- 分享大家都在做什么
ComfyUI-Video-Matting
A minimalistic implementation of Robust Video Matting (RVM) and BRAIAI-RVMBG v1.4 in ComfyUI
digital_human_video_player
带HTTP API的数字人视频播放器,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus
ER-nerf
主要写er-nerf从零到一所有部署过程
fish-speech
Brand new TTS solution
Flowise
Drag & drop UI to build your customized LLM flow
LangGPT
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
llama3-Chinese-chat
Llama3 中文仓库(聚合资料:各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、部署教程视频 & 文档)
llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
metahuman-stream
Real time streaming digital human based on nerf
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
OpenVoiceV2_Webui_resemble_enhance
基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能
Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
phidata
Memory, knowledge and tools for LLMs
piper
A fast, local neural text to speech system
ProgrammingVTuberLogos
High-quality PNGs for logos I made for fun
PyAV
Pythonic bindings for FFmpeg's libraries.
Scrapegraph-ai
Python scraper based on AI
stable-diffusion.cpp
Stable Diffusion in pure C/C++
StoryDiffusion
Create Magic Story!
streamlit-webrtc
Real-time video and audio streams over the network, with Streamlit.
SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs