zfbok's repositories
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
sd-webui-reactor
Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111, SD.Next, Cagliostro)
yidaRule
yida规则仓库
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
tt-zhipin
头头直聘,仿Boss直聘实现。SpringCloud Alibaba 构建后端,React Native 构建移动端,Vue3.0 + Arco Design 构建管理后台,Hadoop + Flink 实现大数据体系。实现招聘、内容管理、IM即时通讯等业务。
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
OBS-RTX-SuperResolution
An OBS plugin to enable nVidia RTX Video Super Resolution, Upscaling, and Artifact Reduction as a filter.
vid2densepose
Convert your videos to densepose and use it on MagicAnimate
FaceStudio
Put Your Face Everywhere in Seconds.
WeChatMsg
提取微信聊天记录,将其导出成HTML、Word、CSV文档永久保存,对聊天记录进行分析生成年度聊天报告
LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
SillyTavern
LLM Frontend for Power Users.
RealtimeTTS
Converts text to speech in realtime by identifying sentence fragments for immediate auditory feedback. Ideal for applications requiring instant audio responses.
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
AVeryComfyNerd
ComfyUI related stuff and things
obs-ndi
NewTek NDI integration for OBS Studio
chinese-independent-developer
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻**独立开发者项目列表 -- 分享大家都在做什么
ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型