zfbok's repositories

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

License:MITStargazers:0Issues:0Issues:0
License:AGPL-3.0Stargazers:0Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

License:Apache-2.0Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

License:MITStargazers:1Issues:0Issues:0

sd-webui-reactor

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111, SD.Next, Cagliostro)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

yidaRule

yida规则仓库

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

License:MITStargazers:0Issues:0Issues:0

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

License:MITStargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

tt-zhipin

头头直聘,仿Boss直聘实现。SpringCloud Alibaba 构建后端,React Native 构建移动端,Vue3.0 + Arco Design 构建管理后台,Hadoop + Flink 实现大数据体系。实现招聘、内容管理、IM即时通讯等业务。

License:Apache-2.0Stargazers:0Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

License:Apache-2.0Stargazers:0Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

License:MITStargazers:0Issues:0Issues:0

OBS-RTX-SuperResolution

An OBS plugin to enable nVidia RTX Video Super Resolution, Upscaling, and Artifact Reduction as a filter.

License:GPL-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

vid2densepose

Convert your videos to densepose and use it on MagicAnimate

License:MITStargazers:0Issues:0Issues:0

FaceStudio

Put Your Face Everywhere in Seconds.

License:Apache-2.0Stargazers:0Issues:0Issues:0

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、CSV文档永久保存,对聊天记录进行分析生成年度聊天报告

License:GPL-3.0Stargazers:0Issues:0Issues:0

LocalAIVoiceChat

Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

License:MITStargazers:0Issues:0Issues:0

SillyTavern

LLM Frontend for Power Users.

License:AGPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

RealtimeTTS

Converts text to speech in realtime by identifying sentence fragments for immediate auditory feedback. Ideal for applications requiring instant audio responses.

Stargazers:0Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

License:Apache-2.0Stargazers:0Issues:0Issues:0

VideoCrafter

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Stargazers:0Issues:0Issues:0

AVeryComfyNerd

ComfyUI related stuff and things

License:MITStargazers:0Issues:0Issues:0

obs-ndi

NewTek NDI integration for OBS Studio

License:GPL-2.0Stargazers:0Issues:0Issues:0

chinese-independent-developer

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻**独立开发者项目列表 -- 分享大家都在做什么

Stargazers:0Issues:0Issues:0

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

License:NOASSERTIONStargazers:0Issues:0Issues:0