linrb685's starred repositories
awesome-adb
:lollipop: ADB Usage Complete / ADB 用法大全
billd-live-server
基于Nodejs + Koa2 + Typescript搭建的billd-live后端
LivePortrait
Bring portraits to life!
SenseVoice
Multilingual Voice Understanding Model
wukong-robot
wukong-robot项目是由github网友wzpan等开发并维护的一个开源中文语音对话机器人项目,能够让感兴趣的开发者快速打造个性化的智能音箱。 模块化。功能插件、语音识别、语音合成、对话机器人都做到了高度模块化,第三方插件单独维护,方便继承和开发自己的插件 - 中文支持。集成百度、科大讯飞、阿里、腾讯等多家中文语音识别和语音合成技术,且可以继续扩展 - 对话机器人支持。支持基于 AnyQ 的本地对话机器人,并支持接入图灵机器人、Emotibot 等在线对话机器人 - 全局监听,离线唤醒。支持 Muse 脑机唤醒,及无接触的离线语音指令唤醒 - 灵活可配置。支持定制机器人名字,支持选择语音识别和合成的插件 - 智能家居。支持和 mqtt、HomeAssistant 等智能家居协议联动,支持语音控制智能家电 - 后台配套支持。提供配套后台,可实现远程操控、修改配置和日志查看等功能 - 开放API。可利用后端开放的API,实现更丰富的功能 - 安装简单,支持多种平台
LiveTalking
Real time interactive streaming digital human
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
so-vits-svc-Deployment-Documents
So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document
so-vits-models
收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。
fish-speech
Brand new TTS solution
so-vits-svc
SoftVC VITS Singing Voice Conversion
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
TalkWithGemini
Deploy your private Gemini application for free with one click, supporting Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini Pro and Gemini Pro Vision models. 一键免费部署您的私人 Gemini 应用, 支持 Gemini 1.5 Pro、Gemini 1.5 Flash、Gemini Pro 和 Gemini Pro Vision 模型。
OpenVoiceV2_Webui_resemble_enhance
基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
sd-webui-prompt-all-in-one
This is an extension based on sd-webui, aimed at improving the user experience of the prompt/negative prompt input box. It has a more intuitive and powerful input interface function, and provides automatic translation, history record, and bookmarking functions. 这是一个基于 sd-webui 的扩展,旨在提高提示词/反向提示词输入框的使用体验。它拥有更直观、强大的输入界面功能,它提供了自动翻译、历史记录和收藏等功能。
ColorSplitter
A cli tool for split vocal timbre.