Beast code in Giters

wukong-robot项目是由github网友wzpan等开发并维护的一个开源中文语音对话机器人项目，能够让感兴趣的开发者快速打造个性化的智能音箱。模块化。功能插件、语音识别、语音合成、对话机器人都做到了高度模块化，第三方插件单独维护，方便继承和开发自己的插件 - 中文支持。集成百度、科大讯飞、阿里、腾讯等多家中文语音识别和语音合成技术，且可以继续扩展 - 对话机器人支持。支持基于 AnyQ 的本地对话机器人，并支持接入图灵机器人、Emotibot 等在线对话机器人 - 全局监听，离线唤醒。支持 Muse 脑机唤醒，及无接触的离线语音指令唤醒 - 灵活可配置。支持定制机器人名字，支持选择语音识别和合成的插件 - 智能家居。支持和 mqtt、HomeAssistant 等智能家居协议联动，支持语音控制智能家电 - 后台配套支持。提供配套后台，可实现远程操控、修改配置和日志查看等功能 - 开放API。可利用后端开放的API，实现更丰富的功能 - 安装简单，支持多种平台

Language:PythonMIT4800

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonMIT930000

LiveTalking

Real time interactive streaming digital human

Language:PythonApache-2.0361900

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonMIT188500

TripoSR

Language:PythonMIT441500

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:Python59600

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并支持api调用

Language:PythonGPL-3.01032700

so-vits-svc-Deployment-Documents

So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document

Language:Jupyter NotebookAGPL-3.065700

so-vits-models

收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。

MIT13400

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION1328100

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.02560700

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT1132100

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.03144100

TalkWithGemini

Deploy your private Gemini application for free with one click, supporting Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini Pro and Gemini Pro Vision models. 一键免费部署您的私人 Gemini 应用, 支持 Gemini 1.5 Pro、Gemini 1.5 Flash、Gemini Pro 和 Gemini Pro Vision 模型。

Language:TypeScriptGPL-3.069600

OpenVoiceV2_Webui_resemble_enhance

基于OpenVoice和Melotts整合的中文版webui，添加resemble_enhance音频增强功能

Language:PythonMIT7800

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT2373200

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.0456500

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonMIT150700

ebsynth_utility

AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.

Language:Python123900

sd-webui-prompt-all-in-one

This is an extension based on sd-webui, aimed at improving the user experience of the prompt/negative prompt input box. It has a more intuitive and powerful input interface function, and provides automatic translation, history record, and bookmarking functions. 这是一个基于 sd-webui 的扩展，旨在提高提示词/反向提示词输入框的使用体验。它拥有更直观、强大的输入界面功能，它提供了自动翻译、历史记录和收藏等功能。

Language:PythonMIT278100

linrb685

linrb685's starred repositories

awesome-adb

billd-live-server

AniTalker

LivePortrait

EchoMimic

SenseVoice

DY-Data

wukong-robot