Beast code in Giters

jacksinofn's starred repositories

sherpa-ncnn-unity

在Unity环境下，借助sherpa-ncnn框架，实现实时并准确的中英双语语音识别功能。

Language:C#Apache-2.02100

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++Apache-2.094800

subtitleedit

the subtitle editor :)

Language:C#GPL-3.0805100

api4sensevoice

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Language:Python5100

NarratoAI

利用AI大模型，一键解说并剪辑视频； Using AI models to automatically provide commentary and edit videos with a single click.

Language:PythonMIT1100

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0842200

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT293300

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonApache-2.0374800

AliCTTransformerPunc

c# library for decoding CTTransformer punc models, which can add punctuation to Chinese and English texts

Language:C#700

revideo

Create Videos with Code

Language:TypeScriptMIT206800

SenseVoice

Multilingual Voice Understanding Model

Language:PythonNOASSERTION212600

AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声；指令协同SD画图。

Language:PythonGPL-3.0265500