zhangsanfeng86's repositories
AMchat
AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics, and their solutions. AM (Advanced Mathematics) chat 高等数学大模型。一个集成数学知识和高等数学习题及其解答的大语言模型。
ASR-2Pass
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
awesome-mcp-servers
A collection of MCP servers.
camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org
ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
funasr_seaco_paraformer_onnx_with_timestamp
修复funasr中seaco-paraformer导出onnx后没有时间戳的bug
gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
JoyHallo
JoyHallo: Digital human model for Mandarin
KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md). MNN TaoAvatar Android - Local 3D Avatar Intelligence: apps/Android/Mnn3dAvatar/README.md
parler-tts
Inference and training library for high-quality TTS models.
qwen1.5-convertor
export qwen1.5 to onnx or tflite
SenseVoice
Multilingual Voice Understanding Model
SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
sherpa-onnx
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
VeOmni
VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework
vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
wavesurfer
Python (jupyter notebook) wrapper for wavesurfer.js
YeAudio
Python的音频工具