Yuekai Zhang's repositories
Triton-ASR-Client
ASR client for Triton ASR Service
ctc_decoder
A ctc decoder for both online and offline asr model
InstructGLM
GLM model SFT
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
espnet_onnx
Onnx wrapper for espnet infrernce model
FasterTransformer
Transformer related optimization, including BERT, GPT
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
lhotse
Tools for handling speech data in machine learning projects.
NeMo
NeMo: a toolkit for conversational AI
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
gss
A simple package for Guided source separation (GSS)
langchain
⚡ Building applications with LLMs through composability ⚡
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
sherpa
Streaming and non-streaming ASR server in Python
sherpa-onnx
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
whisper
Robust Speech Recognition via Large-Scale Weak Supervision