yuekaizhang

followers

following

stars

@Nvidia

Shanghai, CN

https://scholar.google.com/citations?user=YGmuq3UAAAAJ&hl=en

Yuekai Zhang's repositories

minutes

Podcast Summarizer with LLM Technology

Language:Python15 20

Triton-ASR-Client

ASR client for Triton ASR Service

Language:PythonBSD-3-Clause11 2 3

Audio-Adversarial-Examples-Papers

8 30

ctc_decoder

A ctc decoder for both online and offline asr model

Language:C++200

icefall

Language:PythonApache-2.0100

InstructGLM

GLM model SFT

Language:PythonMIT100

tensorrt-hackthon-wenet

Language:CudaApache-2.01 10

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0020

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonMIT000

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.0000

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaNOASSERTION000

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0000

NeMo

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonMIT000

gss

A simple package for Guided source separation (GSS)

Language:PythonMIT000

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonMIT000

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonNOASSERTION000

riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Language:Python000

sherpa

Streaming and non-streaming ASR server in Python

Language:C++Apache-2.0000

sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

Language:C++Apache-2.0000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++Apache-2.0000

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Language:PythonApache-2.0000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT000

yuekaizhang

010