v-yunbin

followers

following

stars

wzy's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.028195 168 408

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT27574 209 212

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011052 202 2163

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION10583 141 338

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION9498 160 617

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

Language:C++MPL-2.07809 84 218

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07663 89 1627

LWM

Language:PythonApache-2.07029 66 68

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6383 60 78

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT6178 37 884

agentscope

Start building LLM-empowered multi-agent applications in an easier way.

Language:PythonApache-2.03738 28 101

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookBSD-2-Clause2514 46 154

whisper_real_time

Real time transcription with OpenAI Whisper.

Language:Python2115 29 48

whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language:PythonMIT1901 27 149

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT1268 14 55

transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

Language:PythonGPL-3.0640 7 13

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonApache-2.0553 9 36

sql-eval

Evaluate the accuracy of LLM generated outputs

Language:PythonApache-2.0477 9 17

SwiftInfer

Efficient AI Inference & Serving

Language:PythonApache-2.0447 5 6

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonApache-2.0426 11 105

ContextualSP

Multiple paper open-source codes of the Microsoft Research Asia DKI group

Language:PythonMIT369 16 33

graph_maker

Language:Jupyter NotebookApache-2.0354 5 2

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Language:PythonApache-2.0225 7 19

LabelLLM

Language:TypeScriptApache-2.0212 6 14

MAC-SQL

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

Language:Python154 5 16

KeSpeech

The repo provides information about KeSpeech dataset.

NOASSERTION98 5 14

tagger_rewriter

对话改写介绍文章

Language:Python95 3 3

pythaiasr

Python Thai Automatic Speech Recognition

Language:PythonApache-2.059 6 11

keyword-spot

端到端语音唤醒工具箱，从模型训练到模型推理。

Language:PythonMIT5200

pai

极简 RPA 框架，包括 Server，Agent，Web，Schedule，DB 等组件

Language:PythonMIT3 10