wzy's starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27099Issues:206Issues:204

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:26977Issues:163Issues:346

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10851Issues:194Issues:2139

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10519Issues:139Issues:333

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9289Issues:158Issues:581

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

Language:C++License:MPL-2.0Stargazers:7681Issues:85Issues:216

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7387Issues:84Issues:1543
Language:PythonLicense:Apache-2.0Stargazers:6993Issues:65Issues:66

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6354Issues:60Issues:78

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5905Issues:37Issues:843

agentscope

Start building LLM-empowered multi-agent applications in an easier way.

Language:PythonLicense:Apache-2.0Stargazers:2855Issues:22Issues:92

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:2412Issues:45Issues:150

whisper_real_time

Real time transcription with OpenAI Whisper.

whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language:PythonLicense:MITStargazers:1854Issues:27Issues:145

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonLicense:MITStargazers:1419Issues:31Issues:76

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1250Issues:14Issues:54

transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

Language:PythonLicense:GPL-3.0Stargazers:620Issues:6Issues:11

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:528Issues:9Issues:31

SwiftInfer

Efficient AI Inference & Serving

Language:PythonLicense:Apache-2.0Stargazers:447Issues:5Issues:6

sql-eval

Evaluate the accuracy of LLM generated outputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:444Issues:8Issues:16

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonLicense:Apache-2.0Stargazers:403Issues:10Issues:100

ContextualSP

Multiple paper open-source codes of the Microsoft Research Asia DKI group

Language:PythonLicense:MITStargazers:366Issues:16Issues:32
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:338Issues:5Issues:2

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Language:PythonLicense:Apache-2.0Stargazers:223Issues:7Issues:19

MAC-SQL

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

tagger_rewriter

对话改写介绍文章

KeSpeech

The repo provides information about KeSpeech dataset.

pythaiasr

Python Thai Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:57Issues:6Issues:11

keyword-spot

端到端语音唤醒工具箱,从模型训练到模型推理。

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

pai

极简 RPA 框架,包括 Server,Agent,Web,Schedule,DB 等组件

Language:PythonLicense:MITStargazers:3Issues:1Issues:0