Sundy1219's repositories
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
faster-whisper
Faster Whisper transcription with CTranslate2
minbpe
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
lantern
Lantern官方版本下载 蓝灯 翻墙 代理 科学上网 外网 加速器 梯子 路由 - Быстрый, надежный и безопасный доступ к открытому интернету - lantern proxy vpn censorship-circumvention censorship gfw accelerator پراکسی لنترن، ضدسانسور، امن، قابل اعتماد و پرسرعت
ColossalAI
Making large AI models cheaper, faster and more accessible
Prompt-Engineering-Guide
:octopus: Guides, papers, lecture, and resources for prompt engineering
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
tfrecord
TFRecord reader for PyTorch
TC-ResNet
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
TCN
Sequence modeling benchmarks and temporal convolutional networks
pytorch_speech_features
A simple PyTorch wrapper for the original python-speech-features repository
fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
lhotse
Tools for handling speech data in machine learning projects.
onnxmltools
ONNXMLTools enables conversion of models to ONNX
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
e2e_lfmmi
This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022