Sundy1219's repositories
awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
ColossalAI
Making large AI models cheaper, faster and more accessible
e2e_lfmmi
This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022
fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
faster-whisper
Faster Whisper transcription with CTranslate2
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
lantern
Lantern官方版本下载 蓝灯 翻墙 代理 科学上网 外网 加速器 梯子 路由 - Быстрый, надежный и безопасный доступ к открытому интернету - lantern proxy vpn censorship-circumvention censorship gfw accelerator پراکسی لنترن، ضدسانسور، امن، قابل اعتماد و پرسرعت
minbpe
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
onnxmltools
ONNXMLTools enables conversion of models to ONNX
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
Prompt-Engineering-Guide
:octopus: Guides, papers, lecture, and resources for prompt engineering
pytorch_speech_features
A simple PyTorch wrapper for the original python-speech-features repository
TC-ResNet
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
TCN
Sequence modeling benchmarks and temporal convolutional networks
tfrecord
TFRecord reader for PyTorch
tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
whisper
Robust Speech Recognition via Large-Scale Weak Supervision