Anlim's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
supervisor
Supervisor process control system for Unix (supervisord)
speechbrain
A PyTorch-based Speech Toolkit
noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
espnet_model_zoo
ESPnet Model Zoo
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
llama_index
LlamaIndex is a data framework for your LLM applications
pixel_ring
RGB LED library for ReSpeaker 4 Mic Array, ReSpeaker V2 & ReSpeaker USB 6+1 Mic Array
deeplearning_ai_books
deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)
Coursera-ML-AndrewNg-Notes
吴恩达老师的机器学习课程个人笔记
whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
faster-whisper
Faster Whisper transcription with CTranslate2
whisper.cpp
Port of OpenAI's Whisper model in C/C++
speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.