MXuer's starred repositories
CMake-tutorial
CMake 官方教程----的翻译
speechmatics-python
Python library and CLI for Speechmatics
DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
Applied-Deep-Learning
Applied Deep Learning Course
GigaSpeech
Large, modern dataset for speech recognition
AEC_DeepModel
基于深度学习的声学回声消除基线代码
language-recognition
CNN to classify samples of voice recordings into the language that was spoken
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
voxceleb_trainer
In defence of metric learning for speaker recognition
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
Chinese-speech-to-text
Chinese Speech To Text Using Wavenet
ctcdecode-pytorch
Python implementation of CTC beam search decoder + agnostic LM scorer
Punctuation_Transcription
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
MachineLearning
audio classification using lstm rnn
UrbanSoundClassification
Classifying daily sounds
Audio-Classification
Pytorch code for "Rethinking CNN Models for Audio Classification"