MXuer's starred repositories

conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:920Issues:0Issues:0

CMake-tutorial

CMake 官方教程----的翻译

Language:CMakeLicense:NOASSERTIONStargazers:325Issues:0Issues:0

speechmatics-python

Python library and CLI for Speechmatics

Language:PythonLicense:MITStargazers:53Issues:0Issues:0

DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Language:PythonLicense:MITStargazers:338Issues:0Issues:0

Applied-Deep-Learning

Applied Deep Learning Course

Stargazers:3056Issues:0Issues:0

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellLicense:Apache-2.0Stargazers:618Issues:0Issues:0

CountNet

Deep Neural Network for Speaker Count Estimation

Language:PythonLicense:MITStargazers:144Issues:0Issues:0
Stargazers:203Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

AEC_DeepModel

基于深度学习的声学回声消除基线代码

Language:PythonStargazers:120Issues:0Issues:0

language-recognition

CNN to classify samples of voice recordings into the language that was spoken

Language:Jupyter NotebookStargazers:46Issues:0Issues:0

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language:PythonLicense:MITStargazers:727Issues:0Issues:0

zeus

⚡️ zeus: Lightning Fast MCMC ⚡️

Language:PythonLicense:GPL-3.0Stargazers:221Issues:0Issues:0

mic2midi

Whistle / Hum / Sing into a microphone, generate MIDI signals to drive a sequencer

Language:PythonStargazers:24Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:25422Issues:0Issues:0

moderncpp

Modern C++: Snippets and Examples

Language:C++License:MITStargazers:496Issues:0Issues:0

pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Language:PerlLicense:BSD-3-ClauseStargazers:135Issues:0Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonLicense:MITStargazers:1016Issues:0Issues:0

OpenCC

Conversion between Traditional and Simplified Chinese

Language:C++License:Apache-2.0Stargazers:8261Issues:0Issues:0

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:5385Issues:0Issues:0

BertPunc

SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:179Issues:0Issues:0
Language:Jupyter NotebookStargazers:5Issues:0Issues:0

cmu-thesis

Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling

Language:PythonLicense:MITStargazers:165Issues:0Issues:0

Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization

A two-stage polyphonic sound event detection and localization method for both SED and DOA.

Language:PythonStargazers:102Issues:0Issues:0

Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

Language:PythonLicense:Apache-2.0Stargazers:156Issues:0Issues:0

ctcdecode-pytorch

Python implementation of CTC beam search decoder + agnostic LM scorer

Language:PythonStargazers:19Issues:0Issues:0

Punctuation_Transcription

A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.

Language:PythonStargazers:15Issues:0Issues:0

MachineLearning

audio classification using lstm rnn

Language:PythonStargazers:3Issues:0Issues:0

UrbanSoundClassification

Classifying daily sounds

Language:Jupyter NotebookStargazers:3Issues:0Issues:0

Audio-Classification

Pytorch code for "Rethinking CNN Models for Audio Classification"

Language:PythonStargazers:122Issues:0Issues:0