wht2020's repositories

DCA-PLDA

Discriminative Condition-Aware PLDA

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

AESRC2020

Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:0Issues:0

CSASR_Challenge

中英文code-swithing语音识别

Language:ShellStargazers:0Issues:0Issues:0

DS-TDNN

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch

Language:PythonStargazers:0Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

kaldi

This is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

lihang-code

《统计学习方法》的代码实现

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-book

PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)

License:MITStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

sort-google-scholar

Sorting Google Scholar search results based on the number of citations

Stargazers:0Issues:0Issues:0

speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

License:Apache-2.0Stargazers:0Issues:0Issues:0

speech_dataset

The dataset of Speech Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

License:MITStargazers:0Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

voiceprint

A simple model implemented with tensorflow for voiceprint

Stargazers:0Issues:0Issues:0

wespeaker

Research and Production Oriented Speaker Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

zhvoice

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

Stargazers:0Issues:0Issues:0