Beast code in Giters

l2009312042's starred repositories

peoples-speech

The People’s Speech Dataset

Language:Jupyter NotebookApache-2.09400

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:Shell34700

cv-arxiv-daily

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonApache-2.077100

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookMIT25600

LibriPhrase

Recipe for LibriPhrase

Language:PythonMIT2200

rubberband

Official mirror of Rubber Band Library, an audio time-stretching and pitch-shifting library.

Language:C++GPL-2.053300

Inter-SubNet

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Language:PythonApache-2.08500

spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

Language:PythonMIT4300

small-footprint-keyword-spotting

Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting

Language:Python600

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Language:PythonMIT40700

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT405700

RobustConformer

Robust speech recognition using teacher-student learning

Language:Python200

DPSL-ASR

Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"

Language:PythonApache-2.03400

sentencepiece_chinese_bpe

使用sentencepiece中BPE训练中文词表，并在transformers中进行使用。

Language:Python9300

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonNOASSERTION984100

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonMIT106500

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.0653100

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

MIT59500

l2009312042

l2009312042's starred repositories

peoples-speech

room-impulse-responses

cv-arxiv-daily

charsiu

LibriPhrase

ssl_noise-robust_kws

rubberband

Speech-Resources

Inter-SubNet

spiking-fullsubnet

small-footprint-keyword-spotting

sgmse

Amphion

RobustConformer

DPSL-ASR

sentencepiece_chinese_bpe

AudioGPT

SpeechT5

EmotiVoice

INTERSPEECH-2023-Papers

NKF-AEC

awesome_LLMs_interview_notes

algorithm-journey

leetcode-master

VALL-E-X

ego2022

KAN-TTS

whisperX

WavAugment

porcupine