Aworselife's starred repositories
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
GPT_API_free
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
DCCRN-with-various-loss-functions
DCCRN with various loss functions
DNN-based-Speech-Enhancement-in-the-frequency-domain
DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping method.
LRS3-For-Speech-Separation
Multi-modal speech separation task data generation script on LRS3 data set.
Calculate-SNR-SDR
Script to calculate SNR and SDR using python
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
IRM-based-Speech-Enhancement-using-LSTM
Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM
speech_separation
Include some core functions and model to handle speech separation