Aworselife

0

followers

following

stars

Aworselife's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.02864200

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT354300

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Language:PythonMIT479900

EaBNet

This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.

Language:Python7500

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

MIT61600

RapidASR

商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

Language:C++MIT47100

COSPA

Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

Language:PythonApache-2.03000

deep-non-linear-filter

Language:Python4100

GPT_API_free

Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。

Language:PythonMIT1969200

TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Language:Python24300

Neural-mask-estimation

Language:Python3600

Speech-Enhancement-Mi

Language:Python200

JAECBF

Language:Python5100

AISHELL-4

Language:PythonApache-2.011300

DCCRN-with-various-loss-functions

DCCRN with various loss functions

Language:PythonMIT8900

DNN-based-Speech-Enhancement-in-the-frequency-domain

DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping method.

Language:PythonMIT5000

FRA-RIR

100

LRS3-For-Speech-Separation

Multi-modal speech separation task data generation script on LRS3 data set.

Language:MATLABMIT7500

Calculate-SNR-SDR

Script to calculate SNR and SDR using python

Language:Python8600

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1400

nn-irm

A Simple DNN-IRM estimator for speech enhancement

Language:Python500

IRM-based-Speech-Enhancement-using-LSTM

Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM

Language:PythonMIT11100

speech_separation

Include some core functions and model to handle speech separation

MIT500

EECS498-007

Language:Jupyter Notebook1800

996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

NOASSERTION26956700