v-perfilev / speech_denoiser

Speech denoising app leveraging PyTorch

Speech Denoiser

This app utilizes PyTorch to denoise human speech, distinguishing it from background noise.

For training the model, speech datasets from Mozilla Common Voice and environmental sounds from UrbanSound8K were used.

Quick Start

Clone the repository:

git clone https://github.com/v-perfilev/speech_denoiser.git

Install the required packages:

pip install -r requirements.txt

Copy dataset with clean and noisy sound samples into the ../_datasets/ directory. To generate datasets you can use my another project https://github.com/v-perfilev/audio_dataset_handler.git.
Train the model by running the model_training.ipynb notebook.
Run the app:

python usage_example.py

Features

Real-time speech detection using a pretrained neural network model.
Supports multiple microphone inputs.
Lightweight and easy to deploy.

Requirements

ffmpeg (!!!)
numpy
matplotlib
torchaudio
pyaudio
soundfile
torch

About

Speech denoising app leveraging PyTorch

Languages

Language:Python 51.4%Language:Jupyter Notebook 48.6%