v-perfilev / speech_denoiser

Speech denoising app leveraging PyTorch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Speech Denoiser

This app utilizes PyTorch to denoise human speech, distinguishing it from background noise.

For training the model, speech datasets from Mozilla Common Voice and environmental sounds from UrbanSound8K were used.

Quick Start

  1. Clone the repository:
git clone https://github.com/v-perfilev/speech_denoiser.git
  1. Install the required packages:
pip install -r requirements.txt
  1. Copy dataset with clean and noisy sound samples into the ../_datasets/ directory. To generate datasets you can use my another project https://github.com/v-perfilev/audio_dataset_handler.git.

  2. Train the model by running the model_training.ipynb notebook.

  3. Run the app:

python usage_example.py

Features

  • Real-time speech detection using a pretrained neural network model.
  • Supports multiple microphone inputs.
  • Lightweight and easy to deploy.

Requirements

  • ffmpeg (!!!)
  • numpy
  • matplotlib
  • torchaudio
  • pyaudio
  • soundfile
  • torch

About

Speech denoising app leveraging PyTorch


Languages

Language:Python 51.4%Language:Jupyter Notebook 48.6%