There are 5 repositories under audio-segmentation topic.
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
PyAnnote Voice Activity Detection (ONNX version)
This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
pitch detection,CNN
Build a digital music library by downloading and segmenting youtube videos.
A useful tool to split speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).
Automatic generation of speech dataset markup using Wav2Vec2 ASR models
Automatic annotation of timbre variation for monophonic musical instruments
AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵
SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Music Source Separation web application using U-Net model with 2 main features: Audio Separation & Karaoke