audio-segmentation

There are 5 repositories under audio-segmentation topic.

autosub
BingLingGroup / autosub
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
subtitles substation-alpha cloud-speech-api voice-activity-detection audio-segmentation xfyun baidu-api xunfei-api
Language:Python 1971
amsehili / auditok
An audio/acoustic activity detection and audio segmentation tool
audio-activities audio-data audio-segmentation vad voice-activity-detection voice-detection
Language:Python 724
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
speech-processing speech-annotation speech-recognition speaker-diarization speech-seperation gender-classification speaker-identification synthetic-speech-detection speech-transcription topic-detection audio-segmentation accent-detection
Language:Forth 99
mt-upc / SHAS
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
audio-segmentation speech-translation speech-to-text speech wav2vec2
Language:Python 36
nianlonggu / WhisperSeg
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
audio-segmentation transformer voice-activity-detection whisper animal-sound-detection whisperseg icassp2024
Language:Python 17
dangvansam / pyannote-onnx
PyAnnote Voice Activity Detection (ONNX version)
audio-segmentation audio-split audio-splitter onnx pyannote speech-activity-detection speech-separation vad voice-ac
Language:Jupyter Notebook 9
huzaifakhan04 / music-recommendation-web-application-based-on-rhythmic-similarity-using-locality-sensitive-hashing
This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.
ann approximate-nearest-neighbors big-data data-science flask-application locality-sensitive-hashing lsh music music-information-retrieval music-recommendation music-recommendation-system spotify web-application audio-processing audio-recommendation audio-segmentation cosine-distance machine-learning
Language:Jupyter Notebook 5
ina-foss / InaGVAD
Voice activity detection and speaker gender segmentation audiovisual corpus
audio-dataset audio-segmentation audiovisual-dataset benchmark corpus gender gender-bias gender-prediction gender-representation radio speaker-gender speech-activity-detection speech-corpus speech-dataset tv voice-activity-detection acoustic-diversity dataset
Language:Jupyter Notebook 4
Metiu-Metiu / Neural-Texture-Sound-synthesis---data-sets
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
audio-dataset-for-machine-learning audio-datasets audio-segmentation data-augmentation real-dataset synthetic-dataset synthetic-dataset-generation
4
yxlijun / solfege-segmentation
pitch detection,CNN
audio-segmentation cnn f0-detection solfege-segmentation
Language:Python 4
boromir674 / music-album-creator
Build a digital music library by downloading and segmenting youtube videos.
cli metadata music-library command-line-tool youtube youtube-downloader music music-metadata audio-processing audio-segmentation automation
Language:Python 3
dangrebenkin / speech_audio_separator
Spliting speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).
audio-processing audio-segmentation
Language:Python 2
dangrebenkin / wav2vec2_speech_markuper
Automatic generation of speech dataset markup using Wav2Vec2 ASR models
forced-alignment speech-recognition speech-to-text audio-segmentation wav2vec2
Language:Python 2
LIMUNIMI / labelSignal
Automatic annotation of timbre variation for monophonic musical instruments
timbre audio-analysis audio audio-segmentation signal-processing sound-and-music-computing
Language:MATLAB 2
luuil / Tools
Our Little Tools
tensorflow savedmodel grpc audio-segmentation tensorflow-serving docker dockerfile svg2png locust
Language:Stylus 2
zqlsnr / speech-music-detection
tensorflow for speech-music-detection task，acc 96%+
audio-classification audio-segmentation speech-music-detection
Language:Python 2
ElHaban3ro / AsegTool
AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵
audio-processing audio-segmentation video-processing video-segmentation
Language:JavaScript 1
mt-upc / SegAugment
SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
audio-segmentation data-augmentation speech-translation
Language:Python 1
nuvita97 / music-source-separation
Music Source Separation web application using U-Net model with 2 main features: Audio Separation & Karaoke
deep-neural-networks css fastapi python streamlit audio-segmentation unet-model
Language:Jupyter Notebook 1
radadiavasu / AudioAnalysis
Whole Audio Analysis with Python
audio-classification audio-segmentation diarization feature-extraction pyaudio-analysis pyaudio-processing python
Language:Python 0
0x7o / PyanNet
Training and using audio segmentation
audio-segmentation