There are 26 repositories under forced-alignment topic.
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Command line utility for forced alignment using Kaldi
🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Integrated speech toolset designed to be accessible to end-users. Fully open-source.
Timething is a library for aligning text transcripts with their audio recordings.
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions
scripts to align a given wave to its transcription using trained models by Kaldi
WriteMyVideo's purpose is to help people create videos quickly and easily by simply typing out the video’s script and a description of images to include in the video.
Gentle and praatio scripts for easy forced alignment
Workflow for forced alignment between languages
Align various Sanskrit texts and audio
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners
Aligning a Japanese audio-book with its text and create Anki sentence cards with audio.
A simple Python wrapper to simplify working with WebRTC VAD and its rougher analogue based on RMS and ZCR (useful for processing audio recordings before using them with neural networks).
Speech to Text Alignment tool implemented with Python and Kaldi
Graphical utility for forced alignment using aeneas, an interactive audio player
A framework for generating labeled audio recordings of single-spoken keywords via automatic forced alignment.
Code for my master thesis at FHNW
A Weakly Supervised Forced Alignment for disluent speech
BERT-Text-Features for Tokenized Transcripts from P2FA.
Align various Sanskrit texts and audio
Flask implementation of Montreal Forced Aligner
Forced alignment decoder for Whisper.
Building Text-to-Speech Systems using OpenBible!
Automatic generation of speech dataset markup with use of Wav2Vec2 ASR models
jaeeadjuster: Japanese-accented English & English adjuster