forced-alignment

There are 26 repositories under forced-alignment topic.

readbeyond / aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
speech alignment tts python linux macos windows nlp espeak espeak-ng festival cli dtw ffmpeg forced-alignment text audio srt smil text-to-speech
Language:Python 2389
MontrealCorpusTools / Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
kaldi forced-alignment grapheme-to-phone pronunciation-dictionary acoustic-model python
Language:Python 1200
r4victor / syncabook
📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)
audiobooks epub3 forced-alignment librivox ebooks
Language:HTML 227
mozilla / DSAlign
DeepSpeech based forced alignment tool
forced-alignment deepspeech
Language:Python 226
saurabhshri / CCAligner
🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
subtitles aligner subtitle-alignment closed-captions forced-alignment word-level-alignment transcription karaoke api cli phonetic-transcriptions speech-recognition pocketsphinx cpp google-summer-of-code gsoc-2017 gsoc ccextractor
Language:C++ 164
r4victor / afaligner
📈 A forced aligner intended for synchronization of narrated text
forced-alignment
Language:Python 77
echogarden-project / echogarden
Integrated speech toolset designed to be accessible to end-users. Fully open-source.
forced-alignment language-identification speech speech-alignment speech-recognition speech-synthesis speech-to-text speech-translation text-to-speech language-detection
Language:TypeScript 75
feldberlin / timething
Timething is a library for aligning text transcripts with their audio recordings.
audio forced-alignment alignment cli huggingface nlp python speech speech-recognition tts
Language:Jupyter Notebook 71
Telegram-Zalo / zac2022-lyric-alignment
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
deep-learning dynamic-programming forced-alignment pytorch wav2vec2 music-alignment vietnamese
Language:Python 66
ronggong / interspeech2018_submission01
Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions
beijing-opera singing-voice cnn keras hsmm hmm forced-alignment interspeech
Language:Python 46
amirharati / kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi
kaldi asr kaldi-asr forced-alignment alignment
Language:Shell 32
joshchen984 / WriteMyVideo-Backend
WriteMyVideo's purpose is to help people create videos quickly and easily by simply typing out the video’s script and a description of images to include in the video.
python youtube video video-editing rq forced-alignment gentle
Language:Python 20
proger / uk
Фонограми та синтагми: інструменти обробки
dataset-generation forced-alignment kaldi speech-recognition ukrainian-language hmm ukrainian
Language:Python 19
BayesForDays / gently
Gentle and praatio scripts for easy forced alignment
forced-alignment praat textgrid textgridtools speech-processing psycholinguistics phonetics phonology
18
jhdeov / interlingual-MFA
Workflow for forced alignment between languages
forced-alignment low-resource-languages montreal-forced-aligner multilingual-alignment cross-language cross-language-alignment
Language:Python 12
avinashvarna / audio_alignment
Align various Sanskrit texts and audio
read-along forced-alignment sanskrit
Language:Python 11
IESTAC
Giuseppe-Della-Corte / IESTAC
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
machine-translation speech-translation corpus parallel-corpus parallel-corpora end-to-end-machine-learning forced-alignment speech-processing mfcc-features bitext sentence-embeddings sentence-similarity statistical-machine-translation speech-recognition text-processing text-preprocessinig web-scraping named-entity-recognition audio-data sql-database
11
dcavar / ELAN2split
Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners
speech-recognition forced-alignment elan speech-corpus sox xerxes xml cpp11
Language:C++ 10
itsupera / audiobook_alignment
Aligning a Japanese audio-book with its text and create Anki sentence cards with audio.
forced-alignment python nlp japanese japanese-language anki anki-flashcards
Language:Python 10
Desklop / WebRTCVAD_Wrapper
A simple Python wrapper to simplify working with WebRTC VAD and its rougher analogue based on RMS and ZCR (useful for processing audio recordings before using them with neural networks).
vad vad-detection voice-activity-detection silence-suppression webrtc webrtc-tools dsp audio audio-processing webrtc-vad webrtcvad-wrapper forced-alignment python
Language:Python 9
zelaki / KaldiLongAligner
Speech to Text Alignment tool implemented with Python and Kaldi
forced-alignment
Language:Python 8
2017fandrei / ForcedAlignment
Graphical utility for forced alignment using aeneas, an interactive audio player
speech alignment python tkinter html javascript linux windows macos espeak ffmpeg forced-alignment text audio text-to-speech language-learning audio-player
Language:Python 6
michel-meneses / keyword-miner
A framework for generating labeled audio recordings of single-spoken keywords via automatic forced alignment.
keyword-spotting forced-alignment speech-processing
Language:Python 6
tiefenauer / ip9
Code for my master thesis at FHNW
speech-recognition forced-alignment sequence-alignment nlp
Language:Python 6
zelaki / DisfluentFA
A Weakly Supervised Forced Alignment for disluent speech
disfluency-detection forced-alignment interspeech2023
Language:Python 6
wxjiao / BERT-Text-Features
BERT-Text-Features for Tokenized Transcripts from P2FA.
text-features bert-embeddings forced-alignment p2fa
Language:Python 4
hrishikeshrt / audio_alignment
Align various Sanskrit texts and audio
sanskrit audio-alignment forced-alignment frontend read-along
Language:Python 3
Japan7 / yohane
Forced alignment for karaokes
forced-alignment karaoke pytorch aegisub songs
Language:Python 3
achen4290 / ForcedAligner
Flask implementation of Montreal Forced Aligner
flask montreal-forced-aligner forced-alignment
Language:Python 2
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
forced-alignment speech-recognition whisper asr
Language:Python 2
aiera-inc / gentle
gentle forced aligner
forced-alignment python cuda
Language:Python 1
bookbot-hive / OpenBible-TTS
Building Text-to-Speech Systems using OpenBible!
bible forced-alignment mms speech-synthesis text-to-spech openbible nlp swahili
Language:Jupyter Notebook 1
dangrebenkin / wav2vec2_speech_markuper
Automatic generation of speech dataset markup with use of Wav2Vec2 ASR models
forced-alignment speech-recognition speech-to-text audio-segmentation wav2vec2
Language:Python 1
scarletcho / evaluateFA
Evaluation script for a forced aligned TextGrid
forced-alignment evaluation
Language:Python 1
seanghay / kfa
A fast Khmer Forced Aligner powered by Wav2Vec2CTC and Phonetisaurus
alignment cambodia forced-alignment khmer wav2vec2
Language:Python 1
KiyotadaMori / jaeeadjuster
jaeeadjuster: Japanese-accented English & English adjuster
forced-alignment julius whisper japanese-accented
Language:Jupyter Notebook

forced-alignment

readbeyond / aeneas

MontrealCorpusTools / Montreal-Forced-Aligner

r4victor / syncabook

mozilla / DSAlign

saurabhshri / CCAligner

r4victor / afaligner

echogarden-project / echogarden

feldberlin / timething

Telegram-Zalo / zac2022-lyric-alignment

ronggong / interspeech2018_submission01

amirharati / kaldi-alligner

joshchen984 / WriteMyVideo-Backend

proger / uk

BayesForDays / gently

jhdeov / interlingual-MFA

avinashvarna / audio_alignment

Giuseppe-Della-Corte / IESTAC

dcavar / ELAN2split

itsupera / audiobook_alignment

Desklop / WebRTCVAD_Wrapper

zelaki / KaldiLongAligner

2017fandrei / ForcedAlignment

michel-meneses / keyword-miner

tiefenauer / ip9

zelaki / DisfluentFA

wxjiao / BERT-Text-Features

hrishikeshrt / audio_alignment

Japan7 / yohane

achen4290 / ForcedAligner

ArenAcikgoz / Whisper-Alignment

aiera-inc / gentle

bookbot-hive / OpenBible-TTS

dangrebenkin / wav2vec2_speech_markuper

scarletcho / evaluateFA

seanghay / kfa

KiyotadaMori / jaeeadjuster