Christoph Minixhofer's repositories
alignments
Automatically creates/downloads alignments for multiple speech datasets, using pre-existing alignments were possible.
speech-collator
A collator for speech datasets with different batching strategies and attribute extraction.
minixc.github.io
My own website.
speech-datasets
Preprocessing pipeline for speech datasets.
charsiu
Charsiu: A neural phonetic aligner.
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
FastPitchesForCirrus
Deep Learning Examples
libriheavy-small
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context (only small split)
masked-prosody-modeling
A masked prosody model.
masked-prosody-modeling-evaluation
A masked prosody model & it's evaluation on downstream tasks.
miipher-3.9
Unofficial implementation of miipher
ml-template
Template for my machine learning projects.
Montreal-Forced-Aligner-3.12-fix
Command line utility for forced alignment using Kaldi
phonemizer-object
Simple text to phones converter for multiple languages - using an object instead of a function
vocex2
Vocex with whisper encoder and additional targets.
whisper-no-triton
Robust Speech Recognition via Large-Scale Weak Supervision (without triton)