Christoph Minixhofer's repositories
simple-back
A simple daily python backtester that works out of the box.
opensubtitles-dataloader
Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
punctuation-iwslt2011
Huggingface datasets script for pre-processing punctuation annotation using IWSLT11 dataset.
DroughtLoader
Loads NASA POWER and World Harmonized Soil Database data for drought prediction.
something-something-webdesign
Some resources for web design.
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
droughted_scripts
Scripts to create a weather + drought dataset for the US.
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
global_phone_dataloader
Huggingface Dataloader for the Global Phone dataset.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
lightning
Build and train PyTorch models and connect them to the ML lifecycle using Lightning App templates, without handling DIY infrastructure, cost management, scaling, and other headaches.
nnAudio
Audio processing by using pytorch 1D convolution network
phonecodes
python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.
streamlit-cece
Example Streamlit app that you can fork to test out share.streamlit.io
tts-for-asr
TTS for Low Resource ASR