Lennart Keller's repositories
roberta2longformer
Convert pretrained RoBerta models to various long-document transformer models
memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
CharsiuG2P
Multilingual G2P in 100 languages
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
germancoref
Adapted Dutch coreference resolution & dialogue analysis using deterministic rules for German
IMSDB_EPub
Scrape movie scripts from imsdb.com and convert them to Ebooks
phonepiece
phone inventory library
rouge-score
Fork of google rouge score implementation
SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
sosap
🗣️ sosap(សូរសព្ទ) Python binding for Phonetisaurus
speechbrain
A PyTorch-based Speech Toolkit
stilometry_paper
compare realismus and romantik
Stylesheets
TEI XSL Stylesheets
summary_align
align summaries with segments of novels
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
trident
Generic model training framework abolishing boilerplate
uroman
Universal Romanizer that can convert any unicode script to roman (latin) script