aflueckiger's starred repositories
NOAH-Corpus
NOAH's Corpus: Part-of-Speech Tagging for Swiss German
PyMuPDF-Utilities
Demos, examples and utilities using PyMuPDF
lm-hackers
Hackers' Guide to Language Models
context-aware-word-vectors
Context aware word vectors
text-clustering
Easily embed, cluster and semantically label text datasets
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
latent-scope
A scientific instrument for investigating latent spaces
datamapplot
Creating beautiful plots of data maps
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
pytesseract
A Python wrapper for Google Tesseract