Michal Stefanik's starred repositories
llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
prefetch_generator
Simple package that makes your generator work in background thread
composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
prompterator
Iterate efficiently towards more effective prompts
rag-demystified
An LLM-powered advanced RAG pipeline built from scratch
CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
text-to-motion-retrieval
Official code for reproducing results obtained in the short paper "Text-to-Motion Retrieval: Towards Joint Understanding of Human Motion Data and Natural Language", accepted at SIGIR 2023.
small-text
Active Learning for Text Classification in Python
pv211-utils
Utilities for the term project in the PV211 introduction to information retrieval course
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
matchmaker
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
genomic_benchmarks
Benchmarks for classification of genomic sequences
master-thesis
One Bit at a Time: Impact of Quantisation on Neural Machine Translation
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
docker-libreoffice-headless
Libreoffice as headless docker container