Oyvind Tafjord's repositories
MRQA-Shared-Task-2019
Resources for the MRQA 2019 Shared Task
allennlp-demo
code for demo.allennlp.org
deep_qa
A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)
drop-leaderboard-example
An example submission to the AllenAI DROP leaderboard
helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
mesh
Mesh TensorFlow: Model Parallelism Made Easier
pipeline
Library for building reproducible data pipelines to support experimentation
transformers
PyTorch version of Google AI's BERT model with script to load Google's pre-trained models