burtenshaw's repositories
agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
bs4_scraping
A quick introductory class to scraping with beautiful soup and wrangling scraped tables with pandas.
burtenshaw.github.io
Personal for website research, design, code, and life.
CCNLG_2019
Convert data from EasyChair for use with aclpub
See-Whence
Sequence classification base code, used for PhD thesis and SemEval 2020 sarcasm detection.
spanwijdte
Binary and Multilabel toxic span detection in Dutch.
data-is-better-together
Let's build better datasets, together!
distilabel
⚗️ AI Feedback framework for scalable LLM alignment
kidscrawler
A child safe web crawler
llm-autoeval
Automatically evaluate your LLMs in Google Colab
OpenNMT-py
Open-Source Neural Machine Translation in PyTorch http://opennmt.net/
orpo
Official repository for ORPO
share-lm
ShareLM is a Chrome extension that lets you share your open-source conversations
trl
Train transformer language models with reinforcement learning.