Leon Derczynski's repositories
hatespeechdata
Catalog of abusive language data (PLoS 2020)
entity_recognition
framework for doing NER and other types of entity recognition, in Python
lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
autoredteam
autoredteam: code for training models that automatically red team other language models
generalised-brown
C++ implementation of Generalised Brown clustering and python scripts for feature generation (AAAI 2016)
acl-anthology
Data and software for building the ACL Anthology.
acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
aclrollingreview
ACL Rolling Review website
CyberAgressionAdo-v1
Dataset of Teen Cyberbullying scenari in French
garak-test
quality tests for llmsec failure mode detectors
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
mole-stance
MoLE: Cross-Domain Label-Adaptive Stance Detection
nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
Prompt-Engineering-Guide
:octopus: Guide and resources for prompt engineering
rtd-tutorial-template
Template for the Read the Docs tutorial
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
www-project-top-10-for-large-language-model-applications
OWASP Foundation Web Respository