Ewanwong's repositories
lit
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
B-cos-v2
Official PyTorch implementation of improved B-cos models
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
self-debiasing
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
ToxificationReversal
Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
gpt4free
The official gpt4free repository | various collection of powerful language models
bias-bench
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
ADEPT
Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).
cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
StereoSet
StereoSet: Measuring stereotypical bias in pretrained language models
eraserbenchmark
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
Diffusion-LM
Diffusion-LM
research-method
论文写作与资料分享
controllable-nlg-biases
Framework for controlling demographic biases in NLG (using adversarial prompts)
PrefixTuning
Prefix-Tuning: Optimizing Continuous Prompts for Generation