leondz

followers

following

stars

NVIDIA · ITU Copenhagen

Copenhagen · Seattle

https://www.derczynski.com

@leonderczynski

Organizations

ITUnlp

Leon Derczynski's repositories

garak

LLM vulnerability scanner

Language:PythonApache-2.01017 17 497

hatespeechdata

Catalog of abusive language data (PLoS 2020)

Language:Python292 22 11

entity_recognition

framework for doing NER and other types of entity recognition, in Python

Language:TeXApache-2.068 18 20

lm_risk_cards

Risks and targets for assessing LLMs & LLM vulnerabilities

Language:Python21 50

autoredteam

autoredteam: code for training models that automatically red team other language models

Language:PythonMIT7 2 8

llmsec-site

7 20

generalised-brown

C++ implementation of Generalised Brown clustering and python scripts for feature generation (AAAI 2016)

Language:C++2 20

acl-anthology

Data and software for building the ACL Anthology.

Language:PythonApache-2.0010

acl-style-files

Official style files for papers submitted to venues of the Association for Computational Linguistics

Language:TeX010

aclrollingreview

ACL Rolling Review website

Language:SCSSMIT010

aclsigsec-web

020

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT010

CyberAgressionAdo-v1

Dataset of Teen Cyberbullying scenari in French

010

dagw-site

Language:HTMLMIT03 1

datasets

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonApache-2.0020

garak-test

quality tests for llmsec failure mode detectors

Apache-2.0000

leondz

BSD-2-Clause020

llmsecurity

Language:SCSS020

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Language:PythonMIT010

mole-stance

MoLE: Cross-Domain Label-Adaptive Stance Detection

Language:PythonNOASSERTION010

nanoChatGPT

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

Language:PythonMIT010

nejlt-kickstart

Language:HTMLMIT020

Prompt-Engineering-Guide

:octopus: Guide and resources for prompt engineering

MIT010

PyRIT

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

Language:PythonMIT010

rtd-tutorial-template

Template for the Read the Docs tutorial

Language:Python010

Snowballed_Hallucination

010

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0010

TrustGPT

Can We Trust Large Language Models?: A Benchmark for Responsible Large Language Models via Toxicity, Bias, and Value-alignment Evaluation

Language:PythonMIT010

vexillomesse

Language:HTMLGPL-3.0020

www-project-top-10-for-large-language-model-applications

OWASP Foundation Web Respository

Language:HTMLNOASSERTION010