tianshuocong

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Language:PythonMIT20200

FigStep

Jailbreaking Large Vision-language Models via Typographic Visual Prompts

Language:PythonMIT5900

toxic-prompt

Language:Python1400

baadd

Code for Backdoor Attacks Against Dataset Distillation

Language:PythonApache-2.03000

cinic-10

A drop-in replacement for CIFAR-10.

Language:Jupyter NotebookMIT23200

TePA

[S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models

Language:Python1400

Lion

Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"

Language:PythonMIT19300

MGTBench

Language:PythonMIT12700

Plot_Steal

Language:Python800

MLHospital

Language:PythonMIT4200

MART

Modular Adversarial Robustness Toolkit

Language:PythonBSD-3-Clause1600

PyTorch_CIFAR10

Pretrained TorchVision models on CIFAR10 dataset (with weights)

Language:PythonMIT62100

TransferAttackEval

Revisiting Transferable Adversarial Images (arXiv)

Language:Python11000

Targeted-Transfer

Simple yet effective targeted transferable attack (NeurIPS 2021)

Language:PythonMIT4700

DUA

The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization (CVPR 2022)

Language:Python5400

ML-Doctor

Code for ML Doctor

Language:PythonApache-2.07800

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonApache-2.02747900

lightning-hydra-template

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Language:Python384900