tml-epfl / llm-adaptive-attacks

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]

Home Page:https://arxiv.org/abs/2404.02151

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

tml-epfl/llm-adaptive-attacks Stargazers