AnthenaMatrix / The-I-Exemption-Bypassing-LLM-Ethical-Filters

The "I" Exemption, is a curious behavior in some LLMs. We discover how these AI systems might shy away from directly assisting with unethical actions if you ask in the first person ("I"). But with a clever rephrase to a general scenario ("they"), they might spill the beans and explain the unethical method.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AnthenaMatrix/The-I-Exemption-Bypassing-LLM-Ethical-Filters Stargazers