Zhang-Yihao / Adversarial-Representation-Engineering

Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.

Home Page:https://arxiv.org/abs/2404.13752

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Zhang-Yihao/Adversarial-Representation-Engineering Watchers