LoryPack / LLM-LieDetector

Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"

Home Page:https://openreview.net/forum?id=567BjxgaTp

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LoryPack/LLM-LieDetector Issues