Geek Repo
followers
following
stars
Location:Berkeley, California
Home Page:danhendrycks.com
Twitter:@danhendrycks
Github PK Tool:Github PK Tool
JAILBREAK PROMPTS FOR ALL MAJOR AI MODELS
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
RuLES: a benchmark for evaluating rule-following in language models
PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures (CVPR 2022)
AI that uses genetic algorithms to beat the game 2048