SPY Lab (ethz-spylab)

SPY Lab

ethz-spylab

Geek Repo

Secure and Private AI research at ETH Zürich

Location:Switzerland

Home Page:https://spylab.ai

Github PK Tool:Github PK Tool

SPY Lab's repositories

rlhf_trojan_competition

Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.

Language:PythonLicense:Apache-2.0Stargazers:96Issues:5Issues:5

rlhf-poisoning

Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"

Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

diffusion_denoised_smoothing

Certified robustness "for free" using off-the-shelf diffusion models and classifiers

Language:PythonLicense:MITStargazers:32Issues:2Issues:3
Language:PythonLicense:MITStargazers:26Issues:0Issues:0

realistic-adv-examples

Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]

Language:PythonLicense:MITStargazers:19Issues:2Issues:1

satml-llm-ctf

Code used to run the platform for the LLM CTF colocated with SaTML 2024

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

lm_memorization_data

Data for "Quantifying Memorization Across Neural Language Models"

License:Apache-2.0Stargazers:6Issues:0Issues:1

lm-extraction-benchmark-data

Datasets for the SATML 2023 competition on training data extraction

License:Apache-2.0Stargazers:4Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

privacy

Library for training machine learning models with privacy for training data

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

misleading-privacy-evals

Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading"

Stargazers:0Issues:0Issues:0