ReDS Lab

ReDS Lab 's repositories

Narcissus

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

Language:PythonMIT105 2 10

LAVA

This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).

Language:PythonMIT4402

CLIP-MIA

This is an official repository for Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study (ICCV2023).

Language:Jupyter NotebookMIT2002

Meta-Sift

The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on poisoned dataset.

Language:Python18 20

ASSET

This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms." ASSET achieves state-of-the-art reliability in detecting poisoned samples in end-to-end supervised learning/ self-supervised learning/ transfer learning.

Language:PythonMIT17 3 2

projektor

This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (NeurIPS 2023).

Language:PythonMIT1300

Universal_Pert_Cert

This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calculate the certified robustness against universal perturbations (UAP/ Backdoor) given a trained model.

Language:PythonMIT12 3 1

BEEAR

This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models".

Language:HTML10 1 1

2d-shapley

This is an official repository for "2D-Shapley: A Framework for Fragmented Data Valuation" (ICML2023).

Language:Jupyter NotebookMIT401

Forward-INF

Language:Jupyter NotebookApache-2.0400

Nash-Meta-Learning

Official implementation of "Fairness-Aware Meta-Learning via Nash Bargaining." We explore hypergradient conflicts in one-stage meta-learning and their impact on fairness. Our two-stage approach uses Nash bargaining to mitigate conflicts, enhancing fairness and model performance simultaneously.

Language:Jupyter Notebook400

privmon

This is an official repository for PrivMon: A Stream-Based System for Real-Time Privacy Attack Detection for Machine Learning Models (RAID 2023)

Language:PythonMIT400

Knowledge-Enriched-DMI

The official implementation of the ICCV 2021 paper, "Knowledge-Enriched Distributional Model Inversion Attacks."

Language:PythonMIT300

I-BAU

Official Implementation of the ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''

Language:Jupyter NotebookMIT200

Restricted_gradient_diversity_unlearning

Language:Python200

Trojan_Removal_Benchmark

Language:Python2 10

WokeyTalky

Language:HTML200

frequency-backdoor

The official implementation of the ICCV 2021 paper, "Rethinking the backdoor attacks' triggers: A frequency perspective."

Language:Jupyter NotebookMIT100

preference-learning-with-rationales

This is the public repository for Data-Centric Human Preference Optimization with Rationales.

Language:PythonApache-2.0100

dataselection

Projektor Website

Language:JavaScriptMIT000

reds-lab.github.io

Homepage portfolio of Reds Projects

Language:TypeScriptMIT000

Woke-Pipeline

Language:PythonMIT000