Beast code in Giters

VLAA@UCSC's repositories

CLIPA

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Language:PythonApache-2.0289 13 11

RobustCNN

[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"

Language:PythonMIT143 4 1

Recap-DataComp-1B

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

107 5 14

DMAE

[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"

Language:PythonNOASSERTION96 5 8

SwinMM

[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"

Language:Python96 4 8

HQ-Edit

HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing

Language:PythonNOASSERTION61 6 6

vllm-safety-benchmark

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

Language:Python51 4 1

CRATE-alpha

This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"

Language:Python35 2 1

EVP

[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"

Language:PythonMIT35 10

MicroDiffusion

[CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"

Language:Python23 4 4

FedConv

[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"

Language:PythonMIT22 10

Image-Pretraining-for-Video

[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".

Language:PythonMIT1901

Sight-Beyond-Text

This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"

Language:PythonApache-2.019 2 1

MixCon3D

[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"

Language:Python18 2 2

AdvXL

[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"

Language:Python16 2 1

Redteaming_Challenge

Language:Python6 10

AQA-Bench

Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability

Language:PythonMIT4 10

vit_cert

[ECCV 2022] This repository includes the official implementation our paper "ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers"

Language:Python300

Compress-Align

This repository includes the official implementation and dataset of our paper "Compress & Align: Curating Image-Text Data with Human Knowledge".

2 2 1

UCSC-VLAA.github.io

Language:HTML000