VLAA@UCSC (UCSC-VLAA)

VLAA@UCSC

UCSC-VLAA

Geek Repo

Home Page:https://ucsc-vlaa.github.io/

Github PK Tool:Github PK Tool

VLAA@UCSC's repositories

CLIPA

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Language:PythonLicense:Apache-2.0Stargazers:289Issues:13Issues:11

RobustCNN

[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"

Language:PythonLicense:MITStargazers:143Issues:4Issues:1

Recap-DataComp-1B

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

DMAE

[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"

Language:PythonLicense:NOASSERTIONStargazers:96Issues:5Issues:8

SwinMM

[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"

HQ-Edit

HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing

Language:PythonLicense:NOASSERTIONStargazers:61Issues:6Issues:6

vllm-safety-benchmark

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

CRATE-alpha

This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"

EVP

[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"

Language:PythonLicense:MITStargazers:35Issues:1Issues:0

MicroDiffusion

[CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"

FedConv

[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"

Language:PythonLicense:MITStargazers:22Issues:1Issues:0

Image-Pretraining-for-Video

[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".

Language:PythonLicense:MITStargazers:19Issues:0Issues:1

Sight-Beyond-Text

This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"

Language:PythonLicense:Apache-2.0Stargazers:19Issues:2Issues:1

MixCon3D

[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"

AdvXL

[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"

AQA-Bench

Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability

Language:PythonLicense:MITStargazers:4Issues:1Issues:0

vit_cert

[ECCV 2022] This repository includes the official implementation our paper "ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers"

Language:PythonStargazers:3Issues:0Issues:0

Compress-Align

This repository includes the official implementation and dataset of our paper "Compress & Align: Curating Image-Text Data with Human Knowledge".

Language:HTMLStargazers:0Issues:0Issues:0