Sangdoo Yun's starred repositories
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
consistencydecoder
Consistency Distilled Diff VAE
LLM-Reading-List
LLM papers I'm reading, mostly on inference and model compression
landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
ConceptBottleneck
Concept Bottleneck Models, ICML 2020
CLIP-Parrot-Bias
Parrot Captions Teach CLIP to Spot Text
WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"
Context-Memory
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
pause-transformer
Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount of time on any token
dual-teacher
Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"
imagenet-12k
ImageNet-12k subset of ImageNet-21k (fall11)
Neural-Relation-Graph
Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)
STAI-tuned
Utility code from STAI (https://scalabletrustworthyai.github.io/)