hellbell

Sangdoo Yun's starred repositories

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.011132 75 458

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6293 61 76

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.04247 57 330

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonMIT2082 23 19

moco-v3

PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057

Language:PythonNOASSERTION1166 18 34

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonNOASSERTION1081 13 22

LLM-Reading-List

LLM papers I'm reading, mostly on inference and model compression

673 210

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonApache-2.0394 40 14

open_lm

A repository for research on medium sized language models.

Language:PythonMIT324 21 59

gisting

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Language:PythonApache-2.0244 6 16

vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Language:PythonMIT207 8 36

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonBSD-3-Clause186 4 10

ConceptBottleneck

Concept Bottleneck Models, ICML 2020

Language:PythonMIT159 5 13

SEARLE

[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion

Language:PythonNOASSERTION123 13 8

meru

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

Language:PythonNOASSERTION115 8 7

tcl

Official implementation of TCL (CVPR 2023)

Language:PythonMIT102 11 8

SRe2L

(NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original ImageNet-1K val set.

Language:Python94 8 9

lincir

Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)

Language:PythonNOASSERTION71 7 12

close

Language:PythonApache-2.051 7 4

CLIP-Parrot-Bias

Parrot Captions Teach CLIP to Spot Text

Language:PythonApache-2.051 3 2

WaffleCLIP

Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"

Language:PythonMIT48 3 2

Context-Memory

Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)

Language:PythonMIT44 4 2

pause-transformer

Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount of time on any token

Language:PythonMIT42 40