Sangdoo Yun (hellbell)

hellbell

Geek Repo

Company:Naver AI Lab

Location:Seoul, Korea

Home Page:https://sangdooyun.github.io

Github PK Tool:Github PK Tool

Sangdoo Yun's starred repositories

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:11132Issues:75Issues:458

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6293Issues:61Issues:76

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4247Issues:57Issues:330

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2082Issues:23Issues:19

moco-v3

PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057

Language:PythonLicense:NOASSERTIONStargazers:1166Issues:18Issues:34

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1081Issues:13Issues:22

LLM-Reading-List

LLM papers I'm reading, mostly on inference and model compression

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonLicense:Apache-2.0Stargazers:394Issues:40Issues:14

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:324Issues:21Issues:59

gisting

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Language:PythonLicense:Apache-2.0Stargazers:244Issues:6Issues:16

vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Language:PythonLicense:MITStargazers:207Issues:8Issues:36

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonLicense:BSD-3-ClauseStargazers:186Issues:4Issues:10

ConceptBottleneck

Concept Bottleneck Models, ICML 2020

Language:PythonLicense:MITStargazers:159Issues:5Issues:13

SEARLE

[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion

Language:PythonLicense:NOASSERTIONStargazers:123Issues:13Issues:8

meru

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

Language:PythonLicense:NOASSERTIONStargazers:115Issues:8Issues:7

tcl

Official implementation of TCL (CVPR 2023)

Language:PythonLicense:MITStargazers:102Issues:11Issues:8

SRe2L

(NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original ImageNet-1K val set.

lincir

Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:71Issues:7Issues:12
Language:PythonLicense:Apache-2.0Stargazers:51Issues:7Issues:4

CLIP-Parrot-Bias

Parrot Captions Teach CLIP to Spot Text

Language:PythonLicense:Apache-2.0Stargazers:51Issues:3Issues:2

WaffleCLIP

Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"

Language:PythonLicense:MITStargazers:48Issues:3Issues:2

Context-Memory

Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)

Language:PythonLicense:MITStargazers:44Issues:4Issues:2

pause-transformer

Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount of time on any token

Language:PythonLicense:MITStargazers:42Issues:4Issues:0

dual-teacher

Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:34Issues:1Issues:5

clippy

Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:33Issues:3Issues:1

imagenet-12k

ImageNet-12k subset of ImageNet-21k (fall11)

Language:PythonLicense:Apache-2.0Stargazers:17Issues:3Issues:0

Neural-Relation-Graph

Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)

Language:PythonLicense:MITStargazers:15Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:8Issues:6Issues:1

STAI-tuned

Utility code from STAI (https://scalabletrustworthyai.github.io/)

Language:Jupyter NotebookLicense:MITStargazers:7Issues:0Issues:0