JunHa Song (junha1125)

junha1125

Geek Repo

Company:KAIST

Location: South Korea

Home Page:https://junha1125.blogspot.com/

Github PK Tool:Github PK Tool

JunHa Song's starred repositories

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3491Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7041Issues:0Issues:0

pycocoevalcap

Python 3 support for the MS COCO caption evaluation tools

Language:PythonLicense:NOASSERTIONStargazers:295Issues:0Issues:0

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3472Issues:0Issues:0

minillm

MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs

Language:PythonLicense:MITStargazers:842Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18475Issues:0Issues:0

FiGCLIP

Official repository for FiGCLIP

Stargazers:7Issues:0Issues:0

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:NOASSERTIONStargazers:1355Issues:0Issues:0

Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

Stargazers:438Issues:0Issues:0

ViECap

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

Language:PythonStargazers:139Issues:0Issues:0

recognize-anything

Codebase for the Recognize Anything Model (RAM)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:55Issues:0Issues:0

smallcap

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

Language:Jupyter NotebookStargazers:85Issues:0Issues:0

CLIP_prefix_caption

Simple image captioning model

Language:Jupyter NotebookLicense:MITStargazers:1274Issues:0Issues:0

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2644Issues:0Issues:0

MIC

[CVPR23] Official Implementation of MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation

Language:PythonStargazers:256Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6854Issues:0Issues:0

clip-distillation

Zero-label image classification via OpenCLIP knowledge distillation

Language:PythonLicense:NOASSERTIONStargazers:101Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10969Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9Issues:0Issues:0

DreamLIP

[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions

Language:PythonStargazers:72Issues:0Issues:0
Language:PythonStargazers:96Issues:0Issues:0

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

Stargazers:1012Issues:0Issues:0

home-robot

Mobile manipulation research tools for roboticists

Language:PythonLicense:MITStargazers:830Issues:0Issues:0

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:509Issues:0Issues:0

RIPU

[CVPR 2022 Oral] Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic Segmentation https://arxiv.org/abs/2111.12940

Language:PythonLicense:MITStargazers:139Issues:0Issues:0

cs229-2018-autumn

All notes and materials for the CS229: Machine Learning course by Stanford University

Language:Jupyter NotebookStargazers:1619Issues:0Issues:0

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2678Issues:0Issues:0

BECoTTA

Code for "BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation [ICML2024]".

Language:PythonStargazers:25Issues:0Issues:0

PETL-ViT

[ICCV 2023] Binary Adapters, [AAAI 2023] FacT, [Tech report] Convpass

Language:PythonLicense:MITStargazers:164Issues:0Issues:0