Wonjae Kim (dandelin)

dandelin

Geek Repo

Company:@naver-ai

Location:South Korea

Home Page:http://wonjae.kim

Github PK Tool:Github PK Tool


Organizations
Deepest-Project
SNU-HCIL

Wonjae Kim's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:43889Issues:295Issues:621

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:24854Issues:219Issues:437

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

Language:JavaScriptLicense:MITStargazers:16275Issues:141Issues:638

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:15941Issues:152Issues:1230

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13401Issues:112Issues:355

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7793Issues:92Issues:343

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:5922Issues:61Issues:49

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5638Issues:72Issues:514

Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Language:PythonLicense:BSD-3-ClauseStargazers:1598Issues:15Issues:21

KoAlpaca

KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1493Issues:29Issues:98

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1414Issues:32Issues:192

scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!

Language:PythonLicense:UnlicenseStargazers:1223Issues:26Issues:271

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:858Issues:8Issues:16

meerkat

Creative interactive views of any dataset.

Language:PythonLicense:Apache-2.0Stargazers:808Issues:15Issues:83

Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Language:PythonLicense:Apache-2.0Stargazers:758Issues:11Issues:28

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonLicense:MITStargazers:727Issues:20Issues:15

segment-anything-with-clip

Segment Anything combined with CLIP

Language:PythonLicense:Apache-2.0Stargazers:297Issues:1Issues:4

Stable-DINO

[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"

trident

A performance library for machine learning applications.

Language:PythonLicense:Apache-2.0Stargazers:175Issues:4Issues:8

zozo-shift15m

SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts

Language:PythonLicense:NOASSERTIONStargazers:161Issues:67Issues:106

Visual-LLaMA

Open LLaMA Eyes to See the World

lgssl

[CVPR 2023] Learning Visual Representations via Language-Guided Sampling

Language:PythonLicense:MITStargazers:140Issues:2Issues:4

plugins

AI plugins for apps like chatGPT :)

Language:JavaScriptStargazers:120Issues:4Issues:0

fuckvkeypad

가상키보드(vKeypad) 우회도구

Language:PythonLicense:MITStargazers:50Issues:1Issues:0

ETA4LLMs

Calculating Expected Time for training LLM.

Language:PythonStargazers:38Issues:1Issues:0

Paint-Anything

An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.

Language:PythonLicense:Apache-2.0Stargazers:33Issues:3Issues:1

FLM

Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

Language:PythonStargazers:31Issues:6Issues:0

refid2bib

biorxiv, doi, and arxiv ids -> bibtex entry

Language:PythonLicense:BSD-3-ClauseStargazers:8Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0