kyusonglee

Kyusong Lee's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT165920 1553 2539

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION67176 558 707

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.040276 393 1292

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25252 221 456

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT19344 147 261

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookApache-2.015854 201 76

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Language:Jupyter NotebookGPL-3.013147 113 1816

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT11336 96 340

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT10122 65 105

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.08184 73 401

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.07519 109 152

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonApache-2.06164 68 247

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonMIT5635 46 292

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT3538 31 253

make-sense

Free to use online tool for labelling photos. https://makesense.ai

Language:TypeScriptGPL-3.03073 53 187

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.01881 24 88

chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

Language:Python1534 20 246

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonApache-2.01278 34 68

TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Language:PythonBSD-3-Clause1229 19 33

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Language:Python1131 14 14

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1111 37 6

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonMIT887 9 17

CLIPasso

Language:Jupyter NotebookNOASSERTION823 14 28

attentions

PyTorch implementation of some attentions for Deep Learning Researchers.

Language:PythonMIT505 3 4

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookApache-2.0471 12 37

soho

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Language:Python206 10 13

fight-detection-surv-dataset

New generated dataset for fight detection in surveillance cameras.

MIT147 6 3

VL-CheckList

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.

Language:Python124 6 11

PB-OVD

A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

Language:PythonBSD-3-Clause53 5 6

Install-Slurm

Install Slurm on CentOS-7 Virtual Cluster.

3100