Kyusong Lee (kyusonglee)

kyusonglee

Geek Repo

Company:SOCO AI

Location:Seattle

Home Page:soco.ai

Github PK Tool:Github PK Tool


Organizations
DialRC
soco-ai

Kyusong Lee's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165920Issues:1553Issues:2539

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67176Issues:558Issues:707

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40276Issues:393Issues:1292

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25252Issues:221Issues:456

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:19344Issues:147Issues:261

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15854Issues:201Issues:76

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:13147Issues:113Issues:1816

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11336Issues:96Issues:340

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:10122Issues:65Issues:105

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8184Issues:73Issues:401

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7519Issues:109Issues:152

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonLicense:Apache-2.0Stargazers:6164Issues:68Issues:247

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5635Issues:46Issues:292

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3538Issues:31Issues:253

make-sense

Free to use online tool for labelling photos. https://makesense.ai

Language:TypeScriptLicense:GPL-3.0Stargazers:3073Issues:53Issues:187

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1881Issues:24Issues:88

chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:Apache-2.0Stargazers:1278Issues:34Issues:68

TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Language:PythonLicense:BSD-3-ClauseStargazers:1229Issues:19Issues:33

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:887Issues:9Issues:17
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:823Issues:14Issues:28

attentions

PyTorch implementation of some attentions for Deep Learning Researchers.

Language:PythonLicense:MITStargazers:505Issues:3Issues:4

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:471Issues:12Issues:37

soho

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

fight-detection-surv-dataset

New generated dataset for fight detection in surveillance cameras.

VL-CheckList

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.

PB-OVD

A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

Language:PythonLicense:BSD-3-ClauseStargazers:53Issues:5Issues:6

Install-Slurm

Install Slurm on CentOS-7 Virtual Cluster.

Stargazers:31Issues:0Issues:0