Soonhwan-Kwon's starred repositories
Papers-on-Ternary-and-Binary-Networks
Papers and codes about Quantized Networks for easier survey and reference.
Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
data_tooling
Tools for managing datasets for governance and training.
languageid
Identifying the language of input text using character-level n-grams, with support for 45 languages
llm-foundry
LLM training code for Databricks foundation models
LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
ama_prompting
Ask Me Anything language model prompting
Image-Caption-Quality-Dataset
A dataset of crowdsourced ratings for machine-generated image captions
eccv-caption
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
watermark-detection
A repository containing datasets and tools to train a watermark classifier.
ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
sigir2020-tablesum
Summarizing and Exploring Tabular Data in Conversational Search (SIGIR '20)
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch