Seungwook Kim's starred repositories
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
threestudio
A unified framework for 3D content generation.
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
torchtitan
A native PyTorch Library for large model training
FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
gangealing
Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
MVDream-threestudio
3D generation code for MVDream
Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
threestudio-3dgs
3D Gaussian Splatting extension of threestudio.
FlowDiffusion_pytorch
Unofficial pytorch implementation of DDVM.
einops-exts
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
CorrespondentDream
Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Poster)
Sparsifiner
Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"