Sukjun Hwang (sukjunhwang)

sukjunhwang

Geek Repo

Company:Carnegie Mellon University

Location:Pittsburgh, PA

Home Page:sukjunhwang.github.io

Github PK Tool:Github PK Tool


Organizations
ciplab

Sukjun Hwang's starred repositories

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1276Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20637Issues:0Issues:0
Language:PythonStargazers:191Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:11337Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1199Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21909Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3653Issues:0Issues:0

attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Language:PythonLicense:MITStargazers:411Issues:0Issues:0

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonLicense:Apache-2.0Stargazers:1052Issues:0Issues:0

Score-Entropy-Discrete-Diffusion

[ICML 2024 Oral] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Language:PythonLicense:MITStargazers:212Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1060Issues:0Issues:0

resource-stream

CUDA related news and material links

License:MITStargazers:905Issues:0Issues:0

sam

SAM: Sharpness-Aware Minimization (PyTorch)

Language:PythonLicense:MITStargazers:1681Issues:0Issues:0

LongMamba

Some preliminary explorations of Mamba's context scaling.

Language:PythonStargazers:170Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27247Issues:0Issues:0

SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Language:PythonLicense:MITStargazers:532Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:115Issues:0Issues:0

contrastors

Train Models Contrastively in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:440Issues:0Issues:0

memray

Memray is a memory profiler for Python

Language:PythonLicense:Apache-2.0Stargazers:12705Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4739Issues:0Issues:0

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:161Issues:0Issues:0

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:601Issues:0Issues:0

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonLicense:MITStargazers:386Issues:0Issues:0

mvit

Code Release for MViTv2 on Image Recognition.

Language:PythonLicense:Apache-2.0Stargazers:364Issues:0Issues:0

ml-cvnets

CVNets: A library for training computer vision networks

Language:PythonLicense:NOASSERTIONStargazers:1708Issues:0Issues:0

awesome-ssm-ml

Reading list for research topics in state-space models

License:MITStargazers:155Issues:0Issues:0

LLM-Training-Puzzles

What would you do with 1000 H100s...

Language:Jupyter NotebookLicense:MITStargazers:778Issues:0Issues:0

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5236Issues:0Issues:0

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookLicense:MITStargazers:2864Issues:0Issues:0