Beast code in Giters

Hm Xiong's repositories

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonMIT000

CUDA-Learn-Note

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记，更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaGPL-3.0000

github-slideshow

A robot powered training repository :robot:

Language:RubyMIT000

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT000

pytorch-distributed-training

Simple tutorials on Pytorch DDP training

000

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Apache-2.0000

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Apache-2.0000

hmxiong

Hm Xiong's repositories

Transformer-Series

OpenMMLabCamp

paper-reading

CRATE

CUDA-Learn-Note

GaLore

github-slideshow

hallow

llama2.c

ScanNet_Vis

Tarurs

pytorch-distributed-training

RWKV-LM

VILA