Hm Xiong (hmxiong)

hmxiong

Geek Repo

Company:Dalian University of Technology

Github PK Tool:Github PK Tool

Hm Xiong's repositories

Language:PythonStargazers:7Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:1Issues:0Issues:0

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CUDA-Learn-Note

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

github-slideshow

A robot powered training repository :robot:

Language:RubyLicense:MITStargazers:0Issues:0Issues:0

hallow

i just wanna learn deep learning

Stargazers:0Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Tarurs

competition files

Stargazers:0Issues:0Issues:0

pytorch-distributed-training

Simple tutorials on Pytorch DDP training

Stargazers:0Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

License:Apache-2.0Stargazers:0Issues:0Issues:0

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

License:Apache-2.0Stargazers:0Issues:0Issues:0