Leon Lu (gfvvz)

gfvvz

Geek Repo

Location:Shanghai, China

Github PK Tool:Github PK Tool

Leon Lu's repositories

Triton-Compiler

Triton Compiler related materials.

License:MITStargazers:25Issues:0Issues:0

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

License:NOASSERTIONStargazers:1Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0

triton-shared

Shared Middle-Layer for Triton Compilation

Language:MLIRLicense:MITStargazers:0Issues:0Issues:0

Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Stargazers:0Issues:0Issues:0

cmake_example

Example pybind11 module built with a CMake-based build system

License:NOASSERTIONStargazers:0Issues:0Issues:0

FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

License:Apache-2.0Stargazers:0Issues:0Issues:0

gfvvz.github.io

Build a Jekyll blog in minutes, without touching the command line.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

gpt-2-output-dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

License:MITStargazers:0Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

License:MITStargazers:0Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Stargazers:0Issues:0Issues:0

llm-from-scratch

llama3 implementation one matrix multiplication at a time

License:MITStargazers:0Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:0Issues:0Issues:0

md-blogs

A blog where I write about research papers and blog posts I read.

Stargazers:0Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

License:MITStargazers:0Issues:0Issues:0

mlir-tutorial

Hands-On Practical MLIR Tutorial

License:Apache-2.0Stargazers:0Issues:0Issues:0

mojo

The Mojo Programming Language

License:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-transformer

Attention is all you need implementation

Stargazers:0Issues:0Issues:0

resource-stream

CUDA related news and material links

License:MITStargazers:0Issues:0Issues:0

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Stargazers:0Issues:0Issues:0

triton-cpu

An experimental CPU backend for Triton (https//github.com/openai/triton)

License:MITStargazers:0Issues:0Issues:0

Triton-Puzzles

Puzzles for learning Triton

License:Apache-2.0Stargazers:0Issues:0Issues:0

tvm-cn

TVM Documentation in Chinese Simplified / TVM 中文文档

Language:JavaScriptStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0