Beast code in Giters

alphaRGB's starred repositories

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

Language:TypeScriptBSD-2-Clause15520 250 3175

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5302 63 89

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonApache-2.03004 45 295

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonApache-2.02944 42 216

sun-panel

A server, NAS navigation panel, Homepage, browser homepage. | 一个服务器、NAS导航面板、Homepage、浏览器首页。

Language:VueMIT2185 11 168

tvm_mlir_learn

compiler learning resources collect.

Language:Python1896 35 4

Pytorch-Memory-Utils

pytorch memory track code

Language:Python965 16 22

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++Apache-2.0766 35 231

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

Language:HTMLNOASSERTION647 26 204

mlir-tutorial

MLIR For Beginners tutorial

Language:C++613 16 16

tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

Language:C++NOASSERTION513 21 86

Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Language:PythonMIT385 13 8

depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Language:PythonMIT370 10 20

llama.onnx

LLaMa/RWKV onnx models, quantization and testcase

Language:PythonGPL-3.0331 13 18

SCALE-Sim

Language:PythonMIT319 240

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaMIT315 3 8

scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Language:PythonMIT187 4 61

mlir-tutorial

Hands-On Practical MLIR Tutorial

Language:C++Apache-2.0181 2 3

DL_Compiler

Study Group of Deep Learning Compiler

152 160

TensorNVMe

A Python library transfers PyTorch tensors between CPU and NVMe

Language:C++82 6 2

wmma_extension

An extension library of WMMA API (Tensor Core API)

Language:CudaMIT76 7 4

calculon

Language:PythonApache-2.075 5 5

fansfood

一个基于django的美食制作教程和美食图片的网站

Language:PythonBSD-3-Clause7500

export_llama_to_onnx

export llama to onnx

Language:PythonMIT72 1 14

ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Language:Python67 30

chatglm-q

Another ChatGLM2 implementation for GPTQ quantization

Language:PythonMIT54 3 10

daily-accounting

a web site made by django to record income and expenses, show charts and statistics / django做的小网站用来记录日常开支和展示图表

Language:HTMLMIT36 3 1

onnxsim_large_model

simplify >2GB large onnx model

Language:PythonMIT32 1 8

llm-cost-estimator

Estimating hardware and cloud costs of LLMs and transformer projects

Language:TypeScriptMIT6 2 1

MyCudaCode

练习的一些cuda代码

Language:Cuda200