alphaRGB

alphaRGB

Geek Repo

Company:XiDAIN

Location:Xi'an

Github PK Tool:Github PK Tool

alphaRGB's starred repositories

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

Language:TypeScriptLicense:BSD-2-ClauseStargazers:15520Issues:250Issues:3175

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5302Issues:63Issues:89

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3004Issues:45Issues:295

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:2944Issues:42Issues:216

sun-panel

A server, NAS navigation panel, Homepage, browser homepage. | 一个服务器、NAS导航面板、Homepage、浏览器首页。

Language:VueLicense:MITStargazers:2185Issues:11Issues:168

tvm_mlir_learn

compiler learning resources collect.

Pytorch-Memory-Utils

pytorch memory track code

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++License:Apache-2.0Stargazers:766Issues:35Issues:231

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

Language:HTMLLicense:NOASSERTIONStargazers:647Issues:26Issues:204

mlir-tutorial

MLIR For Beginners tutorial

tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

Language:C++License:NOASSERTIONStargazers:513Issues:21Issues:86

Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Language:PythonLicense:MITStargazers:385Issues:13Issues:8

depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Language:PythonLicense:MITStargazers:370Issues:10Issues:20

llama.onnx

LLaMa/RWKV onnx models, quantization and testcase

Language:PythonLicense:GPL-3.0Stargazers:331Issues:13Issues:18
Language:PythonLicense:MITStargazers:319Issues:24Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:315Issues:3Issues:8

scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Language:PythonLicense:MITStargazers:187Issues:4Issues:61

mlir-tutorial

Hands-On Practical MLIR Tutorial

Language:C++License:Apache-2.0Stargazers:181Issues:2Issues:3

DL_Compiler

Study Group of Deep Learning Compiler

TensorNVMe

A Python library transfers PyTorch tensors between CPU and NVMe

wmma_extension

An extension library of WMMA API (Tensor Core API)

Language:CudaLicense:MITStargazers:76Issues:7Issues:4
Language:PythonLicense:Apache-2.0Stargazers:75Issues:5Issues:5

fansfood

一个基于django的美食制作教程和美食图片的网站

Language:PythonLicense:BSD-3-ClauseStargazers:75Issues:0Issues:0

export_llama_to_onnx

export llama to onnx

Language:PythonLicense:MITStargazers:72Issues:1Issues:14

ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Language:PythonStargazers:67Issues:3Issues:0

chatglm-q

Another ChatGLM2 implementation for GPTQ quantization

Language:PythonLicense:MITStargazers:54Issues:3Issues:10

daily-accounting

a web site made by django to record income and expenses, show charts and statistics / django做的小网站用来记录日常开支和展示图表

Language:HTMLLicense:MITStargazers:36Issues:3Issues:1

onnxsim_large_model

simplify >2GB large onnx model

Language:PythonLicense:MITStargazers:32Issues:1Issues:8

llm-cost-estimator

Estimating hardware and cloud costs of LLMs and transformer projects

Language:TypeScriptLicense:MITStargazers:6Issues:2Issues:1

MyCudaCode

练习的一些cuda代码

Language:CudaStargazers:2Issues:0Issues:0