alphaRGB

alphaRGB

Geek Repo

Company:XiDAIN

Location:Xi'an

Github PK Tool:Github PK Tool

alphaRGB's starred repositories

DL_Compiler

Study Group of Deep Learning Compiler

Stargazers:152Issues:0Issues:0

mlir-tutorial

MLIR For Beginners tutorial

Language:C++Stargazers:613Issues:0Issues:0

mlir-tutorial

Hands-On Practical MLIR Tutorial

Language:C++License:Apache-2.0Stargazers:180Issues:0Issues:0

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3004Issues:0Issues:0

tvm_mlir_learn

compiler learning resources collect.

Language:PythonStargazers:1896Issues:0Issues:0

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++License:Apache-2.0Stargazers:765Issues:0Issues:0

Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Language:PythonLicense:MITStargazers:384Issues:0Issues:0

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:2944Issues:0Issues:0

depyf

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Language:PythonLicense:MITStargazers:369Issues:0Issues:0

onnxsim_large_model

simplify >2GB large onnx model

Language:PythonLicense:MITStargazers:32Issues:0Issues:0

chatglm-q

Another ChatGLM2 implementation for GPTQ quantization

Language:PythonLicense:MITStargazers:54Issues:0Issues:0

llama.onnx

LLaMa/RWKV onnx models, quantization and testcase

Language:PythonLicense:GPL-3.0Stargazers:331Issues:0Issues:0

export_llama_to_onnx

export llama to onnx

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

Language:C++License:NOASSERTIONStargazers:506Issues:0Issues:0

sun-panel

A server, NAS navigation panel, Homepage, browser homepage. | 一个服务器、NAS导航面板、Homepage、浏览器首页。

Language:VueLicense:MITStargazers:2182Issues:0Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:313Issues:0Issues:0

ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Language:PythonStargazers:67Issues:0Issues:0

TensorNVMe

A Python library transfers PyTorch tensors between CPU and NVMe

Language:C++Stargazers:82Issues:0Issues:0

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

Language:TypeScriptLicense:BSD-2-ClauseStargazers:15508Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5297Issues:0Issues:0

daily-accounting

a web site made by django to record income and expenses, show charts and statistics / django做的小网站用来记录日常开支和展示图表

Language:HTMLLicense:MITStargazers:36Issues:0Issues:0

fansfood

一个基于django的美食制作教程和美食图片的网站

Language:PythonLicense:BSD-3-ClauseStargazers:75Issues:0Issues:0

wmma_extension

An extension library of WMMA API (Tensor Core API)

Language:CudaLicense:MITStargazers:76Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:75Issues:0Issues:0

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

Language:HTMLLicense:NOASSERTIONStargazers:644Issues:0Issues:0

Pytorch-Memory-Utils

pytorch memory track code

Language:PythonStargazers:965Issues:0Issues:0

scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Language:PythonLicense:MITStargazers:187Issues:0Issues:0
Language:PythonLicense:MITStargazers:319Issues:0Issues:0

llm-cost-estimator

Estimating hardware and cloud costs of LLMs and transformer projects

Language:TypeScriptLicense:MITStargazers:6Issues:0Issues:0

MyCudaCode

练习的一些cuda代码

Language:CudaStargazers:2Issues:0Issues:0