alphaRGB

followers

following

stars

XiDAIN

Xi'an

alphaRGB's starred repositories

scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Language:PythonMIT19400

SCALE-Sim

Language:PythonMIT32100

llm-cost-estimator

Estimating hardware and cloud costs of LLMs and transformer projects

Language:TypeScriptMIT600

MyCudaCode

练习的一些cuda代码

Language:Cuda200

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.01271600

OpenBLAS

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

Language:CBSD-3-Clause617900

cuda_sgemm

Language:Cuda9400

how-to-optimize-gemm

row-major matmul optimization

Language:C++GPL-3.056600

how-to-optimize-gemm

Language:C167900

YHs_Sample

Yinghan's Code Sample

Language:CudaGPL-3.025900

parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Language:PythonApache-2.076600

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.0255800

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaApache-2.076300

tutorials

MONAI Tutorials

Language:Jupyter NotebookApache-2.0171800

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:Cuda124300

QvPlugin-Trojan

在 Qv2ray 中使用 Trojan, 感谢 Trojan-Qt5 0.x

Language:C++GPL-3.034800

hysteria

Hysteria is a powerful, lightning fast and censorship resistant proxy.

Language:GoMIT1398500

joplin

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

Language:TypeScriptNOASSERTION4438300

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.04269100

llama.cpp

LLM inference in C/C++

Language:C++MIT6191100

netdata

The open-source observability platform everyone needs!

Language:CGPL-3.06967300

py12306

🚂 12306 购票助手，支持集群，多账号，多任务购票以及 Web 页面管理

Language:PythonApache-2.01425000

12306

12306智能刷票，订票

Language:PythonMIT3373900

triton

Development repository for the Triton language and compiler

Language:C++MIT1202800

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION175300

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION128400

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonBSD-3-Clause819400

maestro

An analytical cost model evaluating DNN mappings (dataflows and tiling).

Language:MATLABMIT17100

profiler-workshop

Example code for profiler workshop

Language:PythonMIT2400

GLM

GLM (General Language Model)

Language:PythonMIT312900