umiswing

umiswing

User data from Github https://github.com/umiswing

Company:NEU

Location:China

Home Page:https://umiswing.github.io/

GitHub:@umiswing


Organizations
AyakaGEMM
PaddlePaddle

umiswing's starred repositories

raylib

A simple and easy-to-use library to enjoy videogames programming

abseil-cpp

Abseil Common Libraries (C++)

Language:C++License:Apache-2.0Stargazers:15613Issues:597Issues:911

BitNet

Official inference framework for 1-bit LLMs

CppTemplateTutorial

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:7996Issues:72Issues:25

XiangShan

Open-source high-performance RISC-V processor

Language:ScalaLicense:NOASSERTIONStargazers:6204Issues:96Issues:501

perf-book

The book "Performance Analysis and Tuning on Modern CPU"

Language:TeXLicense:CC0-1.0Stargazers:2884Issues:77Issues:27

perf-ninja

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

eglot

A client for Language Server Protocol servers

Language:Emacs LispLicense:GPL-3.0Stargazers:2341Issues:37Issues:592

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:2157Issues:38Issues:42

spconv

Spatial Sparse Convolution Library

Language:PythonLicense:Apache-2.0Stargazers:1988Issues:23Issues:709

copilot.el

An unofficial Copilot plugin for Emacs.

Language:Emacs LispLicense:MITStargazers:1987Issues:44Issues:236

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++License:Apache-2.0Stargazers:1799Issues:58Issues:566

ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

Language:CLicense:NOASSERTIONStargazers:1245Issues:85Issues:2105

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Language:C++License:MITStargazers:721Issues:6Issues:39

ventus-gpgpu

GPGPU processor supporting RISCV-V extension, developed with Chisel HDL

Language:ScalaLicense:MulanPSL-2.0Stargazers:710Issues:14Issues:33

duo-attention

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Language:PythonLicense:MITStargazers:435Issues:9Issues:14

Star-Attention

Efficient LLM Inference over Long Sequences

Language:PythonLicense:Apache-2.0Stargazers:365Issues:7Issues:4

dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

Language:CLicense:Apache-2.0Stargazers:237Issues:6Issues:34

PhoenixOS

Fast OS-level support for GPU checkpoint and restore

Language:C++License:Apache-2.0Stargazers:170Issues:10Issues:11

tccl

Thunder Research Group's Collective Communication Library

Language:C++License:NOASSERTIONStargazers:33Issues:3Issues:3

L-Mul

C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907

Language:CStargazers:27Issues:1Issues:0
Language:C++Stargazers:9Issues:2Issues:0