umiswing

User data from Github https://github.com/umiswing

followers

following

stars

NEU

China

https://umiswing.github.io/

Organizations

AyakaGEMM

PaddlePaddle

umiswing's starred repositories

DeepSeek-V3

Language:PythonMIT92531 721 456

DeepSeek-R1

MIT86828 615 481

raylib

A simple and easy-to-use library to enjoy videogames programming

Language:CZlib25458 299 1939

abseil-cpp

Abseil Common Libraries (C++)

Language:C++Apache-2.015613 597 911

BitNet

Official inference framework for 1-bit LLMs

Language:C++MIT12813 133 107

CppTemplateTutorial

中文的C++ Template的教学指南。与知名书籍C++ Templates不同，该系列教程将C++ Templates作为一门图灵完备的语言来讲授，以求帮助读者对Meta-Programming融会贯通。(正在施工中)

Language:C++10002 524 45

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilog7996 72 25

XiangShan

Open-source high-performance RISC-V processor

Language:ScalaNOASSERTION6204 96 501

perf-book

The book "Performance Analysis and Tuning on Modern CPU"

Language:TeXCC0-1.02884 77 27

perf-ninja

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

Language:C++2879 112 43

eglot

A client for Language Server Protocol servers

Language:Emacs LispGPL-3.02341 37 592

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT2157 38 42

spconv

Spatial Sparse Convolution Library

Language:PythonApache-2.01988 23 709

copilot.el

An unofficial Copilot plugin for Emacs.

Language:Emacs LispMIT1987 44 236

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++Apache-2.01799 58 566

Tencent-Hunyuan-Large

Language:PythonNOASSERTION1454 26 17

ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

Language:CNOASSERTION1245 85 2105

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Apache-2.01117 27 11

tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Language:C++MIT721 6 39

ventus-gpgpu

GPGPU processor supporting RISCV-V extension, developed with Chisel HDL

Language:ScalaMulanPSL-2.0710 14 33

duo-attention

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Language:PythonMIT435 9 14

Star-Attention

Efficient LLM Inference over Long Sequences

Language:PythonApache-2.0365 7 4

dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

Language:CApache-2.0237 6 34

PhoenixOS

Fast OS-level support for GPU checkpoint and restore

Language:C++Apache-2.0170 10 11

cute-gemm

Language:C++107 2 5

AttentionEngine

Language:PythonMIT49 4 1

tccl

Thunder Research Group's Collective Communication Library

Language:C++NOASSERTION33 3 3

L-Mul

C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907

Language:C27 10

ConvStencil

Language:CudaMIT25 3 4

HopperTest

Language:C++9 20