Yang Wang 's repositories
blue-rdma
RoCEv2 hardware implementation in Bluespec SystemVerilog
enso
Ensō is a high-performance streaming interface for NIC-application communication.
fast_pytorch_kmeans
This is a pytorch implementation of k-means clustering algorithm
hqq
Official implementation of Half-Quadratic Quantization (HQQ)
kotomamba
Mamba training library developed by kotoba technologies
KVQuant
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Latte
Latte: Latent Diffusion Transformer for Video Generation.
Mamba_SSM
A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)
microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
mojo
The Mojo Programming Language
PolyLUT
PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial function learning to exploit the flexibility of the FPGA soft logic.
quanto
A pytorch Quantization Toolkit
QUIK
Repository for the QUIK project, enabling the use of 4bit kernels for generative inference
REST
REST: Retrieval-Based Speculative Decoding
switchboard
Communication framework for RTL simulation and emulation.
Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
ViR
Official Repository for ViR: Towards Efficient Vision Retention Backbones
VMamba
VMamba: Visual State Space Models
vu13p-resource
国产VU13P加速卡资料
vu13p_corundum
corundum work on vu13p
xmir-patcher
Firmware patcher for Xiaomi routers