Beast code in Giters

hanlinxuy's repositories

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.0500

RWKV_UNKNOWN

RWKV implementation with pure torch and deepspeed.

Language:Python200

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.0000

HASCO

agile hardware-software co-design

Language:PythonApache-2.0000

nn-visualizer

Language:C#000

nn_vis

A project for processing neural networks and rendering to gain insights on the architecture and parameters of a model through a decluttered representation.

Language:PythonMIT000