Beast code in Giters

Michael Mi's starred repositories

chatbot-ui

AI chat for every model.

Language:TypeScriptMIT27256 242 934

triton

Development repository for the Triton language and compiler

Language:C++MIT11852 185 1270

mamba

Mamba SSM architecture

Language:PythonApache-2.011449 98 386

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.010661 67 664

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonApache-2.07428 112 287

LookaheadDecoding

Language:PythonApache-2.01030 11 55

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaApache-2.0752 13 62

Jinja2Cpp

Jinja2 C++ (and for C++) almost full-conformance template engine implementation

Language:C++MPL-2.0478 17 133

nvbench

CUDA Kernel Benchmarking Library

Language:CudaApache-2.0439 18 89

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++Apache-2.0434 10 10

hal-9100

Edge full-stack LLM platform. Written in Rust

Language:RustMIT361 11 79

calm

CUDA/Metal accelerated language model inference

Language:CMIT335 90

pyglove

Manipulating Python Programs

Language:PythonApache-2.0320 6 25

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

Language:C++Apache-2.0315 15 65

run-clang-format

A wrapper script around clang-format, suitable for linting multiple files and to use for continuous integration

Language:PythonMIT235 7 22

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Language:CudaMIT228 4 11

MatmulTutorial

A Easy-to-understand TensorOp Matmul Tutorial

Language:C++Apache-2.0212 8 9

sirius

A Plonkish folding framework for Incrementally Verifiable Computation (IVC).

Language:RustMIT105 5 81

langfun

Empower LLMs with Symbols.

Language:PythonApache-2.084 5 1

cute-gemm

Language:C++52 2 4

SA-Segment-Anything

Vision-oriented multimodal AI

Language:Jupyter NotebookApache-2.046 40

LLMBench

A library for validating and benchmarking LLMs inference.

Language:PythonApache-2.04 2 1