Beast code in Giters

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Language:PythonMIT000

llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

Language:HTML000

llumnix

Efficient and easy multi-instance LLM serving

Language:PythonApache-2.0000

lolcats

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Apache-2.0000

mem0

The memory layer for Personalized AI

Language:Python000

mllm

Fast Multimodal LLM on Mobile Devices

Language:C++MIT000

Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Language:CudaApache-2.0000

ONNXim

ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference

Language:C++MIT000

PromptIR

PromptIR: Prompting for All-in-One Blind Image Restoration [NeurIPS 2023]

Language:PythonNOASSERTION000

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonApache-2.0000

swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose

Language:Python000

tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Language:Python000

vidur

A large-scale simulation framework for LLM inference

Language:PythonMIT000