Foundation Model Inference (FMInference)

Foundation Model Inference

FMInference

Geek Repo

Inference Systems for Foundation Models

Github PK Tool:Github PK Tool

Foundation Model Inference's repositories

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9037Issues:105Issues:80

H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.