xlite-dev / Awesome-LLM-Inference

πŸ“šA curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.πŸŽ‰

Repository from Github https://github.comxlite-dev/Awesome-LLM-InferenceRepository from Github https://github.comxlite-dev/Awesome-LLM-Inference

xlite-dev/Awesome-LLM-Inference Stargazers