Beast code in Giters

Antti Puurula's starred repositories

linux

Linux kernel source tree

Language:CNOASSERTION175312 79590

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023646 219 3619

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Language:C++MIT22125 165 772

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Language:Jupyter NotebookMIT12729 243 104

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT9745 84 247

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python9110 112 189

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08472 99 1227

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.06856 50 597

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Language:PythonApache-2.06324 71 1649

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.03513 33 1133

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT3090 57 663

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonApache-2.02342 59 708

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT2169 24 159

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.02118 24 174