Beast code in Giters

喵哩个咪's starred repositories

LowMemoryBP

The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"

Language:PythonMIT1400

cron

a cron library for go

Language:GoMIT1290500

Autofocus

Implementation of different autofocus functions using python. The main goal is to obtain efficiently the maximal contrast between pixels

1500

CDAF

Contrast Detection Auto Focus

Language:Python200

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0176100

ao

PyTorch native quantization and sparsity for training and inference

Language:PythonBSD-3-Clause56400

x-flux

Language:PythonApache-2.0111700

instant_id

Language:PythonApache-2.0400

AutoGGUF

automatically quant GGUF models

Language:PythonApache-2.010900

ComfyUI-GGUF

GGUF Quantization support for native ComfyUI models

Language:PythonApache-2.054000

jsonformer

A Bulletproof Way to Generate Structured JSON from Language Models

Language:Jupyter NotebookMIT432300

outlines

Structured Text Generation

Language:PythonApache-2.0804600

ollama-copilot

Proxy that allows you to use ollama as a copilot like Github copilot

Language:Go25900

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause20100

flash-attention-2-builds

(Unofficial) Manual builds of wheels for https://github.com/Dao-AILab/flash-attention for Windows x64

BSD-3-Clause1100

SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Language:PythonAGPL-3.0139400

ComfyUI-AdvancedLivePortrait

Language:Python50500

Assets

4500

alpaca_lora_4bit

Language:PythonMIT53300

SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Language:PythonMIT62400

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:Python27300

BlueLM

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Language:PythonNOASSERTION82400

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02551500

AutoAWQ_kernels

Language:CudaMIT4400

torch-bnb-fp4

Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops

Language:PythonMIT2200

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonApache-2.0149500

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonMIT28500

optimum-quanto

A pytorch quantization backend for optimum

Language:PythonApache-2.073500

ComfyUI-AutomaticCFG

If your image was a pizza and the CFG the temperature of your oven: this is a thermostat that ensures it is always cooked like you want. Also adds a 30% speed increase. For ComfyUI / StableDiffusion

Language:Python32300

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonApache-2.0948500

wailovet