Qubitium

followers

following

stars

Earth/Epoch 3

Qubitium's repositories

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT100

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0010

amdvbflash

010

android-app

Official ProtonVPN Android app

Language:CGPL-3.0010

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaApache-2.0000

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonApache-2.0000

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonMIT000

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Language:PythonApache-2.0000

auto-round

SOTA Weight-only Quantization Algorithm for LLMs

Language:PythonApache-2.0000

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonMIT000

C4_200M-synthetic-dataset-for-grammatical-error-correction

This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)

Language:PythonCC-BY-4.0010

checkmk

Checkmk - Best-in-class infrastructure & application monitoring

Language:PythonGPL-2.0010

FastChat

The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

Language:PythonApache-2.0010

GPT-4-LLM

Apache-2.0010

gpt4all

gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue

Language:Python010

GPTQ-for-LLaMa

4 bits quantization of LLaMa using GPTQ

Language:PythonApache-2.0010

GPTQ-triton

GPTQ inference Triton kernel

Language:Jupyter NotebookApache-2.0010

hyperDB

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $35M cap.

Language:PythonMIT000

ios-app

Official ProtonVPN iOS app

Language:SwiftGPL-3.0010

libheif

libheif is an HEIF and AVIF file format decoder and encoder.

Language:C++NOASSERTION010

llama-dl

Language:ShellGPL-3.0010

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CMIT000

protonvpn-cli-ng

Linux command-line client for ProtonVPN. Written in Python.

Language:PythonGPL-3.0010

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT000

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaAGPL-3.0020

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0010

unsloth

5X faster 60% less memory QLoRA finetuning

Apache-2.0000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000

ZeroTierOne

A Smart Ethernet Switch for Earth

Language:C++NOASSERTION010