sagorbrur

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Language:PythonApache-2.06579 71 1728

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT6079 50 1011

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause4010 46 560

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellCC0-1.03777 66 8

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Language:PythonBSD-3-Clause3158 38 278

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language:PythonMIT2518 48 164

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01959 44 120

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonApache-2.01870 34 1070

agentops

Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

Language:PythonMIT1724 22 93

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

Apache-2.0832 4 2

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonBSD-3-Clause660 12 24

biniou

a self-hosted webui for 30+ generative ai

Language:PythonGPL-3.0451 11 23

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonApache-2.0440 33 5

llm_distillation_playbook

Best practices for distilling large language models.

Language:Jupyter Notebook376 120

awesome-tool-llm

173 4 2

IndicLLMSuite

A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages

Language:PythonMIT88 8 3

LLMEvaluation

A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.

Language:HTML5000

sagorbrur

Sagor Sarker's starred repositories

LLaMA-Factory

LLMs-from-scratch

llama3

Qwen

llama3-from-scratch

gorilla

text-generation-inference

cudf

skypilot