rawsh

Robert Washbourne's starred repositories

nm-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonNOASSERTION22200

litgpt

Load, pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonApache-2.0804900

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0587600

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonApache-2.0551400

REMEDI

Inspecting and Editing Knowledge Representations in Language Models

Language:PythonMIT10200

gpt-prompt-engineer

Language:Jupyter NotebookMIT821100

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonNOASSERTION1001600

dspy-rag-fastapi

FastAPI wrapper around DSPy

Language:PythonMIT15900

memory-compressed-attention

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"

Language:PythonMIT7200

optimum-nvidia

Language:PythonApache-2.083100

snorkel

A system for quickly generating training data with weak supervision

Language:PythonApache-2.0574000

aphrodite-engine

PygmalionAI's large-scale inference engine

Language:PythonAGPL-3.073800

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.057500

raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Language:PythonMIT65300

neural-cherche

Neural Search

Language:PythonMIT31800

llama2-burn

Llama2 LLM ported to Rust burn

Language:RustMIT26300

burn

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Language:RustApache-2.0762400