Justin Reppert (reppertj)

reppertj

Geek Repo

Company:Elicit

Location:Austin, TX

Home Page:https://www.justinreppert.com/

Github PK Tool:Github PK Tool


Organizations
elicit
oughtinc
recursecenter

Justin Reppert's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27872Issues:228Issues:4698

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18815Issues:117Issues:530

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18813Issues:172Issues:1363

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:17508Issues:142Issues:745

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:10207Issues:92Issues:760
Language:PythonLicense:Apache-2.0Stargazers:9003Issues:121Issues:98

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8335Issues:90Issues:1833

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:6856Issues:50Issues:597

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Language:PythonLicense:Apache-2.0Stargazers:6630Issues:70Issues:1744

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4344Issues:35Issues:1398

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3807Issues:34Issues:516

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3276Issues:57Issues:693

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language:PythonLicense:MITStargazers:2529Issues:48Issues:165

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1971Issues:45Issues:125

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1835Issues:15Issues:30

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1154Issues:39Issues:76

pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Language:PythonLicense:Apache-2.0Stargazers:608Issues:9Issues:60

FLARE

Forward-Looking Active REtrieval-augmented generation (FLARE)

Language:PythonLicense:MITStargazers:579Issues:7Issues:22

Long-Context

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.

Language:PythonLicense:Apache-2.0Stargazers:578Issues:13Issues:6

ranx

⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍

Language:PythonLicense:MITStargazers:444Issues:11Issues:58

torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

Language:PythonLicense:NOASSERTIONStargazers:328Issues:21Issues:180

clownfish

Constrained Decoding for LLMs against JSON Schema

Language:PythonLicense:MITStargazers:321Issues:6Issues:2

tensorizer

Module, Model, and Tensor Serialization/Deserialization

Language:PythonLicense:MITStargazers:180Issues:23Issues:47

magix

Supercharge huggingface transformers with model parallelism.

scirepeval

SciRepEval benchmark training and evaluation scripts

Language:PythonLicense:Apache-2.0Stargazers:67Issues:6Issues:17

nccl-tests

NVIDIA NCCL Tests for Distributed Training

learned-sparse-retrieval

Unified Learned Sparse Retrieval Framework

Language:PythonLicense:Apache-2.0Stargazers:58Issues:4Issues:4

qdrant-lib

Extract core logic from qdrant and make it available as a library.

Language:DockerfileLicense:MITStargazers:21Issues:19Issues:10

spark-on-k8s-images

Driver/Executor images for spark-operator

Language:ShellLicense:Apache-2.0Stargazers:5Issues:3Issues:0