mirceamironenco

Mircea Mironenco's starred repositories

h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Language:PythonApache-2.010820 156 1061

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.06697 59 137

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05559 78 141

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language:PythonApache-2.04070 40 114

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION2570 36 131

leptonai

A Pythonic framework to simplify AI service building

Language:PythonApache-2.02488 21 52

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.02110 26 54

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonApache-2.01688 29 208

training-operator

Distributed ML Training and Fine-Tuning on Kubernetes

Language:GoApache-2.01490 86 924

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause1333 21 38

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonApache-2.01310 23 61

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1176 25 11

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.0849 10 26

benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Language:PythonBSD-3-Clause796 226 843

OpenOOD

Benchmarking Generalized Out-of-Distribution Detection

Language:PythonMIT778 8 98

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookApache-2.0733 6 7

quaterion

Blazing fast framework for fine-tuning similarity learning models

Language:PythonApache-2.0627 10 79

tensordict

TensorDict is a pytorch dedicated tensor container.

Language:PythonMIT606 28 87

llm-finetuning

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Language:PythonMIT444 6 19

pytorch_influence_functions

This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence Functions by Pang Wei Koh and Percy Liang.

Language:PythonNOASSERTION302 7 31

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonApache-2.0274 6 9

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).

Language:PythonApache-2.0253 9 5

mirceamironenco