Mircea Mironenco (mirceamironenco)

mirceamironenco

Geek Repo

Location:Amsterdam, Netherlands

Github PK Tool:Github PK Tool

Mircea Mironenco's starred repositories

h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Language:PythonLicense:Apache-2.0Stargazers:10820Issues:156Issues:1061

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6697Issues:59Issues:137

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5559Issues:78Issues:141

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language:PythonLicense:Apache-2.0Stargazers:4070Issues:40Issues:114

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2570Issues:36Issues:131

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2488Issues:21Issues:52

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2110Issues:26Issues:54

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonLicense:Apache-2.0Stargazers:1688Issues:29Issues:208

training-operator

Distributed ML Training and Fine-Tuning on Kubernetes

Language:GoLicense:Apache-2.0Stargazers:1490Issues:86Issues:924

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonLicense:BSD-3-ClauseStargazers:1333Issues:21Issues:38

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1310Issues:23Issues:61

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1176Issues:25Issues:11

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:849Issues:10Issues:26

benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Language:PythonLicense:BSD-3-ClauseStargazers:796Issues:226Issues:843

OpenOOD

Benchmarking Generalized Out-of-Distribution Detection

Language:PythonLicense:MITStargazers:778Issues:8Issues:98

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:733Issues:6Issues:7
Language:PythonLicense:Apache-2.0Stargazers:636Issues:16Issues:57

quaterion

Blazing fast framework for fine-tuning similarity learning models

Language:PythonLicense:Apache-2.0Stargazers:627Issues:10Issues:79

tensordict

TensorDict is a pytorch dedicated tensor container.

Language:PythonLicense:MITStargazers:606Issues:28Issues:87

llm-finetuning

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Language:PythonLicense:MITStargazers:444Issues:6Issues:19

pytorch_influence_functions

This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence Functions by Pang Wei Koh and Percy Liang.

Language:PythonLicense:NOASSERTIONStargazers:302Issues:7Issues:31

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:274Issues:6Issues:9

hlb-gpt

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).

Language:PythonLicense:Apache-2.0Stargazers:253Issues:9Issues:5

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:229Issues:4Issues:41

awesome-deep-phenomena

A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...

seqax

seqax = sequence modeling + JAX

Language:PythonLicense:BSD-3-ClauseStargazers:108Issues:5Issues:1

ml-calibration

relplot: Utilities for measuring calibration and plotting reliability diagrams

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:77Issues:10Issues:0

linear_open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:64Issues:0Issues:0

influence_analysis_papers

Influence Analysis and Estimation - Survey, Papers, and Taxonomy

License:MITStargazers:53Issues:9Issues:0

Awesome-ScalingLaws

A curated list of awesome resources dedicated to Scaling Laws for LLMs