hmzo's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:60185Issues:470Issues:1344

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:52249Issues:384Issues:3300

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16354Issues:113Issues:851

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12721Issues:101Issues:512

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11700Issues:206Issues:2248

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLLicense:MITStargazers:8060Issues:90Issues:9

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6960Issues:74Issues:205

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:6126Issues:50Issues:1014

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4583Issues:50Issues:302

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4534Issues:108Issues:134

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2383Issues:24Issues:169

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:2107Issues:21Issues:251

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1971Issues:44Issues:125

hyperlearn

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1789Issues:89Issues:23

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1716Issues:24Issues:39
Language:PythonLicense:Apache-2.0Stargazers:1176Issues:18Issues:54

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1153Issues:39Issues:76

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:719Issues:19Issues:29

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:663Issues:12Issues:30

LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:632Issues:6Issues:69

academy

Ray tutorials from Anyscale

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:580Issues:17Issues:27

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:544Issues:24Issues:72

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonLicense:Apache-2.0Stargazers:476Issues:6Issues:11

FuseAI

FuseAI Project

Language:PythonStargazers:440Issues:0Issues:0

Online-RLHF

A recipe for online RLHF and online iterative DPO.

NeuralFlow

Visualize the intermediate output of Mistral 7B

Language:PythonLicense:GPL-3.0Stargazers:306Issues:8Issues:4

InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Language:PythonLicense:Apache-2.0Stargazers:286Issues:9Issues:83