Beast code in Giters

hmzo's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonGPL-3.060185 470 1344

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.052249 384 3300

GPTs

leaked prompts of GPTs

28412 300 26

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.016354 113 851

mamba

Mamba SSM architecture

Language:PythonApache-2.012721 101 512

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011700 206 2248

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLMIT8060 90 9

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookApache-2.06960 74 205

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT6126 50 1014

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04583 50 302

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.04534 108 134

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT2383 24 169

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.02107 21 251

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01971 44 125

hyperlearn

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Language:Jupyter NotebookApache-2.01789 89 23

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonApache-2.01716 24 39

simple-evals

Language:PythonMIT1590 26 9

megablocks

Language:PythonApache-2.01176 18 54

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.01153 39 76

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:Python806 15 7

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonApache-2.0719 19 29

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonApache-2.0663 12 30

LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT632 6 69

academy

Ray tutorials from Anyscale

Language:Jupyter NotebookApache-2.0580 17 27

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonMIT544 24 72

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonApache-2.0476 6 11

FuseAI

FuseAI Project

Language:Python44000

Online-RLHF

A recipe for online RLHF and online iterative DPO.

Language:Python383 18 21

NeuralFlow

Visualize the intermediate output of Mistral 7B

Language:PythonGPL-3.0306 8 4

InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Language:PythonApache-2.0286 9 83