Beast code in Giters

L's starred repositories

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION139800

InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Language:PythonMIT22900

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Language:Jupyter NotebookMIT6600

RAGChecker

RAGChecker: A Fine-grained Framework For Diagnosing RAG

Language:PythonApache-2.019200

LASP

Linear Attention Sequence Parallelism (LASP)

Language:PythonMIT6100

Husky-v1

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

Language:Python29900

VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Language:PythonApache-2.066800

ms-swift

Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonApache-2.0300900

Megatron-Kwai

[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism

Language:PythonNOASSERTION3800

ring-flash-attention

Ring attention implementation with flash attention

Language:Python49600

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

MIT153900

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION458500

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01884600

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonApache-2.0311900

claude-prompt-generator

Language:PythonApache-2.027600

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonMIT29200

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.0449100

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonBSD-2-Clause284900

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.02950000

LooGLE

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

Language:PythonMIT14100

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:Python26400

awesome-multi-modal-reinforcement-learning

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

Apache-2.036600

ComplexBench

Language:PythonMIT3900

ChatEval

Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"

Language:PythonApache-2.021800

llm-autoeval

Automatically evaluate your LLMs in Google Colab

Language:PythonMIT49100

suri

Code for Suri: Multi-constraint instruction following for long-form text generation

Language:Python1500

Loong

[arxiv:2406.17419]Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

Language:PythonApache-2.05300

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:Python70000

AutoIF

Language:PythonApache-2.017000

MACM

MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems

Language:Python5400