L's starred repositories

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1398Issues:0Issues:0

InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Language:PythonLicense:MITStargazers:229Issues:0Issues:0

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Language:Jupyter NotebookLicense:MITStargazers:66Issues:0Issues:0

RAGChecker

RAGChecker: A Fine-grained Framework For Diagnosing RAG

Language:PythonLicense:Apache-2.0Stargazers:192Issues:0Issues:0

LASP

Linear Attention Sequence Parallelism (LASP)

Language:PythonLicense:MITStargazers:61Issues:0Issues:0

Husky-v1

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

Language:PythonStargazers:299Issues:0Issues:0

VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Language:PythonLicense:Apache-2.0Stargazers:668Issues:0Issues:0

ms-swift

Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:3009Issues:0Issues:0

Megatron-Kwai

[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism

Language:PythonLicense:NOASSERTIONStargazers:38Issues:0Issues:0

ring-flash-attention

Ring attention implementation with flash attention

Language:PythonStargazers:496Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1539Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4585Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18846Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3119Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:276Issues:0Issues:0

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonLicense:MITStargazers:292Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4491Issues:0Issues:0

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonLicense:BSD-2-ClauseStargazers:2849Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29500Issues:0Issues:0

LooGLE

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

Language:PythonLicense:MITStargazers:141Issues:0Issues:0

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:PythonStargazers:264Issues:0Issues:0

awesome-multi-modal-reinforcement-learning

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

License:Apache-2.0Stargazers:366Issues:0Issues:0
Language:PythonLicense:MITStargazers:39Issues:0Issues:0

ChatEval

Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"

Language:PythonLicense:Apache-2.0Stargazers:218Issues:0Issues:0

llm-autoeval

Automatically evaluate your LLMs in Google Colab

Language:PythonLicense:MITStargazers:491Issues:0Issues:0

suri

Code for Suri: Multi-constraint instruction following for long-form text generation

Language:PythonStargazers:15Issues:0Issues:0

Loong

[arxiv:2406.17419]Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

Language:PythonLicense:Apache-2.0Stargazers:53Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:700Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:170Issues:0Issues:0

MACM

MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems

Language:PythonStargazers:54Issues:0Issues:0