Yinpei Su's starred repositories

babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:115Issues:0Issues:0

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

License:MITStargazers:6268Issues:0Issues:0

natural-questions

Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.

Language:PythonLicense:Apache-2.0Stargazers:903Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7667Issues:0Issues:0

summary-of-a-haystack

Codebase accompanying the Summary of a Haystack paper.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:61Issues:0Issues:0

loft

LOFT: A 1 Million+ Token Long-Context Benchmark

License:Apache-2.0Stargazers:104Issues:0Issues:0

LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"

Language:PythonLicense:MITStargazers:79Issues:0Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:901Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:582Issues:0Issues:0

RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Language:PythonLicense:Apache-2.0Stargazers:354Issues:0Issues:0

LEval

[ACL'24] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonLicense:GPL-3.0Stargazers:312Issues:0Issues:0

Loong

[arxiv:2406.17419]Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

Language:PythonLicense:Apache-2.0Stargazers:33Issues:0Issues:0

regular-investing-in-box

定投改变命运 —— 让时间陪你慢慢变富 https://onregularinvesting.com

Language:PythonStargazers:5569Issues:0Issues:0

CLongEval

CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

Language:Jupyter NotebookLicense:MITStargazers:36Issues:0Issues:0
Language:PythonLicense:MITStargazers:1369Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:3755Issues:0Issues:0

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:544Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1320Issues:0Issues:0

LongAlign

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Language:PythonLicense:Apache-2.0Stargazers:154Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10692Issues:0Issues:0

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2575Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:9040Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34255Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:16174Issues:0Issues:0

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:721Issues:0Issues:0

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2019Issues:0Issues:0

AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Language:PythonStargazers:1282Issues:0Issues:0

awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

License:Apache-2.0Stargazers:442Issues:0Issues:0

InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

Stargazers:147Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:13367Issues:0Issues:0