Yinpei Su's starred repositories
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"
persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
regular-investing-in-box
定投改变命运 —— 让时间陪你慢慢变富 https://onregularinvesting.com
LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。