LeeSureman

Xiaonan Li's starred repositories

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029008 341 267

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT5721 65 142

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT5517 33 778

InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Language:PythonApache-2.05430 49 291

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonApache-2.04525 49 261

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.04406 77 87

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonApache-2.03997 40 385

chatgpt-prompts-for-academic-writing

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

2531 31 2

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:Python1930 29 122

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonApache-2.01432 26 24

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

MIT1062 17 6

factool

FacTool: Factuality Detection in Generative AI

Language:PythonApache-2.0773 10 28

AgentSims

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Language:PythonMIT704 4 24

awesome-language-agents

List of language agents based on paper "Cognitive Architectures for Language Agents"

Language:TeX635 14 2

webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Language:PythonApache-2.0599 19 102

Sphere

Web-scale retrieval for knowledge-intensive NLP

Language:PythonNOASSERTION548 14 5

unifiedqa

UnifiedQA: Crossing Format Boundaries With a Single QA System

Language:PythonApache-2.0426 14 40

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonApache-2.0366 16 8

collie

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonApache-2.0352 9 62

LEval

[ACL'24] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonGPL-3.0294 4 10

LawCrimeMining

Law Crime Mining Based on Corpus build and content analysis by NLP methods. 基于领域语料库构建与NLP方法的裁判文书与犯罪案例文本挖掘项目

Language:Python286 17 2

Gentopia

Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.

Language:PythonMIT280 2 5

Everything-about-LLMs

A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.

Language:Jupyter Notebook173 70

code-indexer-loop

Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuously and efficiently updated.

Language:PythonApache-2.0165 40