Xiaonan Li (LeeSureman)

LeeSureman

Geek Repo

Company:Fudan Univeristy

Location:Shanghai

Home Page:https://scholar.google.com/citations?user=ldEcEjEAAAAJ&hl=en

Github PK Tool:Github PK Tool

Xiaonan Li's starred repositories

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29008Issues:341Issues:267

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:5721Issues:65Issues:142

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5517Issues:33Issues:778

InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Language:PythonLicense:Apache-2.0Stargazers:5430Issues:49Issues:291

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonLicense:Apache-2.0Stargazers:4525Issues:49Issues:261

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4406Issues:77Issues:87

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:3997Issues:40Issues:385

chatgpt-prompts-for-academic-writing

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonLicense:Apache-2.0Stargazers:1432Issues:26Issues:24

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

factool

FacTool: Factuality Detection in Generative AI

Language:PythonLicense:Apache-2.0Stargazers:773Issues:10Issues:28

AgentSims

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Language:PythonLicense:MITStargazers:704Issues:4Issues:24

awesome-language-agents

List of language agents based on paper "Cognitive Architectures for Language Agents"

webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Language:PythonLicense:Apache-2.0Stargazers:599Issues:19Issues:102

Sphere

Web-scale retrieval for knowledge-intensive NLP

Language:PythonLicense:NOASSERTIONStargazers:548Issues:14Issues:5

unifiedqa

UnifiedQA: Crossing Format Boundaries With a Single QA System

Language:PythonLicense:Apache-2.0Stargazers:426Issues:14Issues:40

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonLicense:Apache-2.0Stargazers:366Issues:16Issues:8

collie

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonLicense:Apache-2.0Stargazers:352Issues:9Issues:62

LEval

[ACL'24] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonLicense:GPL-3.0Stargazers:294Issues:4Issues:10

LawCrimeMining

Law Crime Mining Based on Corpus build and content analysis by NLP methods. 基于领域语料库构建与NLP方法的裁判文书与犯罪案例文本挖掘项目

Gentopia

Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.

Language:PythonLicense:MITStargazers:280Issues:2Issues:5

Everything-about-LLMs

A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.

Language:Jupyter NotebookStargazers:173Issues:7Issues:0

code-indexer-loop

Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuously and efficiently updated.

Language:PythonLicense:Apache-2.0Stargazers:165Issues:4Issues:0

kgi-slot-filling

This is the code for our KILT leaderboard submissions (KGI + Re2G models).

Language:PythonLicense:Apache-2.0Stargazers:137Issues:7Issues:11

indexify

A retrieval and long term memory service for LLMs

Language:RustLicense:Apache-2.0Stargazers:122Issues:13Issues:40

EnvInteractiveLMPapers

Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW(🏄).

License:MITStargazers:119Issues:10Issues:0

ExpertQA

[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers

Language:PythonLicense:MITStargazers:110Issues:5Issues:6

hagrid

A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution