2+c's starred repositories
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
awesome-ai-agents
A list of AI autonomous agents
Auto-GPT-Plugins
Plugins for Auto-GPT
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Awesome-GPTs
Curated list of awesome GPTs 👍.
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
ChinaTextbook
所有小初高、大学PDF教材。
DeepSeek-LLM
DeepSeek LLM: Let there be answers
deep_learning_curriculum
Language model alignment-focused deep learning curriculum
PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
datablations
Scaling Data-Constrained Language Models
BMPrinciples
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
Everything-about-LLMs
A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.
DeepLearningSystem
AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
pile_dedupe
Pile Deduplication Code