daizhongxiang's starred repositories
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
neural-tangents
Fast and Easy Infinite Neural Networks in Python
data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
bayesoptbook.github.io
Companion webpage for the book "Bayesian Optimization" by Roman Garnett
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
alpaca-chinese-dataset
Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】
awesome-fm4co
Recent research papers about Foundation Models for Combinatorial Optimization
awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
LLM-Agent-Benchmark-List
A banchmark list for evaluation of large language models.