YifanXu (bcxyf123)

bcxyf123

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

YifanXu's starred repositories

minChatGPT

A minimum example of aligning language models with RLHF similar to ChatGPT

Language:PythonLicense:GPL-3.0Stargazers:211Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:2088Issues:0Issues:0

Awesome-Learning-Resource

A curated list of all kinds of learning resources, blogs, books, videos and so on.

License:GPL-3.0Stargazers:291Issues:0Issues:0

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXLicense:MITStargazers:48025Issues:0Issues:0

reinforcement-learning-implementation

Reinforcement Learning examples implementation and explanation

Language:Jupyter NotebookLicense:MITStargazers:318Issues:0Issues:0

gym-navigation

A simulation of the robot navigation problem in Gymnasium.

Language:PythonLicense:GPL-3.0Stargazers:12Issues:0Issues:0

gym-simplegrid

Simple Gridworld Gymnasium Environment

Language:PythonLicense:Apache-2.0Stargazers:41Issues:0Issues:0

noetic_robots

A collection of tutorials to rescue robot simulations that are oldies but goodies to make them work in ROS Noetic

License:MITStargazers:8Issues:0Issues:0

awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

License:Apache-2.0Stargazers:373Issues:0Issues:0

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

Stargazers:312Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:64366Issues:0Issues:0

unreal-map

Multiagent research environment toolbox based on Unreal Engine

Language:PythonStargazers:191Issues:0Issues:0

LUSR

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation (AAAI 2021)

Language:PythonStargazers:24Issues:0Issues:0

hmp2g

Multiagent Reinforcement Learning Research Project

Language:PythonLicense:MITStargazers:113Issues:0Issues:0