Vance0124's repositories
Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
RL-Solutions
强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition
DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
ucas-beamer
:scroll: UCAS Beamer (LaTeX)
Language:JavaScriptMIT000
ChatPaper
Use ChatGPT to summarize the arXiv papers.
Language:PythonNOASSERTION000
Language:Python000
google-research
Google Research
Language:Jupyter NotebookApache-2.0000
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:PythonMIT000