Beast code in Giters

Vance0124's repositories

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonApache-2.074 1 3

强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition

Language:Jupyter NotebookGPL-3.0200

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Language:PythonApache-2.0100

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

Language:PythonNOASSERTION100

:scroll: UCAS Beamer (LaTeX)

Language:TeXMIT100

Language:JavaScriptMIT000

Use ChatGPT to summarize the arXiv papers.

Language:PythonNOASSERTION000

Language:Python000

Google Research

Language:Jupyter NotebookApache-2.0000

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonMIT000