Altrouge7's repositories

autoyunhun

自动觉醒 御魂 痴等等的脚本

Language:Jupyter NotebookStargazers:1Issues:1Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DSAC-T

DSAC-v2; DASC; Distributional Soft Actor-Critic

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

intraday_trading_RL

A soft actor critic agent for autonomous trading in the intraday power market.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

mahjong-selfplay-RL

Deep reinforcement learning of mahjong self-play

Language:PythonStargazers:0Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

ReinforcementLearning

RL project for battery trading on the day-ahead market

Language:PythonStargazers:0Issues:0Issues:0

SCDAC

state-conservative double actor-critcs algorithm

Stargazers:0Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0