Altrouge7's repositories
autoyunhun
自动觉醒 御魂 痴等等的脚本
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:PythonApache-2.0000
DSAC-T
DSAC-v2; DASC; Distributional Soft Actor-Critic
Language:Python000
intraday_trading_RL
A soft actor critic agent for autonomous trading in the intraday power market.
Language:Python000
mahjong-selfplay-RL
Deep reinforcement learning of mahjong self-play
Language:Python000
pykan
Kolmogorov Arnold Networks
Language:Jupyter NotebookMIT000
ReinforcementLearning
RL project for battery trading on the day-ahead market
Language:Python000
SCDAC
state-conservative double actor-critcs algorithm
000
trl
Train transformer language models with reinforcement learning.
Apache-2.0000
Language:Jupyter Notebook000