Beast code in Giters

Altrouge7's repositories

自动觉醒御魂痴等等的脚本

Language:Jupyter Notebook1 10

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

DSAC-v2; DASC; Distributional Soft Actor-Critic

Language:Python000

000

A soft actor critic agent for autonomous trading in the intraday power market.

Language:Python000

000

Deep reinforcement learning of mahjong self-play

Language:Python000

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT000

RL project for battery trading on the day-ahead market

Language:Python000

state-conservative double actor-critcs algorithm

000

Train transformer language models with reinforcement learning.

Apache-2.0000

Language:Jupyter Notebook000