Lukas Hermann's repositories
chem_informatics
chem_informatics
pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
kor
LLM(😽)
lukashermann
Config files for my GitHub profile.
mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
pytorch-rl
Deep Reinforcement Learning with pytorch & visdom
spinningup
An educational resource to help anyone learn deep reinforcement learning.
starter-hugo-academic
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
trl
Train transformer language models with reinforcement learning.