Beast code in Giters

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION000

curiosity_baselines

An open source reinforcement learning codebase with a variety of intrinsic curiosity methods implemented in PyTorch on top of rlpyt.

Language:PythonMIT000

deep_control

Deep Reinforcement Learning for Continuous Control in PyTorch

000

Dr-Jekyll-and-Mr-Hyde-The-Strange-Case-of-Off-Policy-Policy-Updates

Code for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates: https://arxiv.org/abs/2109.14727

Language:PythonMIT000

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonMIT000

easy_experiments

A easy-to-modify tool for launching many experiments.

Language:PythonMIT010

explore_establish_exploit_llms

Language:Python000

fuzz4all

🌌️Fuzz4All: Universal Fuzzing with Large Language Models

CC-BY-4.0000

gdown

Download a large file from Google Drive (curl/wget fails because of the security notice).

Language:PythonMIT000

gym-maze

A customizable gym environment for maze/gridworld

Language:Python020

jaynes

A package for running ML training on SLURM, AWS, GCE, and physical boxes with or without docker

Language:Python000

jaynes-starter-kit

a starter-kit for jaynes, the cloud-agnostic launch library

Language:Python000

mrl

Language:PythonMIT000

multiworld

Multitask Environments for RL

Language:PythonNOASSERTION000

phi_gcn

Language:Python000

py-ttc

Python implementation of the vision-based direct methods of time-to-contact (TTC) estimation

Language:PythonMIT010

pythagora

Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.

Language:JavaScriptApache-2.0000

PythonLinearNonlinearControl

PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.

Language:Python000

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.0000

robovat

RoboVat: A unified toolkit for simulated and real-world robotic task environments.

Language:PythonMIT000

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0000

vizdoomgym

OpenAI Gym wrapper for ViZDoom enviroments

Language:PythonMIT000

WNPG

implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies

Language:Python000