zhangwei hong's repositories

Language:PythonStargazers:0Issues:1Issues:0

ARS

An implementation of the Augmented Random Search algorithm

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

BCO

behavior cloning from observation

License:MITStargazers:0Issues:0Issues:0

BGRL

Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

CityLearn

Official reinforcement learning environment for demand response and load shaping

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

curiosity_baselines

An open source reinforcement learning codebase with a variety of intrinsic curiosity methods implemented in PyTorch on top of rlpyt.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deep_control

Deep Reinforcement Learning for Continuous Control in PyTorch

Stargazers:0Issues:0Issues:0

Dr-Jekyll-and-Mr-Hyde-The-Strange-Case-of-Off-Policy-Policy-Updates

Code for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates: https://arxiv.org/abs/2109.14727

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

easy_experiments

A easy-to-modify tool for launching many experiments.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

fuzz4all

🌌️Fuzz4All: Universal Fuzzing with Large Language Models

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

gdown

Download a large file from Google Drive (curl/wget fails because of the security notice).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gym-maze

A customizable gym environment for maze/gridworld

Language:PythonStargazers:0Issues:2Issues:0

jaynes

A package for running ML training on SLURM, AWS, GCE, and physical boxes with or without docker

Language:PythonStargazers:0Issues:0Issues:0

jaynes-starter-kit

a starter-kit for jaynes, the cloud-agnostic launch library

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

multiworld

Multitask Environments for RL

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

py-ttc

Python implementation of the vision-based direct methods of time-to-contact (TTC) estimation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pythagora

Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PythonLinearNonlinearControl

PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.

Language:PythonStargazers:0Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

robovat

RoboVat: A unified toolkit for simulated and real-world robotic task environments.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vizdoomgym

OpenAI Gym wrapper for ViZDoom enviroments

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WNPG

implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies

Language:PythonStargazers:0Issues:0Issues:0