Minqi's repositories
learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
gym-minigrid
Minimalistic gridworld package for OpenAI Gym
minimax-updates
Efficient baselines for autocurricula in JAX.
Language:PythonApache-2.0000
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
scikit-learn
scikit-learn: machine learning in Python
tfjs-converter
Convert TensorFlow SavedModel and Keras models to TensorFlow.js