minqi

User data from Github https://github.com/minqi

followers

following

stars

Lucida Labs

Oxford, UK

Organizations

FLAIROx

lucidalabs

ucl-dark

uclnlp

Minqi's repositories

learning-to-communicate-pytorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Language:PythonApache-2.0357 15 2

hnatt

Train and visualize Hierarchical Attention Networks

Language:PythonMIT203 9 8

wordcraft

An environment for benchmarking commonsense agents

Language:Python29 20

alphazero

Generic implementation of AlphaZero

Language:PythonMIT7 20

PyMDP

Markov decision processes in Python

Language:PythonMIT5 10

procgen

Procgen Benchmark: Procedurally Generated Game-Like Gym Environments

Language:C++MIT1 20

auto-drac

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Language:PythonMIT010

awesome-open-ended

010

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonBSD-3-Clause010

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT010

basicnn

Common neural networks in numpy

Language:PythonMIT010

carracingf1

020

cma_mae

A python implementation of Covariance Matrix Adaptation MAP-Annealing

Language:PythonMIT010

EGG

EGG: Emergence of lanGuage in Games

Language:Jupyter NotebookMIT020

facenet

Face recognition using Tensorflow

Language:PythonMIT010

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonApache-2.0020

minimax-updates

Efficient baselines for autocurricula in JAX.

Language:PythonApache-2.0000

minqi.github.io

Language:JavaScript020

papers

010

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT020

random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Language:Python010

scikit-learn

scikit-learn: machine learning in Python

Language:PythonBSD-3-Clause010

scipy

SciPy library main repository

Language:PythonBSD-3-Clause010

seq2seq

Example attention-seq2seq implementations.

Language:Python020

tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Language:TypeScriptApache-2.0010

tfjs-converter

Convert TensorFlow SavedModel and Keras models to TensorFlow.js

Language:JavaScript020

ued

Open-Ended Autocurricula

Language:PythonMIT030

v139

Proceedings of ICML 2021

Language:TeX010

vae

VAE implementations

Language:Python030

vqvae

A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)

Language:Jupyter Notebook010