Minqi (minqi)

minqi

Geek Repo

Company:Lucida Labs

Location:Oxford, UK

Home Page:minch.co

Twitter:@minqijiang

Github PK Tool:Github PK Tool


Organizations
FLAIROx
lucidalabs
ucl-dark
uclnlp

Minqi's repositories

learning-to-communicate-pytorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:343Issues:16Issues:1

hnatt

Train and visualize Hierarchical Attention Networks

Language:PythonLicense:MITStargazers:202Issues:11Issues:8

wordcraft

An environment for benchmarking commonsense agents

Language:PythonStargazers:28Issues:3Issues:0

alphazero

Generic implementation of AlphaZero

Language:PythonLicense:MITStargazers:7Issues:3Issues:0

PyMDP

Markov decision processes in Python

Language:PythonLicense:MITStargazers:5Issues:2Issues:0

procgen

Procgen Benchmark: Procedurally Generated Game-Like Gym Environments

Language:C++License:MITStargazers:1Issues:3Issues:0

auto-drac

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:1Issues:0

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

basicnn

Common neural networks in numpy

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:3Issues:0

cma_mae

A python implementation of Covariance Matrix Adaptation MAP-Annealing

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

EGG

EGG: Emergence of lanGuage in Games

Language:Jupyter NotebookLicense:MITStargazers:0Issues:3Issues:0

facenet

Face recognition using Tensorflow

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

neurogram

neurogram

Language:JavaScriptStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Language:PythonStargazers:0Issues:2Issues:0

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

scipy

SciPy library main repository

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

seq2seq

Example attention-seq2seq implementations.

Language:PythonStargazers:0Issues:3Issues:0

tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tfjs-converter

Convert TensorFlow SavedModel and Keras models to TensorFlow.js

Language:JavaScriptStargazers:0Issues:2Issues:0

ued

Open-Ended Autocurricula

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

v139

Proceedings of ICML 2021

Language:TeXStargazers:0Issues:2Issues:0

vae

VAE implementations

Language:PythonStargazers:0Issues:4Issues:0

vqvae

A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)

Language:Jupyter NotebookStargazers:0Issues:1Issues:0