qiumingming7@gmail.com's repositories
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
bayesianLSTM
Bayesian LSTM (Tensorflow)
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
catr
Image Captioning Using Transformer
CS285_Fa19_Deep_Reinforcement_Learning
My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments
DAFI
DAFI: Ensemble based data assimilation and field inversion, repository for internal development
Deep-RL-Policy-Search-for-MPC
This repo is related to Deep Policy search using MPC.
lab2d
A customisable 2D platform for agent-based AI research
LandmarkRecog
Google Landmark Retrieval Challenge
Low-light-Image-Enhancement-using-GAN
In this project, image taken in low lighting conditions, night time, or without much ambient light are converted into and enhanced image as if the image was taken with good lighting condition. Generative Adversarial Networks (GANs) is used to generate the enhanced image from scratch.
MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
Machine-Learning
讲解常见的机器学习算法
MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Mathematics
数学知识点滴积累 矩阵 数值优化 神经网络反向传播 图优化 概率论 随机过程 卡尔曼滤波 粒子滤波 数学函数拟合
missing-semester-cn.github.io
the CS missing semester Chinese version
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
PILCO
Bayesian Reinforcement Learning in Tensorflow
PyDA
PyDA: A hands-on introduction to dynamical data assimilation with Python
pymarl
Python Multi-Agent Reinforcement Learning framework
pytorch-fm
Factorization Machine models in PyTorch
Research
novel deep learning research works with PaddlePaddle
safe_learning
Safe reinforcement learning with stability guarantees
sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
swa_gaussian
Code repo for "A Simple Baseline for Bayesian Uncertainty in Deep Learning"
thermoAI
Heating system control with Reinforcement Learning
VBCAR
Variational Bayesian Context-aware Representation for Grocery Recommendation