bututoubaobei

0

followers

following

stars

qiumingming7@gmail.com's repositories

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonMIT000

bayesianLSTM

Bayesian LSTM (Tensorflow)

Language:Python000

BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Language:PythonMIT000

BookCode_Edition1

Language:Jupyter NotebookGPL-2.0000

catr

Image Captioning Using Transformer

Language:PythonApache-2.0000

CS285_Fa19_Deep_Reinforcement_Learning

My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments

Language:Python000

DAFI

DAFI: Ensemble based data assimilation and field inversion, repository for internal development

Language:Jupyter NotebookApache-2.0000

Deep-RL-Policy-Search-for-MPC

This repo is related to Deep Policy search using MPC.

Language:Python000

flask

Language:Python000

lab2d

A customisable 2D platform for agent-based AI research

Language:C++Apache-2.0000

LandmarkRecog

Google Landmark Retrieval Challenge

Language:PythonMIT000

Low-light-Image-Enhancement-using-GAN

In this project, image taken in low lighting conditions, night time, or without much ambient light are converted into and enhanced image as if the image was taken with good lighting condition. Generative Adversarial Networks (GANs) is used to generate the enhanced image from scratch.

Language:Jupyter NotebookMIT000

MAAC

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

Language:PythonMIT000

Machine-Learning

讲解常见的机器学习算法

Language:Jupyter Notebook000

MARL-code-pytorch

Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.

Language:PythonMIT000

Mathematics

数学知识点滴积累矩阵数值优化神经网络反向传播图优化概率论随机过程卡尔曼滤波粒子滤波数学函数拟合

Language:MATLAB000

missing-semester-cn.github.io

the CS missing semester Chinese version

Language:CSSNOASSERTION000

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonMIT000

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonMIT000

particle-filter-tutorial

MIT000

PILCO

Bayesian Reinforcement Learning in Tensorflow

Language:PythonMIT000

PyDA

PyDA: A hands-on introduction to dynamical data assimilation with Python

Language:Python000

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.0000

pytorch-fm

Factorization Machine models in PyTorch

Language:PythonMIT000

Research

novel deep learning research works with PaddlePaddle

Apache-2.0000

safe_learning

Safe reinforcement learning with stability guarantees

Language:PythonMIT000

sample-efficient-bayesian-rl

Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL

Language:Jupyter NotebookMIT000

swa_gaussian

Code repo for "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

Language:Jupyter NotebookBSD-2-Clause000

thermoAI

Heating system control with Reinforcement Learning

Language:HTML000

VBCAR

Variational Bayesian Context-aware Representation for Grocery Recommendation

Language:Jupyter Notebook000