wanghuimu's repositories
DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
apple-store-helper
Apple Store iPhone预约助手
Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR and CVR prediction), Post Ranking, Multi-task Learning, Graph Neural Networks, Transfer Learning, Reinforcement Learning, Self-supervised Learning and so on.
Batch-Offline--RL-Paper-Lists
Paper Collection for Batch RL with brief introductions.
Deep-RL-Notes
A collection of comprehensive notes on Deep Reinforcement Learning, based on UC Berkeley's CS 285 (prev. CS 294-112)
DeepClustering
Methods and Implements of Deep Clustering
deeprl_network
multi-agent deep reinforcement learning for networked system control.
DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (https://arxiv.org/abs/2007.12322)
football-paris
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 8th/1141
GroupIM
Code for GroupIM: A Mutual Information Maximization Framework for Neural Group Recommendation (SIGIR 2020)
gumbel_lstm
Experiments with binary LSTM using gumbel-sigmoid
HuimuWang
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
jd_seckill
京东茅台抢购,不支持其他商品!愿大家与黄牛站在同一个起跑线,公平的参与这场抢茅大赛。
LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
LIRD
Deep Reinforcement Learning for Movies Recommendation System
minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Multi-Agent-Coordination-Google-Football
Coordination between Deep RL Agents for Virtual Football
multiagent_gnn_policies
Learning multi-agent policies for flocking using graph neural networks
on-policy
This is the official implementation of Multi-Agent PPO.
ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
VBC
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"