wanghuimu

wanghuimu

Geek Repo

Github PK Tool:Github PK Tool

wanghuimu's repositories

DRL4Recsys

Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems

Stargazers:1Issues:0Issues:0

Sparse-Reward-Algorithms

Implement many Sparse Reward algorithms in Gym Fetch environment

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

apple-store-helper

Apple Store iPhone预约助手

License:GPL-3.0Stargazers:0Issues:0Issues:0

Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

Awesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR and CVR prediction), Post Ranking, Multi-task Learning, Graph Neural Networks, Transfer Learning, Reinforcement Learning, Self-supervised Learning and so on.

Language:PythonStargazers:0Issues:1Issues:0

Batch-Offline--RL-Paper-Lists

Paper Collection for Batch RL with brief introductions.

Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

damarl

Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".

Language:PythonStargazers:0Issues:1Issues:0

Deep-RL-Notes

A collection of comprehensive notes on Deep Reinforcement Learning, based on UC Berkeley's CS 285 (prev. CS 294-112)

Stargazers:0Issues:0Issues:0

DeepClustering

Methods and Implements of Deep Clustering

Stargazers:0Issues:1Issues:0

deeprl_network

multi-agent deep reinforcement learning for networked system control.

Stargazers:0Issues:0Issues:0

DOP

Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (https://arxiv.org/abs/2007.12322)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

football-paris

The exact codes used by the team "liveinparis" at the kaggle football competition ranked 8th/1141

License:MITStargazers:0Issues:0Issues:0

GroupIM

Code for GroupIM: A Mutual Information Maximization Framework for Neural Group Recommendation (SIGIR 2020)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

gumbel_lstm

Experiments with binary LSTM using gumbel-sigmoid

License:MITStargazers:0Issues:0Issues:0

HuimuWang

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

License:MITStargazers:0Issues:0Issues:0

jd_seckill

京东茅台抢购,不支持其他商品!愿大家与黄牛站在同一个起跑线,公平的参与这场抢茅大赛。

License:GPL-3.0Stargazers:0Issues:0Issues:0

LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Language:JavaStargazers:0Issues:1Issues:0

LIRD

Deep Reinforcement Learning for Movies Recommendation System

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

License:MITStargazers:0Issues:0Issues:0

Multi-Agent-Coordination-Google-Football

Coordination between Deep RL Agents for Virtual Football

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

multiagent_gnn_policies

Learning multi-agent policies for flocking using graph neural networks

License:NOASSERTIONStargazers:0Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO.

License:MITStargazers:0Issues:0Issues:0

pymarl2

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ReAgent

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

RSPapers

A Curated List of Must-read Papers on Recommender System.

License:MITStargazers:0Issues:1Issues:0

StarCraft

Implementations of QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Stargazers:0Issues:0Issues:0

VBC

pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"

License:Apache-2.0Stargazers:0Issues:0Issues:0