ArronDZhang's starred repositories

rad

RAD: Reinforcement Learning with Augmented Data

Language:Jupyter NotebookStargazers:400Issues:0Issues:0

simsiam

PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566

Language:PythonLicense:NOASSERTIONStargazers:1136Issues:0Issues:0

simclr

SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4033Issues:0Issues:0

awesome-game-ai

Awesome Game AI materials of Multi-Agent Reinforcement Learning

License:MITStargazers:736Issues:0Issues:0

easy-rl

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8846Issues:0Issues:0

UPDeT

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Language:PythonLicense:MITStargazers:127Issues:0Issues:0

RODE

Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effectively discovers roles based on joint action space decomposition according to action effects, establishing a new state of the art on the StarCraft multi-agent benchmark.

Language:PythonLicense:Apache-2.0Stargazers:66Issues:0Issues:0

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:589Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33592Issues:0Issues:0

deep_bisim4control

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Language:PythonLicense:NOASSERTIONStargazers:139Issues:0Issues:0

transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Language:PythonLicense:MITStargazers:13166Issues:0Issues:0

NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Language:OpenEdge ABLStargazers:510Issues:0Issues:0

SEAL

SEAL (learning from Subgraphs, Embeddings, and Attributes for Link prediction). "M. Zhang, Y. Chen, Link Prediction Based on Graph Neural Networks, NeurIPS 2018 spotlight".

Language:CStargazers:568Issues:0Issues:0

awesome-deep-rl

For deep RL and the future of AI.

Language:HTMLLicense:MITStargazers:1406Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:7684Issues:0Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:3910Issues:0Issues:0

pytorch-DRL

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Language:PythonLicense:MITStargazers:519Issues:0Issues:0

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:PythonStargazers:1389Issues:0Issues:0

AIDungeon

Infinite adventures await!

Language:PythonLicense:MITStargazers:3184Issues:0Issues:0

awesome-reinforcement-learning

Learning Resources And Links Of Reinforcement Learning (updating)

Language:PythonLicense:MITStargazers:222Issues:0Issues:0

pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

Language:Jupyter NotebookLicense:MITStargazers:975Issues:0Issues:0
Language:PythonStargazers:262Issues:0Issues:0

ijcai-2018

ijcai-2018 top1 solution

Language:Jupyter NotebookStargazers:436Issues:0Issues:0

design-pattern

Python3实现设计模式,致力于将设计模式的**应用在开发中。创建型模式有:简单工厂模式、工厂方法模式、抽象工厂模式、 建造者模式和单例模式;结构型模式:适配器模式、桥模式、组合模式、外观模式和代理模式;行为型模式:责任链模式、观察者模式、策略模式和模板方法模式。设计模式是对软件设计中普遍存在或反复出向的各种问题所提出的解决方案。每一个设计模式系统地被命名、解释和评价了面向对象系统中一个重要和重复出现的设计。

License:Apache-2.0Stargazers:136Issues:0Issues:0

netron

Visualizer for neural network, deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:27116Issues:0Issues:0

pytorch_geometric

Graph Neural Network Library for PyTorch

Language:PythonLicense:MITStargazers:20763Issues:0Issues:0

trax

Trax — Deep Learning with Clear Code and Speed

Language:PythonLicense:Apache-2.0Stargazers:8024Issues:0Issues:0

filter-grafting

Filter Grafting for Deep Neural Networks(CVPR 2020)

Language:PythonStargazers:140Issues:0Issues:0

magnetW

[已失效,不再维护]

Language:JavaScriptLicense:GPL-3.0Stargazers:11658Issues:0Issues:0

translations

🐼 Chinese translations for classic software development resources

License:NOASSERTIONStargazers:6824Issues:0Issues:0