HongdaZhang's repositories

datasets

A collection of datasets of ML problem solving

Language:RStargazers:0Issues:0Issues:0

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Diffusion_RL

This repo has the code and suplementary materials of our 2024 RAL submission.

Language:PythonStargazers:0Issues:0Issues:0

eat_tensorflow2_in_30_days

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

formation

ROS package for formation and rendezvous of multi-drone (T-Cyber 2020)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

homework

Assignments for CS294-112.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

KaTeX

Fast math typesetting for the web.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

MADDPG_torch

The code for maddpg using pytorch

Language:PythonStargazers:0Issues:0Issues:0

MAgent

A Platform for Many-agent Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Stargazers:0Issues:0Issues:0

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Language:PythonStargazers:0Issues:0Issues:0

Multi-Agent-Reinforcement-Learning

PyTorch implementations of MADDPG, MAPPO (coming)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nmea_navsat_driver

ROS package containing drivers for NMEA devices that can output satellite navigation data (e.g. GPS or GLONASS).

Language:PythonStargazers:0Issues:0Issues:0

off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

orbbec_competition

第四届3DV创新应用竞赛

License:GPL-3.0Stargazers:0Issues:0Issues:0

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Python

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ray

A fast and simple framework for building and running distributed applications.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

rl-book

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ROS-ENKI_robot_simulation

A framework for the development of new closed-loop AI algorithms

Language:C++Stargazers:0Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

StarCraft

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:PythonStargazers:0Issues:0Issues:0

tensorflow_study

tensorflow学习代码

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

WorldModels

An implementation of the ideas from this paper https://arxiv.org/pdf/1803.10122.pdf

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0