YanyueMa's repositories
tutorials
机器学习相关教程
multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
sumolights
SUMO adaptive traffic signal control - DQN, DDPG, Webster's, Max-pressure, Self-Organizing Traffic Lights
design_patterns
图说设计模式
distributedRL_MAPF
Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
AlgorithmsByPython
算法/数据结构/Python/剑指offer/机器学习/leetcode
CityFlow
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Deep-QLearning-Agent-for-Traffic-Signal-Control
A framework where a deep Q-Learning Reinforcement Learning agent tries to choose the correct traffic light phase at an intersection to maximize the traffic efficiency.
PacmanDQN
Deep Reinforcement Learning in Pac-man
deeprl_network
multi-agent deep reinforcement learning for networked system control.
deeprl_signal_control
multi-agent deep reinforcement learning for large-scale traffic signal control.
pysc2
StarCraft II Learning Environment
uestc_Internet_plus_course_project
本人在大学期间所有课程课设和作业的代码和部分报告,包括【计算机组成与结构】、【计算机网络与通信技术】、【软件基础综合课程设计】、【互联网软件开发综合课程设计】、【数据挖掘与大数据分析】、【时间序列分析】、【机器学习】、【数据结构与算法】、【并行程序设计导论】、【计算机操作系统】、【计算机视觉】
WebServer_Comment
使用C++11完成的Web服务器
Realtime-Action-Recognition
Multi-person real-time recognition (classification) of 9 actions based on human skeleton from OpenPose and a 0.5-second window.
sumo-rl
A simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. Compatible with Gym Env from OpenAI and MultiAgentEnv from RLlib.
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
ELF
An End-To-End, Lightweight and Flexible Platform for Game Research
Coursera-ML-AndrewNg-Notes
吴恩达老师的机器学习课程个人笔记
Reinforcement-Learning-Algorithms-with-Python
Reinforcement Learning Algorithms with Python, Published by Packt
tf-pose-estimation
Deep Pose Estimation implemented using Tensorflow with Custom Architectures for fast inference.
ShortLink
使用go实现的短网址服务
ThreadPool
使用C++11完成线程池
CVE-2019-0708
Scanner PoC for CVE-2019-0708 RDP RCE vuln
airline
项目描述:使用python语言开发了一个显示航班航线的可视化系统,主要实现查询航班信息,航线显示信 息,显示航班晚点率。 项目职责:采用python开发了航线显示模块,在《智能计算机与应用》2019 年第 9 卷第 3 期上发表论文《基于航班数据可视化系统的设计与实现》,并申请软件著作权《基于echarts库的航班信息可视化系统》,登记号2019R11S0348072。
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
MoSTScenario
Monaco SUMO Traffic (MoST) Scenario
CVE-2019-7239
Nexus Repository Manager 3 Remote Code Execution without authentication < 3.15.0