Ove's repositories
chinese-independent-blogs
中文独立博客列表
zjuthesis-OCproposal-OverLeaf
海洋学院开题报告OverLeaf版本
epymarl_resco
EPyMARL codebase modified to operate the RESCO benchmark environments
awesome-reinforcement-learning-lib
GitHub's code repository is all you need
DQN_pytorch
Vanilla DQN, Double DQN, and Dueling DQN implemented in PyTorch
DRL
Deep Reinforcement Learning
GPTs
leaked prompts of GPTs
JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Multi-Commander
Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem
ove.github.io
MyBlog
PKD-for-BERT-Model-Compression
pytorch implementation for Patient Knowledge Distillation for BERT Model Compression
practicalAI
📚 A practical approach to learning and using machine learning.
pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
sumolights
SUMO adaptive traffic signal control - DQN, DDPG, Webster's, Max-pressure, Self-Organizing Traffic Lights