know-nothing8's starred repositories
Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
llm_multiagent_debate
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
deeplearningbook-chinese
Deep Learning Book Chinese Translation
MachineLearningNotes
My personal notes
EQL-Pytorch
Equation learning method -- pytorch implementation
pytorch-cifar
95.47% on CIFAR10 with PyTorch
Ensemble-Pytorch
A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.
GMAN-PyTorch
Implementation of Graph Muti-Attention Network with PyTorch
RL-based-Graph2Seq-for-NQG
Code & data accompanying the ICLR 2020 paper "Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation"
pytorch_dkvmn
Pytorch implementation of DKVMN
R-transformer
Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.
deep-learning-nlp-rl-papers
Recent Deep Learning papers in NLU and RL
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
pytorch-dqn
Deep Q-Learning Network in pytorch (not actively maintained)
Youtube-Code-Repository
Repository for most of the code from my YouTube channel