Beast code in Giters

know-nothing8's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02397800

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonNOASSERTION33400

llm_multiagent_debate

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Language:Python31400

MATRIX

Implementation of the MATRIX framework (ICML 2024)

Language:Python3500

book

Language:C++41500

deeplearningbook-chinese

Deep Learning Book Chinese Translation

Language:TeX3546400

MachineLearningNotes

My personal notes

154600

EQL-Pytorch

Equation learning method -- pytorch implementation

Language:PythonMIT400

gncl

Language:Jupyter NotebookMIT300

pytorch-cifar

95.47% on CIFAR10 with PyTorch

Language:PythonMIT587400

Ensemble-Pytorch

A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.

Language:PythonBSD-3-Clause106000

GBDT

A simple GBDT in Python

Language:Python35100

GMAN-PyTorch

Implementation of Graph Muti-Attention Network with PyTorch

Language:Python13300

RL-based-Graph2Seq-for-NQG

Code & data accompanying the ICLR 2020 paper "Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation"

Language:PythonApache-2.012200

pytorch_dkvmn

Pytorch implementation of DKVMN

Language:Python2300

AKT

Language:PythonMIT9300

KT

Knowledge Tracing Models with PyTorch

Language:Python8600

ktm

Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing

Language:Jupyter NotebookMIT13000

GKT

Graph-based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network

Language:PythonMIT10500

Convolutional-Knowledge-Tracing

1800

serve

Serve, optimize and scale PyTorch models in production

Language:JavaApache-2.0407900

R-transformer

Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.

Language:Python22400

RNN-RL

Experiments with reinforcement learning and recurrent neural networks

Language:Python10900

deep-learning-nlp-rl-papers

Recent Deep Learning papers in NLU and RL

Language:Python29400

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

MIT100

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Language:Jupyter Notebook300500

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT378200

pytorch-dqn

Deep Q-Learning Network in pytorch (not actively maintained)

Language:PythonMIT38400

Youtube-Code-Repository

Repository for most of the code from my YouTube channel

Language:Python85400

ekt

Language:Python5800