know-nothing8

know-nothing8

Geek Repo

Github PK Tool:Github PK Tool

know-nothing8's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23978Issues:0Issues:0

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonLicense:NOASSERTIONStargazers:334Issues:0Issues:0

llm_multiagent_debate

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Language:PythonStargazers:314Issues:0Issues:0

MATRIX

Implementation of the MATRIX framework (ICML 2024)

Language:PythonStargazers:35Issues:0Issues:0
Language:C++Stargazers:415Issues:0Issues:0

deeplearningbook-chinese

Deep Learning Book Chinese Translation

Language:TeXStargazers:35464Issues:0Issues:0

MachineLearningNotes

My personal notes

Stargazers:1546Issues:0Issues:0

EQL-Pytorch

Equation learning method -- pytorch implementation

Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:3Issues:0Issues:0

pytorch-cifar

95.47% on CIFAR10 with PyTorch

Language:PythonLicense:MITStargazers:5874Issues:0Issues:0

Ensemble-Pytorch

A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.

Language:PythonLicense:BSD-3-ClauseStargazers:1060Issues:0Issues:0

GBDT

A simple GBDT in Python

Language:PythonStargazers:351Issues:0Issues:0

GMAN-PyTorch

Implementation of Graph Muti-Attention Network with PyTorch

Language:PythonStargazers:133Issues:0Issues:0

RL-based-Graph2Seq-for-NQG

Code & data accompanying the ICLR 2020 paper "Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation"

Language:PythonLicense:Apache-2.0Stargazers:122Issues:0Issues:0

pytorch_dkvmn

Pytorch implementation of DKVMN

Language:PythonStargazers:23Issues:0Issues:0
Language:PythonLicense:MITStargazers:93Issues:0Issues:0

KT

Knowledge Tracing Models with PyTorch

Language:PythonStargazers:86Issues:0Issues:0

ktm

Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing

Language:Jupyter NotebookLicense:MITStargazers:130Issues:0Issues:0

GKT

Graph-based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network

Language:PythonLicense:MITStargazers:105Issues:0Issues:0

serve

Serve, optimize and scale PyTorch models in production

Language:JavaLicense:Apache-2.0Stargazers:4079Issues:0Issues:0

R-transformer

Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.

Language:PythonStargazers:224Issues:0Issues:0

RNN-RL

Experiments with reinforcement learning and recurrent neural networks

Language:PythonStargazers:109Issues:0Issues:0

deep-learning-nlp-rl-papers

Recent Deep Learning papers in NLU and RL

Language:PythonStargazers:294Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

License:MITStargazers:1Issues:0Issues:0

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Language:Jupyter NotebookStargazers:3005Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:3782Issues:0Issues:0

pytorch-dqn

Deep Q-Learning Network in pytorch (not actively maintained)

Language:PythonLicense:MITStargazers:384Issues:0Issues:0

Youtube-Code-Repository

Repository for most of the code from my YouTube channel

Language:PythonStargazers:854Issues:0Issues:0
Language:PythonStargazers:58Issues:0Issues:0