haotiansun14

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION000

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonApache-2.0000

CSE6140-Fall-2022-Project-Minimum-Vertex-Cover

CSE6140 Fall 2022 Project: Minimum Vertex Cover

Language:Jupyter Notebook010

d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被60个国家的400所大学用于教学。

Language:PythonApache-2.0000

DIG

A library for graph deep learning research

Language:PythonGPL-3.0000

ElegantRL

Cloud-native Deep Reinforcement Learning. 🔥

NOASSERTION000

homework_fall2022

Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)

Language:Jupyter Notebook000

lihang-code

《统计学习方法》的代码实现

000

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0000

lvrep-rl

Language:Python000

rci-agent

A codebase for "Language Models can Solve Computer Tasks"

MIT000

ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Language:Jupyter NotebookMIT000

ReAgent

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Language:PythonBSD-3-Clause000

reflexion

Reflexion: an autonomous agent with dynamic memory and self-reflection

MIT000

repeat_motion_segmentation

Segmenting a time series with repeating patterns using DTW matching

Language:Python000

rl-rep-page

Language:JavaScript010

rl_graph_generation

BSD-3-Clause000

score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Apache-2.0000

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonApache-2.0000

tianshou

An elegant PyTorch deep reinforcement learning library.

MIT000