Tim Wee's repositories

Language:JavaScriptStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:GoStargazers:0Issues:2Issues:0
Language:GoStargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

datasaurust

Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.

Language:RustLicense:MITStargazers:0Issues:1Issues:0

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

Stargazers:0Issues:1Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

GPU-Puzzles

Solve puzzles. Learn CUDA.

License:MITStargazers:0Issues:0Issues:0

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mini-lsm

A tutorial of building an LSM-Tree storage engine in a week!

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

openplayground

An LLM playground you can run on your laptop

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

placemark

Placemark open source project

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

ppo-ewma

Code for the paper "Batch size invariance for policy optimization"

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonStargazers:0Issues:1Issues:0

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Language:PythonStargazers:0Issues:1Issues:0

roomGPT

Upload a photo of your room to generate your dream room with AI.

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

tkrzw

a set of implementations of DBM

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0
Language:ZigLicense:MITStargazers:0Issues:2Issues:0