Tim Wee's repositories

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Triton-Puzzles

Puzzles for learning Triton

License:Apache-2.0Stargazers:0Issues:0Issues:0

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

mini-lsm

A tutorial of building an LSM-Tree storage engine in a week!

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

annotated-transformer

An annotated implementation of the Transformer paper.

License:MITStargazers:0Issues:0Issues:0

placemark

Placemark open source project

License:MITStargazers:0Issues:0Issues:0
Language:GoStargazers:0Issues:0Issues:0
Language:GoStargazers:0Issues:0Issues:0

python-mastery

Advanced Python Mastery (course by @dabeaz)

Stargazers:0Issues:0Issues:0

roomGPT

Upload a photo of your room to generate your dream room with AI.

License:MITStargazers:0Issues:0Issues:0

openplayground

An LLM playground you can run on your laptop

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

Transformer-Puzzles

Puzzles for exploring transformers

License:MITStargazers:0Issues:0Issues:0

pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

License:MITStargazers:0Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

License:MITStargazers:0Issues:0Issues:0

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Stargazers:0Issues:0Issues:0

datasaurust

Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.

License:MITStargazers:0Issues:0Issues:0

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

License:MITStargazers:0Issues:0Issues:0

ppo-ewma

Code for the paper "Batch size invariance for policy optimization"

License:MITStargazers:0Issues:0Issues:0

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

License:MITStargazers:0Issues:0Issues:0

summarize-from-feedback

Code for "Learning to summarize from human feedback"

License:NOASSERTIONStargazers:0Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

License:MITStargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

License:Apache-2.0Stargazers:0Issues:0Issues:0

tkrzw

a set of implementations of DBM

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:ZigLicense:MITStargazers:0Issues:0Issues:0