Beast code in Giters

Tim Wee's repositories

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookMIT000

Triton-Puzzles

Puzzles for learning Triton

Apache-2.0000

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookMIT000

MuKoe

Apache-2.0000

mini-lsm

A tutorial of building an LSM-Tree storage engine in a week!

Language:RustApache-2.0000

annotated-transformer

An annotated implementation of the Transformer paper.

MIT000

placemark

Placemark open source project

MIT000

codecrafters-redis-go

Language:Go000

codecrafters-bittorrent-go

Language:Go000

python-mastery

Advanced Python Mastery (course by @dabeaz)

000

roomGPT

Upload a photo of your room to generate your dream room with AI.

MIT000

openplayground

An LLM playground you can run on your laptop

Language:TypeScriptMIT000

Transformer-Puzzles

Puzzles for exploring transformers

MIT000

pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Language:Jupyter NotebookMIT000

RL_learning

Language:Jupyter Notebook000

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Apache-2.0000

10kai_backend_fly

Language:PythonMIT000

10k-remix-frontend-fly

Language:JavaScript000

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

MIT000

spinningup

An educational resource to help anyone learn deep reinforcement learning.

MIT000

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

000

datasaurust

Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.

MIT000

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

MIT000

ppo-ewma

Code for the paper "Batch size invariance for policy optimization"

MIT000

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

MIT000

summarize-from-feedback

Code for "Learning to summarize from human feedback"

NOASSERTION000

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

MIT000

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

Apache-2.0000

tkrzw

a set of implementations of DBM

Apache-2.0000

ziglings-march2023

Language:ZigMIT000