Beast code in Giters

Tim Wee's repositories

10k-remix-frontend-fly

Language:JavaScript020

10kai_backend_fly

Language:PythonMIT020

codecrafters-bittorrent-go

Language:Go020

codecrafters-redis-go

Language:Go020

RL_learning

Language:Jupyter Notebook020

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT010

chat-10k

Language:PythonMIT010

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

Language:PythonApache-2.0010

datasaurust

Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.

Language:RustMIT010

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

010

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonApache-2.0010

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonNOASSERTION010

GPU-Puzzles

Solve puzzles. Learn CUDA.

MIT000

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Language:PythonMIT010

mini-lsm

A tutorial of building an LSM-Tree storage engine in a week!

Language:RustApache-2.0000

MuKoe

Apache-2.0000

openplayground

An LLM playground you can run on your laptop

Language:TypeScriptMIT010

pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Language:Jupyter NotebookMIT010

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonMIT010

placemark

Placemark open source project

Language:TypeScriptMIT010

ppo-ewma

Code for the paper "Batch size invariance for policy optimization"

Language:Jupyter NotebookMIT010

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:Python010

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Language:Jupyter NotebookMIT010

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Language:Python010

roomGPT

Upload a photo of your room to generate your dream room with AI.

Language:TypeScriptMIT010

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonMIT010

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonNOASSERTION010

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookMIT000

tkrzw

a set of implementations of DBM

Language:C++Apache-2.0010

ziglings-march2023

Language:ZigMIT020