Jingwei Zhang's repositories
pytorch-rl
Deep Reinforcement Learning with pytorch & visdom
annotated_latex_equations
Examples of how to create colorful, annotated equations in Latex using Tikz.
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
dynamics-aware-embeddings
Official implementation of DynE, Dynamics-aware Embeddings for RL
extending-jax
Extending JAX with custom C++ and CUDA code
get-started-with-JAX
The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.
gym-minigrid
Minimalistic gridworld package for OpenAI Gym
jax-rl
Jax (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
makemore
The most accessible way of tinkering with a GPT - one hackable script
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
neural_chat
Code to support training, evaluating and interacting neural network dialog models, and training them with reinforcement learning. Code to deploy a web server which hosts the models live online is available at: https://github.com/asmadotgh/neural_chat_web
phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
plan2explore
Repository for the paper "Planning to Explore via Self-Supervised World Models"
pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
rl_games
RL implementations
sample-factory
High throughput asynchronous reinforcement learning
SciencePlots
Matplotlib styles for scientific plotting
softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
trax
Trax — Deep Learning with Clear Code and Speed
VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.