Ted Moskovitz's repositories
ConstrainedRL4LMs
A library for constrained RLHF.
directorv3
Mastering Diverse Domains through World Models
reinforcement_learning
My solutions to Denny Britz's short course on RL.
first_occupancy
A First Occupancy Representation for Reinforcement Learning
bayesian_modeling
A collection of simple Bayesian machine learning methods implemented on toy data.
Computational_Decipherment
Applying deep learning and other machine learning methods to the decipherment of ancient writing systems.
ConvRNN_Analysis
Analyze Biologically-Realistic Convolutional Recurrent Networks
GA_TSP
A simple genetic algorithm (GA) for solving the travelling salesman problem.
LambdaRepresentation
Lambda Representation for Diminishing Marginal Utility
SimpleCUDA
Simple Neural Network in CUDA
tvpo
An implementation of Total Variation Policy Optimization (TVPO)
DeepLearning_Thesis
A sample of code from my thesis at Princeton applying deep learning models to neural spike data.
Feedback_Alignment
Investigating biologically-plausible implementations of the backpropagation algorithm.
minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
PracticeCpp
Simple C++ Programs
SimplePPO
A Simple, Easily-Customizable, Fully Jitted PPO Implementation in Jax
TDSR_python
successor representation for RL