Ted Moskovitz (tedmoskovitz)

tedmoskovitz

Geek Repo

Github PK Tool:Github PK Tool

Ted Moskovitz's repositories

TOP

Implementation of Tactical Optimistic and Pessimistic value estimation

Language:PythonStargazers:21Issues:0Issues:0

WNPG

implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies

Language:PythonStargazers:10Issues:1Issues:0

ConstrainedRL4LMs

A library for constrained RLHF.

Language:PythonStargazers:7Issues:0Issues:0

directorv3

Mastering Diverse Domains through World Models

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

reinforcement_learning

My solutions to Denny Britz's short course on RL.

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

first_occupancy

A First Occupancy Representation for Reinforcement Learning

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

SVGD

Stein Variational Gradient Descent

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

bayesian_modeling

A collection of simple Bayesian machine learning methods implemented on toy data.

Language:PythonStargazers:0Issues:0Issues:0

Computational_Decipherment

Applying deep learning and other machine learning methods to the decipherment of ancient writing systems.

Language:JavaStargazers:0Issues:0Issues:0

ConvRNN_Analysis

Analyze Biologically-Realistic Convolutional Recurrent Networks

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

GA_TSP

A simple genetic algorithm (GA) for solving the travelling salesman problem.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

LambdaRepresentation

Lambda Representation for Diminishing Marginal Utility

Language:PythonStargazers:0Issues:0Issues:0

SimpleCUDA

Simple Neural Network in CUDA

Language:CudaStargazers:0Issues:0Issues:0

tvpo

An implementation of Total Variation Policy Optimization (TVPO)

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DeepLearning_Thesis

A sample of code from my thesis at Princeton applying deep learning models to neural spike data.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Feedback_Alignment

Investigating biologically-plausible implementations of the backpropagation algorithm.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

minRLHF

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

PracticeCpp

Simple C++ Programs

Language:C++Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

SimplePPO

A Simple, Easily-Customizable, Fully Jitted PPO Implementation in Jax

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

TDSR_python

successor representation for RL

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0