Beast code in Giters

Mahan's repositories

CS231

Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition

Language:Jupyter Notebook377 11 3

iLQG-MuJoCo

Iterative LQG for a couple of MuJoCo models

Language:C++59 1 1

Model-Based-RL

Model-based Policy Gradients

Language:Python31 2 1

UnsupervisedObjectDetection

Use your classification neural network for object detection and localization

Language:Python16 10

HJxB

Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)

Language:Python14 30

OBJET

OBJET: A Computer Vision Graphical Sandbox

Language:C++MIT13 20

Unity2OpenSim

Regenerate Movements of Unity w/ OpenSim

Language:Python7 10

TrajOpt-KneedWalker

Finding a stable limit cycle for a passive kneed walker using trajectory optimization.

Language:Jupyter Notebook4 30

S4RL

Offline RL with S4

Language:Python3 20

gail

A Frankenstein implementation of "Generative Adversarial Imitation Learning".

Language:Python200

DeepAnaglyph3D

End-to-end generation of good old red-cyan 3D images via CNNs

Language:Python1 10

DMPx

Dynamic Movement Primitives in JAX

Language:Python1 20

.files

config files for i3, doom emacs, vim, fish, etc.

Language:Emacs Lisp010

GCxEBM

goal-conditioned energy-based models

Language:Python020

go-rest

simple rest api in go

Language:Go020

ift6163_homeworks

Language:Jupyter Notebook010

JEM

Energy-based Option-like Discovery

Language:Python020

junk

all kinda shit here

Language:C++020

LearnOpenGL

recently started to dabble in OpenGL

Language:C020

length-generalization

Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023

Language:PythonMIT000

long-convs

long convolutions in jax

Language:Python010

MahanFathi

020

MAYO

Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch

Language:PythonMIT010

mjDS

Language:C++020

NxDP

Language:Jupyter Notebook020

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Apache-2.0000

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.0000

specssm

Spectral State-Space Models

Language:PythonMIT010

sub-q-learning

break up the Q-function in linear parts which correspond to subrewards

Language:Python020

trabrax

Trajectory Optimization on BRAX

020