sbhambr1

Siddhant Bhambri's repositories

camera_model_and_stereo_depth_sensing

Camera model and stereo depth sensing using OpenCV

Language:Python7 10

wordle_using_rollouts

This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.

Language:Jupyter Notebook3 10

gpt4-testing-tom

Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)

Language:Python1 10

60_Days_RL_Challenge

Learn Deep Reinforcement Learning in Depth in 60 days

Language:Jupyter NotebookMIT000

adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

Language:PythonMIT000

cse536-xv6-os

Writing code for xv6 OS.

Language:AssemblyMIT000

FinRL

A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥

Language:Jupyter NotebookMIT000

ImageBind_testing

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION000

imitation-learning

Imitation learning algorithms

Language:PythonMIT000

learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Language:PythonMIT000

lmql-playground

Playing with LMQL:https://lmql.ai

Language:Python010

MarkovGameSolvers

This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.

Language:PythonMIT000

MAS-Memory-Aware-Synapses

Memory Aware Synapses method implementation code

Language:Jupyter Notebook000

sbhambr1.github.io

Language:JavaScriptMIT010

StackelbergEquilibribumSolvers

Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.

Language:PythonMIT000

segment_anything_playground

Playing with SAM model by MetaAI

Language:Jupyter Notebook010

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonApache-2.0000

StatisticalML-Course

Language:Python010

symbolic_planning_and_rl

Spring 2021 - CSE 574 Project

Language:Python000

turtlebot3_simulations

Simulating TurtleBot3 in custom worlds & playing the evader-pursuer game.

Language:C++000

videos

Code for the manim-generated scenes used in 3blue1brown videos

Language:Python000

XAI-papers

MIT000