Siddhant Bhambri's repositories
camera_model_and_stereo_depth_sensing
Camera model and stereo depth sensing using OpenCV
wordle_using_rollouts
This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.
gpt4-testing-tom
Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)
60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
cse536-xv6-os
Writing code for xv6 OS.
FinRL
A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥
ImageBind_testing
ImageBind One Embedding Space to Bind Them All
imitation-learning
Imitation learning algorithms
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
lmql-playground
Playing with LMQL:https://lmql.ai
MarkovGameSolvers
This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.
MAS-Memory-Aware-Synapses
Memory Aware Synapses method implementation code
StackelbergEquilibribumSolvers
Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.
segment_anything_playground
Playing with SAM model by MetaAI
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
symbolic_planning_and_rl
Spring 2021 - CSE 574 Project
turtlebot3_simulations
Simulating TurtleBot3 in custom worlds & playing the evader-pursuer game.
videos
Code for the manim-generated scenes used in 3blue1brown videos