Siddhant Bhambri (sbhambr1)

sbhambr1

Geek Repo

Company:Arizona State University

Twitter:@BhambriSiddhant

Github PK Tool:Github PK Tool

Siddhant Bhambri's repositories

camera_model_and_stereo_depth_sensing

Camera model and stereo depth sensing using OpenCV

Language:PythonStargazers:7Issues:1Issues:0

wordle_using_rollouts

This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.

Language:Jupyter NotebookStargazers:3Issues:1Issues:0

gpt4-testing-tom

Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)

Language:PythonStargazers:1Issues:1Issues:0

60_Days_RL_Challenge

Learn Deep Reinforcement Learning in Depth in 60 days

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cse536-xv6-os

Writing code for xv6 OS.

Language:AssemblyLicense:MITStargazers:0Issues:0Issues:0

FinRL

A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

ImageBind_testing

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

imitation-learning

Imitation learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

lmql-playground

Playing with LMQL:https://lmql.ai

Language:PythonStargazers:0Issues:1Issues:0

MarkovGameSolvers

This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MAS-Memory-Aware-Synapses

Memory Aware Synapses method implementation code

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

StackelbergEquilibribumSolvers

Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

segment_anything_playground

Playing with SAM model by MetaAI

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

symbolic_planning_and_rl

Spring 2021 - CSE 574 Project

Language:PythonStargazers:0Issues:0Issues:0

turtlebot3_simulations

Simulating TurtleBot3 in custom worlds & playing the evader-pursuer game.

Language:C++Stargazers:0Issues:0Issues:0

videos

Code for the manim-generated scenes used in 3blue1brown videos

Language:PythonStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0