ucb

There are 0 repository under ucb topic.

wizardforcel / sicp-py-zh
:book:【译】UCB CS61a SICP Python
python sicp textbook ucb cs61a
Language:CSS 2232
apachecn / cs61b-textbook-zh
:book: [译] UCB CS61b Java 中的数据结构
ucb cs61b textbook data-structures java
Language:HTML 345
czahie / CS61A
Structure and Interpretation of Computer Programs
ucb python scheme data-structure sqlite
Language:Python 335
Kivy-CN / data8-textbook-zh
:book: [译] UCB DATA8 计算与推断思维
data8 python statistics textbook ucb
Language:HTML 326
alison-carrera / mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
algorithm arm contextual-bandits mab monte-carlo montecarlo-simulation multi-armed-bandit rank ranked-mab ranking-algorithm reinforcement-learning reinforcement-learning-algorithms reward simulation thompson-sampling ucb
Language:Python 132
apachecn / ds100-textbook-zh
:book: [译] UCB DS100 数据科学的原理与技巧
python data-analysis machine-learning textbook ucb ds100
Language:JavaScript 117
apachecn / sicp-py-zh
:book: [译] UCB CS61a SICP Python 描述中文版
cs61a lecture-notes python sicp ucb
Language:CSS 84
xuyanshi / cs61a-2022
CS 61A: Structure and Interpretation of Computer Programs, Fall 2022, UC Berkeley
cs61a study ucb ucberkeley
Language:Python 56
zjsyhjh / ucb-cs61b
All projects about ucb-61b(2014 spring), http://www.cs.berkeley.edu/~jrs/61b/index.html
data-structures ucb
Language:Java 40
OMerkel / UCThello
UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
board-game othello mcts game ucb 2-player-strategy-game simulation upper-confidence-bounds abstract-game ai ai-players artificial-intelligence entertainment mobile mobile-app mobile-game monte-carlo-tree-search perfect-information uct
Language:JavaScript 25
csfive / home
🐭 计算机废物自学指北
cmu computer-science cs csdiy csfive harvard mit self-learning stanford ucb
23
v-i-s-h / MAB.jl
A Julia Package for providing Multi Armed Bandit Experiments
reinforcement-learning reinforcement-learning-algorithms julia-language multi-arm-bandits julia thompson-sampling mab bandit-experiments ucb exp julialang julia-package
Language:Julia 21
akshaykhadse / reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
reinforcement-learning reinforcement-learning-excercises reinforcement-learning-analysis multi-armed-bandits multiarm-bandit markovian-epidemic-processes mdps ucb ucb1 kl-divergence epsilon-greedy thompson-sampling linear-programming howards-pi policy-iteration policy-evaluation batch-switching randomised-algorithms randomized-policy-iteration
Language:Python 17
idanmoradarthas / MutiArmedBandit-DeepLearning
Multi-armed bandit algorithm with tensorflow and 11 policies
tensorflow deep-reinforcement-learning epsilon ucb softmax python3 multi-armed-bandit
Language:Python 14
Correlated-AoI-Bandits
ishank-juneja / Correlated-AoI-Bandits
Author's implementation of the paper Correlated Age-of-Information Bandits.
multi-armed-bandit ucb thompson-sampling age-of-information aoi correlated-multi-armed-bandits correlated-arms aoi-regret
Language:Python 14
zjsyhjh / ucb-cs186
All projects about ucb-cs186(fall 2013), you can get information from the course website(https://sites.google.com/site/cs186fall2013)
database ucb
Language:Java 14
csfive / CS61A
🚧
cs cs61a python sicp ucb
13
annieyan / Bandits-using-UCB-algorithm
Thompson Sampling for Bandits using UCB policy
reinforcement-learning ucb bandits thompson-sampling
Language:Python 10
OMerkel / Oware
Oware and Ouril - traditional African Mancala games with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
mcts monte-carlo-tree-search ucb upper-confidence-bounds uct ai artificial-intelligence board-game game entertainment mobile abstract-game perfect-information 2-player-strategy-game mancala-game mancala oware ouril mobile-app mobile-game
Language:HTML 10
rudrajit1729 / Machine-Learning-Codes-And-Templates
Codes and templates for ML algorithms created, modified and optimized in Python and R.
datascience feature-extraction feature-selection regression-models regression-algorithms classification-algorithims kmeans-clustering hierarchical-clustering apriori-algorithm eclat-algorithm ucb thompson-sampling nlp-machine-learning ann cnn-classification kfold-cross-validation dimensionality-reduction xgboost-model parameter-tuning
Language:Python 9
csfive / CS61B
🚧
21sp algorithm cs61b java ucb ucberkeley
8
BigBobAtBerkeley / CS70
CS70 Homework and Discussion Solutions
70 and berkeley cs discrete mathematics probability theory uc ucb
7
ChillyHigh / CS61A-CN
A mirror website for CS61A Fall 2020 with Chinese translation.
chinese-translation cs61a mirror python ucb
Language:HTML 6
Suchetaaa / CS747-Assignments
Foundations Of Intelligent Learning Agents (FILA) Assignments
reinforcement-learning multi-armed-bandits bellman-equation linear-programming howards-pi bootstrapping monte-carlo sarsa-learning windy-gridworld temporal-differencing-learning intelligent-learning-agents ucb thompson-sampling kl-ucb
Language:Python 6
MaxenceGiraud / ucb-nonstationary
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems
discounted-ucb multi-armed-bandits non-stationary-bandit sliding-ucb ucb
Language:Python 5
OMerkel / Alquerque
Alquerque - a 2 player abstract strategic perfect information traditional board game with computer AI option.
board-game game checkers draughts mcts monte-carlo-tree-search ucb upper-confidence-bounds uct ai artificial-intelligence 2-player-strategy-game perfect-information deterministic-game ai-players mobile mobile-app mobile-game entertainment
Language:JavaScript 5
OMerkel / FourInARow3D
3 dimensional Four in a Row game with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short).
mcts monte-carlo-tree-search ucb upper-confidence-bounds uct ai artificial-intelligence board-game game entertainment mobile abstract-game perfect-information 2-player-strategy-game mobile-app mobile-game
Language:JavaScript 5
Ralami1859 / Stochastic-Multi-Armed-Bandit
Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment
stochastic-bandit-algorithms ucb thompson-sampling kl-ucb bayes-ucb moss
Language:MATLAB 5
woctezuma / puissance4
AI for the game "Connect Four". Available on PyPI.
puissance4 puissance-4 connect4 connect4-game connect-4 connect-four upper-confidence-bounds monte-carlo-tree-search monte-carlo tree-search artificial-intelligence game-ai artificial-intelligence-algorithms game-artificial-intelligence ai-players ai-opponents ai-bots ai-agents uct ucb
Language:Python 5
amaitammar / Hex-Game
Python implementation of the Hex game with AI based on MC and MCTS methods. Interactive mode with pygame.
ai hex game python reinforcement-learning ucb
Language:Python 4
nicoleorzan / Multi-armed-bandit-RL
C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
multi-armed-bandits reinforcement-learning softmax-policy bernoulli-bandit gaussian-bandit softmax ucb bandit-algorithms
Language:C++ 4
salimandre / Monte-Carlo-Tree-Search
We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.
mcts monte-carlo-tree-search reinforcement-learning tic-tac-toe-game upper-confidence-bound ucb graphics
Language:Python 4
BigBobAtBerkeley / CS170
CS 170 Homework Solutions
170 algorithms and cs efficient problems solutions intractable berkeley uc ucb
3
zamburak
mknbv / zamburak
Bandit algorithms in OCaml
bandit-algorithms adversarial-bandit ucb exp3 trading stochastic-bandit ocaml
Language:OCaml 3
Sagarnandeshwar / Bandit_Algorithms
Reinforcement Learning (COMP 579) Project
bandit-algorithms bernoulli-distribution epsilon-greedy exploration-exploitation reinforcement-learning thompson-sampling ucb
Language:Jupyter Notebook 3
SanketAgrawal / ReinforcementLearning
Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto
reinforcement-learning python-3 k-armed-bandit ucb epsilon-greedy gradient-bandit optimistic-inital-values artificial-intelligence
Language:Jupyter Notebook 3

ucb

wizardforcel / sicp-py-zh

apachecn / cs61b-textbook-zh

czahie / CS61A

Kivy-CN / data8-textbook-zh

alison-carrera / mabalgs

apachecn / ds100-textbook-zh

apachecn / sicp-py-zh

xuyanshi / cs61a-2022

zjsyhjh / ucb-cs61b

OMerkel / UCThello

csfive / home

v-i-s-h / MAB.jl

akshaykhadse / reinforcement-learning

idanmoradarthas / MutiArmedBandit-DeepLearning

ishank-juneja / Correlated-AoI-Bandits

zjsyhjh / ucb-cs186

csfive / CS61A

annieyan / Bandits-using-UCB-algorithm

OMerkel / Oware

rudrajit1729 / Machine-Learning-Codes-And-Templates

csfive / CS61B

BigBobAtBerkeley / CS70

ChillyHigh / CS61A-CN

Suchetaaa / CS747-Assignments

MaxenceGiraud / ucb-nonstationary

OMerkel / Alquerque

OMerkel / FourInARow3D

Ralami1859 / Stochastic-Multi-Armed-Bandit

woctezuma / puissance4

amaitammar / Hex-Game

nicoleorzan / Multi-armed-bandit-RL

salimandre / Monte-Carlo-Tree-Search

BigBobAtBerkeley / CS170

mknbv / zamburak

Sagarnandeshwar / Bandit_Algorithms

SanketAgrawal / ReinforcementLearning