Ruben Solozabal (rubensolozabal)

rubensolozabal

Geek Repo

Location:Bilbao

Github PK Tool:Github PK Tool

Ruben Solozabal's starred repositories

TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:43248Issues:2054Issues:233

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookLicense:MITStargazers:20175Issues:862Issues:155

transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Language:PythonLicense:Apache-2.0Stargazers:4190Issues:110Issues:159

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:4069Issues:62Issues:944

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4053Issues:109Issues:536

minigo

An open-source implementation of the AlphaGoZero algorithm

Language:C++License:Apache-2.0Stargazers:3437Issues:150Issues:321

higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3266Issues:79Issues:1

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:2767Issues:49Issues:39
Language:PythonLicense:MITStargazers:2397Issues:75Issues:175

powerful-gnns

How Powerful are Graph Neural Networks?

Language:PythonLicense:MITStargazers:1147Issues:26Issues:23

attention-learn-to-route

Attention based model for learning to solve different routing problems

Language:Jupyter NotebookLicense:MITStargazers:1026Issues:22Issues:52

BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Language:PythonLicense:MITStargazers:580Issues:7Issues:14

DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

Language:PythonLicense:Apache-2.0Stargazers:549Issues:14Issues:108

optnet

OptNet: Differentiable Optimization as a Layer in Neural Networks

Language:PythonLicense:Apache-2.0Stargazers:485Issues:28Issues:6

undreamt

Unsupervised Neural Machine Translation

Language:PythonLicense:GPL-3.0Stargazers:471Issues:21Issues:17

robo-gym

An open source toolkit for Distributed Deep Reinforcement Learning on real and simulated robots.

Language:PythonLicense:MITStargazers:386Issues:21Issues:53

RoseTTAFold2NA

RoseTTAFold2 protein/nucleic acid complex prediction

Language:PythonLicense:MITStargazers:301Issues:15Issues:98

deep-rl

Collection of Deep Reinforcement Learning algorithms

Language:PythonLicense:MITStargazers:296Issues:13Issues:17

RLcycle

A library for ready-made reinforcement learning agents and reusable components for neat prototyping

Language:PythonLicense:MITStargazers:295Issues:13Issues:6

cpo

Constrained Policy Optimization

L2D

Official implementation of paper "Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning"

ibm3202

Google Colab Tutorials for IBM3202

Language:Jupyter NotebookLicense:MITStargazers:223Issues:14Issues:3

puzzle_cube

Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search

Language:PythonLicense:MITStargazers:91Issues:12Issues:1

JSPLIB

Benchmark instances for job-shop scheduling problem

minimal-marl

Minimal implementation of multi-agent reinforcement learning algorithms

Language:PythonLicense:MITStargazers:45Issues:3Issues:4

Actor_CriticPointer_Network-TSP

Tensorflow implementation of an Actor Critic algorithm using a Pointer Network to solve the TSP (algorithm from Neural Combinatorial Optimization with Reinforcement Learning, Bello et al, 2016)

Language:Jupyter NotebookStargazers:40Issues:2Issues:2

ITU-Challenge-ML5G-PHY-RL

Scripts for the "ITU-ML5G-PS-006: ML5G-PHY-Reinforcement learning: scheduling and resource allocation"

Language:PythonLicense:MITStargazers:26Issues:7Issues:1

betazero

Tabula Rasa Tic-Tac-Toe

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

AI4Bio-Reading-List

Must-read papers on AI for Biology

ADP_LP

LP approach for Approximate Dynamic Programming.

Language:PythonLicense:MITStargazers:7Issues:1Issues:0