georgepsh's repositories

ModelDistillation

BiLSTM Distillation with BERT for Sequence CLassification

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

MountainCarContinuous-v0_DDGP

DDPG solution for MountainCarContinuous problem

Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

dqn-pytorch

DQN to play Atari Pong

Language:PythonStargazers:0Issues:0Issues:0

NeuralStyleTransfer

Neural Style Transfer Pytorch Implementation

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

expert

Expert-augmented actor-critic

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

GIQA

Pytorch implementation of Generated Image Quality Assessment

Stargazers:0Issues:0Issues:0

MLBD

Materials for "Machine Learning on Big Data" course

Stargazers:0Issues:0Issues:0

MountainCar-v0_DQN

DQN solution for Open AI's MountainCar-v0 problem

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

License:MITStargazers:0Issues:0Issues:0

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0