Andrei Kvasov (kvas7andy)

kvas7andy

Geek Repo

Company:The HKUST

Location:Hong Kong, Israel

Github PK Tool:Github PK Tool

Andrei Kvasov's repositories

kdd_project_2018

Team 20 KDD course project 2018 "Fine-Tuning strategy for classification based on transfer & active learning"

Language:PythonStargazers:6Issues:0Issues:0

bm3il

Bayesian Multi-type Mean Field Multi-agent Imitation Learning

Language:PythonStargazers:3Issues:0Issues:0

unreal

Reinforcement learning with unsupervised auxiliary tasks

Language:PythonLicense:NOASSERTIONStargazers:2Issues:3Issues:0

CyberBattleSim_Web

Version of CyberBattleSim https://github.com/microsoft/CyberBattleSim with extended funcitonality for training RL agents attacks on web applications

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

maml_rl

Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:HTMLStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

CyberBattleSim

An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.

License:MITStargazers:0Issues:0Issues:0

deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

drl_berkley_course

Lecture notes & Assignments of the CS294-112 course on Deep Reinforcement Learning in UC Berkley

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

License:NOASSERTIONStargazers:0Issues:0Issues:0

HowToTrainYourMAMLPytorch

The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) paper in Pytorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MA-AIRL

Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.

Stargazers:0Issues:0Issues:0

MAGAIL

Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

maml

Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

minos

MINOS: Multimodal Indoor Simulator

Language:PythonLicense:MITStargazers:0Issues:3Issues:0
License:MITStargazers:0Issues:0Issues:0

multiagent-gail_wsjeon

multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ngsim_env

Learning human driver models from NGSIM data with imitation learning.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

olympics2021

Updated with World Bank Data Olympics dataset and NEW PCP coordinates plot notebook

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

papers

Research papers outline and analysis

Stargazers:0Issues:0Issues:0

polygon-pascalvoc-writer

For generating Pascal VOC XML image annotation files. Supports polygon & bounding-boxes.

License:MITStargazers:0Issues:0Issues:0

Practical_RL

A course in reinforcement learning in the wild

Language:Jupyter NotebookLicense:UnlicenseStargazers:0Issues:0Issues:0

ROMA

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tensorboard

TensorFlow's Visualization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0