MarcyLee

Mengxi Li's starred repositories

gitignore

A collection of useful .gitignore templates

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.031933 472 17921

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonMIT7590 93 719

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonMIT4184 68 558

modAL

A modular active learning framework for Python

Language:PythonMIT2167 42 142

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.01765 29 130

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonMIT1569 152 67

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:Jupyter NotebookMIT1491 28 30

Ipopt

COIN-OR Interior Point Optimizer IPOPT

Language:C++NOASSERTION1342 42 582

deep-active-learning

Deep Active Learning

Language:PythonMIT780 16 15

simple_canvas_game

Quick tutorial on how to make a simple HTML5 Canvas game

Language:JavaScript508 49 2

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonMIT421 13 20

softgym

SoftGym is a set of benchmark environments for deformable object manipulation.

Language:C++BSD-3-Clause255 11 41

marLo

Multi Agent Reinforcement Learning using MalmÖ

Language:PythonMIT241 10 43

ewc.pytorch

An implementation of EWC with PyTorch

Language:Jupyter Notebook225 5 4

robovat

RoboVat: A unified toolkit for simulated and real-world robotic task environments.

Language:PythonMIT67 17 2

Multi-agent-reinforcement-learning

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Language:PythonMIT63 3 7

Dynamic-Movement-Primitives-and-Imitation-Learning-Robotics

Dynamic movement primitives (DMPs) are a method of trajectory control/planning from Stefan Schaal’s lab. Complex movements have long been thought to be composed of sets of primitive action ‘building blocks’ executed in sequence and \ or in parallel, and DMPs are a proposed mathematical formalization of these primitives. The difference between DMPs and previously proposed building blocks is that each DMP is a nonlinear dynamical system. The basic idea is that you take a dynamical system with well specified, stable behavior and add another term that makes it follow some interesting trajectory as it goes about its business. The DMP differential equations (Transformation System, Canonical System, Non-linear Function) realize a general way of generating point-to-point movements. Imitation learning using linear regression is performed to compute the weight factor W from a demonstrated trajectory dataset, given by a teacher. The quality of the imitation is evaluated by comparing the training data with the data generated by the DMP.

Language:MATLAB47 2 1

MarcyLee

Mengxi Li's starred repositories

gitignore

ray

tianshou

sacred

modAL

pymarl

maddpg

continual-learning

Ipopt

active-learning

deep-active-learning

simple_canvas_game

pytorch-trpo

mbbl

softgym

marLo

ewc.pytorch

urdf_tutorial

multiagent-gail

robovat

Multi-agent-reinforcement-learning

tianshou-docs-zh_CN

Dynamic-Movement-Primitives-and-Imitation-Learning-Robotics

jponttuset.github.io

gym-adv

SAIL

blender-rope-sim

fernandomayer.github.io

panda-env

Mean-Value-Coordinates-for-Closed-Triangular-Mesh