MarcyLee

Mengxi Li's starred repositories

Mean-Value-Coordinates-for-Closed-Triangular-Mesh

Applications of Mean Value Coordinates for Closed Triangular Mesh

Language:C++Apache-2.0800

ewc.pytorch

An implementation of EWC with PyTorch

Language:Jupyter Notebook23300

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:Jupyter NotebookMIT155000

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonMIT782300

softgym

SoftGym is a set of benchmark environments for deformable object manipulation.

Language:C++BSD-3-Clause27000

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonMIT43100

panda-env

pybullet simulated environment for panda robots, similar structure to gym

Language:PythonMIT700

SAIL

Code for Paper "State Alignment-based Imitation Learning". Under maintenance

Language:PythonMIT1600

Ipopt

COIN-OR Interior Point Optimizer IPOPT

Language:C++NOASSERTION140800

Dynamic-Movement-Primitives-and-Imitation-Learning-Robotics

Dynamic movement primitives (DMPs) are a method of trajectory control/planning from Stefan Schaal’s lab. Complex movements have long been thought to be composed of sets of primitive action ‘building blocks’ executed in sequence and \ or in parallel, and DMPs are a proposed mathematical formalization of these primitives. The difference between DMPs and previously proposed building blocks is that each DMP is a nonlinear dynamical system. The basic idea is that you take a dynamical system with well specified, stable behavior and add another term that makes it follow some interesting trajectory as it goes about its business. The DMP differential equations (Transformation System, Canonical System, Non-linear Function) realize a general way of generating point-to-point movements. Imitation learning using linear regression is performed to compute the weight factor W from a demonstrated trajectory dataset, given by a teacher. The quality of the imitation is evaluated by comparing the training data with the data generated by the DMP.

Language:MATLAB4700

robovat

RoboVat: A unified toolkit for simulated and real-world robotic task environments.

Language:PythonMIT6700

modAL

A modular active learning framework for Python

Language:PythonMIT220100

deep-active-learning

Deep Active Learning

Language:PythonMIT80200

active-learning

Language:PythonApache-2.0111500

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonMIT161100

Multi-agent-reinforcement-learning

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Language:PythonMIT6300

marLo

Multi Agent Reinforcement Learning using MalmÖ

Language:PythonMIT24500

fernandomayer.github.io

Source code of personal webpage

Language:SCSS700

jponttuset.github.io

My personal webpage

Language:HTMLGPL-2.04200

gym-adv

Gym environments modified with adversarial agents

Language:Python3500

simple_canvas_game

Quick tutorial on how to make a simple HTML5 Canvas game

Language:JavaScript50700

urdf_tutorial

Language:CMake22700

mbbl

Language:Python38500

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonMIT422500

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.0182900

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.03327300

gitignore

A collection of useful .gitignore templates

CC0-1.016149500

multiagent-gail

Language:PythonMIT8000