Mengxi Li's starred repositories

Mean-Value-Coordinates-for-Closed-Triangular-Mesh

Applications of Mean Value Coordinates for Closed Triangular Mesh

Language:C++License:Apache-2.0Stargazers:8Issues:0Issues:0

ewc.pytorch

An implementation of EWC with PyTorch

Language:Jupyter NotebookStargazers:233Issues:0Issues:0

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:Jupyter NotebookLicense:MITStargazers:1550Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:7823Issues:0Issues:0

tianshou-docs-zh_CN

天授中文文档

Language:TeXStargazers:55Issues:0Issues:0

softgym

SoftGym is a set of benchmark environments for deformable object manipulation.

Language:C++License:BSD-3-ClauseStargazers:270Issues:0Issues:0
Language:PythonStargazers:16Issues:0Issues:0

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonLicense:MITStargazers:431Issues:0Issues:0

panda-env

pybullet simulated environment for panda robots, similar structure to gym

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

SAIL

Code for Paper "State Alignment-based Imitation Learning". Under maintenance

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

Ipopt

COIN-OR Interior Point Optimizer IPOPT

Language:C++License:NOASSERTIONStargazers:1408Issues:0Issues:0

Dynamic-Movement-Primitives-and-Imitation-Learning-Robotics

Dynamic movement primitives (DMPs) are a method of trajectory control/planning from Stefan Schaal’s lab. Complex movements have long been thought to be composed of sets of primitive action ‘building blocks’ executed in sequence and \ or in parallel, and DMPs are a proposed mathematical formalization of these primitives. The difference between DMPs and previously proposed building blocks is that each DMP is a nonlinear dynamical system. The basic idea is that you take a dynamical system with well specified, stable behavior and add another term that makes it follow some interesting trajectory as it goes about its business. The DMP differential equations (Transformation System, Canonical System, Non-linear Function) realize a general way of generating point-to-point movements. Imitation learning using linear regression is performed to compute the weight factor W from a demonstrated trajectory dataset, given by a teacher. The quality of the imitation is evaluated by comparing the training data with the data generated by the DMP.

Language:MATLABStargazers:47Issues:0Issues:0

robovat

RoboVat: A unified toolkit for simulated and real-world robotic task environments.

Language:PythonLicense:MITStargazers:67Issues:0Issues:0

modAL

A modular active learning framework for Python

Language:PythonLicense:MITStargazers:2201Issues:0Issues:0

deep-active-learning

Deep Active Learning

Language:PythonLicense:MITStargazers:802Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1115Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:1611Issues:0Issues:0

Multi-agent-reinforcement-learning

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Language:PythonLicense:MITStargazers:63Issues:0Issues:0

marLo

Multi Agent Reinforcement Learning using MalmÖ

Language:PythonLicense:MITStargazers:245Issues:0Issues:0

fernandomayer.github.io

Source code of personal webpage

Language:SCSSStargazers:7Issues:0Issues:0

jponttuset.github.io

My personal webpage

Language:HTMLLicense:GPL-2.0Stargazers:42Issues:0Issues:0

gym-adv

Gym environments modified with adversarial agents

Language:PythonStargazers:35Issues:0Issues:0

simple_canvas_game

Quick tutorial on how to make a simple HTML5 Canvas game

Language:JavaScriptStargazers:507Issues:0Issues:0
Language:CMakeStargazers:227Issues:0Issues:0
Language:PythonStargazers:385Issues:0Issues:0

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonLicense:MITStargazers:4225Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:1829Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:33273Issues:0Issues:0

gitignore

A collection of useful .gitignore templates

License:CC0-1.0Stargazers:161495Issues:0Issues:0
Language:PythonLicense:MITStargazers:80Issues:0Issues:0