Mengxi Li's starred repositories

gitignore

A collection of useful .gitignore templates

License:CC0-1.0Stargazers:159393Issues:3368Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:31933Issues:472Issues:17921

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:7590Issues:93Issues:719

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonLicense:MITStargazers:4184Issues:68Issues:558

modAL

A modular active learning framework for Python

Language:PythonLicense:MITStargazers:2167Issues:42Issues:142

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:1765Issues:29Issues:130

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:1569Issues:152Issues:67

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.

Language:Jupyter NotebookLicense:MITStargazers:1491Issues:28Issues:30

Ipopt

COIN-OR Interior Point Optimizer IPOPT

Language:C++License:NOASSERTIONStargazers:1342Issues:42Issues:582
Language:PythonLicense:Apache-2.0Stargazers:1102Issues:34Issues:0

deep-active-learning

Deep Active Learning

Language:PythonLicense:MITStargazers:780Issues:16Issues:15

simple_canvas_game

Quick tutorial on how to make a simple HTML5 Canvas game

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonLicense:MITStargazers:421Issues:13Issues:20

softgym

SoftGym is a set of benchmark environments for deformable object manipulation.

Language:C++License:BSD-3-ClauseStargazers:255Issues:11Issues:41

marLo

Multi Agent Reinforcement Learning using MalmÖ

Language:PythonLicense:MITStargazers:241Issues:10Issues:43

ewc.pytorch

An implementation of EWC with PyTorch

Language:Jupyter NotebookStargazers:225Issues:5Issues:4

robovat

RoboVat: A unified toolkit for simulated and real-world robotic task environments.

Language:PythonLicense:MITStargazers:67Issues:17Issues:2

Multi-agent-reinforcement-learning

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Language:PythonLicense:MITStargazers:63Issues:3Issues:7

tianshou-docs-zh_CN

天授中文文档

Language:TeXStargazers:52Issues:7Issues:0

Dynamic-Movement-Primitives-and-Imitation-Learning-Robotics

Dynamic movement primitives (DMPs) are a method of trajectory control/planning from Stefan Schaal’s lab. Complex movements have long been thought to be composed of sets of primitive action ‘building blocks’ executed in sequence and \ or in parallel, and DMPs are a proposed mathematical formalization of these primitives. The difference between DMPs and previously proposed building blocks is that each DMP is a nonlinear dynamical system. The basic idea is that you take a dynamical system with well specified, stable behavior and add another term that makes it follow some interesting trajectory as it goes about its business. The DMP differential equations (Transformation System, Canonical System, Non-linear Function) realize a general way of generating point-to-point movements. Imitation learning using linear regression is performed to compute the weight factor W from a demonstrated trajectory dataset, given by a teacher. The quality of the imitation is evaluated by comparing the training data with the data generated by the DMP.

jponttuset.github.io

My personal webpage

Language:HTMLLicense:GPL-2.0Stargazers:41Issues:4Issues:2

gym-adv

Gym environments modified with adversarial agents

SAIL

Code for Paper "State Alignment-based Imitation Learning". Under maintenance

Language:PythonLicense:MITStargazers:16Issues:2Issues:0
Language:PythonStargazers:15Issues:0Issues:0

fernandomayer.github.io

Source code of personal webpage

Language:SCSSStargazers:8Issues:2Issues:0

panda-env

pybullet simulated environment for panda robots, similar structure to gym

Language:PythonLicense:MITStargazers:7Issues:3Issues:1

Mean-Value-Coordinates-for-Closed-Triangular-Mesh

Applications of Mean Value Coordinates for Closed Triangular Mesh

Language:C++License:Apache-2.0Stargazers:7Issues:0Issues:1