Ching-An Cheng (chinganc)

chinganc

Geek Repo

Github PK Tool:Github PK Tool

Ching-An Cheng's repositories

Language:PythonLicense:MITStargazers:3Issues:1Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:3Issues:0

alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:0Issues:0Issues:0

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

garage

A toolkit for reproducible reinforcement learning research.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hand_dapg

Repository to accompany RSS 2018 paper on dexterous hand manipulation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

metaworld

An open source robotics benchmark for meta- and multi-task reinforcement learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DSRL

🔥 Datasets and env wrappers for offline safe reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

IQL-PyTorch

A PyTorch implementation of Implicit Q-Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mj_envs

A collection of MuJoCo based environments.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mjrl

Reinforcement learning algorithms for MuJoCo tasks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Organized-LLM-Agents

Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".

Language:PythonStargazers:0Issues:0Issues:0

Parrot_Paraphraser

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

License:NOASSERTIONStargazers:0Issues:0Issues:0