tianjunz's repositories

Language:PythonLicense:NOASSERTIONStargazers:35Issues:2Issues:3
Language:PythonStargazers:18Issues:4Issues:0

agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

awesome-deep-rl

For deep RL and the future of AI.

License:MITStargazers:0Issues:1Issues:0

azure-cli-cheatsheet

Azure CLI Cheatsheet

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

GNNPapers

Must-read papers on graph neural networks (GNN)

Stargazers:0Issues:1Issues:0

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:TeXLicense:NOASSERTIONStargazers:0Issues:0Issues:0

marLo

Multi Agent Reinforcement Learning using MalmĂ–

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ml-agents

Unity Machine Learning Agents Toolkit

Language:C#License:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

ort

Accelerate PyTorch models with ONNX Runtime

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

overcooked_ai

A benchmark environment for fully cooperative multi-agent performance.

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

poet

ML model training for edge devices

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

python

Official Python client library for kubernetes

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:0Issues:0