tianjunz

followers

following

stars

tianjunz's repositories

HIR

Language:Python158 5 2

TEMPERA

Language:Python40 3 4

NovelD

Language:PythonNOASSERTION35 2 3

MADE

Language:Python18 40

agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Language:PythonApache-2.0010

awesome-deep-rl

For deep RL and the future of AI.

MIT010

azure-cli-cheatsheet

Azure CLI Cheatsheet

000

c-planning

020

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonMIT010

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT000

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION010

Learn_Prompting

Language:TeXNOASSERTION000

marLo

Multi Agent Reinforcement Learning using MalmÖ

Language:PythonMIT010

MemGPT

Create LLM agents with long-term memory and custom tools 📚🦙

Apache-2.0000

metaseq

Repo for external large-scale work

Language:PythonMIT010

ml-agents

Unity Machine Learning Agents Toolkit

Language:C#Apache-2.0010

my-offlinerl

Language:Python010

ort

Accelerate PyTorch models with ONNX Runtime

Language:PythonMIT000

overcooked_ai

A benchmark environment for fully cooperative multi-agent performance.

Language:JavaScriptMIT010

poet

ML model training for edge devices

Language:PythonApache-2.0000

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.0010

python

Official Python client library for kubernetes

Language:PythonApache-2.0000

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT010

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonMIT010

raft

010

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonNOASSERTION010

tianjunz.github.io

Language:JavaScript020

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000