tianjunz's repositories
awesome-deep-rl
For deep RL and the future of AI.
azure-cli-cheatsheet
Azure CLI Cheatsheet
DeepSpeedExamples
Example models using DeepSpeed
guidance
A guidance language for controlling large language models.
MemGPT
Create LLM agents with long-term memory and custom tools 📚🦙
ort
Accelerate PyTorch models with ONNX Runtime
overcooked_ai
A benchmark environment for fully cooperative multi-agent performance.
poet
ML model training for edge devices
python
Official Python client library for kubernetes
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs