Longtao Zheng's repositories
data-privacy
Preserve data privacy with k-anonymity (samarati & mondrian), differential privacy, federated learning, paillier homomorphic encryption, etc.
CurriculumMARL
Code of "Towards Skilled Population Curriculum for MARL" + Implementation of Curriculum MARL algorithms based on Ray
mappo-football
Multi-Agent PPO (MAPPO) with the Google Research Football environment.
parallel-computing-ustc
Experiments for the Parallel Computing course
Replica-Currency-Estimation
Python implementation of Replica Currency Estimation
formal-methods-ustc
Experiments for the Formal Methods course
numerical-analysis
Assignments for the Numerical Methods course at USTC
coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
muzero-general
MuZero
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs