chaovven

Chao Wen's repositories

PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)

Language:PythonApache-2.033 30

Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020

Language:PythonApache-2.024 1 2

Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022

Language:Python1600

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

000

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

Inference code for CodeLlama models

Language:PythonNOASSERTION000

Language:CSS000

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000