Chao Wen (chaovven)

chaovven

Geek Repo

Location:Saarbrücken, Germany

Home Page:http://chaovven.github.io/

Github PK Tool:Github PK Tool

Chao Wen's repositories

PyRL

PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)

Language:PythonLicense:Apache-2.0Stargazers:33Issues:3Issues:0

SMIX

Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020

Language:PythonLicense:Apache-2.0Stargazers:24Issues:1Issues:2

maab

Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022

Language:PythonStargazers:16Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

chaovven.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0