Yifan Zhu's starred repositories

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33930Issues:751Issues:1247

manim

A community-maintained Python framework for creating mathematical animations.

Language:PythonLicense:MITStargazers:21346Issues:137Issues:1498

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6854Issues:71Issues:587

awesome-mlss

🤖 Machine Learning Summer School deadlines

Language:JavaScriptLicense:MITStargazers:2619Issues:272Issues:32

weread2notion

将微信读书划线同步到Notion

attention-learn-to-route

Attention based model for learning to solve different routing problems

Language:Jupyter NotebookLicense:MITStargazers:1076Issues:23Issues:53

diffuser

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Language:PythonLicense:MITStargazers:830Issues:12Issues:62

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonLicense:Apache-2.0Stargazers:405Issues:10Issues:35

rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Language:PythonLicense:MITStargazers:399Issues:8Issues:81

pgx

♟️ Vectorized RL game environments in JAX

Language:PythonLicense:Apache-2.0Stargazers:393Issues:7Issues:242

Imitating-Human-Behaviour-w-Diffusion

Code for ICLR 2023 paper "Imitating Human Behaviour with Diffusion Models"

Language:PythonLicense:MITStargazers:127Issues:8Issues:0

DeepACO

[NeurIPS 2023] DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization

Language:Jupyter NotebookLicense:MITStargazers:121Issues:4Issues:3

endless-memory-gym

Challenging Memory-based Deep Reinforcement Learning Agents

Language:PythonLicense:MITStargazers:82Issues:4Issues:4

learning-to-delegate

[NeurIPS 2021 Spotlight] Learning to Delegate for Large-scale Vehicle Routing

poppy

:hibiscus: Population-Based Reinforcement Learning for Combinatorial Optimization

Language:PythonLicense:Apache-2.0Stargazers:64Issues:7Issues:0

MARVIN

Uber's Multi-Agent Routing Value Iteration Network

Language:PythonLicense:NOASSERTIONStargazers:57Issues:5Issues:0

GFlowNet-CombOpt

PyTorch implementation for our NeurIPS 2023 spotlight paper "Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets".

AdaptDiffuser

[ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"

Language:PythonLicense:MITStargazers:46Issues:2Issues:4

fast_irl

Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.

Language:Jupyter NotebookStargazers:45Issues:4Issues:1
Language:PythonLicense:GPL-3.0Stargazers:30Issues:6Issues:1

UEyes-CHI2023

Code released for our CHI2023 paper "UEyes: Understanding Visual Saliency across User Interface Types"

Language:Jupyter NotebookStargazers:22Issues:1Issues:0

icrl

Inverse Constrained Reinforcement Learning (ICML 2021)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:16Issues:2Issues:0

CAMA

Code for ICML2023 accepted paper: Complementary Attention for Multi-Agent Reinforcement Learning.

Language:PythonLicense:BSD-3-ClauseStargazers:10Issues:1Issues:1

Rewriting-By-Generating

code for paper: 'Rewriting by Generating: Learn Heuristics for Large-scale Vehicle Routing Problems'

Language:PythonLicense:MITStargazers:5Issues:2Issues:0

lori

Inferring Lexicographically-Ordered Rewards from Preferences

Language:PythonStargazers:3Issues:1Issues:0
Language:PythonStargazers:1Issues:3Issues:0