Haoyu Ma (Earthring)

Earthring

Geek Repo

Company:Tsinghua University

Location:Beijing

Home Page:earthring.github.io

Github PK Tool:Github PK Tool

Haoyu Ma's starred repositories

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:7156Issues:44Issues:75

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

RecSysPapers

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Language:PythonLicense:BSD-2-ClauseStargazers:1203Issues:51Issues:0

causal-learn

Causal Discovery in Python. It also includes (conditional) independence tests and score functions.

Language:PythonLicense:MITStargazers:1105Issues:16Issues:101

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

clash-core

backup of clash core

Language:GoLicense:GPL-3.0Stargazers:837Issues:9Issues:2

hok_env

Honor of Kings AI Open Environment of Tencent

Language:PythonLicense:Apache-2.0Stargazers:609Issues:16Issues:62

pgx

♟️ Vectorized RL game environments in JAX

Language:PythonLicense:Apache-2.0Stargazers:383Issues:7Issues:242

sbx

SBX: Stable Baselines Jax (SB3 + Jax)

Language:PythonLicense:MITStargazers:314Issues:9Issues:24

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Language:PythonLicense:MITStargazers:204Issues:4Issues:5

Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Language:PythonLicense:MITStargazers:187Issues:3Issues:12

WorldModelPapers

Paper collections of the continuous effort start from World Models.

License:MITStargazers:125Issues:10Issues:0

CarDreamer

World Model based Autonomous Driving Platform in CARLA :car:

Language:PythonLicense:NOASSERTIONStargazers:114Issues:3Issues:2

iVideoGPT

Official repo for "iVideoGPT: Interactive VideoGPTs are Scalable World Models", https://arxiv.org/abs/2405.15223

Language:PythonLicense:MITStargazers:58Issues:4Issues:5

Transolver

About code release of "Transolver: A Fast Transformer Solver for PDEs on General Geometries", ICML 2024 Spotlight. https://arxiv.org/abs/2402.02366

Language:PythonLicense:MITStargazers:57Issues:5Issues:3

Uni-RLHF-Platform

Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonLicense:MITStargazers:28Issues:2Issues:0

Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonLicense:MITStargazers:27Issues:2Issues:3
Language:PythonLicense:MITStargazers:25Issues:4Issues:4

HarmonyDream

Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344

Language:PythonLicense:MITStargazers:21Issues:6Issues:0

Deductive-Beam-Search

[COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"

Language:PythonStargazers:16Issues:3Issues:0

granular

Fast dataset format and loader

Language:PythonLicense:MITStargazers:15Issues:2Issues:0

DocBench

DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems

Language:PythonStargazers:13Issues:0Issues:0

HelmFluid

About code release of "HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid Prediction", ICML 2024. https://arxiv.org/pdf/2310.10565

Language:PythonLicense:MITStargazers:11Issues:4Issues:0

Multi-Embedding

About Code Release for "On the Embedding Collapse When Scaling Up Recommendation Models" (ICML 2024)

Language:PythonLicense:MITStargazers:11Issues:4Issues:0

RigorLLM

Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"

Language:PythonStargazers:5Issues:0Issues:0

SHINE

Official code for: "SHINE: Shielding Backdoors in Deep Reinforcement Learning"

Language:PythonStargazers:3Issues:0Issues:0

bridge_env

Contract Bridge Environment package in Python

Language:PythonLicense:MITStargazers:2Issues:2Issues:44