Earthring

followers

following

stars

Tsinghua University

Beijing

earthring.github.io

Haoyu Ma's starred repositories

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

MIT13049 113 49

ShiArthur03

Language:MATLABGPL-3.010369 32 1354

Omost

Your image is almost there!

Language:PythonApache-2.07156 44 75

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell7148 41 751

RecSysPapers

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Language:PythonBSD-2-Clause1203 510

causal-learn

Causal Discovery in Python. It also includes (conditional) independence tests and score functions.

Language:PythonMIT1105 16 101

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

Apache-2.0859 35 1

clash-core

backup of clash core

Language:GoGPL-3.0837 9 2

hok_env

Honor of Kings AI Open Environment of Tencent

Language:PythonApache-2.0609 16 62

pgx

♟️ Vectorized RL game environments in JAX

Language:PythonApache-2.0383 7 242

sbx

SBX: Stable Baselines Jax (SB3 + Jax)

Language:PythonMIT314 9 24

General-World-Models-Survey

MIT229 120

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Language:PythonMIT204 4 5

Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Language:PythonMIT187 3 12

WorldModelPapers

Paper collections of the continuous effort start from World Models.

MIT125 100

CarDreamer

World Model based Autonomous Driving Platform in CARLA :car:

Language:PythonNOASSERTION114 3 2

iVideoGPT

Official repo for "iVideoGPT: Interactive VideoGPTs are Scalable World Models", https://arxiv.org/abs/2405.15223

Language:PythonMIT58 4 5

Transolver

About code release of "Transolver: A Fast Transformer Solver for PDEs on General Geometries", ICML 2024 Spotlight. https://arxiv.org/abs/2402.02366

Language:PythonMIT57 5 3

Uni-RLHF-Platform

Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonMIT28 20

Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonMIT27 2 3

TimeSiam

Language:PythonMIT25 4 4

HarmonyDream

Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344

Language:PythonMIT21 60

Deductive-Beam-Search

[COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"

Language:Python16 30

granular

Fast dataset format and loader

Language:PythonMIT15 20

DocBench

DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems

Language:Python1300

HelmFluid

About code release of "HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid Prediction", ICML 2024. https://arxiv.org/pdf/2310.10565

Language:PythonMIT11 40

Multi-Embedding

About Code Release for "On the Embedding Collapse When Scaling Up Recommendation Models" (ICML 2024)

Language:PythonMIT11 40

RigorLLM

Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"

Language:Python500

SHINE

Official code for: "SHINE: Shielding Backdoors in Deep Reinforcement Learning"

Language:Python300

bridge_env

Contract Bridge Environment package in Python

Language:PythonMIT2 2 44