Haoyu Ma (Earthring)

Earthring

Geek Repo

Company:Tsinghua University

Location:Beijing

Home Page:earthring.github.io

Github PK Tool:Github PK Tool

Haoyu Ma's starred repositories

JaxTransformer

This repository demonstrates how to build a Decoder-Only Transformer with Multi-Query Attention in JAX.

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

Stargazers:430Issues:0Issues:0

RecSysPapers

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Language:PythonLicense:BSD-2-ClauseStargazers:1245Issues:0Issues:0

bridge_env

Contract Bridge Environment package in Python

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Language:PythonLicense:MITStargazers:211Issues:0Issues:0

DocBench

DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems

Language:PythonStargazers:15Issues:0Issues:0

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

License:MITStargazers:13258Issues:0Issues:0

Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Language:PythonLicense:MITStargazers:192Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:10369Issues:0Issues:0

causal-learn

Causal Discovery in Python. It also includes (conditional) independence tests and score functions.

Language:PythonLicense:MITStargazers:1136Issues:0Issues:0

granular

Fast dataset format and loader

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

License:Apache-2.0Stargazers:885Issues:0Issues:0

clash-core

backup of clash core

Language:GoLicense:GPL-3.0Stargazers:870Issues:0Issues:0
Language:PythonLicense:MITStargazers:27Issues:0Issues:0

Deductive-Beam-Search

[COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"

Language:PythonStargazers:18Issues:0Issues:0

CarDreamer

World Model based Autonomous Driving Platform in CARLA :car:

Language:PythonLicense:NOASSERTIONStargazers:131Issues:0Issues:0

Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:8678Issues:0Issues:0

SHINE

Official code for: "SHINE: Shielding Backdoors in Deep Reinforcement Learning"

Language:PythonStargazers:3Issues:0Issues:0

RigorLLM

Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"

Language:PythonStargazers:9Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:7244Issues:0Issues:0

HarmonyDream

Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

HelmFluid

About code release of "HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid Prediction", ICML 2024. https://arxiv.org/pdf/2310.10565

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

Transolver

About code release of "Transolver: A Fast Transformer Solver for PDEs on General Geometries", ICML 2024 Spotlight. https://arxiv.org/abs/2402.02366

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

Uni-RLHF-Platform

Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

Multi-Embedding

About Code Release for "On the Embedding Collapse When Scaling Up Recommendation Models" (ICML 2024)

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

sbx

SBX: Stable Baselines Jax (SB3 + Jax)

Language:PythonLicense:MITStargazers:328Issues:0Issues:0

iVideoGPT

Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223

Language:PythonLicense:MITStargazers:60Issues:0Issues:0

hok_env

Honor of Kings AI Open Environment of Tencent

Language:PythonLicense:Apache-2.0Stargazers:629Issues:0Issues:0