zhimin-z

JIMMY ZHAO's starred repositories

MULTI-Benchmark

MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images

Language:PythonMIT2300

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonMIT326000

DreamMat

[SIGGRAPH2024] DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models

Language:PythonMIT16600

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookMIT10500

puppeteer

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Language:PythonMIT10400

Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

8200

FlashST

[ICML'2024] "FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction"

Language:Python1800

vHeat

vHeat: Building Vision Models upon Heat Conduction

Language:Python7000

llm-latent-language

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Language:Jupyter Notebook3100

MVSGaussian

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

MIT10300

alphafold3-pytorch

Implementation of Alphafold 3 in Pytorch

Language:PythonMIT55200

Omost

Your image is almost there!

Language:PythonApache-2.0562200

UrbanGPT

[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"

Language:Python13300

Fox

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Language:Python5500

arithmetic

Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (2024)

Language:PythonMIT13200

M3Act

[CVPR2024] Learning from Synthetic Human Group Activities

NOASSERTION600

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documentation/SubmittingPatches procedure for any of your improvements.

Language:CNOASSERTION5063200

zhimin-z

JIMMY ZHAO's starred repositories

MULTI-Benchmark

LMOps

DreamMat

RL4VLM

puppeteer

Skywork-MoE

FlashST

vHeat

llm-latent-language

MVSGaussian

alphafold3-pytorch

Omost

UrbanGPT

Fox

ChatGPT_DAN

arithmetic

M3Act

git

n8n

PowerToys

ChatTTS

spdx-licenses

GLM

LucaOne

UniDoorManip

EditWorld

Yuan2.0-M32

yolov10

octo

TeleSpeech-ASR