JIMMY ZHAO (zhimin-z)

zhimin-z

Geek Repo

Company:Queen's University

Location:Canada

Home Page:zhimin-z.github.io

Github PK Tool:Github PK Tool

JIMMY ZHAO's starred repositories

MULTI-Benchmark

MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3260Issues:0Issues:0

DreamMat

[SIGGRAPH2024] DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models

Language:PythonLicense:MITStargazers:166Issues:0Issues:0

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookLicense:MITStargazers:105Issues:0Issues:0

puppeteer

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Language:PythonLicense:MITStargazers:104Issues:0Issues:0

Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Stargazers:82Issues:0Issues:0

FlashST

[ICML'2024] "FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction"

Language:PythonStargazers:18Issues:0Issues:0

vHeat

vHeat: Building Vision Models upon Heat Conduction

Language:PythonStargazers:70Issues:0Issues:0

llm-latent-language

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Language:Jupyter NotebookStargazers:31Issues:0Issues:0

MVSGaussian

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

License:MITStargazers:103Issues:0Issues:0

alphafold3-pytorch

Implementation of Alphafold 3 in Pytorch

Language:PythonLicense:MITStargazers:552Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:5622Issues:0Issues:0

UrbanGPT

[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"

Language:PythonStargazers:133Issues:0Issues:0

Fox

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Language:PythonStargazers:55Issues:0Issues:0

ChatGPT_DAN

ChatGPT DAN, Jailbreaks prompt

Stargazers:5837Issues:0Issues:0

arithmetic

Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (2024)

Language:PythonLicense:MITStargazers:132Issues:0Issues:0

M3Act

[CVPR2024] Learning from Synthetic Human Group Activities

License:NOASSERTIONStargazers:6Issues:0Issues:0

git

Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documentation/SubmittingPatches procedure for any of your improvements.

Language:CLicense:NOASSERTIONStargazers:50632Issues:0Issues:0

n8n

Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

Language:TypeScriptLicense:NOASSERTIONStargazers:41687Issues:0Issues:0

PowerToys

Windows system utilities to maximize productivity

Language:C#License:MITStargazers:106242Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:20565Issues:0Issues:0

spdx-licenses

Tools for working with the SPDX license list and validating licenses.

Language:PHPLicense:MITStargazers:1390Issues:0Issues:0

GLM

GLM (General Language Model)

Language:PythonLicense:MITStargazers:3069Issues:0Issues:0

LucaOne

The LucaOne’s model code.

Language:PythonLicense:Apache-2.0Stargazers:80Issues:0Issues:0

UniDoorManip

This is the official repository of UniDoorManip: Learning Universal Door Manipulation Policy Over Large-scale and Diverse Door Manipulation Environments.

Language:PythonStargazers:37Issues:0Issues:0

EditWorld

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing

Language:PythonStargazers:83Issues:0Issues:0

Yuan2.0-M32

Mixture-of-Experts (MoE) Language Model

Language:PythonLicense:Apache-2.0Stargazers:141Issues:0Issues:0

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:6905Issues:0Issues:0

octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonLicense:MITStargazers:541Issues:0Issues:0
Language:PythonStargazers:329Issues:0Issues:0