ZhaoQiiii

ZhaoQiiii's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0139940 1069 7640

InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.

Language:TypeScriptApache-2.022883 201 2965

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.019386 159 1487

stable-diffusion-webui-colab

stable diffusion webui colab

Language:Jupyter NotebookUnlicense15571 189 353

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT8776 63 209

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookApache-2.06868 97 708

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonNOASSERTION5467 55 87

dreamgaussian

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Language:PythonMIT3872 46 149

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.03751 33 509

DiffBIR

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Language:PythonApache-2.03252 36 124

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonBSD-3-Clause3213 58 96

mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Language:PythonApache-2.03169 45 279

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonMIT1608 15 81

Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Language:PythonBSD-2-Clause1306 29 156

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonMIT1139 43 26

SMPLer-X

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Language:PythonNOASSERTION969 22 71

PointLLM

[ECCV 2024 Oral] PointLLM: Empowering Large Language Models to Understand Point Clouds

Language:Python522 12 35

DISC-MedLLM

Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare services.

Language:PythonApache-2.0467 2 18

multimodal-garment-designer

This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023

Language:PythonNOASSERTION406 28 30

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonApache-2.0377 17 22

Awesome

Github Trending榜高赞与趣味项目速览。主理人：同济子豪兄

376 140

MathGLM

Official Pytorch Implementation for MathGLM

Language:Python315 11 11

SAFMN

[ICCV 2023] Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution; runner-up method for the model complexity track in NTIRE2023 Efficient SR challenge

Language:Python255 6 56