ZhaoQiiii

ZhaoQiiii

Geek Repo

Company:Shanghai AI Lab

Location:Shanghai

Github PK Tool:Github PK Tool

ZhaoQiiii's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:139940Issues:1069Issues:7640

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.

Language:TypeScriptLicense:Apache-2.0Stargazers:22883Issues:201Issues:2965

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19386Issues:159Issues:1487

stable-diffusion-webui-colab

stable diffusion webui colab

Language:Jupyter NotebookLicense:UnlicenseStargazers:15571Issues:189Issues:353

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8776Issues:63Issues:209

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6868Issues:97Issues:708

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:5467Issues:55Issues:87

dreamgaussian

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Language:PythonLicense:MITStargazers:3872Issues:46Issues:149

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3751Issues:33Issues:509

DiffBIR

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Language:PythonLicense:Apache-2.0Stargazers:3252Issues:36Issues:124

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3213Issues:58Issues:96

mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3169Issues:45Issues:279

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonLicense:MITStargazers:1608Issues:15Issues:81

Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Language:PythonLicense:BSD-2-ClauseStargazers:1306Issues:29Issues:156

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1139Issues:43Issues:26

SMPLer-X

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Language:PythonLicense:NOASSERTIONStargazers:969Issues:22Issues:71

PointLLM

[ECCV 2024 Oral] PointLLM: Empowering Large Language Models to Understand Point Clouds

DISC-MedLLM

Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare services.

Language:PythonLicense:Apache-2.0Stargazers:467Issues:2Issues:18

multimodal-garment-designer

This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023

Language:PythonLicense:NOASSERTIONStargazers:406Issues:28Issues:30

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonLicense:Apache-2.0Stargazers:377Issues:17Issues:22

Awesome

Github Trending榜高赞与趣味项目速览。主理人:同济子豪兄

MathGLM

Official Pytorch Implementation for MathGLM

SAFMN

[ICCV 2023] Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution; runner-up method for the model complexity track in NTIRE2023 Efficient SR challenge

VIGC

AAAI 2024: Visual Instruction Generation and Correction

Language:PythonLicense:Apache-2.0Stargazers:86Issues:5Issues:14

vision-process-webui

💡💡💡awesome compute vision app in gradio

Language:PythonLicense:Apache-2.0Stargazers:41Issues:2Issues:1

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:18Issues:0Issues:0