ZuanGao (FaltingsA)

FaltingsA

Geek Repo

Company:USTC

Location:Hefei China

Github PK Tool:Github PK Tool

ZuanGao's starred repositories

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5756Issues:46Issues:75

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4065Issues:53Issues:113

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3685Issues:37Issues:90

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonLicense:MITStargazers:2432Issues:35Issues:29

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:1607Issues:26Issues:49

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1358Issues:23Issues:56

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1303Issues:19Issues:57

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:507Issues:13Issues:88

Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Language:Jupyter NotebookLicense:MITStargazers:420Issues:8Issues:24

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:403Issues:16Issues:40

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

DEADiff

[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"

Language:PythonLicense:Apache-2.0Stargazers:194Issues:10Issues:15

LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

License:MITStargazers:172Issues:0Issues:0

Awesome-Layout-Generators

An awesome list of layout generation papers

Pyllusion

A Parametric Framework to Generate Visual Illusions using Python

Language:PythonLicense:MITStargazers:59Issues:8Issues:15

MEBOW

Code for "MEBOW: Monocular Estimation of Body Orientation In the Wild", CVPR 2020

Draw-and-Understand

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Language:PythonLicense:Apache-2.0Stargazers:47Issues:1Issues:3

IconQA

Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".

Vary-tiny-600k

Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)

Language:PythonStargazers:26Issues:0Issues:0

SSM

[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

Language-Enhanced-CLIP-For-Multi-label-Image-Recognition

3rd Place, Visual Prompt Tuning Challenge @ CVPR 2023 HIT Workshop (2023)

Language:PythonLicense:MITStargazers:5Issues:2Issues:0
Language:PythonStargazers:1Issues:0Issues:0