xiuyanDL's starred repositories

sd-webui-model-downloader-cn

免梯子下载 civitai 上的模型

Language:PythonLicense:AGPL-3.0Stargazers:216Issues:0Issues:0

stable-diffusion

Latent Text-to-Image Diffusion

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3692Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:9482Issues:0Issues:0

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:PythonStargazers:1616Issues:0Issues:0

docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Language:PythonLicense:MITStargazers:250Issues:0Issues:0

lxmert

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

Language:PythonLicense:MITStargazers:921Issues:0Issues:0

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Language:PythonStargazers:1014Issues:0Issues:0

TGDoc

arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"

Language:PythonStargazers:9Issues:0Issues:0

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Stargazers:421Issues:0Issues:0

Decoupled-attention-network

Pytorch implementation for "Decoupled attention network for text recognition".

Language:PythonLicense:MITStargazers:309Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5260Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:3940Issues:0Issues:0

deit

Official DeiT repository

Language:PythonLicense:Apache-2.0Stargazers:3918Issues:0Issues:0

dessurt

Official implementation for Dessurt

Language:PythonLicense:MITStargazers:54Issues:0Issues:0

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonLicense:MITStargazers:11207Issues:0Issues:0

TCM

Turning a CLIP Model into a Scene Text Detector (CVPR2023)

Language:PythonLicense:NOASSERTIONStargazers:160Issues:0Issues:0

Bridging-Text-Spotting

(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.

Language:PythonLicense:NOASSERTIONStargazers:35Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:275Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:99Issues:0Issues:0

openseg.pytorch

The official Pytorch implementation of OCNet series and SegFix.

Language:PythonLicense:MITStargazers:1179Issues:0Issues:0

LangGPT

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4156Issues:0Issues:0

SDSB

Simplified Diffusion Schrödinger Bridge

Language:PythonStargazers:29Issues:0Issues:0
Language:Jupyter NotebookStargazers:730Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4363Issues:0Issues:0

ID-Aligner

Official implement of ID-Aligner

Stargazers:110Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4301Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:10783Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8548Issues:0Issues:0

V2ray-Configs

🛰️✨ Free V2ray Configs , Updating Every 10 minutes.

Language:PythonLicense:MITStargazers:3444Issues:0Issues:0

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Language:PythonLicense:MITStargazers:350Issues:0Issues:0