Beast code in Giters

xiuyanDL's starred repositories

sd-webui-model-downloader-cn

免梯子下载 civitai 上的模型

Language:PythonAGPL-3.021600

stable-diffusion

Latent Text-to-Image Diffusion

Language:Jupyter NotebookNOASSERTION369200

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.0948200

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:Python161600

docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Language:PythonMIT25000

lxmert

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

Language:PythonMIT92100

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Language:Python101400

TGDoc

arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"

Language:Python900

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

42100

Decoupled-attention-network

Pytorch implementation for "Decoupled attention network for text recognition".

Language:PythonMIT30900

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.0526000

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonApache-2.0394000

deit

Official DeiT repository

Language:PythonApache-2.0391800

dessurt

Official implementation for Dessurt

Language:PythonMIT5400

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonMIT1120700

TCM

Turning a CLIP Model into a Scene Text Detector (CVPR2023)

Language:PythonNOASSERTION16000

Bridging-Text-Spotting

(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.

Language:PythonNOASSERTION3500

FSCE

Language:PythonApache-2.027500

UReader

Language:PythonApache-2.09900

openseg.pytorch

The official Pytorch implementation of OCNet series and SegFix.

Language:PythonMIT117900

LangGPT

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt，Language of GPT, 结构化提示词，结构化Prompt

Language:Jupyter NotebookApache-2.0415600

SDSB

Simplified Diffusion Schrödinger Bridge

Language:Python2900

deepul

Language:Jupyter Notebook73000

sd-scripts

Language:PythonApache-2.0436300

ID-Aligner

Official implement of ID-Aligner

11000

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.0430100

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT1078300

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT854800

V2ray-Configs

🛰️✨ Free V2ray Configs , Updating Every 10 minutes.

Language:PythonMIT344400

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Language:PythonMIT35000