bjtuln

0

followers

following

stars

bjtuln's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0138119 1061 7601

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.047117 365 2812

TaskMatrix

Language:PythonNOASSERTION34530 300 351

paper-reading

深度学习经典、新论文逐段精读

Apache-2.025723 7150

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024677 192 3929

generative-models

Generative Models by Stability AI

Language:PythonMIT23801 254 292

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.021267 178 440

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9380 97 636

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.07774 108 440

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.04821 61 366

sd-scripts

Language:PythonApache-2.04689 52 868

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Language:PythonApache-2.04241 58 894

TextRecognitionDataGenerator

A synthetic data generator for text recognition

Language:PythonMIT3188 63 246

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.03134 26 129

Kolors

Kolors Team

Language:PythonApache-2.03067 27 94

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonMIT2622 29 94

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonApache-2.02389 40 367

china_area

2024年**全国5级行政区划（省、市、县、镇、村）

GPL-3.02021 48 28

Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Language:PythonNOASSERTION1684 18 97

parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Language:PythonApache-2.0542 13 138

synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Language:PythonMIT465 6 41

segmoe

Language:PythonApache-2.0396 6 23

MyArxiv

Arxiv个性化定制化模版，实现对特定领域的相关内容、作者与学术会议的有效跟进。

Language:CSSGPL-2.0223 4 5

NeRCo

[ICCV 2023] Implicit Neural Representation for Cooperative Low-light Image Enhancement

Language:Python213 5 16

HI-Diff

PyTorch code for our NeurIPS 2023 paper "Hierarchical Integration Diffusion Model for Realistic Image Deblurring"

Language:PythonApache-2.0151 6 19

character_set

ImageForensicsOSN

Language:PythonMIT94 3 16

traditional-chinese-text-recogn-dataset

繁體中文OCR文字識別數據集

Language:PythonApache-2.050 3 3

CV_papers_arxiv_daily

Daily feed of this day's research articles about Computer Vision published to https://arxiv.org.

Language:Python24 30

GlyphDraw2

GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models

Language:PythonMIT1900