bjtuln

bjtuln

Geek Repo

Github PK Tool:Github PK Tool

bjtuln's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:138119Issues:1061Issues:7601

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:47117Issues:365Issues:2812
Language:PythonLicense:NOASSERTIONStargazers:34530Issues:300Issues:351

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:25723Issues:715Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24677Issues:192Issues:3929

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23801Issues:254Issues:292

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21267Issues:178Issues:440

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9380Issues:97Issues:636

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7774Issues:108Issues:440

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4821Issues:61Issues:366
Language:PythonLicense:Apache-2.0Stargazers:4689Issues:52Issues:868

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Language:PythonLicense:Apache-2.0Stargazers:4241Issues:58Issues:894

TextRecognitionDataGenerator

A synthetic data generator for text recognition

Language:PythonLicense:MITStargazers:3188Issues:63Issues:246

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3134Issues:26Issues:129

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:3067Issues:27Issues:94

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonLicense:MITStargazers:2622Issues:29Issues:94

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2389Issues:40Issues:367

china_area

2024年**全国5级行政区划(省、市、县、镇、村)

Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Language:PythonLicense:NOASSERTIONStargazers:1684Issues:18Issues:97

parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Language:PythonLicense:Apache-2.0Stargazers:542Issues:13Issues:138

synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Language:PythonLicense:MITStargazers:465Issues:6Issues:41
Language:PythonLicense:Apache-2.0Stargazers:396Issues:6Issues:23

MyArxiv

Arxiv个性化定制化模版,实现对特定领域的相关内容、作者与学术会议的有效跟进。

Language:CSSLicense:GPL-2.0Stargazers:223Issues:4Issues:5

NeRCo

[ICCV 2023] Implicit Neural Representation for Cooperative Low-light Image Enhancement

HI-Diff

PyTorch code for our NeurIPS 2023 paper "Hierarchical Integration Diffusion Model for Realistic Image Deblurring"

Language:PythonLicense:Apache-2.0Stargazers:151Issues:6Issues:19

traditional-chinese-text-recogn-dataset

繁體中文OCR文字識別數據集

Language:PythonLicense:Apache-2.0Stargazers:50Issues:3Issues:3

CV_papers_arxiv_daily

Daily feed of this day's research articles about Computer Vision published to https://arxiv.org.

Language:PythonStargazers:24Issues:3Issues:0

GlyphDraw2

GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models

Language:PythonLicense:MITStargazers:19Issues:0Issues:0