bjtuln

bjtuln

Geek Repo

Github PK Tool:Github PK Tool

bjtuln's starred repositories

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3052Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:39218Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:18086Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23604Issues:0Issues:0

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonLicense:MITStargazers:2549Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9091Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

Language:PythonStargazers:1870Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4385Issues:0Issues:0

CV_papers_arxiv_daily

Daily feed of this day's research articles about Computer Vision published to https://arxiv.org.

Language:PythonStargazers:18Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:380Issues:0Issues:0

HI-Diff

PyTorch code for our NeurIPS 2023 paper "Hierarchical Integration Diffusion Model for Realistic Image Deblurring"

Language:PythonLicense:Apache-2.0Stargazers:129Issues:0Issues:0

MyArxiv

Arxiv个性化定制化模版,实现对特定领域的相关内容、作者与学术会议的有效跟进。

Language:CSSLicense:GPL-2.0Stargazers:210Issues:0Issues:0

NeRCo

[ICCV 2023] Implicit Neural Representation for Cooperative Low-light Image Enhancement

Language:PythonStargazers:211Issues:0Issues:0

parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Language:PythonLicense:Apache-2.0Stargazers:514Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:24769Issues:0Issues:0

Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Language:PythonLicense:NOASSERTIONStargazers:1608Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:134187Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:34527Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7665Issues:0Issues:0

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Language:PythonLicense:Apache-2.0Stargazers:4162Issues:0Issues:0
Language:PythonLicense:MITStargazers:91Issues:0Issues:0

china_area

2024年**全国5级行政区划(省、市、县、镇、村)

License:GPL-3.0Stargazers:1931Issues:0Issues:0

traditional-chinese-text-recogn-dataset

繁體中文OCR文字識別數據集

Language:PythonLicense:Apache-2.0Stargazers:49Issues:0Issues:0

synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Language:PythonLicense:MITStargazers:441Issues:0Issues:0

TextRecognitionDataGenerator

A synthetic data generator for text recognition

Language:PythonLicense:MITStargazers:3115Issues:0Issues:0

maxim-pytorch

[CVPR 2022 Oral] PyTorch re-implementation for "MAXIM: Multi-Axis MLP for Image Processing", with *training code*. Official Jax repo: https://github.com/google-research/maxim

Language:PythonLicense:Apache-2.0Stargazers:166Issues:0Issues:0

DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Language:PythonLicense:MITStargazers:337Issues:0Issues:0

code-server

VS Code in the browser

Language:TypeScriptLicense:MITStargazers:66237Issues:0Issues:0

FGVC-PIM

Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.

Language:PythonLicense:MITStargazers:181Issues:0Issues:0