bjtuln's starred repositories
clip-interrogator
Image to prompt with BLIP and CLIP
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
CV_papers_arxiv_daily
Daily feed of this day's research articles about Computer Vision published to https://arxiv.org.
paper-reading
深度学习经典、新论文逐段精读
stable-diffusion-webui
Stable Diffusion web UI
china_area
2024年**全国5级行政区划(省、市、县、镇、村)
traditional-chinese-text-recogn-dataset
繁體中文OCR文字識別數據集
synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
TextRecognitionDataGenerator
A synthetic data generator for text recognition
maxim-pytorch
[CVPR 2022 Oral] PyTorch re-implementation for "MAXIM: Multi-Axis MLP for Image Processing", with *training code*. Official Jax repo: https://github.com/google-research/maxim
code-server
VS Code in the browser