rkshuai's starred repositories
GPT-4V_Social_Media
GPT-4V(ision) as A Social Media Analysis Engine
CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
Dewarping-Document-Image-By-Displacement-Flow-Estimation
Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network
waveCorrection
OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正
TreeDecoder
A Tree-Structured Decoder for Image-to-Markup Generation
seq2seq-layout-analysis
end2end layout analysis based seq2seq
DBnet-lite.pytorch
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
Document-Image-Dewarping
Document Image Dewarping
CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
deocclusion
Code for our CVPR 2020 work.