rkshuai's starred repositories

GPT-4V_Social_Media

GPT-4V(ision) as A Social Media Analysis Engine

Stargazers:27Issues:0Issues:0

CapsFusion

[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale

Language:PythonStargazers:173Issues:0Issues:0

MMBench

Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"

License:Apache-2.0Stargazers:96Issues:0Issues:0

BlueLM

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Language:PythonLicense:NOASSERTIONStargazers:775Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:57879Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:5486Issues:0Issues:0

minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

Language:C++License:MITStargazers:547Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2866Issues:0Issues:0

SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

Stargazers:2632Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:34519Issues:0Issues:0

DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Language:PythonLicense:MITStargazers:334Issues:0Issues:0

yapf

A formatter for Python files

Language:PythonLicense:Apache-2.0Stargazers:13665Issues:0Issues:0

movenet

Un-official implementation of MoveNet from Google

Language:PythonLicense:MITStargazers:98Issues:0Issues:0

Dewarping-Document-Image-By-Displacement-Flow-Estimation

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

Language:PythonLicense:MITStargazers:152Issues:0Issues:0

waveCorrection

OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正

Language:PythonStargazers:82Issues:0Issues:0

TreeDecoder

A Tree-Structured Decoder for Image-to-Markup Generation

Language:PythonStargazers:92Issues:0Issues:0

seq2seq-layout-analysis

end2end layout analysis based seq2seq

Language:PythonStargazers:132Issues:0Issues:0

DBnet-lite.pytorch

A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

Language:PythonStargazers:105Issues:0Issues:0

Document-Image-Dewarping

Document Image Dewarping

Stargazers:305Issues:0Issues:0

cddod

Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"

Language:PythonLicense:MITStargazers:44Issues:0Issues:0
Language:C++Stargazers:250Issues:0Issues:0

Table-OCR

Recognize tables from images and restore them into word.

Language:C++License:GPL-3.0Stargazers:268Issues:0Issues:0

qaida

Large scale font independent printed Urdu text data set

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:49Issues:0Issues:0

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

Stargazers:16251Issues:0Issues:0

deocclusion

Code for our CVPR 2020 work.

Language:PythonLicense:Apache-2.0Stargazers:771Issues:0Issues:0

AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Language:PythonLicense:NOASSERTIONStargazers:3327Issues:0Issues:0

DocProj

Document Rectification and Illumination Correction using a Patch-based CNN

Language:PythonLicense:MITStargazers:314Issues:0Issues:0

LaTeX_OCR

:gem: 数学公式识别 Math Formula OCR

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:468Issues:0Issues:0

E2E-MLT

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text

Language:C++License:MITStargazers:289Issues:0Issues:0

DB

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

Language:PythonStargazers:2022Issues:0Issues:0