rkshuai's repositories
chromium_org
android5.0的chromium源码
TIES_DataGeneration
Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)
Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
CapsFusion
CapsFusion: Rethinking Image-Text Data at Scale
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & more LLMs
Dewarping-Document-Image-By-Displacement-Flow-Estimation
Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network
DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
llama.cpp
Port of Facebook's LLaMA model in C/C++
minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
MMBench
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
seq2seq-ocr-analysis
end2end layout analysis based seq2seq
stable-diffusion
A latent text-to-image diffusion model
stable-diffusion-webui
Stable Diffusion web UI
stablediffusion-infinity
Outpainting with Stable Diffusion on an infinite canvas
TaiSu
TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)
Text2Poster-ICASSP-22
Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"
torch-fidelity
High-fidelity performance metrics for generative models in PyTorch
TreeDecoder
A Tree-Structured Decoder for Image-to-Markup Generation
VisCPM
Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
visual-chatgpt
VisualChatGPT
waveCorrection
OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正