rkshuai's repositories

chromium_org

android5.0的chromium源码

TIES_DataGeneration

Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:0Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:0Issues:0Issues:0

Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CapsFusion

CapsFusion: Rethinking Image-Text Data at Scale

Stargazers:0Issues:0Issues:0

chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & more LLMs

Language:C++License:MITStargazers:0Issues:0Issues:0

Dewarping-Document-Image-By-Displacement-Flow-Estimation

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0

minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

Language:C++License:MITStargazers:0Issues:0Issues:0

MMBench

Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"

License:Apache-2.0Stargazers:0Issues:0Issues:0

movenet

Un-official implementation of MoveNet from Google

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

seq2seq-ocr-analysis

end2end layout analysis based seq2seq

Language:PythonStargazers:0Issues:1Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

stablediffusion-infinity

Outpainting with Stable Diffusion on an infinite canvas

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TaiSu

TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Text2Poster-ICASSP-22

Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torch-fidelity

High-fidelity performance metrics for generative models in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TreeDecoder

A Tree-Structured Decoder for Image-to-Markup Generation

Language:PythonStargazers:0Issues:1Issues:0

VisCPM

Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Language:PythonStargazers:0Issues:0Issues:0

visual-chatgpt

VisualChatGPT

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

waveCorrection

OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正

Language:PythonStargazers:0Issues:1Issues:0

yapf

A formatter for Python files

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0