zero is not none's starred repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
json-repair
🔧 Repair JSON!Solution for JSON Anomalies from LLMs.
attribute-control
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
Composition-Stable-Diffusion
Image Composition via Stable Diffusion
Document-Layout-Analysis
Object Detection Model for Scanned Documents
General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
InstructionGPT-4
InstructionGPT-4
color-clustering
A tool to perform K-means clustering analysis of the colors in an image.
Scence-Text-Recognition-With-YOLOv8-and-CRNN
This is an implementation of YOLOv8 and CRNN network for Scene Text Recognition task