zero is not none's starred repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
gligen-gui
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
attribute-control
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
json-repair
🔧 Repair JSON!Solution for JSON Anomalies from LLMs.
Composition-Stable-Diffusion
Image Composition via Stable Diffusion
Document-Layout-Analysis
Object Detection Model for Scanned Documents
General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
color-clustering
A tool to perform K-means clustering analysis of the colors in an image.
Scence-Text-Recognition-With-YOLOv8-and-CRNN
This is an implementation of YOLOv8 and CRNN network for Scene Text Recognition task