xcxhy's starred repositories
MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
YOLOv10-Document-Layout-Analysis
YOLOv10 trained on DocLayNet dataset.
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
DUP-ocropy
Python-based tools for document analysis and OCR
websockets
Library for building WebSocket servers and clients in Python
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
deepdoctection
A Repo For Document AI
llama_parse
Parse files for optimal RAG
pdfcrowd-python
A Python client library for the Pdfcrowd HTML to PDF API
OmniFusion
OmniFusion — a multimodal model to communicate using text and images
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output