zhangwei730's starred repositories
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
vrdu
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.
MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
llama3-from-scratch
llama3 implementation one matrix multiplication at a time