berooo's repositories
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
CAN
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
cord
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DocBank
DocBank: A Benchmark Dataset for Document Layout Analysis
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
GitHub520
:kissing_heart: 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
insightface
State-of-the-art 2D and 3D Face Analysis Project
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
ml-cvnets
CVNets: A library for training computer vision networks
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
open-llms
📋 A list of open LLMs available for commercial use.
open-mllms
open llm for multimodal
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
source-han-sans
Source Han Sans | 思源黑体 | 思源黑體 | 思源黑體 香港 | 源ノ角ゴシック | 본고딕
TabRecSet
A large scale camera-taken table detection and recognition dataset.
tabula
Tabula is a tool for liberating data tables trapped inside PDF files
UIE
Unified Structure Generation for Universal Information Extraction
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
WanJuan1.0
万卷1.0多模态语料