berooo

followers

following

stars

berooo's repositories

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

MIT000

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

000

baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Apache-2.0000

CAN

When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).

MIT000

ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

000

cord

CORD: A Consolidated Receipt Dataset for Post-OCR Parsing

CC-BY-4.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

DocBank

DocBank: A Benchmark Dataset for Document Layout Analysis

Apache-2.0000

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Apache-2.0000

ERNIE-Layout-Pytorch

An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.

MIT000

GitHub520

:kissing_heart: 让你“爱”上 GitHub，解决访问时图裂、加载慢的问题。（无需安装）

000

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Language:PythonMIT000

i-Code

MIT000

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonMIT000

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

MIT000

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Apache-2.0000

LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Language:PythonMIT000

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

000

ml-cvnets

CVNets: A library for training computer vision networks

Language:PythonNOASSERTION000

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT000

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

MIT000

open-llms

📋 A list of open LLMs available for commercial use.

Apache-2.0000

open-mllms

open llm for multimodal

Apache-2.0000

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Apache-2.0000

source-han-sans

Source Han Sans | 思源黑体 | 思源黑體 | 思源黑體香港 | 源ノ角ゴシック | 본고딕

NOASSERTION000

TabRecSet

A large scale camera-taken table detection and recognition dataset.

000

tabula

Tabula is a tool for liberating data tables trapped inside PDF files

MIT000

UIE

Unified Structure Generation for Universal Information Extraction

Language:Python000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000

WanJuan1.0

万卷1.0多模态语料

CC-BY-4.0000