zhangwei730

zhangwei730

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

zhangwei730's starred repositories

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5564Issues:0Issues:0

doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Language:PythonLicense:Apache-2.0Stargazers:3222Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:24010Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:2460Issues:0Issues:0

SwinDocSegmenter

[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

Language:PythonLicense:Apache-2.0Stargazers:62Issues:0Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1070Issues:0Issues:0

PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

Language:PythonStargazers:11627Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:6851Issues:0Issues:0

ERNIE-Layout-Pytorch

An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.

Language:PythonLicense:MITStargazers:94Issues:0Issues:0

vrdu

We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.

Stargazers:67Issues:0Issues:0

MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Language:PythonLicense:Apache-2.0Stargazers:199Issues:0Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:857Issues:0Issues:0

CDLA

CDLA: A Chinese document layout analysis (CDLA) dataset

Language:PythonStargazers:207Issues:0Issues:0

TableBank

TableBank: A Benchmark Dataset for Table Detection and Recognition

License:Apache-2.0Stargazers:979Issues:0Issues:0

DocLayNet

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

License:NOASSERTIONStargazers:190Issues:0Issues:0
Stargazers:83Issues:0Issues:0

UPOCR

Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)

Language:PythonStargazers:17Issues:0Issues:0
Language:PythonLicense:MITStargazers:46Issues:0Issues:0

Noi

🚀 Power Your World with AI - Explore, Extend, Empower.

Language:JavaScriptStargazers:5090Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1646Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoLicense:MITStargazers:72537Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:59923Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18895Issues:0Issues:0

MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Language:PythonStargazers:340Issues:0Issues:0

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:PythonStargazers:1609Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53834Issues:0Issues:0
Language:PythonStargazers:672Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

Language:PythonLicense:Apache-2.0Stargazers:1537Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8520Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:10115Issues:0Issues:0