Beast code in Giters

zhongguogu's repositories

DocParser

Language:PythonMIT100

borderless_tbls_detection

Language:Python000

detectron2

Detectron2 for Document Layout Analysis

Apache-2.0000

Document-Layout-Analysis

Tools for extract figure, table, text, .. from a pdf document.

000

Document_QA

类似于chatpdf的简化demo版

000

dothinking.github.io

Thinking and writing

000

easytable

Small table drawing library built upon Apache PDFBox

MIT000

Event-Extraction

基于法律裁判文书的事件抽取及其应用，包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容

000

examples

Examples for https://github.com/therecipe/qt

000

go-openai

OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

Apache-2.0000

gpt4-pdf-chatbot-langchain

GPT4 & LangChain Chatbot for large PDF docs

000

HanLP

中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理

Apache-2.0000

ilovepdf

Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..

Apache-2.0000

krill

Improved HTML output for Tika extraction

Language:JavaApache-2.0000

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Apache-2.0000

Lhy_Machine_Learning

李宏毅2021春季机器学习课程课件及作业

Language:Jupyter Notebook000

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

MIT000

openai-java

OpenAI GPT-3 Api Client in Java

MIT000

paper-reading

深度学习经典、新论文逐段精读

Apache-2.0000

pdf-parser

A parser for pdf that can extract paragraphs, tables and pictures

Apache-2.0000

pdf-table

Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV

Language:JavaMIT000

pdf-unstamper

Remove textual watermark of any font, any encoding and any language with pdf-unstamper now!

Language:JavaGPL-3.0000

PDFBOX

000

pdfGPT

基于 openai api 的超长 PDF 解析服务

000

PST-table

表格结构解析新思路（表格识别新思路）

000

py-pdf-parser

A Python tool to help extracting information from structured PDFs.

Language:PythonMIT000

SPLERGE

Deep Splitting and Merging for Table Structure Decomposition

Language:Python000

tessdoc

Tesseract documentation

000

testarea-pdfbox2

Test area for public PDFBox v2 issues on stackoverflow etc

Apache-2.0000

wechat-chatgpt

Use ChatGPT On Wechat via wechaty

000