zhongguogu's repositories
detectron2
Detectron2 for Document Layout Analysis
Document-Layout-Analysis
Tools for extract figure, table, text, .. from a pdf document.
Document_QA
类似于chatpdf的简化demo版
dothinking.github.io
Thinking and writing
easytable
Small table drawing library built upon Apache PDFBox
Event-Extraction
基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容
examples
Examples for https://github.com/therecipe/qt
go-openai
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
ilovepdf
Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..
krill
Improved HTML output for Tika extraction
layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Lhy_Machine_Learning
李宏毅2021春季机器学习课程课件及作业
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
openai-java
OpenAI GPT-3 Api Client in Java
paper-reading
深度学习经典、新论文逐段精读
pdf-parser
A parser for pdf that can extract paragraphs, tables and pictures
pdf-table
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
pdf-unstamper
Remove textual watermark of any font, any encoding and any language with pdf-unstamper now!
pdfGPT
基于 openai api 的超长 PDF 解析服务
PST-table
表格结构解析新思路(表格识别新思路)
py-pdf-parser
A Python tool to help extracting information from structured PDFs.
SPLERGE
Deep Splitting and Merging for Table Structure Decomposition
tessdoc
Tesseract documentation
testarea-pdfbox2
Test area for public PDFBox v2 issues on stackoverflow etc
wechat-chatgpt
Use ChatGPT On Wechat via wechaty