huzhiguan's starred repositories

hugo-coder

A minimalist blog theme for hugo.

Language:HTMLLicense:MITStargazers:2653Issues:0Issues:0

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8436Issues:0Issues:0

Best-Practice-for-Building-A-Startup-in-Delaware-with-Tech-Tools

美国华人技术创业的快速公司/银行/税务/投资操作白皮书

License:GPL-3.0Stargazers:625Issues:0Issues:0

starship

☄🌌️ The minimal, blazing-fast, and infinitely customizable prompt for any shell!

Language:RustLicense:ISCStargazers:42934Issues:0Issues:0

DAVAR-Lab-OCR

OCR toolbox from Davar-Lab

Language:PythonLicense:Apache-2.0Stargazers:721Issues:0Issues:0

deep-text-recognition-benchmark

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3676Issues:0Issues:0

PClean

A domain-specific probabilistic programming language for scalable Bayesian data cleaning

Language:JuliaStargazers:216Issues:0Issues:0

alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Language:JavaLicense:Apache-2.0Stargazers:6730Issues:0Issues:0

docnet

DocNET is as fast PDF editing and reading library for modern .NET applications

Language:C#License:MITStargazers:437Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19110Issues:0Issues:0

awesome-cs-books

🔥 经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等

Stargazers:17286Issues:0Issues:0

OCR_DataSet

收集并整理有关OCR的数据集并统一标注格式,以便实验需要

Language:PythonStargazers:850Issues:0Issues:0

ocr-open-dataset

list all open dataset about ocr.

License:Apache-2.0Stargazers:100Issues:0Issues:0

xLights

xLights is a sequencer for Lights. xLights has usb and E1.31 drivers. You can create sequences in this object oriented program. You can create playlists, schedule them, test your hardware, convert between different sequencers.

Language:C++License:GPL-3.0Stargazers:530Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:40258Issues:0Issues:0

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:13892Issues:0Issues:0

pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Language:ScalaLicense:Apache-2.0Stargazers:562Issues:0Issues:0

v2ray-core

A platform for building proxies to bypass network restrictions.

Language:GoLicense:MITStargazers:44987Issues:0Issues:0

grobid

A machine learning software for extracting information from scholarly documents

Language:JavaLicense:Apache-2.0Stargazers:3256Issues:0Issues:0

free-books

互联网上的免费书籍

Stargazers:14661Issues:0Issues:0

elsa-core

A .NET workflows library

Language:C#License:MITStargazers:6051Issues:0Issues:0

SwiftOCR

Fast and simple OCR library written in Swift

Language:SwiftLicense:Apache-2.0Stargazers:4605Issues:0Issues:0

camelot

A Python library to extract tabular data from PDFs

Language:PythonLicense:MITStargazers:2764Issues:0Issues:0

camelot

Camelot: PDF Table Extraction for Humans

Language:PythonLicense:NOASSERTIONStargazers:3606Issues:0Issues:0

freki

Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

sciencebeam-parser

A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools together to generate a full XML document.

Language:PythonLicense:MITStargazers:294Issues:0Issues:0

architect-awesome

后端架构师技术图谱

Stargazers:59445Issues:0Issues:0

zipkin4net

A .NET client library for Zipkin

Language:C#License:Apache-2.0Stargazers:341Issues:0Issues:0

AspNetCore.Diagnostics.HealthChecks

Enterprise HealthChecks for ASP.NET Core Diagnostics Package

Language:C#License:Apache-2.0Stargazers:3983Issues:0Issues:0

bloomrpc

Former GUI client for gRPC services. No longer maintained.

Language:TypeScriptLicense:LGPL-3.0Stargazers:9012Issues:0Issues:0