616 (bravet)

bravet

Geek Repo

Location:Thailand

Github PK Tool:Github PK Tool

616's starred repositories

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:17447Issues:143Issues:743

aimoneyhunter

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.

novel

Notion-style WYSIWYG editor with AI-powered autocompletion.

Language:TypeScriptLicense:Apache-2.0Stargazers:12671Issues:45Issues:232

OpenSearch

🔎 Open source distributed and RESTful search engine.

Language:JavaLicense:Apache-2.0Stargazers:9609Issues:141Issues:5681

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8827Issues:63Issues:213

RTranslator

Open source real-time translation app for Android that runs locally

Language:C++License:Apache-2.0Stargazers:6583Issues:50Issues:55

pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Language:PythonLicense:MITStargazers:5848Issues:117Issues:701

pytesseract

A Python wrapper for Google Tesseract

Language:PythonLicense:Apache-2.0Stargazers:5783Issues:110Issues:362

OpenPDF

OpenPDF is a free Java library for creating and editing PDF files, with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository.

Language:JavaLicense:NOASSERTIONStargazers:3529Issues:74Issues:499

qpdf

qpdf: A content-preserving PDF document transformer

Language:C++License:Apache-2.0Stargazers:3397Issues:69Issues:720

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Language:PythonLicense:MITStargazers:2223Issues:39Issues:141

unstract

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Language:PythonLicense:AGPL-3.0Stargazers:2206Issues:18Issues:21

pikepdf

A Python library for reading and writing PDF, powered by QPDF

Language:PythonLicense:MPL-2.0Stargazers:2140Issues:37Issues:427

itext-java

iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.

Language:JavaLicense:NOASSERTIONStargazers:1980Issues:90Issues:0

telegram-sms

An SMS-forwarding Robot Running on Your Android Device.

Language:JavaLicense:BSD-3-ClauseStargazers:1670Issues:29Issues:13

excalibur

A web interface to extract tabular data from PDFs

Language:HTMLLicense:MITStargazers:1566Issues:38Issues:129

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language:PythonLicense:MITStargazers:885Issues:12Issues:13
Language:PythonLicense:Apache-2.0Stargazers:504Issues:23Issues:117

boxable

Boxable is a library that can be used to easily create tables in pdf documents.

Language:JavaLicense:Apache-2.0Stargazers:331Issues:24Issues:212

fast-retry

高性能百万级任务重试框架

Language:JavaLicense:Apache-2.0Stargazers:114Issues:2Issues:2

unstructured-python-client

A Python client for the Unstructured hosted API

Language:PythonLicense:MITStargazers:79Issues:18Issues:27

ph-pdf-layout

Java library for creating fluid page layouts with Apache PDFBox. Supporting multi-page tables, different page layouts etc.

Language:JavaLicense:Apache-2.0Stargazers:60Issues:11Issues:35

unstructured.PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:29Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0

unstructured.pytesseract

A Python wrapper for Google Tesseract

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0

rpa-xvfb

RPA Framework in Docker, with xvfb

Language:RobotFrameworkStargazers:2Issues:1Issues:0

base-images

Store Dockerfiles and Packer configs for images to use as a base to build upon

Language:ShellLicense:Apache-2.0Stargazers:2Issues:3Issues:1

unstructured.Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Language:C++License:Apache-2.0Stargazers:2Issues:2Issues:0