maminge's starred repositories

mindsdb

The platform for building AI from enterprise data

Language:PythonLicense:NOASSERTIONStargazers:26007Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:14728Issues:0Issues:0

ImHex

🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

Language:C++License:GPL-2.0Stargazers:42282Issues:0Issues:0

mdout

一个Go语言实现的Markdown转PDF命令行工具,基于headless chrome,简单、可靠、易安装、可定制化、易拓展

Language:GoLicense:Apache-2.0Stargazers:86Issues:0Issues:0

marker-api

Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.

Language:PythonLicense:GPL-3.0Stargazers:658Issues:0Issues:0

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language:PythonLicense:GPL-3.0Stargazers:4683Issues:0Issues:0

grobid

A machine learning software for extracting information from scholarly documents

Language:JavaLicense:Apache-2.0Stargazers:3343Issues:0Issues:0

javapdf

🍣100本 Java电子书 技术书籍PDF(以下载阅读为荣,以点赞收藏为耻)

Stargazers:2024Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9481Issues:0Issues:0

meilisearch

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Language:RustLicense:MITStargazers:45896Issues:0Issues:0

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaLicense:GPL-3.0Stargazers:37831Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:14144Issues:0Issues:0

Warp

Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster.

License:NOASSERTIONStargazers:20615Issues:0Issues:0

chat-ollama

ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.

Language:TypeScriptLicense:MITStargazers:2502Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7771Issues:0Issues:0

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:11048Issues:0Issues:0

drawio-desktop

Official electron build of draw.io

Language:JavaScriptLicense:Apache-2.0Stargazers:49105Issues:0Issues:0

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:28869Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:15485Issues:0Issues:0

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2428Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29221Issues:0Issues:0

OpenCLaP

Open Chinese Language Pre-trained Model Zoo

License:MITStargazers:978Issues:0Issues:0

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4340Issues:0Issues:0

stop-words

List of common stop words in various languages.

License:CC-BY-4.0Stargazers:317Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:90690Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130539Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:14069Issues:0Issues:0

pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Language:PythonLicense:MITStargazers:5737Issues:0Issues:0

pdfminer

Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Language:PythonLicense:MITStargazers:5236Issues:0Issues:0

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonLicense:MITStargazers:29482Issues:0Issues:0