xiaoymin's starred repositories

excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

Language:TypeScriptLicense:MITStargazers:75440Issues:389Issues:3297

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:59018Issues:1681Issues:2609

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:46963Issues:417Issues:86

minio

The Object Store for AI Data Infrastructure

Language:GoLicense:AGPL-3.0Stargazers:44947Issues:618Issues:7270

quill

Quill is a modern WYSIWYG editor built for compatibility and extensibility.

Language:TypeScriptLicense:BSD-3-ClauseStargazers:41713Issues:481Issues:3458

certbot

Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.

Language:PythonLicense:NOASSERTIONStargazers:31016Issues:754Issues:5285

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaLicense:GPL-3.0Stargazers:28549Issues:104Issues:628

editor.js

A block-style editor with clean JSON output

Language:TypeScriptLicense:Apache-2.0Stargazers:27313Issues:243Issues:1414

redisson

Redisson - Easy Valkey/Redis Java client and Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache ...

Language:JavaLicense:Apache-2.0Stargazers:22878Issues:881Issues:5113

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:22516Issues:307Issues:959

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:12785Issues:98Issues:738

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:11992Issues:50Issues:125

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:11987Issues:96Issues:1018

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:11852Issues:107Issues:852

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8579Issues:73Issues:86

tabula

Tabula is a tool for liberating data tables trapped inside PDF files

Language:CSSLicense:MITStargazers:6570Issues:194Issues:0

pywin32

Python for Windows (pywin32) Extensions

python-docx

Create and modify Word documents with Python

Language:PythonLicense:MITStargazers:4293Issues:147Issues:1194

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3407Issues:37Issues:220

canvas-editor

rich text editor by canvas/svg

Language:TypeScriptLicense:MITStargazers:3103Issues:56Issues:435

hjson

Hjson, a user interface for JSON

Language:HTMLLicense:MITStargazers:2629Issues:29Issues:107

spring-ai

An Application Framework for AI Engineering

Language:JavaLicense:Apache-2.0Stargazers:2320Issues:59Issues:349

tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Language:JavaLicense:Apache-2.0Stargazers:2223Issues:99Issues:0

chardet

Python character encoding detector

Language:PythonLicense:LGPL-2.1Stargazers:2105Issues:50Issues:138

poi

Mirror of Apache POI

Language:JavaStargazers:1865Issues:77Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1070Issues:26Issues:139

xmc.dspy

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Language:PythonLicense:MITStargazers:318Issues:23Issues:8