xiaoymin's starred repositories

excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

Language:TypeScriptLicense:MITStargazers:74139Issues:383Issues:3243

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:58513Issues:1683Issues:2602

minio

The Object Store for AI Data Infrastructure

Language:GoLicense:AGPL-3.0Stargazers:44577Issues:614Issues:7233

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:43990Issues:391Issues:81

quill

Quill is a modern WYSIWYG editor built for compatibility and extensibility.

Language:TypeScriptLicense:BSD-3-ClauseStargazers:41632Issues:479Issues:3451

certbot

Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.

Language:PythonLicense:NOASSERTIONStargazers:30927Issues:753Issues:5280

editor.js

A block-style editor with clean JSON output

Language:TypeScriptLicense:Apache-2.0Stargazers:27069Issues:240Issues:1405

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaLicense:GPL-3.0Stargazers:26552Issues:104Issues:546

redisson

Redisson - Easy Redis Java client and Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache ...

Language:JavaLicense:Apache-2.0Stargazers:22802Issues:881Issues:5083

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:22238Issues:305Issues:952

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:12451Issues:96Issues:715

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:11478Issues:93Issues:1006

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:11260Issues:97Issues:803

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:8820Issues:46Issues:90

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:6888Issues:64Issues:67

tabula

Tabula is a tool for liberating data tables trapped inside PDF files

Language:CSSLicense:MITStargazers:6546Issues:194Issues:0

pywin32

Python for Windows (pywin32) Extensions

python-docx

Create and modify Word documents with Python

Language:PythonLicense:MITStargazers:4247Issues:148Issues:1185

canvas-editor

rich text editor by canvas/svg

Language:TypeScriptLicense:MITStargazers:3020Issues:54Issues:378

hjson

Hjson, a user interface for JSON

Language:HTMLLicense:MITStargazers:2629Issues:29Issues:107

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:Jupyter NotebookLicense:MITStargazers:2395Issues:31Issues:149

tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Language:JavaLicense:Apache-2.0Stargazers:2184Issues:97Issues:0

spring-ai

An Application Framework for AI Engineering

Language:JavaLicense:Apache-2.0Stargazers:2141Issues:58Issues:304

chardet

Python character encoding detector

Language:PythonLicense:LGPL-2.1Stargazers:2087Issues:50Issues:138

poi

Mirror of Apache POI

Language:JavaStargazers:1848Issues:77Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1013Issues:26Issues:131

xmc.dspy

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Language:PythonLicense:MITStargazers:313Issues:23Issues:8