leo (leoChaoGlut)

leoChaoGlut

Geek Repo

Company:Alibaba

Location:HangZhou - China

Home Page:http://blog.csdn.net/lc0817

Github PK Tool:Github PK Tool

leo's starred repositories

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:13891Issues:0Issues:0

gptpdf

Using GPT to parse PDF

Language:PythonLicense:MITStargazers:2564Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:15154Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:13347Issues:0Issues:0

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language:PythonLicense:GPL-3.0Stargazers:4543Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9912Issues:0Issues:0

dataverse

The Universe of Data. All about data, data science, and data engineering

Language:PythonLicense:Apache-2.0Stargazers:472Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:13918Issues:0Issues:0

latentbox

A collection of awesome-lists for AI, creativity and art. AI、创意和艺术领域的精选合集。https://latentbox.com

Language:TypeScriptLicense:NOASSERTIONStargazers:991Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1908Issues:0Issues:0

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4289Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21057Issues:0Issues:0

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:73636Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6612Issues:0Issues:0

XuanYuan

轩辕:度小满中文金融对话大模型

Language:PythonStargazers:975Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38443Issues:0Issues:0

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:13259Issues:0Issues:0

civitai

A repository of models, textual inversions, and more

Language:TypeScriptLicense:Apache-2.0Stargazers:5964Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:60097Issues:0Issues:0

tess4j

Java JNA wrapper for Tesseract OCR API

Language:JavaLicense:Apache-2.0Stargazers:1569Issues:0Issues:0

Aspose.OCR-for-Java

Aspose.OCR for Java Examples and Sample Projects

License:MITStargazers:42Issues:0Issues:0

jnativehook

Global keyboard and mouse listeners for Java.

Language:JavaLicense:NOASSERTIONStargazers:1725Issues:0Issues:0

reflections

Java runtime metadata analysis

Language:JavaLicense:WTFPLStargazers:4701Issues:0Issues:0

java-design-patterns

Design patterns implemented in Java

Language:JavaLicense:NOASSERTIONStargazers:88879Issues:0Issues:0

herddb

A JVM-embeddable Distributed Database

Language:JavaLicense:Apache-2.0Stargazers:312Issues:0Issues:0

Algorithms

A collection of algorithms and data structures

Language:JavaLicense:MITStargazers:16898Issues:0Issues:0

Java

All Algorithms implemented in Java

Language:JavaLicense:MITStargazers:57937Issues:0Issues:0

jcasbin

An authorization library that supports access control models like ACL, RBAC, ABAC in Java

Language:JavaLicense:Apache-2.0Stargazers:2370Issues:0Issues:0

OpenMLDB

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.

Language:C++License:Apache-2.0Stargazers:1579Issues:0Issues:0

jenetics

Jenetics - Genetic Algorithm, Genetic Programming, Grammatical Evolution, Evolutionary Algorithm, and Multi-objective Optimization

Language:JavaLicense:Apache-2.0Stargazers:838Issues:0Issues:0