zhudongwork's starred repositories

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:40001Issues:430Issues:9074

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:35117Issues:280Issues:2394

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:34149Issues:1065Issues:1814

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:32994Issues:235Issues:4301

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:22684Issues:308Issues:962

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17708Issues:158Issues:1363

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10396Issues:94Issues:319

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:9944Issues:63Issues:638

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8872Issues:74Issues:90

vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Language:PythonLicense:MITStargazers:8779Issues:56Issues:237

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:3937Issues:48Issues:248

llama-hub

A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain

Language:Jupyter NotebookLicense:MITStargazers:3421Issues:46Issues:212

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonLicense:Apache-2.0Stargazers:3326Issues:36Issues:855

kimi-free-api

🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Language:TypeScriptLicense:GPL-3.0Stargazers:3235Issues:27Issues:98

dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Language:PythonLicense:Apache-2.0Stargazers:3188Issues:24Issues:40

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2432Issues:28Issues:203

swift

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2075Issues:19Issues:570

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonLicense:MITStargazers:1879Issues:22Issues:246

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

spider

scripts and baselines for Spider: Yale complex and cross-domain semantic parsing and text-to-SQL challenge

Language:PythonLicense:Apache-2.0Stargazers:746Issues:29Issues:93

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:467Issues:8Issues:35

finetune-embedding

Fine-Tuning Embedding for RAG with Synthetic Data

Language:Jupyter NotebookStargazers:430Issues:4Issues:5
Language:PythonLicense:Apache-2.0Stargazers:210Issues:4Issues:16

Dive-into-OCR

“Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:175Issues:3Issues:1

Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

easy-rag

快速入门RAG与私有化部署

Language:PythonStargazers:94Issues:0Issues:0

aiops24-RAG-demo

用于AIOPS24挑战赛的Demo

Language:ShellStargazers:48Issues:0Issues:8