zhudongwork's starred repositories

Language:Jupyter NotebookLicense:MITStargazers:9190Issues:0Issues:0

PromptCBLUE

PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese

Language:PythonStargazers:305Issues:0Issues:0

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonLicense:Apache-2.0Stargazers:2838Issues:0Issues:0

RapidLayout

Analysis of Chinese and English layouts 中英文版面分析

Language:PythonLicense:Apache-2.0Stargazers:73Issues:0Issues:0

openai-python

The official Python library for the OpenAI API

Language:PythonLicense:Apache-2.0Stargazers:21494Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1924Issues:0Issues:0

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:2347Issues:0Issues:0
Language:PythonStargazers:1438Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8553Issues:0Issues:0

SegFormer

Official PyTorch implementation of SegFormer

Language:PythonLicense:NOASSERTIONStargazers:2420Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1628Issues:0Issues:0
Language:PythonStargazers:262Issues:0Issues:0

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1642Issues:0Issues:0

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2573Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:6783Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:2061Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18560Issues:0Issues:0

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2826Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9417Issues:0Issues:0

Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

License:Apache-2.0Stargazers:199Issues:0Issues:0

finetune-embedding

Fine-Tuning Embedding for RAG with Synthetic Data

Language:Jupyter NotebookStargazers:445Issues:0Issues:0

kimi-free-api

🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Language:TypeScriptLicense:GPL-3.0Stargazers:3512Issues:0Issues:0

Dive-into-OCR

“Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:193Issues:0Issues:0

easy-rag

快速入门RAG与私有化部署

Language:PythonStargazers:104Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:23125Issues:0Issues:0

aiops24-RAG-demo

用于AIOPS24挑战赛的Demo

Language:ShellStargazers:51Issues:0Issues:0
Language:Jupyter NotebookStargazers:238Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:237Issues:0Issues:0

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:500Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4246Issues:0Issues:0