zhudongwork's starred repositories
PromptCBLUE
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese
MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
openai-python
The official Python library for the OpenAI API
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
Dive-into-OCR
“Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.
aiops24-RAG-demo
用于AIOPS24挑战赛的Demo