AI-Mou's repositories
litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
R2R
A framework for rapid development and deployment of production-ready RAG systems
super-rag
Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.
phospho
Text analytics for LLM apps. PostHog for prompts. Extract evaluations, intents and events from text messages. phospho leverages LLM (OpenAI, MistralAI, Ollama, etc.)
lm-evaluation-harness
A framework for few-shot evaluation of language models.
AIlice
A lightweight AI Agent
so-large-lm
大模型理论基础
llm-on-openshift
Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.
bgpt
Beyond Language Models: Byte Models are Digital World Simulators
RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
agentscope
AgentScope: A Flexible yet Robust Multi-Agent Platform
promptbench
A unified evaluation framework for large language models
Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
llm-practical-test
一个不同 SOTA 开源模型在不同的日常任务(主要为代码生成)效果上的测试。
ActiveRAG
This is the code repo for our paper "Revealing the Treasures of Knowledge via Active Learning".
datatunerx
Large language model fine-tuning capabilities based on cloud native and distributed computing.
LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
space
Unified storage framework for the entire machine learning lifecycle
minbpe
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
phidata
Build AI Assistants using function calling
juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
chat-ui
Open source codebase powering the HuggingChat app
chat_templates
Chat Templates for HuggingFace Large Language Models
ai-hub
AI Hub 是一个为了接入包括ChatGPT、Baichuan、Zhipu、混元、MiniMax、Moonshot等多种大型语言模型而设计的服务。它旨在积累和管理各种有效的模型调用提示(prompt),并对这些大型语言模型进行持续的测试和评估。
OpenCopilot
🤖 🔥 Siri, but for your own product. ship an AI copilot for your product in minutes.
MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.