Beast code in Giters

AI-Mou's repositories

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

NOASSERTION000

R2R

A framework for rapid development and deployment of production-ready RAG systems

MIT000

super-rag

Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.

MIT000

phospho

Text analytics for LLM apps. PostHog for prompts. Extract evaluations, intents and events from text messages. phospho leverages LLM (OpenAI, MistralAI, Ollama, etc.)

Apache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

MIT000

llm-on-openshift

Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.

Apache-2.0000

bgpt

Beyond Language Models: Byte Models are Digital World Simulators

MIT000

RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

000

agentscope

AgentScope: A Flexible yet Robust Multi-Agent Platform

Apache-2.0000

promptbench

A unified evaluation framework for large language models

MIT000

Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

000

llm-practical-test

一个不同 SOTA 开源模型在不同的日常任务（主要为代码生成）效果上的测试。

000

ActiveRAG

This is the code repo for our paper "Revealing the Treasures of Knowledge via Active Learning".

MIT000

datatunerx

Large language model fine-tuning capabilities based on cloud native and distributed computing.

Apache-2.0000

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

MIT000

LWM

Apache-2.0000

space

Unified storage framework for the entire machine learning lifecycle

Apache-2.0000

minbpe

Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

MIT000

phidata

Build AI Assistants using function calling

MPL-2.0000

juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Apache-2.0000

lighteval

000

ppl.nn.llm

Apache-2.0000

chat-ui

Open source codebase powering the HuggingChat app

Apache-2.0000

chat_templates

Chat Templates for HuggingFace Large Language Models

000

ai-hub

AI Hub 是一个为了接入包括ChatGPT、Baichuan、Zhipu、混元、MiniMax、Moonshot等多种大型语言模型而设计的服务。它旨在积累和管理各种有效的模型调用提示（prompt），并对这些大型语言模型进行持续的测试和评估。

Apache-2.0000

OpenCopilot

🤖 🔥 Siri, but for your own product. ship an AI copilot for your product in minutes.

MIT000

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Apache-2.0000

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Apache-2.0000

AI-Mou

AI-Mou's repositories

litellm

R2R

super-rag

phospho

lm-evaluation-harness

AIlice

so-large-lm

llm-on-openshift

bgpt

RAG-Survey

agentscope

promptbench

Awesome-Knowledge-Distillation-of-LLMs

llm-practical-test

ActiveRAG

datatunerx

LLMLingua

LWM

space

minbpe

phidata

juicefs

lighteval

ppl.nn.llm

chat-ui

chat_templates

ai-hub

OpenCopilot

MiniCPM

sglang