Tan Guofu's repositories
arena
A CLI for Kubeflow.
chatglm3-finetune
最容易上手的0门槛 chatglm3 & agent & langchain 项目
gateway
Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway
devices
Device plugins for Volcano, e.g. GPU
frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
kserve
Standardized Serverless ML Inference Platform on Kubernetes
kuberay
A toolkit to run Ray applications on Kubernetes
KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
langchain
⚡ Building applications with LLMs through composability ⚡
lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
llama.cpp
Port of Facebook's LLaMA model in C/C++
ollama-webui
ChatGPT-Style Web UI Client for Ollama 🦙
pdfs
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
prometheus-kafka-adapter
Use Kafka as a remote storage database for Prometheus (remote write only)
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Qwen-Explained
千问14B和7B的逐行解释
radondb-mysql-kubernetes
Open Source,High Availability Cluster,based on MySQL
so-vits-svc
SoftVC VITS Singing Voice Conversion
text-generation-inference
Large Language Model Text Generation Inference
triton
Development repository for the Triton language and compiler
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs