LiuFeng's repositories
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
calm
C(UDA) accelerated language model inference
CrownLabs
Kubernetes-based Remote Laboratories
cube-studio
cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,数据资产对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式算法训练,超参搜索,推理服务VGPU,多集群调度,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型一键微调,llmops,私有知识库,AI应用商店,支持模型一键开发/推理/微调,私有化部署,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
cursusdb
Highly performant, secure-by-default, in-memory, distributed document oriented database with an SQL like query language written in pure GO.
dashboard
General purpose dashboard for Dapr
fiftyone-docs-search
Search docs.voxel51.com with an LLM!
grpcbalance
grpc-go load balancing
HAMi
OpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.
kilo
Kilo is a multi-cloud network overlay built on WireGuard and designed for Kubernetes (k8s + wg = kg)
lakeFS
lakeFS - Data version control for your data lake | Git for data
limiters
Golang rate limiters for distributed applications
litefs
FUSE-based file system for replicating SQLite databases across a cluster of machines
llama-inference
experiments with inference on llama
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
my-ielts
雅思词汇真经、雅思语法、听力 179、阅读 538 同义替换等。Everything during preparing for my IELTS exam.
omnivore
Omnivore is a complete, open source read-it-later solution for people who like reading.
openai-benchmark
OpenAI benchmarking tool
qcloud-documents
腾讯云官方文档
roop
one-click face swap
skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
spegel
Stateless cluster local OCI registry mirror.
test-infra
Test infrastructure for the Kubernetes project.
TigerBot
TigerBot: A multi-language multi-task LLM
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
zot
zot - A production-ready vendor-neutral OCI-native container image/artifact registry (purely based on OCI Distribution Specification)