LiuFeng (imneov)

imneov

Geek Repo

Location:wuhan

Github PK Tool:Github PK Tool

LiuFeng's repositories

3k

3-k platform is for training LLMs

License:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:GoStargazers:0Issues:1Issues:0

aistore

AIStore: scalable storage for AI applications

Language:GoLicense:MITStargazers:0Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

BentoML

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

calm

C(UDA) accelerated language model inference

Language:CLicense:MITStargazers:0Issues:1Issues:0

container-image-csi-driver

Kubernetes CSI driver for mounting image

Language:GoLicense:MITStargazers:0Issues:0Issues:0

dapr-cli

Command-line tools for Dapr.

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

duckdb

DuckDB is an in-process SQL OLAP Database Management System

License:MITStargazers:0Issues:0Issues:0

guidance

A guidance language for controlling large language models.

License:MITStargazers:0Issues:0Issues:0

HAMi

OpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

incus

Powerful system container and virtual machine manager

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kubeedge

Kubernetes Native Edge Computing Framework (project under CNCF)

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ModelGuard

Guarding your models, Empowering your edge

License:Apache-2.0Stargazers:0Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoLicense:MITStargazers:0Issues:0Issues:0

openai-benchmark

OpenAI benchmarking tool

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

oscar

[mirror] Open source contributor agent architecture repo.

Language:GoLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

License:MITStargazers:0Issues:0Issues:0

qcloud-documents

腾讯云官方文档

Language:HTMLStargazers:0Issues:1Issues:0

regclient

Docker and OCI Registry Client in Go and tooling using those libraries.

License:Apache-2.0Stargazers:0Issues:0Issues:0

spegel

Stateless cluster local OCI registry mirror.

Language:GoLicense:MITStargazers:0Issues:0Issues:0

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tower

Proxy for multiple Kubernetes cluster communication

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ttl.sh

An anonymous & ephemeral Docker image registry

License:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

zot

zot - A production-ready vendor-neutral OCI-native container image/artifact registry (purely based on OCI Distribution Specification)

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0