epic2005

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonApache-2.0378900

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.03838200

transformers-benchmarks

real Transformer TeraFLOPS on various GPUs

Language:Jupyter NotebookApache-2.083300

triton

Development repository for the Triton language and compiler

Language:C++MIT1202800

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT5701900

veScale

A PyTorch Native LLM Training Framework

Language:PythonApache-2.050400

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.03216500

DCGM

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

Language:C++Apache-2.035500

k8s-device-plugin

NVIDIA device plugin for Kubernetes

Language:GoApache-2.0257500

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION945900

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1052800

epic2005

lipengyue's starred repositories

cuOpt-Resources

cluster-health

kubekey

CV-CUDA

DeepSeek-Coder

gpu-operator

open-webui

clash-for-linux-backup

inference

Transformer-from-scratch

clash-for-AutoDL

inference

ColossalAI

transformers-benchmarks

triton

generative-ai-for-beginners

veScale

ray

DCGM

k8s-device-plugin

Megatron-LM

llama-recipes

LLaMA-Factory

Langchain-Chatchat

TransformerEngine

kubesphere

slurm

DeepSpeed

drawio-desktop

LLM-quickstart