fredchen's repositories
clip-as-service
🏄 Embed/reason/rank images and sentences with CLIP models
DeepSpeedExamples
Example models using DeepSpeed
FasterTransformer
Transformer related optimization, including BERT, GPT
juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
k8s-client-go
Go client for Kubernetes.
k8s-examples
Kubernetes application example tutorials
lsp-kubeutil
kubernetes develop utils
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
stable-diffusion-webui
Stable Diffusion web UI
stable-diffusion-webui-docker
Easy Docker setup for Stable Diffusion with user-friendly UI
TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
triton
Development repository for the Triton language and compiler
Qwen-7B
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.
text-generation-inference
Large Language Model Text Generation Inference
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
volcano
A Cloud Native Batch System (Project under CNCF)