fredchen's repositories

Language:PythonStargazers:0Issues:0Issues:0

clip-as-service

🏄 Embed/reason/rank images and sentences with CLIP models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

k8s-client-go

Go client for Kubernetes.

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

k8s-examples

Kubernetes application example tutorials

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lsp-kubeutil

kubernetes develop utils

Language:GoLicense:MITStargazers:0Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

stable-diffusion-webui-docker

Easy Docker setup for Stable Diffusion with user-friendly UI

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0

Qwen-7B

The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.

License:NOASSERTIONStargazers:0Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

License:NOASSERTIONStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

volcano

A Cloud Native Batch System (Project under CNCF)

License:Apache-2.0Stargazers:0Issues:0Issues:0