bergwolf

Peng Tao's repositories

linux

Linux kernel source tree -- forked to track pNFS block client commits. Now used to track my patches for Lustre kernel client clean up. well, I'm using it to track my own nfs patches again... And now it is used to track my random staff...

Language:CNOASSERTION3 30

libvirt

Automatic read-only mirror of http://libvirt.org/git/?p=libvirt.git;a=summary

Language:CLGPL-2.11 20

kata-containers

Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workload isolation and security advantages of VMs. https://katacontainers.io/

Language:RustApache-2.0010

cinder

Language:PythonApache-2.0020

community

Language:PythonApache-2.0020

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION010

DeepLearningSystem

Deep Learning System core principles introduction.

Language:Jupyter NotebookApache-2.0010

diod

Distributed I/O Daemon - a 9P file server

Language:CGPL-2.0020

firecracker

Secure and fast microVMs for serverless computing.

Language:RustApache-2.0020

grok-1

Grok open release

Apache-2.0000

iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

Language:C++Apache-2.0000

libfuse

The reference implementation of the Linux FUSE (Filesystem in Userspace) interface

Language:CNOASSERTION020

liburing

Language:CMIT010

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonBSD-2-Clause000

llm-inference-solutions

A collection of all available inference solutions for the LLMs

MIT000

lmquant

Language:PythonApache-2.0000

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

MIT000

moby

Moby Project - a collaborative project for the container ecosystem to assemble container-based systems

Language:GoApache-2.0020

MS-AMP

Microsoft Automatic Mixed Precision Library

MIT000

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language:CNOASSERTION010

PancrePal-xiaoyibao

面向胰腺癌肿瘤患者的智能RAG平台

Language:PythonApache-2.0000

qemu

Official QEMU mirror. Please see http://wiki.qemu.org/Contribute/SubmitAPatch for how to submit changes to QEMU. Pull Requests are ignored.

Language:CNOASSERTION020

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonApache-2.0000

stablehlo

Backward compatible ML compute opset inspired by HLO/MHLO

Language:MLIRApache-2.0010

tag-runtime

🏃🏿‍♀️🏃🏽‍♀️🏃🏻‍♂️🕒CNCF Technical Advisory Group for Runtime

Apache-2.0000

ThunderKittens

Tile primitives for speedy kernels

MIT000

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Apache-2.0000

triton

Development repository for the Triton language and compiler

MIT000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000

xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

Language:C++Apache-2.0010