Antonin Stefanutti's repositories
scratch-node
Distroless Node.js Docker Images
metrics-aspectj
AspectJ integration for Dropwizard Metrics
metrics-cdi
CDI extension for Dropwizard Metrics
further-cdi
π Going further with CDI presentation
spring-boot-camel-rest-jpa
Apache Camel REST / JPA Spring Boot example
custom-metrics-apiserver
Framework for implementing custom metrics support for Kubernetes
kube-schedulers
A performance workbench for Kubernetes batch schedulers and queue managers
kubernetes
Production-Grade Container Scheduling and Management
accelerate
π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
internal-acls
Repository used to main group ACLs used by Kubeflow developers
jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
kubeflow-website
Kubeflow Website
kuberay-llm-tuning
Fine Tuning LLMs with Ray on Kubernetes
Liger-Kernel
Efficient Triton Kernels for LLM Training
lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
nfs-server-alpine
A handy Alpine Linux based NFS Server image running NFS v4 only, over TCP on port 2049
peft
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
sdk
Kubeflow SDK for ML Experience
training-operator
Distributed ML Training and Fine-Tuning on Kubernetes
transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
tutorials
PyTorch tutorials.
warp
A Python framework for high performance GPU simulation and graphics