Carlos Mocholí's repositories
PyLaia-examples
A set of experiments using PyLaia on different datasets
AdventOfCode2016
My java solutions of the programming puzzles.
algorhythmHashCode
Repo to practice Google's HashCode problems
DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
faster-pytorch-blog
Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy
ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
lightning
Build and train PyTorch models and connect them to the ML lifecycle using Lightning App templates, without handling DIY infrastructure, cost management, scaling, and other headaches.
lightning-thunder
Source to source compiler for PyTorch. It makes PyTorch programs faster on single accelerators and distributed.
litdata
Blazingly fast, distributed streaming of training data from any cloud storage for training AI models
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Megatron-LM
Ongoing research training transformer models at scale
neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
stable-diffusion
A latent text-to-image diffusion model
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
toolbox
Essential guides and programming tools in my toolbox (with focus on ML Training)
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
xla
Enabling PyTorch on Google TPU