Ferdinand Mom (3outeille)

3outeille

User data from Github https://github.com/3outeille

Company:HuggingFace

Location:France

Home Page:3outeille.github.io

GitHub:@3outeille

Twitter:@FerdinandMom


Organizations
huggingface

Ferdinand Mom's repositories

Language:HTMLLicense:MITStargazers:2Issues:1Issues:0

kernel-builder

👷 Build compute kernels

Language:RustStargazers:1Issues:0Issues:0

minRF

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

prime

prime is a framework for efficient, globally distributed training of AI models over the internet.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

License:Apache-2.0Stargazers:0Issues:0Issues:0

diloco_simple

torch implementation of diloco

Language:PythonStargazers:0Issues:0Issues:0

DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

License:MITStargazers:0Issues:0Issues:0

dust

A Nintendo DS emulator written in Rust for desktop devices and the web, with debugging features and a focus on accuracy

Language:RustLicense:GPL-3.0Stargazers:0Issues:0Issues:0

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

fms-fsdp

Demonstrate throughput of PyTorch FSDP

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gpt-oss-recipes

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kernels

Load compute kernels from the Hub

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

nccl

Optimized primitives for collective multi-GPU communication

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

picotron-deepseek

Minimalistic 4D-parallelism distributed training framework for education purpose

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

prime-rl

Decentralized RL Training at Scale

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

quack

A Quirky Assortment of CuTe Kernels

License:Apache-2.0Stargazers:0Issues:0Issues:0

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

License:NOASSERTIONStargazers:0Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

veScale

A PyTorch Native LLM Training Framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0