Ilyas Moutawwakil (IlyasMoutawwakil)

IlyasMoutawwakil

Geek Repo

Company:@huggingface

Location:Paris, France

Home Page:ilyasmoutawwakil.github.io

Github PK Tool:Github PK Tool


Organizations
Chouafa
huggingface

Ilyas Moutawwakil's starred repositories

pytest-xdist

pytest plugin for distributed testing and loop-on-failures testing modes.

Language:PythonLicense:MITStargazers:1403Issues:0Issues:0

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:138Issues:0Issues:0

lovely-tensors

Tensors, for human consumption

Language:Jupyter NotebookLicense:MITStargazers:1072Issues:0Issues:0

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:844Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49149Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1625Issues:0Issues:0

llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Language:PythonLicense:MITStargazers:193Issues:0Issues:0

docker-py

A Python library for the Docker Engine API

Language:PythonLicense:Apache-2.0Stargazers:6716Issues:0Issues:0

uv

An extremely fast Python package installer and resolver, written in Rust.

Language:RustLicense:Apache-2.0Stargazers:14690Issues:0Issues:0

rye

a Hassle-Free Python Experience

Language:RustLicense:MITStargazers:12176Issues:0Issues:0

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1759Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:946Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:11890Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:471Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4836Issues:0Issues:0

ruff

An extremely fast Python linter and code formatter, written in Rust.

Language:RustLicense:MITStargazers:28764Issues:0Issues:0

llama-cpp-python

Python bindings for llama.cpp

Language:PythonLicense:MITStargazers:7124Issues:0Issues:0

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:PythonStargazers:232Issues:0Issues:0

float8_experimental

This repository contains the experimental PyTorch native float8 training UX

Language:PythonLicense:BSD-3-ClauseStargazers:186Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3809Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:22281Issues:0Issues:0

amdsmi

AMD SMI

Language:C++License:MITStargazers:29Issues:0Issues:0

hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Language:PythonLicense:Apache-2.0Stargazers:562Issues:0Issues:0

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonLicense:MITStargazers:2664Issues:0Issues:0
Language:CudaLicense:MITStargazers:32Issues:0Issues:0

marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Language:PythonLicense:Apache-2.0Stargazers:443Issues:0Issues:0

codecarbon

Track emissions from Compute and recommend ways to reduce their impact on the environment.

Language:PythonLicense:MITStargazers:1008Issues:0Issues:0

optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Language:PythonLicense:Apache-2.0Stargazers:123Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5352Issues:0Issues:0

gpu-benches

collection of benchmarks to measure basic GPU capabilities

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:171Issues:0Issues:0