Ilyas Moutawwakil (IlyasMoutawwakil)

IlyasMoutawwakil

Geek Repo

Company:@huggingface

Location:Paris, France

Home Page:ilyasmoutawwakil.github.io

Github PK Tool:Github PK Tool


Organizations
Chouafa
huggingface

Ilyas Moutawwakil's starred repositories

privateGPT

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonLicense:Apache-2.0Stargazers:49730Issues:443Issues:992

uBlock

uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.

Language:JavaScriptLicense:GPL-3.0Stargazers:44283Issues:905Issues:3445

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

Language:LLVMLicense:NOASSERTIONStargazers:26537Issues:595Issues:71744

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:17150Issues:436Issues:2733

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14750Issues:107Issues:939

candle

Minimalist ML framework for Rust

Language:RustLicense:Apache-2.0Stargazers:14164Issues:150Issues:598

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8373Issues:69Issues:193

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8277Issues:100Issues:1157

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7251Issues:84Issues:1488
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6871Issues:62Issues:175

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5277Issues:29Issues:27

diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3299Issues:101Issues:22

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:3176Issues:30Issues:327

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonLicense:MITStargazers:3164Issues:35Issues:352

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Language:PythonLicense:BSD-3-ClauseStargazers:2400Issues:69Issues:1418

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustLicense:Apache-2.0Stargazers:2206Issues:27Issues:177

docquery

An easy way to extract information from documents

Language:PythonLicense:MITStargazers:1669Issues:24Issues:46

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1589Issues:8Issues:305

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1576Issues:31Issues:605

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1371Issues:11Issues:311

llm-vscode

LLM powered development for VSCode

Language:TypeScriptLicense:Apache-2.0Stargazers:1162Issues:22Issues:80

cuda-python

CUDA Python Low-level Bindings

Language:PythonLicense:NOASSERTIONStargazers:801Issues:30Issues:61

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:645Issues:12Issues:29

quanto

A pytorch Quantization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:613Issues:8Issues:65

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonLicense:MITStargazers:607Issues:17Issues:68
Language:PythonLicense:Apache-2.0Stargazers:514Issues:22Issues:21

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Language:PythonLicense:MITStargazers:345Issues:11Issues:14

onnxscript

ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.

Language:PythonLicense:MITStargazers:239Issues:27Issues:464

scrape-open-llm-leaderboard

Scrape and export data from the Open LLM Leaderboard.

Language:PythonLicense:MITStargazers:35Issues:0Issues:0