Quentin Gallouédec (qgallouedec)

qgallouedec

Geek Repo

Company:@huggingface

Location:Lyon, France

Home Page:gallouedec.com

Twitter:@QGallouedec

Github PK Tool:Github PK Tool


Organizations
huggingface

Quentin Gallouédec's starred repositories

first-contributions

🚀✨ Help beginners to contribute to open source projects

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:31584Issues:164Issues:4611

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:18879Issues:278Issues:2858

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13012Issues:325Issues:315

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8998Issues:74Issues:1038

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8811Issues:121Issues:967

starcoder

Home of StarCoder: fine-tuning & inference!

Language:PythonLicense:Apache-2.0Stargazers:7227Issues:69Issues:142

PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning

Language:Jupyter NotebookLicense:MITStargazers:6794Issues:70Issues:111

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonLicense:Apache-2.0Stargazers:5077Issues:43Issues:1008

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4422Issues:49Issues:288

ManimML

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

Language:PythonLicense:MITStargazers:2293Issues:32Issues:38

blog

Public repo for HF blog posts

Language:Jupyter NotebookStargazers:2207Issues:89Issues:243

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2137Issues:42Issues:590

Arcade-Learning-Environment

The Arcade Learning Environment (ALE) -- a platform for AI research.

Language:C++License:GPL-2.0Stargazers:2112Issues:81Issues:247

sample-factory

High throughput synchronous and asynchronous reinforcement learning

Language:PythonLicense:MITStargazers:781Issues:18Issues:99

awesome-deep-rl

A curated list of awesome Deep Reinforcement Learning resources.

License:MITStargazers:648Issues:32Issues:0

modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:213Issues:11Issues:9

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonLicense:MITStargazers:196Issues:5Issues:12

cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

Language:PythonLicense:NOASSERTIONStargazers:101Issues:4Issues:5

RL-X

A framework for Reinforcement Learning research.

Language:PythonLicense:MITStargazers:96Issues:8Issues:2

reincarnating_rl

[NeurIPS 2022] Open source code for reusing prior computational work in RL.

Language:PythonLicense:Apache-2.0Stargazers:91Issues:7Issues:4

IQN-and-Extensions

PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER, Noisy layer, N-step bootstrapping, Dueling architecture and parallel env support.

Language:Jupyter NotebookLicense:MITStargazers:77Issues:4Issues:6

prioritized_experience_replay

Prioritized Experience Replay implementation with proportional prioritization

Language:PythonLicense:MITStargazers:64Issues:2Issues:1

powderworld

Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions

Gato-A-Generalist-Agent

Minimal code for A Generalist Agent

karolos

An Open-Source Reinforcement Learning Framework for Robot-Task Environments

Language:PythonLicense:MITStargazers:20Issues:6Issues:9

torch-gato

Pytorch implementation of the Gato paper from Deepmind

Language:Jupyter NotebookLicense:MITStargazers:13Issues:0Issues:0

stable-baselines3-contrib-maskable-recurrent-ppo

Combination of Maskable PPO and Recurrent PPO based on the sb3-contrib repository

Language:PythonLicense:MITStargazers:9Issues:1Issues:2