Clodéric Mars (cloderic)

cloderic

Geek Repo

Company:AI Redefined (makers of @cogment)

Location:Montréal, Quebec, Canada

Home Page:https://www.cloderic.com

Twitter:@cloderic

Github PK Tool:Github PK Tool


Organizations
cogment

Clodéric Mars's starred repositories

docusaurus

Easy to maintain open source documentation websites.

Language:TypeScriptLicense:MITStargazers:55753Issues:408Issues:3089

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34863Issues:342Issues:2738

yq

yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor

ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Language:PythonLicense:Apache-2.0Stargazers:11098Issues:194Issues:1063

dagger

An engine to run your pipelines in containers

Language:GoLicense:Apache-2.0Stargazers:10941Issues:276Issues:2736

maigret

🕵️‍♂️ Collect a dossier on a person by username from thousands of sites

Language:PythonLicense:MITStargazers:10134Issues:90Issues:1169

oauth2-proxy

A reverse proxy that provides authentication with Google, Azure, OpenID Connect and many more identity providers.

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9348Issues:73Issues:1116

cog

Containers for machine learning

Language:PythonLicense:Apache-2.0Stargazers:7863Issues:68Issues:740

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:6862Issues:39Issues:446

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:5367Issues:36Issues:182

SpacetimeDB

Multiplayer at the speed of light

Language:RustLicense:NOASSERTIONStargazers:4347Issues:27Issues:465

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4187Issues:107Issues:554

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2586Issues:32Issues:57

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonLicense:NOASSERTIONStargazers:2564Issues:18Issues:371

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2223Issues:40Issues:606

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonLicense:MITStargazers:1274Issues:18Issues:340

taskipy

the complementary task runner for python

Language:PythonLicense:MITStargazers:476Issues:6Issues:32

SuperSuit

A collection of wrappers for Gymnasium and PettingZoo environments (being merged into gymnasium.wrappers and pettingzoo.wrappers

Language:PythonLicense:NOASSERTIONStargazers:451Issues:9Issues:79

ibc

Official implementation of Implicit Behavioral Cloning, as described in our CoRL 2021 paper, see more at https://implicitbc.github.io/

Language:PythonLicense:Apache-2.0Stargazers:306Issues:9Issues:15

rlmeta

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

Language:PythonLicense:MITStargazers:284Issues:14Issues:13

gamecontroller.js

A JavaScript library that lets you handle, configure, and use gamepads and controllers on a browser, using the Gamepad API

Language:JavaScriptLicense:MITStargazers:245Issues:6Issues:20

buffrs

Modern protobuf package management

Language:RustLicense:Apache-2.0Stargazers:208Issues:14Issues:98

lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).

Language:PythonLicense:MITStargazers:189Issues:5Issues:24
Language:PythonLicense:MITStargazers:100Issues:9Issues:96

cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)

Language:PythonLicense:Apache-2.0Stargazers:78Issues:10Issues:93

cogment

Cogment platform, Cogment is the first open source platform designed to address the challenges of continuously training humans and AI together.

Language:GoLicense:Apache-2.0Stargazers:26Issues:2Issues:0

quart-sqlalchemy

Quart SQLAlchemy

Language:PythonLicense:MITStargazers:16Issues:2Issues:5

cogment-lab

A toolkit for practical Human-AI cooperation research

Language:PythonLicense:Apache-2.0Stargazers:13Issues:3Issues:5