Ariel Kwiatkowski (RedTachyon)

RedTachyon

Geek Repo

Location:Paris

Home Page:https://redtachyon.me

Github PK Tool:Github PK Tool


Organizations
Farama-Foundation

Ariel Kwiatkowski's repositories

coltra-rl

A modular implementation of PPO, and soon hopefully other algorithms.

CrowdAI

This will be a PhD thesis someday

Language:C#Stargazers:5Issues:0Issues:0

Ferry

WiP gRPC Gymnasium API

Language:RustStargazers:5Issues:1Issues:0

freewill-checker

Do you have free will? Project based on a blog I'll find later, and on a similar website I'll also find later.

Language:CSSLicense:MITStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:3Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

License:MITStargazers:0Issues:0Issues:0

AutoGPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)

License:Apache-2.0Stargazers:0Issues:0Issues:0

cpython

The Python programming language

License:NOASSERTIONStargazers:0Issues:0Issues:0

Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

License:MITStargazers:0Issues:0Issues:0

EquationEditorPP

Put equations in Google Docs with the power of LaTeX and the simplicity of a graphical editor.

License:MITStargazers:0Issues:0Issues:0

Gymnasium

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

keras

Deep Learning for humans

License:Apache-2.0Stargazers:0Issues:0Issues:0

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

License:MITStargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

ml-agents

Unity Machine Learning Agents Toolkit

Language:C#License:NOASSERTIONStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

popgym

Partially Observable Process Gym

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Shimmy

An API conversion tool for popular external reinforcement learning environments

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

task-standard

METR Task Standard

Stargazers:0Issues:0Issues:0

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

License:MITStargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

Stargazers:0Issues:0Issues:0