RedTachyon

followers

following

stars

Paris

https://redtachyon.me

Organizations

Farama-Foundation

Ariel Kwiatkowski's repositories

coltra-rl

A modular implementation of PPO, and soon hopefully other algorithms.

Language:Python26 3 14

CrowdAI

This will be a PhD thesis someday

Language:C#6 30

Ferry

WiP gRPC Gymnasium API

Language:Rust5 10

llm-zth

Language:Jupyter Notebook1 10

tutor-at-home

Language:Python1 30

redtachyonme

Language:JavaScriptMIT020

anterion

Open-source software engineer

MIT000

AutoGPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:JavaScriptMIT000

cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)

Language:PythonApache-2.0000

Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

MIT000

Gymnasium

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)

Language:PythonMIT000

instructor

structured outputs for llms

Language:PythonMIT000

keras

Deep Learning for humans

Apache-2.0000

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

MIT000

LaVague

Copilot for web automation

Language:PythonApache-2.0000

llm.c

LLM training in simple, raw C/CUDA

MIT000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT000

mincv

Language:TypeScriptMIT000

ml-agents

Unity Machine Learning Agents Toolkit

Language:C#NOASSERTION010

OSWorld

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Apache-2.0000

pong-wars

Language:HTML000

redtachyon

020

rentbusters

Language:Python010

Shimmy

An API conversion tool for popular external reinforcement learning environments

Language:PythonMIT000

SWE-agent

SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models

MIT000

task-standard

METR Task Standard

Language:TypeScript000

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0000

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

Language:Jupyter Notebook000

vimGPT

Browse the web with GPT-4V and Vimium

MIT000

wildcats-ai

This will one day be an actually working AI agent

Language:Jupyter Notebook000