Ariel Kwiatkowski (RedTachyon)

RedTachyon

Geek Repo

Location:Paris

Home Page:https://redtachyon.me

Github PK Tool:Github PK Tool


Organizations
Farama-Foundation

Ariel Kwiatkowski's repositories

coltra-rl

A modular implementation of PPO, and soon hopefully other algorithms.

CrowdAI

This will be a PhD thesis someday

Language:C#Stargazers:6Issues:3Issues:0

Ferry

WiP gRPC Gymnasium API

Language:RustStargazers:5Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0
Language:PythonStargazers:1Issues:3Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:2Issues:0

anterion

Open-source software engineer

License:MITStargazers:0Issues:0Issues:0

AutoGPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

License:MITStargazers:0Issues:0Issues:0

Gymnasium

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

keras

Deep Learning for humans

License:Apache-2.0Stargazers:0Issues:0Issues:0

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

License:MITStargazers:0Issues:0Issues:0

LaVague

Copilot for web automation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

License:MITStargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

ml-agents

Unity Machine Learning Agents Toolkit

Language:C#License:NOASSERTIONStargazers:0Issues:1Issues:0

OSWorld

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Shimmy

An API conversion tool for popular external reinforcement learning environments

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SWE-agent

SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models

License:MITStargazers:0Issues:0Issues:0

task-standard

METR Task Standard

Language:TypeScriptStargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

vimGPT

Browse the web with GPT-4V and Vimium

License:MITStargazers:0Issues:0Issues:0

wildcats-ai

This will one day be an actually working AI agent

Language:Jupyter NotebookStargazers:0Issues:0Issues:0