joshbickett / multimodal-gamer

A framework to enable multimodal models to play games on a computer.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Multimodal Gamer

A framework to enable multimodal models to play games on a computer.

Using the same inputs and outputs as a human operator, the model plays a game!

Key Features

See MultiModal Gamer play Super Mario 64 on YouTube (click below)

YouTube video player

Quick install

Create venv

python3 -m venv env

Activate it (Mac)

source env/bin/activate

Install requirements

pip install -r requirements.txt

Add OpenAI Key

export OPENAI_API_KEY=yourkeyhere

Run it

python main.py

About

A framework to enable multimodal models to play games on a computer.

License:MIT License


Languages

Language:Python 100.0%