LittleLittleCloud / GAIA

Beating the GAIA benchmark with Transformers Agents. πŸš€

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Beating GAIA with Transformers Agents πŸš€

This is the exact code used for our submission that scores #2 on the test set, #1 on the validation set.

GAIA leaderboard screenshot

Check out the current leaderboard here.

How to run tests?

First, install requirements:

pip install -r requirements.txt

Setup your secrets in a .envfile:

HUGGINGFACEHUB_API_TOKEN
SERPAPI_API_KEY
OPENAI_API_KEY
ANTHROPIC_API_KEY

And optionally if you want to use Anthropic models via AWS bedrock:

AWS_BEDROCK_ID
AWS_BEDROCK_KEY

Then run gaia.py to launch tests!

About

Beating the GAIA benchmark with Transformers Agents. πŸš€

License:Apache License 2.0


Languages

Language:Jupyter Notebook 68.9%Language:Python 31.1%