OSU-NLP-Group / SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Home Page:https://osu-nlp-group.github.io/SeeAct/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OSU-NLP-Group/SeeAct Stargazers