There are 1 repository under computer-using-agent topic.
The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning to elicit spatial reasoning and perception, emulates, plans and simulates synthetic HID interactions.
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Mark web pages for use with vision-language models
A ReAct Principles based fully autonomous Command Line Computer Using Agent
Use an LLM agent to automate ordering food and other items from Deliveroo, Uber Eats, DoorDash, etc.