retsamcam's repositories

AppAgent2

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

self-operating-computer

A framework to enable multimodal models to operate a computer.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0