1ssb / roomie

An embodied agent which tags which room you are in from the objects in the image.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Roomie: An embodied agent which tags which room you are in from the objects in the image using Automated Annotations & ChatGPT prompt engineering.

Requirements

Note

  1. Yolov8.pt and SAM_L.pt will be downloaded (automatically). You may use the base SAM model. The code is optimised for CUDA enabled GPU, change the parameters, as you deem fit. Make sure you have sufficient memory on disk.

  2. Images used for demo are scraped off the internet, the copyright lies with the creators.

Usage

  • Clone this repository: git clone https://github.com/1ssb/roomie.git
  • Make sure you have pip installed the above requirements. Be specifically careful about the python version. Select reattempt configuration based on the automated structure.
  • Make sure you replace the openai api id with your own in roomie.py.
  • Upload images to the ./images/ directory.

If you encounter any issues, feel free to open an issue on the repository or send an email to Subhransu.Bhattacharjee@anu.edu.au.

About

An embodied agent which tags which room you are in from the objects in the image.

License:MIT License


Languages

Language:Python 100.0%