GPTCaption is a tool designed to leverage the ChatGPT API for generating captions that describe images. This application streamlines the process of obtaining descriptive captions for a variety of image formats.
This image showcases an example of the captioning capability of GPTCaption, demonstrating how the tool can provide contextually relevant descriptions for diverse types of images.
To utilize GPTCaption, follow these steps:
- Enter your OpenAI API key into the
settings.ini
file to authenticate your requests. - Execute
main_gui.py
to launch the graphical user interface. - Insert the URLs of the images you wish to caption into the designated field, ensuring one URL per line. The tool currently supports images in JPG, JPEG, and PNG formats.
- Optionally, you can modify the image analysis prompt to suit your specific needs or preferences.
- Click on
Generate Captions
to initiate the captioning process.
Before proceeding with the API calls, GPTCaption will present an estimate of the potential costs involved and request your confirmation to continue.
The output, including all generated captions, will be organized in a folder named after the current date. Inside this folder, additional time-stamped subfolders are created for each use of the tool, with a consolidated caption file named identically to its corresponding folder.
GPTCaption is compatible with any image hosting service that offers public URL access to the uploaded images. For batch uploading (up to 1000 images), https://PostImages.org is recommended. Ensure you select "Direct Link" as the URL type for compatibility with GPTCaption.