COMP7404-Project-Flamingo

This is the implementation of a demo using Flamingo for few-shot image classification and simple OCR (Optical Character Recognition).

Quick Start

Input

Category 1:

Text input: An image of a cat named Hanbao 🍔 .

Category 2:

Text input: An image of a cat named Tuanzi 🍡 .

Test Case:

Text input: An image of a cat named

Output

An image of a cat named tuanzi.

Input

Text Input: This is the logo of {University Name}

Images Input:

Test Case:

Text Input: This is the logo of

Images Input:

Output This is the logo of the university of hong kong

To install related packages, run the following code to set up the environment.

pip install -r requirements.txt

Run Cats Classification

python flamingo.py --class_1_path=PATH_TO_HANBAO --class_2_path=PATH_TO_TUANZI --test_cases_path=PATH_TO_TESTCASES

Run Logo OCR

python flamingo_OCR.py --image_paths=PATH_TO_OCR_EXAMPLES