SkalskiP / fashion-assistant

Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from images. We pass the prompt, along with the extracted features, to LLM, allowing for advanced image dataset queries.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SkalskiP/fashion-assistant Watchers