0asa / autodistill-llava

LLaVA base model for use with Autodistill.

Home Page:https://docs.autodistill.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Autodistill LLaVA Module

This repository contains the code supporting the LLaVA base model for use with Autodistill.

LLaVA is a multi-modal language model with object detection capabilities. You can use LLaVA with autodistill for object detection. Learn more about LLaVA 1.5, the most recent version of LLaVA at the time of releasing this package.

Read the full Autodistill documentation.

Read the LLaVA Autodistill documentation.

Installation

To use CLIP with autodistill, you need to install the following dependency:

pip3 install autodistill-clip

Quickstart

from autodistill_llava import LLaVA

# define an ontology to map class names to our LLaVA prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = LLaVA(
    ontology=CaptionOntology(
        {
            "a forklift": "forklift"
        }
    )
)
base_model.label("./context_images", extension=".jpeg")

License

This model is licensed under an Apache 2.0 License.

๐Ÿ† Contributing

We love your input! Please see the core Autodistill contributing guide to get started. Thank you ๐Ÿ™ to all our contributors!

About

LLaVA base model for use with Autodistill.

https://docs.autodistill.com

License:Apache License 2.0


Languages

Language:Python 89.9%Language:Makefile 10.1%