Autodistill LLaVA Module

This repository contains the code supporting the LLaVA base model for use with Autodistill.

LLaVA is a multi-modal language model with object detection capabilities. You can use LLaVA with autodistill for object detection. Learn more about LLaVA 1.5, the most recent version of LLaVA at the time of releasing this package.

Read the full Autodistill documentation.

Read the LLaVA Autodistill documentation.

Installation

To use CLIP with autodistill, you need to install the following dependency:

pip3 install autodistill-clip

Quickstart

from autodistill_llava import LLaVA

# define an ontology to map class names to our LLaVA prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = LLaVA(
    ontology=CaptionOntology(
        {
            "a forklift": "forklift"
        }
    )
)
base_model.label("./context_images", extension=".jpeg")

License

This model is licensed under an Apache 2.0 License.

🏆 Contributing

We love your input! Please see the core Autodistill contributing guide to get started. Thank you 🙏 to all our contributors!

About

LLaVA base model for use with Autodistill.

https://docs.autodistill.com

Apache License 2.0

Languages

Language:Python 89.9%Language:Makefile 10.1%