cjiang2 / robot_semantics

Implementation for Paper "Understanding Contexts Inside Joint Robot and Human Manipulation Tasks through Vision-Language Model with Ontology Constraints in a Video Streamline"