I want to ask about the output of your CLTR model

Question

I want to ask about the output of your CLTR model

VietPT3502 opened this issue a year ago · comments

i see outputs['pred_logits'] which shape are (batch_size, num_queries, num_classes) but outputs['pred_points'] shape is (batch_size, num_queries, 3). What is that 3 stands for?. And does num_classes = 2 which is person head and background ?

Cheng-Yen Yang · Answer 1 · Fri May 12 2023 13:00:56 GMT+0800 (China Standard Time)

# the `num_classes` naming here is somewhat misleading.
# it indeed corresponds to `max_obj_id + 1`, where max_obj_id
# is the maximum id for a class in your dataset. For example,
# COCO has a max_obj_id of 90, so we pass `num_classes` to be 91.
# As another example, for a dataset that has a single class with id 1,
# you should pass `num_classes` to be 2 (max_obj_id + 1).
# For more details on this, check the following discussion
# https://github.com/facebookresearch/detr/issues/108#issuecomment-650269223

Dingkang Liang · Answer 2 · Mon Oct 16 2023 21:28:15 GMT+0800 (China Standard Time)

The third number means the KNN distance