PINTO0309 / PINTO_model_zoo

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.

Home Page:https://qiita.com/PINTO

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Explaination on The YoloX Body-Head-Hand output dimension?

kho-bluefrogrobotics opened this issue · comments

Issue Type

Documentation Feature Request

OS

Other

OS architecture

armv7

Programming Language

Other

Framework

TensorFlowLite

Model name and Weights/Checkpoints URL

YoloX Body-Head-Hand
https://github.com/PINTO0309/PINTO_model_zoo/tree/main/426_YOLOX-Body-Head-Hand

Description

First of all thanks and congratulation for your excellent work, on YOLO.

I have a question regarding the output size, though.
@PINTO0309 You say you limit the output to 20 boxes : How come we obtain a float32[60,7] output then?
shouldn't we get something like 20x [class, score, x1, y1, x2, y2] = 20 x 5 floats?

Relevant Log Output

No response

URL or source code for simple inference testing code

No response

As detailed in the README.

max output boxes per class

20 (Body) + 20 (Head) + 20 (Hand) = 60 boxes

[batch_num, classid, score, x1, y1, x2, y2] * 60 = 7 x 60 floats

image

image