packyan / PyTorch-YOLOv3-kitti

use yolov3 pytorch to train kitti

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Format for targets

manueldiaz96 opened this issue · comments

Hi!

I just wanted to ask about what is the final format of the targets when you pass them to the network for training?

From your code I understand that you read the GT from a txt file, but on datasets.py in line 96 you use np.readtxt but the labels begin with a string (see this example).

Could you please do me the favor of specifying what is the composition of the targets? I know that per image you limit the amount of boxes to 50, so the targets will have shape [batch_size, 50, 5]

Are they in the form [class, center_x_ratio, center_y_ratio, box_width_ratio, box_height_ratio]?

Where ratio means the value in pixels divided by the width of the image (for center_x and box_width) or by the height of the image (for center_y and box_height).

Thank you!