single GPU in training

Question

single GPU in training

chiran7 opened this issue 2 years ago · comments

Dear Authors,

In the training, it is mentioned that;
We use multiple GPUs for training. {YOLOv4-P5, YOLOv4-P6, YOLOv4-P7} use input resolution {896, 1280, 1536} for training respectively.

yolov4-p5

python -m torch.distributed.launch --nproc_per_node 4 train.py --batch-size 64 --img 896 896 --data coco.yaml --cfg yolov4-p5.yaml --weights '' --sync-bn --device 0,1,2,3 --name yolov4-p5
python -m torch.distributed.launch --nproc_per_node 4 train.py --batch-size 64 --img 896 896 --data coco.yaml --cfg yolov4-p5.yaml --weights 'runs/exp0_yolov4-p5/weights/last_298.pt' --sync-bn --device 0,1,2,3 --name yolov4-p5-tune --hyp 'data/hyp.finetune.yaml' --epochs 450 --resume

Can it be trained using a single GPU? If so, do we need to do any modifications in the train.py?

Thank you for your time and consideration.

shrysh8bit · Answer 1 · Thu Jul 07 2022 04:06:27 GMT+0800 (China Standard Time)

Use the following command
python train.py --batch-size 64 --img 896 896 --data coco.yaml --cfg yolov4-p5.yaml --weights '' --device 0 --name yolov4-p5

Change the batch size and image dimensions as per your GPU configuration. For a GPU with 12 gigs RAM, i'm using batch size 32 and image size 320(Low resolution will give poorer result).

chiran7 · Answer 2 · Thu Jul 07 2022 04:09:00 GMT+0800 (China Standard Time)

Thank you for your response.

…

On Wed, Jul 6, 2022 at 3:06 PM shrysh8bit ***@***.***> wrote: Use the following command python train.py --batch-size 64 --img 896 896 --data coco.yaml --cfg yolov4-p5.yaml --weights '' --device 0 --name yolov4-p5 Change the batch size and image dimensions as per your GPU configuration. For a GPU with 12 gigs RAM, i'm using batch size 32 and image size 320(Low resolution will give poorer result). — Reply to this email directly, view it on GitHub <#398 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGDADHDSERH2T4OOMBHE6NDVSXRM3ANCNFSM52DMQWLQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

chiran7 · Answer 3 · Thu Jul 07 2022 04:15:39 GMT+0800 (China Standard Time)

Can you explain the different layers? For instance, in [-1, 1, BottleneckCSP, [64]], it takes only one argument 64, but BottleneckCSP requires two inputs as arguments inside common.py.

Also, if we want to train for custom data, will it be ok if nc =80 is replaced, or do we need to change filter size as well before yolo layers in scaled yolov4?

Thank you for your time and consideration.

chiran7 · Answer 4 · Thu Jul 07 2022 04:20:03 GMT+0800 (China Standard Time)

Use the following command python train.py --batch-size 64 --img 896 896 --data coco.yaml --cfg yolov4-p5.yaml --weights '' --device 0 --name yolov4-p5

Change the batch size and image dimensions as per your GPU configuration. For a GPU with 12 gigs RAM, i'm using batch size 32 and image size 320(Low resolution will give poorer result).

Can you explain the different layers? For instance, in [-1, 1, BottleneckCSP, [64]], it takes only one argument 64, but BottleneckCSP requires two inputs as arguments inside common.py.

Also, if we want to train for custom data, will it be ok if nc =80 is replaced, or do we need to change filter size as well before yolo layers in scaled yolov4?

Thank you for your time and consideration.