In this repo, we will learn how to finetune and feature extract the torchvision models, all of which have been pretrained on the 1000-class Imagenet dataset. This repo will give an indepth look at how to work with modern CNN architectures, and will build an intuition for finetuning any PyTorch model. Since each model architecture is different, there is no boilerplate finetuning code that will work in all scenarios. Rather, the researcher must look at the existing architecture and make custom adjustments for each model.
# uncomment one of the CNN models in grad_cam_results.py and run model=models.vgg19(weights=None, num_classes=4)
# model = models.densenet161(weights=None, num_classes=4)# model = models.resnet50(weights=None, num_classes=4)# model = models.squeezenet1_0(weights=None, num_classes=4)# model = models.alexnet(weights=None, num_classes=4)# model = models.resnext101_64x4d(weights=None, num_classes=4)# model = models.googlenet(weights=None, num_classes=4)# model = models.mnasnet1_0(weights=None, num_classes=4)# model = models.regnet_x_16gf(weights=None, num_classes=4)# model = models.shufflenet_v2_x2_0(weights=None, num_classes=4)# model = models.efficientnet_b7(weights=None, num_classes=4)# model = models.efficientnet_v2_s(weights=None, num_classes=4)# model = models.vit_l_16(weights=None, num_classes=4)# model = models.swin_b(weights=None, num_classes=4)
P.S
Please give ⭐ to this repo if you find it useful. Stay tuned!
About
This repo is special for those who want to start learning computer vision related tasks such as image classification.