Model and weights do not fit together

by j35t3r - opened Oct 25, 2023

Oct 25, 2023

import timm
model = timm.create_model("hf_hub:edadaltocg/resnet18_svhn", pretrained=True)

   Rraise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for ResNet:
        size mismatch for conv1.weight: copying a param with shape torch.Size([64, 3, 3, 3]) from checkpoint, the shape in current model is torch.Size([64, 3, 7, 7]).

j35t3r

Oct 25, 2023

•

edited Oct 25, 2023

This seems to work for me:

        model = timm.create_model("hf_hub:edadaltocg/resnet18_svhn", pretrained=False)
        # override model
        model.conv1 = nn.Conv2d(3, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
        model.maxpool = nn.Identity()  # type: ignore
        model.fc = nn.Linear(512, 10)

        if args.mode == 'spatial':
            model.load_state_dict(
            torch.hub.load_state_dict_from_url(
                "https://huggingface.co/edadaltocg/resnet18_svhn/resolve/main/pytorch_model.bin",
                map_location="cpu", 
                file_name="resnet18_cifar10.pth",
                )
            )

More testing needed

edadaltocg

Owner Oct 25, 2023

Please load the model in the following manner as suggested in the model card.

import detectors
import timm

model = timm.create_model("resnet18_svhn", pretrained=True)

The conv1 layer implemented in timm is oriented towards larger images, in which the convolutional kernel has a size of 7x7. To deal with smaller images, I swapped this layer to have a kernel with sizes 3x3, as you can see in the error message. That's why loading from the default resnet18 model in timm fails, and you should import detectors to load the correct model architecture module.

Please see https://github.com/edadaltocg/detectors/blob/3555ebada096694a69d7f030691ca233ec288190/src/detectors/models/resnet.py#L116for the modifications made in the model architecture.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment