CNN practical notes

some notes on my training with CNN

Train CIFAR10

  • On 2019.12.6, I tried to use torchvision.models.resnet18 to train CIFAR10

  • Some findings in time sequential orders

  • Takeaway

    • weight decay is important, yet torch.optim disable it by default. set it to 1e-4 or 5e-4.

    • It's not preferrable to directly use torchvision.models or other pretrained model architectures on datasets other than ImageNet. That's what so called 'Hyperparameter tuning is important'.

Pretrained models

Last updated

Was this helpful?