CNN practical notes
some notes on my training with CNN

Train CIFAR10

  • On 2019.12.6, I tried to use torchvision.models.resnet18 to train CIFAR10
  • Some findings in time sequential orders
  • Takeaway
    • weight decay is important, yet torch.optim disable it by default. set it to 1e-4 or 5e-4.
    • It's not preferrable to directly use torchvision.models or other pretrained model architectures on datasets other than ImageNet. That's what so called 'Hyperparameter tuning is important'.

Pretrained models