Details on training hyperparameters

by egesko19 - opened Oct 7

Oct 7

Is there any way I can find details on how such high accuracies in Imagenet-1k is reached? I am trying to replicate it by training it from scratch to reach similar metrics, but I usually end up around 60% accuracy before validation loss starts to increase, which is a massive difference compared to the pretrained model that reaches around 90% accuracy on validation dataset.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment