/sci/ - Science & Math » Thread #13246674

63KiB, 956x369, 1_vq0qBTUFO27cJMPQo0h6xg.png

View Same Google iqdb SauceNAO

Anonymous Mon 07 Jun 07:38:51 2021 No.13246674 View Reply Original Report

Quoted By:

>try out mlp-mixer on cifar10 with some default params and no pretraining
>55% accuracy on test after 20 epochs
>double the width of the hidden layers, max out my gpu
>train for 200 epochs
>55% test accuracy
>wut
>reset the width, double the depth
>56% accuracy after 20 epochs
Since the loss was much lower for the more intensive second training attempt than the other attempts (by a factor of 10,) I assume it's an overfitting issue, but that seems like a very low accuracy to be overfitting at (compared to the potential of the architecture.) Is this just the best you can expect to do with cifar alone, or am I probably doing something wrong?

Capcode	All Only User Posts Only Moderator Posts Only Admin Posts Only Developer Posts
Show Posts	All Only With Images Only Without Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Your latest searches