Skip to content

Latest commit

 

History

History
41 lines (25 loc) · 1.27 KB

Complexity.md

File metadata and controls

41 lines (25 loc) · 1.27 KB

This is quick evaluation of #filters impact on performance on ImageNet-2012.

The architecture is similar to CaffeNet, but has differences:

  1. Images are resized to small side = 128 for speed reasons.
  2. fc6 and fc7 layers have 2048 neurons instead of 4096.
  3. Networks are initialized with LSUV-init
  4. No LRN layers.

Default augmentation: random crop 128x128 from 144xN image, 50% random horizontal flip.

Network width

Name Accuracy LogLoss Comments
4sqrt(2)x wider 0.565 1.96 Start overfitting
4x wider 0.563 1.92 Still no overfitting %)
2sqrt(2)x wider 0.552 1.94
2x wider 0.533 2.04
sqrt(2)x wider 0.506 2.17
Default 0.471 2.36
sqrt(2)x narrower 0.460 2.41
2x narrower 0.416 2.68
2sqrt(2)x narrower 0.340 3.11 no group conv
2sqrt(2)x narrower 0.318 3.25
4x narrower 0.256 3.33

logs

CaffeNet128 test accuracy

CaffeNet128 test loss

CaffeNet128 train loss