How to train a deep neural network?
2021-08-14
1 min read
- large batch size
- Adam or SGD
- learning rate
- data auto augmentation
- ResNeSt > ResNet
- circle loss
- weight decay:
- WEIGHT_DECAY: 0.0005
- WEIGHT_DECAY_BIAS: 0.