The max-average pooling 'trick' doesn't have a theoretical basis to give better results that I know of. The reason it might be improving results could be due to the increased number of large crops from the image when using it.
A better solution would be to add the cropping scheme from BigSleep, which adds a bias for larger crops and improves results coherence significantly.
The max-average pooling 'trick' doesn't have a theoretical basis to give better results that I know of. The reason it might be improving results could be due to the increased number of large crops from the image when using it.
A better solution would be to add the cropping scheme from BigSleep, which adds a bias for larger crops and improves results coherence significantly.