Following is from ColorMAE:
ADE20k Semantic Segmentation. We employ UperNet [46] as our segmen-
tation model and perform end-to-end fine-tuning on the ADE20k [50] dataset
for 160k iterations with an image resolution of 512 × 512. The evaluation metric
used is the mean Intersection over Union (mIoU)
Following is from ColorMAE: