Fig. 3From: Self-supervised pre-training for joint optic disc and cup segmentation via attention-aware networkIllustration of the proposed multi-scale attention module. For each query image token pixel, it will match with its top-K potentially corresponding tokens. Afterwards, it will be updated by aggregating different sub-region representations using the multi-layer perceptron operationBack to article page