Fig. 4From: Self-supervised pre-training for joint optic disc and cup segmentation via attention-aware networkIllustration of the proposed aggregation attention module. The input tokens are first clustered into different groups. For each group, the self-attention operation is performed individually over the cluster centroid and cluster tokens. Ultimately, the updated cluster centroid and the group features are aggregated together to form a new feature vectorBack to article page