Abstract: A self-attention module is often used in image segmentation tasks such as facial part segmentation. Because the self-attention module weights the features at each position using the weighted ...