本文提出了一种用于语义分割的新型深度学习架构。拟议的全球和选择性注意网络(GSANet)具有Atrous空间金字塔池(ASPP),新颖的sparsemax全局注意和新颖的选择性注意,该注意部署了凝聚和扩散机制,可从提取的深层特征中聚合多尺度上下文信息。..

GSANet: Semantic Segmentation with Global and Selective Attention

This paper proposes a novel deep learning architecture for semantic segmentation. The proposed Global and Selective Attention Network (GSANet) features Atrous Spatial Pyramid Pooling (ASPP) with a novel sparsemax global attention and a novel selective attention that deploys a condensation and diffusion mechanism to aggregate the multi-scale contextual information from the extracted deep features.A selective attention decoder is also proposed to process the GSA-ASPP outputs for optimizing the softmax volume. We are the first to benchmark the performance of semantic segmentation networks with the low-complexity feature extraction network (FXN) MobileNetEdge, that is optimized for low latency on edge devices. We show that GSANet can result in more accurate segmentation with MobileNetEdge, as well as with strong FXNs, such as Xception. GSANet improves the state-of-art semantic segmentation accuracy on both the ADE20k and the Cityscapes datasets.