S3NAS：支持NPU的快速神经结构搜索方法

qqpenalty43293 15 0 .pdf 2021-01-22 05:01:58

随着卷积神经网络（CNN）的应用领域在嵌入式设备中不断增长，使用称为神经处理单元（NPU）的硬件CNN加速器来实现比CPU或GPU更高的每瓦性能变得越来越普遍。最近，自动神经体系结构搜索（NAS）成为默认技术，它可以比手动设计的图像分类体系更准确地找到最先进的CNN体系结构。..

S3NAS: Fast NPU-aware Neural Architecture Search Methodology

As the application area of convolutional neural networks (CNN) is growing in embedded devices, it becomes popular to use a hardware CNN accelerator, called neural processing unit (NPU), to achieve higher performance per watt than CPUs or GPUs. Recently, automated neural architecture search (NAS) emerges as the default technique to find a state-of-the-art CNN architecture with higher accuracy than manually-designed architectures for image classification.In this paper, we present a fast NPU-aware NAS methodology, called S3NAS, to find a CNN architecture with higher accuracy than the existing ones under a given latency constraint. It consists of three steps: supernet design, Single-Path NAS for fast architecture exploration, and scaling. To widen the search space of the supernet structure that consists of stages, we allow stages to have a different number of blocks and blocks to have parallel layers of different kernel sizes. For a fast neural architecture search, we apply a modified Single-Path NAS technique to the proposed supernet structure. In this step, we assume a shorter latency constraint than the required to reduce the search space and the search time. The last step is to scale up the network maximally within the latency constraint. For accurate latency estimation, an analytical latency estimator is devised, based on a cycle-level NPU simulator that runs an entire CNN considering the memory access overhead accurately. With the proposed methodology, we are able to find a network in 3 hours using TPUv3, which shows 82.72% top-1 accuracy on ImageNet with 11.66 ms latency. Code are released at https://github.com/cap-lab/S3NAS

用户评论

暂无评论

nas compose docker compose我的私人NAS的文件源码

nas-compose docker-compose我的私人NAS的文件初始情况和服务: 描述: Airsonic是一个免费的基于Web的媒体流媒体,可让您无所不在地访问您的音乐。平衡器描述

25 2021-04-06
Yii2实现让关联字段支持搜索功能的方法

主要介绍了Yii2实现让关联字段支持搜索功能的方法,结合实例形式分析了Yii2关联字段搜索功能的原理与相关实现技巧,需要的朋友可以参考下

13 2020-10-31
3d3s使用方法

建筑方面的软件，同济大学开发的软件。包含很好的说明。

10 2019-07-29
生成具有隐藏结构的可搜索公钥密文以进行快速关键字搜索

现有的语义安全的公钥可搜索加密方案使搜索时间与密文总数成线性关系。这使得从大型数据库中检索变得不可行。为了缓解这个问题,本文提出了一种具有隐藏结构的可搜索公钥密文(SPCHS),以便在不牺牲加密关

8 2021-04-04
zTeam的s6ep3的支持中文的Main

zTeam的s6ep3的支持中文的Main

13 2020-06-13
在Windows Server2019上配置NAS的方法

主要介绍了在Windows Server 2019上配置NAS的方法,小编觉得挺不错的,现在分享给大家,也给大家做个参考。一起跟随小编过来看看吧

10 2020-11-08
linux下访问EMC的NAS存储解决方法

linux下访问EMC的NAS存储解决方法!

30 2019-07-24
NTFS快速搜索Everything FTP快速架设

NTFS Quick Search Everything FTP Fast Setup

43 2019-06-22
NAS SAN和iSCSI网络存储结构分析评价

根据存储服务质量的各项评价指标,对当前具有代表性的网络存储结构NAS、SAN和iSCSI进行分析。研究结果表明,iSCSI是NAS和SAN两种技术在TCP/IP网络上的融合,通过把面向数据块的SCSI

11 2020-10-28
无结构P2P网络搜索方法

全面讲解高效的无结构P2P网络搜索方法，p2p算法上的经典资料

29 2019-06-05

S3NAS：支持NPU的快速神经结构搜索方法

用户评论

推荐下载