Third ArchEdge Workshop: Exploring the Design Space of Efficient Deep
Neural Networks
- URL: http://arxiv.org/abs/2011.10912v1
- Date: Sun, 22 Nov 2020 01:56:46 GMT
- Title: Third ArchEdge Workshop: Exploring the Design Space of Efficient Deep
Neural Networks
- Authors: Fuxun Yu, Dimitrios Stamoulis, Di Wang, Dimitrios Lymberopoulos, Xiang
Chen
- Abstract summary: This paper gives an overview of our ongoing work on the design space exploration of efficient deep neural networks (DNNs)
We cover two aspects: (1) static architecture design efficiency and (2) dynamic model execution efficiency.
We highlight several open questions that are poised to draw research attention in the next few years.
- Score: 14.195694804273801
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper gives an overview of our ongoing work on the design space
exploration of efficient deep neural networks (DNNs). Specifically, we cover
two aspects: (1) static architecture design efficiency and (2) dynamic model
execution efficiency. For static architecture design, different from existing
end-to-end hardware modeling assumptions, we conduct full-stack profiling at
the GPU core level to identify better accuracy-latency trade-offs for DNN
designs. For dynamic model execution, different from prior work that tackles
model redundancy at the DNN-channels level, we explore a new dimension of DNN
feature map redundancy to be dynamically traversed at runtime. Last, we
highlight several open questions that are poised to draw research attention in
the next few years.
Related papers
Err
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.