Email: eyeriss at mit dot edu
This tutorial describes methods to enable efficient processing for deep neural networks (DNNs), which is the cornerstone of many state-of-the-art image processing and computer vision algorithms. While
DNNs delivers best-in-class accuracy and quality of results, it comes at the cost of high computational complexity. Accordingly, designing efficient algorithms for deep neural networks is an important step towards enabling the wide deployment of DNNs in image processing and computer vision systems (e.g., autonomous vehicles, drones, robots, smartphones, wearables, Internet of Things, etc).
In this tutorial, we will provide a brief overview of DNNs, the various hardware platforms that support DNNs including CPU, GPU, FPGA and ASICs, and highlight important benchmarking/comparison metrics and design considerations. We will then describe recent techniques that reduce the computation cost of DNNs from both the hardware architecture and network algorithm perspective. We will also discuss how these techniques can be applied to different image processing and computer vision tasks.
Entire Tutorial [ slides ]
An overview paper based on the tutorial "Efficient Processing of Deep Neural Networks: A Tutorial and Survey" is available here.
@article{2017_dnn_piee,
title={Efficient processing of deep neural networks: A tutorial and survey},
author={Sze, Vivienne and Chen, Yu-Hsin and Yang, Tien-Ju and Emer, Joel},
journal={Proceedings of the IEEE},
year={2017}
}