Lite-CNN: a high-performance architecture to execute CNNs in low density FPGAs

Véstias, Mário; Duarte, Rui; De Sousa, Jose; Cláudio de Campos Neto, Horácio

http://hdl.handle.net/10400.21/8903

Use this identifier to reference this record.

Name:	Description:	Size:	Format:
Lite-CNN_MVestias.pdf		173.6 KB	Adobe PDF	Download

Send Feedback

Authors

Véstias, Mário

Duarte, Rui

De Sousa, Jose

Cláudio de Campos Neto, Horácio

Abstract(s)

Due to the computational complexity of Convolutional Neural Networks (CNNs), high performance platforms are generally considered for their execution. However, CNNs are very useful in embedded systems and its execution right next to the source of data has many advantages, like avoiding the need for data communication. In this paper, we propose an architecture for CNN inference (Lite-CNN) that can achieve high performance in low density FPGAs. Lite-CNN adopts a fixed-point representation for both neurons and weights, which was already shown to be sufficient for most CNNs. Also, with a simple and known dot product reorganization, the number of multiplications is reduced to half. We show implementation results for 8 bit fixed-point in a ZYNQ7020 and extrapolate for other larger FPGAs. Lite-CNN achieves 410 GOPs in a ZYNQ7020.

Keywords

Embedded computing Deep learning Convolutional neural network Field-programmable gate array

URI

http://hdl.handle.net/10400.21/8903

Citation

VÉSTIAS, Mário; [et al] – Lite-CNN: a high-performance architecture to execute CNNs in low density FPGAs. In 28th International Conference on Field Programmable Logic & Applications. Dublin, Ireland: 2018. Pp. 399-402