Pruning Convolution Neural Network (SqueezeNet) for Efficient Hardware Deployment

Gaikwad, Akash S.

Pruning Convolution Neural Network (SqueezeNet) for Efficient Hardware Deployment

dc.contributor.advisor	El-Sharkawy, Mohamed
dc.contributor.author	Gaikwad, Akash S.
dc.contributor.other	Rizkalla, Maher
dc.contributor.other	King, Brian
dc.date.accessioned	2018-12-05T21:36:36Z
dc.date.available	2018-12-05T21:36:36Z
dc.date.issued	2018-12
dc.degree.date	2018	en_US
dc.degree.discipline	Electrical & Computer Engineering	en
dc.degree.grantor	Purdue University	en_US
dc.degree.level	M.S.E.C.E.	en_US
dc.description	Indiana University-Purdue University Indianapolis (IUPUI)	en_US
dc.description.abstract	In recent years, deep learning models have become popular in the real-time embedded application, but there are many complexities for hardware deployment because of limited resources such as memory, computational power, and energy. Recent research in the field of deep learning focuses on reducing the model size of the Convolution Neural Network (CNN) by various compression techniques like Architectural compression, Pruning, Quantization, and Encoding (e.g., Huffman encoding). Network pruning is one of the promising technique to solve these problems. This thesis proposes methods to prune the convolution neural network (SqueezeNet) without introducing network sparsity in the pruned model. This thesis proposes three methods to prune the CNN to decrease the model size of CNN without a significant drop in the accuracy of the model. 1: Pruning based on Taylor expansion of change in cost function Delta C. 2: Pruning based on L2 normalization of activation maps. 3: Pruning based on a combination of method 1 and method 2. The proposed methods use various ranking methods to rank the convolution kernels and prune the lower ranked filters afterwards SqueezeNet model is fine-tuned by backpropagation. Transfer learning technique is used to train the SqueezeNet on the CIFAR-10 dataset. Results show that the proposed approach reduces the SqueezeNet model by 72% without a significant drop in the accuracy of the model (optimal pruning efficiency result). Results also show that Pruning based on a combination of Taylor expansion of the cost function and L2 normalization of activation maps achieves better pruning efficiency compared to other individual pruning criteria and most of the pruned kernels are from mid and high-level layers. The Pruned model is deployed on BlueBox 2.0 using RTMaps software and model performance was evaluated.	en_US
dc.identifier.uri	https://hdl.handle.net/1805/17923
dc.identifier.uri	http://dx.doi.org/10.7912/C2/2489
dc.language.iso	en_US	en_US
dc.rights	Attribution 3.0 United States
dc.rights.uri	https://creativecommons.org/licenses/by/3.0/us
dc.subject	Convolution neural network	en_US
dc.subject	CNN	en_US
dc.subject	SqueezeNet	en_US
dc.subject	Pruning	en_US
dc.subject	L2 Normalization	en_US
dc.subject	CIFAR-10	en_US
dc.subject	Transfer learning	en_US
dc.subject	Coarse pruning	en_US
dc.subject	S32V234	en_US
dc.subject	Taylor expansion	en_US
dc.subject	RTMaps	en_US
dc.subject	BlueBox	en_US
dc.subject	Fine pruning	en_US
dc.subject	Model compression	en_US
dc.subject	Activation maps	en_US
dc.title	Pruning Convolution Neural Network (SqueezeNet) for Efficient Hardware Deployment	en_US
dc.type	Thesis	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Thesis__Pruning_Convolution_Neural_Network__SqueezeNet__for_Efficient_Hardware_Deployment_.pdf
Size:: 4.68 MB
Format:: Adobe Portable Document Format
Description:: Thesis on Pruning Convolution Neural Network (SqueezeNet) for Efficient Hardware Deployment

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.99 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electrical & Computer Engineering Department Theses and Dissertations