High Performance SqueezeNext: Real time deployment on Bluebox 2.0 by NXP

dc.contributor.authorDuggal, Jayan Kant
dc.contributor.authorEl-Sharkawy, Mohamed
dc.contributor.departmentElectrical and Computer Engineering, School of Engineering and Technology
dc.date.accessioned2024-01-08T19:09:21Z
dc.date.available2024-01-08T19:09:21Z
dc.date.issued2022-05-22
dc.description.abstractDNN implementation and deployment is quite a challenge within a resource constrained environment on real-time embedded platforms. To attain the goal of DNN tailor made architecture deployment on a real-time embedded platform with limited hardware resources (low computational and memory resources) in comparison to a CPU or GPU based system, High Performance SqueezeNext (HPS) architecture was proposed. We propose and tailor made this architecture to be successfully deployed on Bluexbox 2.0 by NXP and also to be a DNN based on pytorch framework. High Performance SqueezeNext was inspired by SqueezeNet and SqueezeNext along with motivation derived from MobileNet architectures. High Performance SqueezeNext (HPS) achieved a model accuracy of 92.5% with 2.62MB model size at 16 seconds per epoch model using a NVIDIA based GPU system for training. It was trained and tested on various datasets such as CIFAR-10 and CIFAR-100 with no transfer learning. Thereafter, successfully deploying the proposed architecture on Bluebox 2.0, a real-time system developed by NXP with the assistance of RTMaps Remote Studio. The model accuracy results achieved were better than the existing CNN/DNN architectures model accuracies such as alexnet_tf (82% model accuracy), Maxout networks (90.65%), DCNN (89%), modified SqueezeNext (92.25%), Squeezed CNN (79.30%), MobileNet (76.7%) and an enhanced hybrid MobileNet (89.9%) with better model size. It was developed, modified and improved with the help of different optimizer implementations, hyper parameter tuning, tweaking, using no transfer learning approach and using in-place activation functions while maintaining decent accuracy.
dc.eprint.versionFinal published version
dc.identifier.citationDuggal, J. K., & El-Sharkawy, M. (2022). High Performance SqueezeNext: Real time deployment on Bluebox 2.0 by NXP. Advances in Science, Technology and Engineering Systems Journal, 7(3), 70–81. https://doi.org/10.25046/aj070308
dc.identifier.urihttps://hdl.handle.net/1805/37709
dc.language.isoen_US
dc.publisherASTES
dc.relation.isversionof10.25046/aj070308
dc.relation.journalAdvances in Science, Technology and Engineering Systems Journal
dc.rightsAttribution-ShareAlike 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-sa/4.0/
dc.sourcePublisher
dc.subjectBluebox 2.0
dc.subjectConvolution Neural Networks (CNNs)
dc.subjectDeep Learning
dc.subjectDeep Neural Networks (DNNs)
dc.subjectModified SqueezeNext
dc.subjectReal-time deployment
dc.subjectSqueezeNext
dc.titleHigh Performance SqueezeNext: Real time deployment on Bluebox 2.0 by NXP
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Duggal2022High-CCBYSA.pdf
Size:
846.56 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: