Efficient Intelligence Towards Real-Time Precision Medicine With Systematic Pruning and Quantization

If you need an accessible version of this item, please email your request to digschol@iu.edu so that they may create one and provide it to you.
Date
2024-08
Language
American English
Embargo Lift Date
Department
Committee Chair
Degree
M.S.E.C.E.
Degree Year
2024
Department
Electrical & Computer Engineering
Grantor
Purdue University
Journal Title
Journal ISSN
Volume Title
Found At
Abstract

The widespread adoption of Convolutional Neural Networks (CNNs) in real-world applications, particularly on resource-constrained devices, is hindered by their computational complexity and memory requirements. This research investigates the application of pruning and quantization techniques to optimize CNNs for arrhythmia classification using the MIT-BIH Arrhythmia Database. By combining magnitude-based pruning, regularization-based pruning, filter map-based pruning, and quantization at different bit-widths (4-bit, 8-bit, 2-bit, and 1-bit), the study aims to develop a more compact and efficient CNN model while maintaining high accuracy. The experimental results demonstrate that these techniques effectively reduce model size, improve inference speed, and maintain accuracy, adapting them for use on devices with limited resources. The findings highlight the potential of these optimization techniques for real-time applications in mobile health monitoring and edge computing, paving the way for broader adoption of deep learning in resource-limited environments.

Description
IUPUI
item.page.description.tableofcontents
item.page.relation.haspart
Cite As
ISSN
Publisher
Series/Report
Sponsorship
Major
Extent
Identifier
Relation
Journal
Source
Alternative Title
Type
Thesis
Number
Volume
Conference Dates
Conference Host
Conference Location
Conference Name
Conference Panel
Conference Secretariat Location
Version
Full Text Available at
This item is under embargo {{howLong}}