Object Detection Using Vision Transformed EfficientDet

Kar, Shreyanil

Object Detection Using Vision Transformed EfficientDet

dc.contributor.advisor	El-Sharkawy, Mohamed A.
dc.contributor.author	Kar, Shreyanil
dc.contributor.other	King, Brian S.
dc.contributor.other	Rizkalla, Maher E.
dc.date.accessioned	2023-10-02T13:41:31Z
dc.date.available	2023-10-02T13:41:31Z
dc.date.issued	2023-08
dc.degree.date	2023	en_US
dc.degree.discipline	Electrical & Computer Engineering	en
dc.degree.grantor	Purdue University	en_US
dc.degree.level	M.S.E.C.E.
dc.description	Indiana University-Purdue University Indianapolis (IUPUI)	en_US
dc.description.abstract	This research presents a novel approach for object detection by integrating Vision Transformers (ViT) into the EfficientDet architecture. The field of computer vision, encompassing artificial intelligence, focuses on the interpretation and analysis of visual data. Recent advancements in deep learning, particularly convolutional neural networks (CNNs), have significantly improved the accuracy and efficiency of computer vision systems. Object detection, a widely studied application within computer vision, involves the identification and localization of objects in images. The ViT backbone, renowned for its success in image classification and natural language processing tasks, employs self-attention mechanisms to capture global dependencies in input images. However, ViT’s capability to capture fine-grained details and context information is limited. To address this limitation, the integration of ViT into the EfficientDet architecture is proposed. EfficientDet is recognized for its efficiency and accuracy in object detection. By combining the strengths of ViT and EfficientDet, the proposed integration enhances the network’s ability to capture fine-grained details and context information. It leverages ViT’s global dependency modeling alongside EfficientDet’s efficient object detection framework, resulting in highly accurate and efficient performance. Noteworthy object detection frameworks utilized in the industry, such as RetinaNet, EfficientNet, and EfficientDet, primarily employ convolution. Experimental evaluations were conducted using the PASCAL VOC 2007 and 2012 datasets, widely acknowledged benchmarks for object detection. The integrated ViT-EfficientDet model achieved an impressive mean Average Precision (mAP) score of 86.27% when tested on the PASCAL VOC 2007 dataset, demonstrating its superior accuracy. These results underscore the potential of the proposed integration for real-world applications. In conclusion, the research introduces a novel integration of Vision Transformers into the EfficientDet architecture, yielding significant improvements in object detection performance. By combining ViT’s ability to capture global dependencies with EfficientDet’s efficiency and accuracy, the proposed approach offers enhanced object detection capabilities. Future research directions may explore additional datasets and evaluate the performance of the proposed framework across various computer vision tasks.	en_US
dc.identifier.uri	https://hdl.handle.net/1805/35935
dc.language.iso	en_US	en_US
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	*
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0	*
dc.subject	Convolutional Neural Networks
dc.subject	Pascal VOC
dc.subject	EfficientDet
dc.subject	Vision Transformer
dc.subject	Object Detection
dc.subject	Hybrid CNN
dc.subject	ViT-EfficientDet
dc.subject	Computer Vision
dc.subject	Artificial Intelligence
dc.subject	Machine Learning
dc.subject	Deep Learning
dc.subject	Object Classification
dc.subject	Deep Convolutional Neural Networks
dc.subject	PyTorch
dc.title	Object Detection Using Vision Transformed EfficientDet	en_US
dc.type	Thesis	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Shreyanil_Kar_Masters_Thesis.pdf
Size:: 7.06 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.99 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electrical & Computer Engineering Department Theses and Dissertations