NextDet: Efficient Sparse-to-Dense Object Detection with Attentive Feature Aggregation

Kalgaonkar, Priyank; El-Sharkawy, Mohamed

NextDet: Efficient Sparse-to-Dense Object Detection with Attentive Feature Aggregation

Files

Kalgaonkar2022NextDet-CCBY.pdf (3.51 MB)

Date

2022-11-28

Authors

Kalgaonkar, Priyank

El-Sharkawy, Mohamed

Language

American English

Department

Electrical and Computer Engineering, School of Engineering and Technology

Found At

MDPI

Abstract

Object detection is a computer vision task of detecting instances of objects of a certain class, identifying types of objects, determining its location, and accurately labelling them in an input image or a video. The scope of the work presented within this paper proposes a modern object detection network called NextDet to efficiently detect objects of multiple classes which utilizes CondenseNeXt, an award-winning lightweight image classification convolutional neural network algorithm with reduced number of FLOPs and parameters as the backbone, to efficiently extract and aggregate image features at different granularities in addition to other novel and modified strategies such as attentive feature aggregation in the head, to perform object detection and draw bounding boxes around the detected objects. Extensive experiments and ablation tests, as outlined in this paper, are performed on Argoverse-HD and COCO datasets, which provide numerous temporarily sparse to dense annotated images, demonstrate that the proposed object detection algorithm with CondenseNeXt as the backbone result in an increase in mean Average Precision (mAP) performance and interpretability on Argoverse-HD’s monocular ego-vehicle camera captured scenarios by up to 17.39% as well as COCO’s large set of images of everyday scenes of real-world common objects by up to 14.62%.

Keywords

CodnenseNeXt, object detection, PyTorch, deep learning, convolutional neural network

Cite As

Kalgaonkar, P., & El-Sharkawy, M. (2022). NextDet: Efficient Sparse-to-Dense Object Detection with Attentive Feature Aggregation. Future Internet, 14(12), Article 12. https://doi.org/10.3390/fi14120355

Journal

Future Internet

Rights

Attribution 4.0 International

Source

Publisher

Type

Article

Permanent Link

https://hdl.handle.net/1805/37305

DOI

https://doi.org/10.3390/fi14120355

Version

Final published version

Collections

Open Access Policy Articles
Department of Electrical and Computer Engineering Works

Full item page