Enhanced 3D Object Detection and Tracking in Autonomous Vehicles: An Efficient Multi-Modal Deep Fusion Approach

Kalgaonkar, Priyank B.

Enhanced 3D Object Detection and Tracking in Autonomous Vehicles: An Efficient Multi-Modal Deep Fusion Approach

dc.contributor.advisor	El-Sharkawy, Mohamed
dc.contributor.author	Kalgaonkar, Priyank B.
dc.contributor.other	King, Brian S.
dc.contributor.other	Rizkalla, Maher E.
dc.contributor.other	Abdallah, Mustafa A.
dc.date.accessioned	2024-09-03T12:58:48Z
dc.date.available	2024-09-03T12:58:48Z
dc.date.issued	2024-08
dc.degree.date	2024
dc.degree.discipline	Electrical & Computer Engineering	en
dc.degree.grantor	Purdue University	en
dc.degree.level	Ph.D.
dc.description	IUPUI
dc.description.abstract	This dissertation delves into a significant challenge for Autonomous Vehicles (AVs): achieving efficient and robust perception under adverse weather and lighting conditions. Systems that rely solely on cameras face difficulties with visibility over long distances, while radar-only systems struggle to recognize features like stop signs, which are crucial for safe navigation in such scenarios. To overcome this limitation, this research introduces a novel deep camera-radar fusion approach using neural networks. This method ensures reliable AV perception regardless of weather or lighting conditions. Cameras, similar to human vision, are adept at capturing rich semantic information, whereas radars can penetrate obstacles like fog and darkness, similar to X-ray vision. The thesis presents NeXtFusion, an innovative and efficient camera-radar fusion network designed specifically for robust AV perception. Building on the efficient single-sensor NeXtDet neural network, NeXtFusion significantly enhances object detection accuracy and tracking. A notable feature of NeXtFusion is its attention module, which refines critical feature representation for object detection, minimizing information loss when processing data from both cameras and radars. Extensive experiments conducted on large-scale datasets such as Argoverse, Microsoft COCO, and nuScenes thoroughly evaluate the capabilities of NeXtDet and NeXtFusion. The results show that NeXtFusion excels in detecting small and distant objects compared to existing methods. Notably, NeXtFusion achieves a state-of-the-art mAP score of 0.473 on the nuScenes validation set, outperforming competitors like OFT by 35.1% and MonoDIS by 9.5%. NeXtFusion's excellence extends beyond mAP scores. It also performs well in other crucial metrics, including mATE (0.449) and mAOE (0.534), highlighting its overall effectiveness in 3D object detection. Visualizations of real-world scenarios from the nuScenes dataset processed by NeXtFusion provide compelling evidence of its capability to handle diverse and challenging environments.
dc.identifier.uri	https://hdl.handle.net/1805/43087
dc.language.iso	en_US
dc.subject	Artificial Intelligence
dc.subject	AI
dc.subject	CNN
dc.subject	DNN
dc.subject	Neural Network
dc.subject	Object Detection
dc.subject	Sensor Fusion
dc.title	Enhanced 3D Object Detection and Tracking in Autonomous Vehicles: An Efficient Multi-Modal Deep Fusion Approach
dc.type	Thesis	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PhD_Thesis.pdf
Size:: 10.08 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.04 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electrical & Computer Engineering Department Theses and Dissertations