Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01p5547v675
Title: RGB-Thermal Fusion for Improved Object Detection
Authors: McManamon, Brendan
Advisors: Heide, Felix
Department: Electrical and Computer Engineering
Class Year: 2023
Abstract: Object detection based on RGB images alone often suffers due to a lack of illumination or other environmental conditions, while thermal infrared cameras generally succeed in those exact challenging scenarios. In the realm of RGB-thermal sensor fusion, previous research has been conducted using a variety of network architectures and fusion techniques. Using a YOLOv7 architecture, this work uses pixel-level early fusion and ensemble late fusion to compare against single-sensor models, trained and evaluated on the Teledyne FLIR dataset. Results indicate that pixel-level fusion significantly outperforms RGB and thermal models while maintaining latency in the 8 ms range on a Tesla V100 with an overall mAP of 92.8%, while the ensemble approach performed below baseline and more than doubled latency.
URI: http://arks.princeton.edu/ark:/88435/dsp01p5547v675
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Electrical and Computer Engineering, 1932-2023

Files in This Item:
File Description SizeFormat 
MCMANAMON-BRENDAN-THESIS.pdf5.6 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.